...In the last post we saw how to configure MPSD (Management Pack for Storage Devices) with vRealize Operations. Lets' see once configured, what do we get out off it.
With the addition of MPSD to vROPS, we get some very useful and simple Dashboards added to the Home View.
For this post, i will showing some Dashboard views of a Hybrid Virtual SAN setup in my lab.
Dashboard 1 : Virtual SAN Troubleshooting
Below are the views from Virtual SAN Troubleshooting Dashboard. Let's have a closer look at them
Clicking any Virtual Machine would highlight the VSAN Datastore its hosted on and the associated membership to other objects in the hierarchy.
If i highlight one of the objects showing a Red Fault (EsxPnic in this case) a respective trendline shows up on the right side, showcasing the pattern and the reason for the fault.
Likewise we can switch to the "Workload" view and highlight a Red colored object reflecting an anomaly.
In this specific case its a Virtual Machine experiencing an increase in Write Latency.
Dashboard 2: Virtual SAN Heatmap
This dashboard provide a heatmap view of the object performance against the predefined limits vs. its own trended behavior in past.
Dashboard 3: Virtual SAN Entity Usage
This dashboard provides extremely useful data points for all Physical subcomponents of Virtual SAN subsystem, i.e Host Adapter, SSD Cache Drive, Mechanical/SSD Capacity Drives.
At a glance you can make out which specific host and it components is more worked than others in the Virtual cluster.
It also gives us host level performance metrics for SSD/Mechanical disk, for both Read/Write IO patterns. Mostly information which contributes to the performance of the Virtual SAN can be found on this page.
Dashboard 4: Virtual SAN Device Insights
This is a very important Dashboard as it provides insights to disk level counters like reported errors, media endurance indicators etc. which determines the health of the disks.
Besides this is gives provide details like Capacity utilization values across hosts. If we notice that the data is not distributed uniformly across hosts, we can initiate a rebalance operation to have data equally spread.
Other counters like CPU/memory Utilization, SSD Cache hit ratios etc. are useful for gauging the overall host performance.
Dashboard 5: Virtual SAN Cluster Insights
This dashboard is suitable to gauge disk group level details holistically. If you want to know that which disk group within a host/cluster has the most amount of data, which is most utilized in catering to the IOs, which has most errors or latency...you can check this dashboard.
Its a good starting point for Virtual SAN troubleshooting at a disk group level.
I believe that this configuration in vRealize Operations is a must for any VMware administrator using Virtual SAN. With the help of this specially crafted Management Pack one can easily isolate problem caused at the Storage subsystem and connected SAN devices.