Supported Versions
Validated on VMware VxRail 7.0, and 8.0

Introduction

  • The VxRail appliance hardware is composed of Dell EMC PowerEdge servers.
  • The virtualization and software layers are composed of vSphere and vSAN which is integrated into the kernel of vSphere.
  • VxRail Manager plug-in for vCenter Server allows:
    • Manage all features & configurations VxRail Manager.
    • Allows to manage physical hardware of the VxRail cluster.
VMware VxRail Introduction

Monitoring Use cases

  • Enable end to end visibility of the full VxRail HCI stack.
  • Achieve through combination of vSphere, vSAN & vxRail monitoring, all through VMware vCenter.
  • Discover and monitor physical components of VxRail appliances:
    • Chassis
    • Disk drives
    • Boot devices
    • Power Supplies
  • VxRail S/W & H/W events monitoring through vCenter event forward mechanism.
  • Support for multi VxRail clusters managed through vCenter.

Pre-requisites

  • REST APIs need to be enabled on the VxRail manager appliance.
  • Configured VxRail IP address should be reachable from OpsRamp gateway.
  • Configuration with VxRail Manager hostname is not allowed. Only VxRail manager IP address to be used.
  • Virtual machine discovery is mandatory in OpsRamp portal for VxRail components discovery.
  • Resave the discovery profile to update the information on the integration page. Once the information is modified, click Save. Click the Update operation under the discovery profile page to apply the changes to the VxRail configuration.

Integration mechanism

  • Integration is done through REST APIs exposed by VxRail Manager
  • VxRail entities are mapped as components to existing vCenter resources (cluster, host)

Integration configuration

  1. From All Clients, select a client.
  2. Go to Setup > Account.
  3. Select the Integrations and Apps tab.
  4. The Installed Integrations page, where all the installed applications are displayed.
    Note: If there are no installed applications, it will navigate to the Available Integrations and Apps page.
  5. Click + ADD on the Installed Integrations page. The Available Integrations and Apps page displays all the available applications along with the newly created application with the version.
    Note: You can even search for the application using the search option available. Also you can use the All Categories option to search.
VxRail Manager IPaddress
  1. Click ADD in VMware. In the vCenter Plug-ins Configurations section the VxRail checkbox needs to be enabled, and VxRail Manager IPaddress should be added in the provided field.
  • Discover physical components of VxRail appliances:
    • Network Components
    • Chassis
    • Disk drives
    • Boot devices
  • Discovered VxRail Chassis, Bootdevices and Disks can be seen under Infrastructure → vCenter → VxRail Components.
VxRail Manager IPaddress
  • Discovered VxRail Network Components can be seen under Infrastructure → vCenter → DataCenter → Cluster → Host → Attributes → Hardware → Network Information.
NetApp Solidfire
  • Discovered VxRail Bootdevices and Disks can be seen under Infrastructure → vCenter → DataCenter → Cluster → Host → VxRail Components.
VxRail Manager IPaddress

Monitoring metrics and Templates

Template NameMonitor NameMetric NamePossible metric valuesThresholds(Warning / Critical)Description
VxRail Host TemplateVxRail Hosts Monitorvxrail_host_nics_Link_StatusDown:0, Up:1Critical when !=1

Note: here operator is !=
link_status (string): The link status of the NIC

Sample Value: "link_status":"Up",
VxRail Host TemplateVxRail Hosts Monitorvxrail_host_disk_StateUnknown:0, OK:1Critical when !=1

Note: here operator is !=
disk_state (string): Health state of the disk

Sample value: "disk_state": "OK",
VxRail Host TemplateVxRail Hosts Monitorvxrail_host_disk_Led_StatusRed:Off:0, Blue:Blink:1, Green:On:2Critical when == 0
Warning when == 1

Note: here operator is ==
State of the chassis LED indicator for the host

Sample Value: "led_status": "Blue:On"
VxRail Host TemplateVxRail Hosts Monitorvxrail_host_boot_device_stats_Health0 to 100%Critical when !=100

Note: here operator is !=
health (string): Health status of the boot device (percentage)

Sample value: "health": "100"
VxRail Host TemplateVxRail Hosts Monitorvxrail_host_disk_Write_Endurance0 to 100%Critical when > 90
Waring when > 70

Notes:
Write endurance is the number of program/erase cycles, or P/E cycles, that can be applied to a block of flash memory before the storage media becomes unreliable

write_endurance (string): Write endurance of the disk (usage percentage)

Sample value: "write_endurance": "3",
VxRail Host TemplateVxRail Hosts Monitorvxrail_host_Operational_StatusNormal:1, Unknown:0Critical when != 1

Note: here operator is !=
operational_status (string): Operational status of the host

Sample Value: "operational_status": "normal",
VxRail Host TemplateVxRail Hosts Monitorvxrail_host_Power_StatusOn:1, Off:0Critical when != 1

Note: here operator is !=
power_status (string):Power supply status of the host

Sample value: "power_status": "on",
VxRail Host TemplateVxRail Hosts Monitorvxrail_host_Health_StatusHealthy:1, Unknown:0Critical when != 1

Note: here operator is !=
Health status of the VxRail system

Supported values are Critical, Error, Warning, and Healthy.

Sample value: "health": "Healthy",
VxRail Host TemplateVxRail Hosts Monitorvxrail_host_powersupply_Health_StatusHealthy:1, Unknown:0Critical when != 1

Note: here operator is !=
health (string): Health status of the power supply

Supported values are Critical, Error, Warning, and Healthy.

Sample value: "health": "Healthy",
VxRail Cluster TemplateVxRail Cluster Monitorvxrail_cluster_chassis_stats_HealthHealthy:1, Unknown:0Critical when != 1

Note: here operator is !=
health (string): Status of the health of the chassis.

Supported values are Critical, Error, Warning, and Healthy.

Sample value: "health": "Healthy"
VxRail Cluster TemplateVxRail Cluster Monitorvxrail_cluster_Operational_Statusok:1, Unknown:0Critical when != 1

Note: here operator is !=
operational_status string Operational status information

Sample value: "operational_status": "ok"