Agent G2 - Active Directory Free System Page Table Entries DotNet v4
Description
Monitors Active Directory Free System Page Table Entries Counters
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Active Directory Free System Page Table Entries DotNet v4 | AD_Free_System_Page_Table_Entries | FreeSystemPageTableEntries | NULL | Free System Page Table Entries is the number of page table entries not currently in used by the system. This counter displays the last observed value only; it is not an average |
Agent G2 - Active Directory Performance Counters DotNet v4
Description
Monitors AD Performance data
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Active Directory Performance Counters DotNet v4 | DRAInboundObjectsPersec | DRAInboundObjectsPersec | NULL | The number of objects received (per second) through inbound replication from replication partners. |
DSServerBindsPersec | DSServerBindsPersec | NULL | Shows the number of DC-to-DC binds per second that are serviced by this DC. | |
DRAInboundBytesTotalPersec | DRAInboundBytesTotalPersec | Bytes per second | It is the sum of the number of bytes (per second) of uncompressed data (never compressed) and compressed data (after compression) received through replication. Lack of activity indicates that the network is slowing down replication. | |
DRAInboundObjectsAppliedPersec | DRAInboundObjectsAppliedPersec | NULL | This counter excludes changes that are received but not applied (for example, when the update is already made) and also how many replication updates are occurring on the server as a result of changes generated on other servers. | |
ABClientSessions | ABClientSessions | NULL | AB Client Sessions is the number of connected Address Book client sessions. | |
LDAPClientSessions | LDAPClientSessions | NULL | The number of sessions of connected LDAP clients. Lack of activity points to network problems. | |
DSDirectoryReadsPersec | DSDirectoryReadsPersec | NULL | Shows the number of directory reads per second. | |
DRAPendingReplicationSynchronizations | DRAPendingReplicationSynchronizations | NULL | The number of directory synchronizations that are queued for this server that are not yet processed. This counter helps in determining replication backlog - the larger the number, the larger the backlog. This value should be low, with a higher value indicating that the hardware is not adequately servicing replication. | |
NTLMAuthentications | NTLMAuthentications | NULL | The number of NTLM authentications (per second) serviced by this domain controller | |
DSDirectoryWritesPersec | DSDirectoryWritesPersec | NULL | Shows the number of directory writes per second. | |
LDAPActiveThreads | LDAPActiveThreads | NULL | LDAP Active Threads is the current number of threads in use by the LDAP subsystem of the local direcotry service. | |
KerberosAuthentications | KerberosAuthentications | NULL | The number of times per second that clients use a client ticket to this domain controller to authenticate to this domain controller. A lack of activity can indicate network problems that are preventing authentication requests from succeeding. | |
DRAOutboundBytesTotalPersec | DRAOutboundBytesTotalPersec | Bytes per second | It is the sum of the number of bytes of uncompressed data (never compressed) and compressed data (after compression) sent per second. Lack of activity indicates that the hardware or network is slowing down replication. | |
LDAPWritesPersec | LDAPWritesPersec | NULL | Shows the rate at which LDAP clients perform write operations. | |
DSNotifyQueueSize | DSNotifyQueueSize | NULL | The number of pending update notifications that have been queued, but not yet transmitted to clients. | |
DSClientBindsPersec | DSClientBindsPersec | NULL | Shows the number of Ntdsapi.dll binds per second serviced by this DC. | |
LDAPUDPoperationsPersec | LDAPUDPoperationsPersec | NULL | Shows the number of UDP operations that the LDAP server is processing per second. | |
LDAPBindTime | LDAPBindTime | Milliseconds | This counter shows the time required for completion of the last LDAP binding, with a higher value pointing to either hardware or network performance problems. | |
LDAPSearchesPersec | LDAPSearchesPersec | NULL | The number of search operations per second performed by LDAP clients. A lack of activity points to network problems. | |
DRAOutboundObjectsPersec | DRAOutboundObjectsPersec | NULL | The number of objects sent (per second) through outbound replication to replication partners. | |
DRAInboundObjectUpdatesRemaininginPacket | DRAInboundObjectUpdatesRemaininginPacket | NULL | This counter tells you whether the monitored server is receiving changes, but is taking a long time applying them to the database. The value should be low, with a higher value indicating that the hardware is incapable of adequately servicing replication (warranting a server upgrade). |
Agent G2 - AD Database Monitoring - v2
Description
Monitor AD database metrics like DBFileSizeGrowth, DBFile_DiskUsage, DiskHealthStatus, FreeDiskSpace.
Note: Previous version “Agent G2 - AD Database Monitoring” template has a bug at the monitor level. We recommend using the latest template (Agent G2 - AD Database Monitoring - v2)
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - AD Database Custom Monitor - v2 | AD_Database_DBFileSizeGrowth | AD Database DBFile Size Growth | MB | It monitors the growth of the Active Directory database file size. It calculates the delta of the database file size from previous poll to current poll. |
AD_Database_FreeDiskSpace | AD Database FreeDiskSpace | MB | It monitors the free disk space in MB for the drives which are having Active Directory Database file / Log file. | |
AD_Database_DiskHealthStatus | AD Database Disk Health Status | null | It monitors the disk health status of the drive which is having an Active Directory DB File. Below are the possible states: 0 - Healthy 1 - Warning 2 - Unhealthy 3 - Unknown | |
AD_Database_DBFile_DiskUsage | AD Database DBFile DiskUsage | % | It monitors the disk usage% of the drive which is having Active Directory DB file. |
Agent G2 - AD Performance Counters DotNet v4
Description
Agent G2 - AD Performance Counters DotNet v4
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - AD Performance Counters DotNet v4 | directory.services.ldap.successful.binds.per.sec | Directory Services LDAPSuccessfulbindspersec | NULL | Number of LDAP Binds per second |
directory.services.dra.outbound.values.dns.only.per.sec | Directory Services DRAOutboundvaluesdnsonlypersec | NULL | Number of object property values containing Distinguished Names sent to outbound replication partners. DN-values, such as group or distribution list memberships, are generally more expensive to read than other kinds of values | |
directory.services.dra.inbound.values.dns.only.per.sec | Directory Services DRAInboundvaluesdnsonlypersec | NULL | Number of object property values received from inbound replication partners that are Distinguished Names; i.e., that reference other objects. DN-values, such as group or distribution list memberships, are generally more expensive to apply than other kind | |
directory.services.threads.in.use | Directory Services Threadsinuse | NULL | DS Threads in Use is the current number of threads in use by the directory service (different than the number of threads in the directory service process). Threads in Use is the number of threads currently servicing client API calls and can be used to in | |
directory.services.dra.inbound.full.sync.objects.remaining | Directory Services DRAInboundfullsyncobjectsremaining | NULL | Number of objects remaining until the full sync completes (when set) |
Agent G2 - Backup Symantec Exec-11-Performance Counters DotNet v4
Description
Template for Symantec backup exec. Monitors total bytes, total directories and total files. Also performs event log monitoring.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Backup Symantec Exec-11-Performance Counters DotNet v4 | TotalDirectories | TotalDirectories | NULL | The total number of directories that have been backed up since the Backup Exec Engine Service last started. |
TotalFiles | TotalFiles | NULL | The total number of files that have been backed up since the Backup Exec Engine Service last started. | |
TotalBytes | TotalBytes | NULL | The total number of bytes that have been backed up since the Backup Exec Engine Service last started. |
Agent G2 - Backup-Symantec Exec-12.5-Performance Counters DotNet v4
Description
12.5_Symantec_Backup_Exec. Monitors total bytes, total directories and total files.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Backup-Symantec Exec-12.5-Performance Counters DotNet v4 | TotalBytes | TotalBytes | NULL | The total number of bytes that have been backed up since the Backup Exec Engine Service last started. |
FailedJobs | FailedJobs | NULL | The number of jobs that have failed since the Backup Exec Engine Service last started. | |
TotalDirectories | TotalDirectories | NULL | The total number of directories that have been backed up since the Backup Exec Engine Service last started. | |
TotalFiles | TotalFiles | NULL | The total number of files that have been backed up since the Backup Exec Engine Service last started. |
Agent G2 - Backup-Veritas-Job Performance Counters DotNet v4
Description
Monitors the AbortedJobs, ActiveJobCount, BackupDeviceWaitTime, InUseSkippedObjects MountTime, TotalExchangeMailboxes, TotalSQLServerDatabases.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Backup-Veritas-Job Performance Counters DotNet v4 | BackupDeviceWaitTime | BackupDeviceWaitTime | Seconds | The total time (in seconds) all backup jobs have spent waiting for a storage device since the Backup Exec Engine Service last started. |
AbortedJobs | AbortedJobs | NULL | The number of jobs that have been aborted since the Backup Exec Engine Service last started. | |
MountTime | MountTime | Seconds | The total time (in seconds) all jobs have spent waiting for media to be mounted in a storage device since the Backup Exec Engine Service last started. | |
InUseSkippedObjects | InUseSkippedObjects | NULL | The number of objects that have been skipped because they were in use during backup since the Backup Exec Engine Service last started. | |
FailedJobs | FailedJobs | NULL | The number of jobs that have failed since the Backup Exec Engine Service last started. | |
ActiveJobCount | ActiveJobCount | NULL | The number of jobs currently active (running or pending) in the Backup Exec Engine Service. |
Agent G2 - Blackberry 501 Performance Counters DotNet v4
Description
Monitor Blackberry 501 Performance Counters
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Blackberry 501 Performance Counters DotNet v4 | MessagesQueuedForDelivery | MessagesQueuedForDelivery | NULL | Blackberry Agent displays queued messages for delivery |
MessagesExpired | MessagesExpired | NULL | Blackberry Agent displays expired messages | |
MessagesSent | MessagesSent | NULL | Blackberry Agent displays sent messages | |
MessagesReceived | MessagesReceived | NULL | Blackberry Agent displays received messages | |
MessagesFiltered | MessagesFiltered | NULL | Blackberry Agent displays filtered messages |
Agent G2 - Blackberry Enterprise Server
Description
Template for BlackBerry enterprise server. Monitors messages expired, messages filtered, messages queued for delivery, messages received and messages sent. Also performs event log monitoring.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Blackberry Enterprise Server | MessagesQueuedForDelivery | MessagesQueuedForDelivery | NULL | Blackberry Agent displays queued messages for delivery |
MessagesExpired | MessagesExpired | NULL | Blackberry Agent displays expired messages | |
MessagesSent | MessagesSent | NULL | Blackberry Agent displays sent messages | |
MessagesReceived | MessagesReceived | NULL | Blackberry Agent displays received messages | |
MessagesFiltered | MessagesFiltered | NULL | Blackberry Agent displays filtered messages |
Agent G2 - Blackberry Performance Counters DotNet v4
Description
Monitors Blackberry Performance Counters
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Blackberry Performance Counters DotNet v4 | MessagesQueuedForDelivery | MessagesQueuedForDelivery | NULL | Blackberry Agent displays queued messages for delivery |
MessagesExpired | MessagesExpired | NULL | Blackberry Agent displays expired messages | |
MessagesSent | MessagesSent | NULL | Blackberry Agent displays sent messages | |
MessagesReceived | MessagesReceived | NULL | Blackberry Agent displays received messages | |
MessagesFiltered | MessagesFiltered | NULL | Blackberry Agent displays filtered messages |
Agent G2 - Cisco Unity Performance Counters DotNet v4
Description
Monitors Cisco Unity Performance data
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Cisco Unity Performance Counters DotNet v4 | IncomingCallsExternalCurrent | IncomingCallsExternalCurrent | NULL | The current number of incoming calls from external callers. |
PortsIdleCurrent | PortsIdleCurrent | NULL | The current number of integration ports that are not in use by the Cisco Unity Connection server. | |
TTSSessionsPersec | TTSSessionsPersec | NULL | The number of active TTS voice sessions per second. | |
MessageStoresOfflineCurrent | MessageStoresOfflineCurrent | NULL | A current total of Cisco Unity Message Stores that are offline. | |
PortsUsedCurrent | PortsUsedCurrent | NULL | The current number of integration ports that are in use by the Cisco Unity Connection server. | |
AverageAuthenticationTime | AverageAuthenticationTime | NULL | Average time taken to authenticate. | |
TTSSessionDurationAverage | TTSSessionDurationAverage | NULL | The average duration of all TTS sessions in seconds. | |
MessageStoresOnlineCurrent | MessageStoresOnlineCurrent | NULL | A current total of Cisco Unity Message Stores that are online. | |
PortsLockedCount | PortsLockedCount | NULL | The current count of the ports that no longer respond or are otherwise unusable by Cisco Unity Connection | |
UnityMTAMessageCountCurrent | UnityMTAMessageCountCurrent | NULL | The number of messages currently queued in the MTA(Message Transfer Agent). | |
IncomingCallsDurationAverage | IncomingCallsDurationAverage | NULL | The average duration in seconds of all incoming calls to the Cisco Unity Connection server. | |
DirectoryResynchronizationDurationAverage | DirectoryResynchronizationDurationAverage | NULL | The average duration of information directory synchronization, in seconds | |
PortsIdleDurationAverage | PortsIdleDurationAverage | NULL | The average time that any port remains idle between incoming calls to the Cisco Unity Connection server in seconds. | |
OutgoingCallsDurationAverage | OutgoingCallsDurationAverage | NULL | The average duration of all outgoing calls from the Cisco Unity Connection server in seconds. |
Agent G2 - Citrix Broker Agent DotNet v4
Description
Monitors Citrix Broker Agent role performance Counters
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Citrix Broker Agent DotNet v4 | citrix.broker.agent.total.sessions | CitrixBrokerAgent TotalSessions | NULL | Total Number of Sessions |
citrix.broker.agent.num.of.registrations | CitrixBrokerAgent NumberofRegistrations | NULL | Total Number of Registrations | |
citrix.broker.agent.total.app.sessions | CitrixBrokerAgent TotalAppSessions | NULL | Total Number of Seamless App Sessions | |
citrix.broker.agent.total.notifications | CitrixBrokerAgent TotalNotifications | NULL | Total Number of Notifications | |
citrix.broker.agent.num.of.deregistrations | CitrixBrokerAgent NumberofDeregistrations | NULL | Total Number of DeRegistrations | |
citrix.broker.agent.total.desktops.session | CitrixBrokerAgent TotalDesktopsSession | NULL | Total Number of Desktop Sessions |
Agent G2 - Citrix Broker Service DotNet v4
Description
Monitors Citrix Broker Service role performance counters
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Citrix Broker Service DotNet v4 | citrix.broker.service.hard.registrations.per.sec | CitrixBrokerService HardRegistrationsPerSec | NULL | Hard Registrations/sec is the rate at which virtual desktop agents hard-register with Citrix Broker Service |
Agent G2 - Citrix Licensing
Description
Template for Citrix Licensing
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Citrix Licensing | Total_License | Total_License | NULL | The sum of the total license count of the citrix concurrent licenses with the common name available in the citrix licensing server. |
License_Usage | License_Usage | NULL | The sum of the license usage number of the citrix concurrent licenses being used currently with the common name. | |
License_Used_Percentage | License_Used_Percentage | NULL | License percent used is the percentage of the license usage of the citrix concurrent licenses with the common name in the citrix licensing server. | |
License_SA_Expiry_in_days | License_SA_Expiry_in_days | NULL | Provides the number of days remaining to expire subscription advantage (SA) of all the possible licenses of the citrix license server. |
Agent G2 - Citrix Licensing Performance Counters
Description
Monitors Citrix Licensing Performance data
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Citrix Licensing Performance Counters | Total_License | Total_License | NULL | The sum of the total license count of the citrix concurrent licenses with the common name available in the citrix licensing server. |
License_Usage | License_Usage | NULL | The sum of the license usage number of the citrix concurrent licenses being used currently with the common name. | |
License_Used_Percentage | License_Used_Percentage | NULL | License percent used is the percentage of the license usage of the citrix concurrent licenses with the common name in the citrix licensing server. | |
License_SA_Expiry_in_days | License_SA_Expiry_in_days | NULL | Provides the number of days remaining to expire subscription advantage (SA) of all the possible licenses of the citrix license server. |
Agent G2 - Citrix Performance Counters DotNet v4
Description
These performance counters should be used to monitor the key performance metrics of the Citrix infrastructure, application servers, and virtual desktops.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Citrix Performance Counters DotNet v4 | citrix.percent.processor.time | Citrix Percent Processor Time | % | Percent Processor Time is the percentage of elapsed time that the processor spends to execute a non-Idle thread. It is calculated by measuring the duration of the idle thread is active in the sample interval, and subtracting that time from interval duration. (Each processor has an idle thread that consumes cycles when no other threads are ready to run). This counter is the primary indicator of processor activity, and displays the average percentage of busy time observed during the sample interval. It is calculated by monitoring the time that the service is inactive and subtracting that value from 100 Percent |
citrix.logicaldisk.avg.disksecpertransfer | Citrix LogicalDisk Avg DiskSecPerTransfer | MS | The Average Disk Second counters show the average time in seconds of a transfer from or to a disk. | |
citrix.logicaldisk.currentdiskqueuelength | Citrix LogicalDisk CurrentDiskQueueLength | NULL | Current disk queue length provides a primary measure of disk congestion. It is an indication of the number of transactions that are waiting to be processed. | |
citrix.logicaldisk.avg.disksecperwrite | Citrix LogicalDisk Avg DiskSecPerWrite | MS | The Average Disk Second counters show the average time in seconds of a write from or to a disk. | |
citrix.logicaldisk.percent.disktime | Citrix LogicalDisk Percent DiskTime | % | Percent Disk Time marks how busy the disk is. | |
citrix.network.interface.bytestotalpersec | Citrix Network Interface BytesTotalPerSec | NULL | Bytes Total/sec shows the rate at which the network adaptor is processing data bytes. This counter includes all application and file data, in addition to protocol information, such as packet headers. | |
citrix.system.processor.queuelength | Citirx System Processor Queue Length | NULL | Processor queue length is the number of threads in the processor queue. Unlike the disk counters, this counter shows ready threads only, not threads that are running. There is a single queue for processor time even on computers with multiple processors. Therefore, if a computer has multiple processors, you need to divide this value by the number of processors servicing the workload. A sustained processor queue of less than ten threads per processor is normally acceptable, dependent of the workload. | |
citrix.paging.file.percentusage | Citrix Paging File Percent Usage | NULL | This is the percentage amount of the Page File instance in use. | |
citrix.logicaldisk.percent.freespace | Citrix LogicalDisk Percent FreeSpace | % | Percent Free Space is the percentage of total usable space on the selected logical disk drive that is free. | |
citrix.memory.available.bytes | Citrix Memory Available Bytes | % | Available memory indicates the amount of memory that is left after nonpaged pool allocations, paged pool allocations, process working sets, and the file system cache have all taken their piece. | |
citrix.logicaldisk.avg.disksecperread | Citrix LogicalDisk Avg DiskSecPerRead | MS | The Average Disk Second counters show the average time in seconds of a read from or to a disk. |
Agent G2 - Citrix XenApp 7.5 DotNet v4
Description
Monitor Citrix XenApp Server 7.5 version performance counters.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Citrix XenApp 7.5 DotNet v4 | Citrix_Conf_Logging_Database_State | Citrix_Conf_Logging_Database_State | NULL | Database Connected indicates whether this service is in contact with its database (1 is connected; 0 is not connected). |
Agent G2 - Citrix XenApp 7.6 DotNet v4
Description
Monitor Citrix XenApp Server 7.6 version performance counters.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Citrix XenApp 7.6 DotNet v4 | citrix.conf.loggingdatabase.avgtransactiontime | Citirx Conf Logging Database Avg Transaction Time | NULL | The time on average, in seconds, taken to execute a database transaction. A baseline needs to be established in the environment in order to accurately establish threshold values. |
citrix.monitor.database.connected | Citrix Monitor Database Connected | NULL | Database Connected indicates whether this service is in contact with its database (1 is connected; 0 is not connected) | |
citrix.conf.loggingdatabase.transactionerrorspersec | Citrix Conf Logging Database Transaction Errors Per sec | NULL | The rate at which database transactions are failing. | |
citrix.env.test.database.connected | Citrix Env Test Database Connected | NULL | Database Connected indicates whether this service is in contact with its database (1 is connected; 0 is not connected) |
Agent G2 - Citrix XenApp Advanced Performance Check
Description
XenApp Advanced Monitoring
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Citrix XenApp Advanced Performance Check | XADisconnectedSessions | XADisconnectedSessions | Count | Lists sessions in disconnected state. This helps identify if there are any sporadic disconnects from the server. |
XAOfflineServers | XAOfflineServers | Count | A server can be taken offline either for maintenance or because the Server is unstable and is removed from available servers. These servers will not service any user requests. | |
XAServerLoad | XAServerLoad | NULL | This monitor displays the load on each Citrix XenApp Servers. These load values can identify issues with your servers in addition to determining which server is the least/most loaded in your farm. A trending value of this monitor helps identify peak usage of the servers. A value greater than 9000 indicates a heavily loaded server | |
XAServerApplication | XAServerApplication | Count | Number of applications hosted on each server. This help you decide if the load is distributed evenly across all your servers. | |
XASessions | XASessions | Count | Number of active sessions per server. |
Agent G2 - Citrix XenApp Performance Check DotNet v4
Description
XenApp Standard Monitoring
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Citrix XenApp Performance Check DotNet v4 | LicenseServerConnectionFailure | LicenseServerConnectionFailure | NULL | The number of minutes that the XenApp server has been disconnected from the License Server. The time returned should be less than 30 minutes. Ideally the returned time should be zero. |
WorkItemQueueExecutingCount | WorkItemQueueExecutingCount | NULL | The number of work items that are ready to be executed. | |
DataStorewritesPersec | DataStorewritesPersec | NULL | The number of times data was written to the data store per second. | |
ZoneElectionsTriggered | ZoneElectionsTriggered | NULL | The number of times a server triggers a zone election. | |
STATicketTimeoutCount | STATicketTimeoutCount | NULL | The total number of ticket time-outs that occur during the lifetime of the STA. | |
AverageLicenseCheckInResponseTimems | AverageLicenseCheckInResponseTimems | NULL | This component monitor returns the average response time for a license check-in operation in milliseconds. | |
LatencySessionDeviation | LatencySessionDeviation | NULL | This component monitor returns the difference between the minimum and the maximum session latency values. This value should be as low as possible. | |
BytesReceivedPersec | Bytes Received Per sec | NULL | This component monitor returns the data rate of incoming Independent Management Architecture(IMA) network traffic. | |
NetworkConnections | NetworkConnections | NULL | This component monitor returns the number of active network IMA connections to IMA servers. | |
LocalHostCachewritesPersec | LocalHostCachewritesPersec | NULL | The number of times data was written to the IMA local host cache per second. | |
ResolutionWorkItemQueueExecutingCount | ResolutionWorkItemQueueExecutingCount | NULL | The number of work items that are currently being executed. | |
WorkItemQueuePendingCount | WorkItemQueuePendingCount | NULL | The number of work items that are not yet ready to be executed. | |
NumberofbusyXMLthreads | NumberofbusyXMLthreads | NULL | The number of busy threads. | |
DataStorereadsPersec | DataStorereadsPersec | NULL | The number of times data was read from the data store per second. | |
CPUEntitlement | CPUEntitlement | NULL | The percentage of CPU resource that Citrix CPU Utilization Management makes available to a user at a given time. | |
CPUUsage | CPUUsage | NULL | The percentage of CPU resource consumed by a user at a given time averaged over a few seconds. | |
STAPeakTicketRequestRate | STAPeakTicketRequestRate | NULL | The maximum rate of ticket generation requests per second during the lifetime of the STA. | |
LongtermCPUUsage | LongtermCPUUsage | NULL | The percentage of CPU resource consumed by a user averaged over a longer period than the CPU Usage counter. | |
ApplicationResolutionsPersec | ApplicationResolutionsPersec | NULL | The number of resolutions completed per second. | |
DynamicStorewritesPersec | DynamicStorewritesPersec | NULL | The number of times data was written to the dynamic store per second. | |
ApplicationEnumerationsPersec | ApplicationEnumerationsPersec | NULL | Enumeration is the process in which a client transmits data to locate servers on the network and retrieves information about the server farms published applications. During enumeration the XenApp Plug-in for Hosted Apps communicates with the Citrix XML Service or the ICA browser depending on the browsing protocol selected in the plug-in. This monitor provides the number of application enumerations per second. | |
DataStoreConnectionFailure | DataStoreConnectionFailure | NULL | The number of minutes that the XenApp server has been disconnected from the data store. This value should be zero at all times. | |
LastRecordedLicenseCheckOutResponseTimems | LastRecordedLicenseCheckOutResponseTimems | NULL | The last recorded license check-out response time in milliseconds. | |
WorkItemQueueReadyCount | WorkItemQueueReadyCount | NULL | The number of work items that are not yet ready to be executed. | |
STAPeakAllRequestRate | STAPeakAllRequestRate | NULL | Secure Ticket Authority (STA) is responsible for issuing session tickets in response to connection requests for published resources on XenApp. These session tickets form the basis of authentication and authorization for access to published resources. | |
LatencySessionAverage | LatencySessionAverage | NULL | The average client latency over the lifetime of a session. | |
LatencyLastRecorded | LatencyLastRecorded | NULL | This component monitor returns the last recorded latency value of the session. | |
ICARoundtripLatencyMedian | ICARoundtripLatencyMedian | NULL | The median time of ICA roundtrip latency for all sessions on the server. | |
BytesSentPersec | BytesSentPersec | NULL | This component monitor returns the data rate of outgoing IMA network traffic. | |
STAPeakTicketRefreshRate | STAPeakTicketRefreshRate | NULL | The maximum rate of refresh requests per second during the lifetime of the STA. | |
ResolutionWorkItemQueueReadyCount | ResolutionWorkItemQueueReadyCount | NULL | The number of work items that are ready to be executed. | |
CPUReservation | CPUReservation | NULL | The percentage of total computer CPU resource reserved for a user, should that user require it. | |
CPUShares | CPUShares | NULL | The proportion of CPU resource assigned to a user. | |
LocalHostCachereadsPersec | LocalHostCachereadsPersec | NULL | The number of times data was read from the IMA local host cache per second. | |
MaximumnumberofXMLthreads | MaximumnumberofXMLthreads | NULL | The maximum number of threads allocated to service Web-based sessions since the server restarted. | |
AverageLicenseCheckOutResponseTimems | AverageLicenseCheckOutResponseTimems | NULL | This component monitor returns the average response time for a license check-out operation in milliseconds. | |
ApplicationResolutionsFailedPersec | ApplicationResolutionsFailedPersec | NULL | The number of application resolutions failed per second. | |
DynamicStorereadsPersec | DynamicStorereadsPersec | NULL | The number of times data was read from the dynamic store per second. | |
STAPeakDataRequestRate | STAPeakDataRequestRate | NULL | The maximum rate of data requests per second during the lifetime of the STA. | |
ZoneElectionsWon | ZoneElectionsWon | NULL | The number of times a server wins a zone election. | |
ApplicationResolutionTimems | ApplicationResolutionTimems | ms | The time in milliseconds that a resolution took to complete. A baseline would be needed in order to establish increases during peak logon times before an accurate threshold can be defined. |
Agent G2 - Citrix XenDesktop Advanced Performance Check
Description
Citrix XenDesktop advanced monitoring based on performance counters.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Citrix XenDesktop Advanced Performance Check | DesktopsWithRegistrationStateAsAgentError | DesktopsWithRegistrationStateAsAgentError | NULL | Lists all Virtual Desktops which have not registered with the controller due to Agent Error. The state of being in communication is referred to as the VDA being registered with a controller. If communication fails for any reason the VDA is said to have failed to register with a controller and it will not be possible for DDC to broker a connection to the VDM in question; the VDM becomes a wasted resource. |
DesktopsUnRegistered | DesktopsUnRegistered | NULL | Lists all Virtual Desktops in the Unregistered state. The state of being in communication is referred to as the VDA being registered with a controller. If communication fails for any reason the VDA is said to have failed to register with a controller and it will not be possible for DDC to broker a connection to the VDM in question; the VDM becomes a wasted resource. | |
DesktopsWithUnknownPowerState | DesktopsWithUnknownPowerState | NULL | The Virtual desktop power state and issuing of power commands to the VM depends on the DDC services being able to communicate with the hypervisor that hosts the VM. The unknown power state is a result of the DDC never having received a notification of the power state of the VM from the hypervisor (hence state unknown). | |
DesktopsNeverRegistered | DesktopsNeverRegistered | NULL | Virtual desktop machine which never registered with the Desktop Controllers. An unregistered desktop cannot be used by an end-user. | |
TotalDesktops | TotalDesktops | NULL | The total number of desktops in a given Desktop Group. | |
BrokerHypervisorAlertSeverityRed | BrokerHypervisorAlertSeverityRed | NULL | Lists the current alerts objects reported by the hypervisors that the controller is monitoring. | |
DesktopsFacingICALatency | DesktopsFacingICALatency | NULL | Desktop Receiver on the client PCs communicates with the Virtual Desktop Agent(VDA) on the Virtual Desktops using the ICA (Independent Computing Architecture). High latency on this communication channel would result in slowness/freezing of sessions. | |
DesktopsDisconnected | DesktopsDisconnected | NULL | Number of Virtual Desktops whose summary state is in disconnected state. | |
DesktopsFacingHighProfileLoadTime | DesktopsFacingHighProfileLoadTime | NULL | Desktop Receiver on the client PCs communicates with the Virtual Desktop Agent(VDA) on the Virtual Desktops using the ICA (Independent Computing Architecture). Profile Load time is directly impacted with the profile size. This results in users experiencing logon delays. | |
DesktopsAvailable | DesktopsAvailable | NULL | The number of desktops available in a desktop group. | |
DesktopGroupsUsage | DesktopGroupsUsage | NULL | % of virtual desktop machines used within a group. | |
DesktopsInUse | DesktopsInUse | NULL | The number of desktops currently in use in a given Desktop Group. | |
ActiveSessions | ActiveSessions | NULL | The number of Desktop sessions currently streamed. | |
DesktopsinMaintenanceMode | DesktopsinMaintenanceMode | NULL | Putting a desktop in maintenance mode temporarily stops connections to the desktop so that maintenance tasks can be carried out. A user trying to connect to a desktop in maintenance mode will receive a message telling them the desktop is currently unavailable and to try reconnecting. XenDesktop has no control over desktops in maintenance mode. No user can log on to a desktop in this state. If a user is already logged on, maintenance mode takes effect as soon as they log off. | |
DesktopsWithImageOutOfDate | DesktopsWithImageOutOfDate | NULL | Shows Desktop Groups that have desktops with Out Of Date Images |
Agent G2 - Citrix XenDesktop Performance Counters
Description
Monitors the XenDesktop System, Power status, Available system count, Delivery Group Desktops available count, unregistered count along with the Controller details like State, Services 7 Licensing details etc., Applicable on the XenApp/Xendesktop 7.x versions.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Citrix XenDesktop Performance Counters | broker.desktop.group.desktops.available | BrokerDesktopGroup DesktopsAvailable | % | Number of available desktops |
broker.desktop.group.desktops.disconnected | BrokerDesktopGroup DesktopsDisconnected | % | Number of disconnected desktops | |
broker.catalog.available.count | BrokerCatalog AvailableCount | NULL | Number of available systems | |
broker.desktop.group.desktops.unregistered | BrokerDesktopGroup DesktopsUnregistered | % | Number of unregistered desktops | |
broker.desktop.group.desktops.never.registered | BrokerDesktopGroup Desktops NeverRegistered | % | Number of desktops never registered | |
broker.desktop.group.desktops.preparing | BrokerDesktopGroup DesktopsPreparing | % | Number of desktops in preparing state |
Agent G2 - Citrix XenDesktop Status and Performance Check DotNet v4
Description
Applicable on XenDesktop - Desktop Delivery Controller
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Citrix XenDesktop Status and Performance Check DotNet v4 | RegistrationRequestsPersec | RegistrationRequestsPersec | NULL | Registration Requests/sec is the rate at which Citrix Broker Service receives registration requests from virtual desktops. |
RegistrationAvgRequestTime | RegistrationAvgRequestTime | NULL | Registration Avg. Request Time is the time on average in seconds taken to process a virtual desktop registration request in Citrix Broker Service. This delay is a one time daily expense | |
BrokeredSessions | BrokeredSessions | NULL | Brokered Sessions is the number of virtual desktop sessions brokered by the Citrix Broker Service. |
Agent G2 - CPU - Run Queue Monitor DotNet v4
Description
Number of threads in queue waiting for processor time. Threshold for this metric is based on the number of processors on the system. Ideal value range from one to three threads per processor.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - CPU - Run Queue Monitor DotNet v4 | cpu.run.queue.monitor | CPU Run Queue Monitor | NULL | Number of threads in queue waiting for processor time. Threshold for this metric is based on the number of processors on the system. Ideal value range from one to three threads per processor. |
Agent G2 - DB-Oracle DotNet v4
Description
Monitors Oracle Performance Counters
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - DB-Oracle DotNet v4 | Oracle_UserRollbacks | Oracle User Rollbacks | Count | Validates the Number of User Roll backs. |
Oracle_DataFileDiskWrites | Oracle Data File Disk Writes | Count | Validates the Number of Data file disk writes to database. | |
Oracle_CacheInvalidations | Oracle Cache Invalidations | Count | Validates the how many Cache invalidations on particular database. | |
Oracle_TableSpaceFree | Oracle Table Space Free | MB | Validates free table space available | |
Oracle_UsersCommit | Oracle Users Commit | Count | Validates Number of User commits | |
Oracle_TableSpaceAllocated | Oracle Table Space Allocated | MB | Validates the Size allocated for the table by database. | |
Oracle_Sessions | Oracle Sessions | Count | Validates how many sessions currently on particular database. | |
Oracle_DataFileDiskReads | Oracle Data File Disk Reads | Count | Validates the Number of Data file disk reads by database. | |
Oracle_LibraryCacheReloads | Oracle Library Cache Reloads | Count | Validates the Number of Library Cache Reloads by database. | |
Oracle_DataFilelesizeAllocated | Oracle Data File size Allocated | MB | Validates the Data File Size Allocated for the database. | |
Oracle_LibraryCacheGets | Oracle Library Cache Gets | Count | Validates the Number of Library Cache gets by database. | |
Oracle_LongRunningQueries | Oracle Long Running Queries | Count | Validates the how many long running queries on particular database. | |
Oracle_TablescanBlocks | Oracle Table scan Blocks | Count | Validates the Number of Table scan blocks by database. | |
Oracle_BlockingLockQueries | Oracle Blocking Lock Queries | Count | Validates the how many block lock queries on particular database. | |
oracle_processes | Oracle Processes | Count | Validates the how many processes on particular database. |
Agent G2 - Dell Hardware Health - WMI DotNet v4
Description
Monitors the Dell hardware health parameters like CPU Status, Memory status, Fan status, Fan reading, Temperature status, Temperature reading, Power consumption watts sensor status, Power consumption watts sensor reading, Power consumption amps sensor status, Power consumption amps sensor reading, voltage status and voltage reading.
Prerequisites
Validated on Dell PowerEdge R710, Microsoft Windows Server 2008 R2 Standard Edition Service Pack 1, 64-bit.
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Dell Hardware Health - WMI DotNet v4 | dell.voltage.sensor.reading | Voltage Reading | NULL | Dell Voltage sensor current reading in millivolts. |
dell.temperature.reading | Temperature Reading | NULL | Dell Temperature sensor current reading in centigrade. | |
dell.power.consumption.ampssensor.reading | Power Consumption Amps Sensor Reading | NULL | Dell Power consumption amps sensor current reading. |
Agent G2 - DFS NameSpace Replication Performance Counters DotNet v4
Description
DFS Monitoring - Namespace and Replication Check
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - DFS NameSpace Replication Performance Counters DotNet v4 | dfsnamespaceserviceapirequests.requestsprocessed | DFSNamespaceServiceAPIRequests RequestsProcessed | NULL | Requests Processed shows the number of requests to one API that were processed by the DFS Namespace service. |
dfsreplicatedfolders.deletedspaceinuse | DFSReplicatedFolders DeletedSpaceInUse | NULL | Deleted Space in Use shows the total size (in bytes) of the deleted files and folders currently in the Conflict and Deleted folder used by the DFS Replication service. The DFS Replication service detects remote deletes from its sending partner and moves the file or folder to the Conflict and Deleted folder. The service automatically cleans up the Conflict and Deleted folder when it exceeds a pre-configured threshold of the quota. | |
dfsreplicatedfolders.deletedfilesgenerated | DFSReplicatedFolders DeletedFilesGenerated | NULL | Deleted Files Generated shows the number of replicated deleted files and folders that were moved to the Conflict and Deleted folder after they were deleted from a replicated folder on a sending member. The DFS Replication service automatically cleans up the Conflict and Deleted folder when it exceeds a pre-configured threshold of the quota. | |
dfsreplicationconnections.totalfilesreceived | DFSReplicationConnections TotalFilesReceived | NULL | Total Files Received shows the number of files that were received on the connection. | |
dfsreplicatedfolders.conflictfoldercleanupscompleted | DFSReplicatedFolders ConflictFolderCleanupsCompleted | NULL | Conflict Folder Cleanups Completed shows the number of times conflict loser files and folders in the Conflict and Deleted folder were deleted by the DFS Replication service. The DFS Replication service automatically detects and resolves conflicts encountered in replicated folders and moves the losing version to the Conflict and Deleted folder. The service automatically cleans up the Conflict and Deleted folder when it exceeds a pre-configured threshold of the quota. | |
dfsreplicationconnections.rdcbytesreceived | DFSReplicationConnections RDCBytesReceived | NULL | RDC Bytes Received shows the bytes that were received on this connection while replicating files using remote differential compression (RDC). This is the actual bytes received over the network without the networking protocol overhead. | |
dfsreplicatedfolders.stagingspaceinuse | DFSReplicatedFolders StagingSpaceInUse | NULL | Staging Space In Use shows the total size (in bytes) of the files and folders currently in the staging folder used by the DFS Replication service. This counter will fluctuate as staging space is reclaimed. The DFS Replication service stages files and folders in the staging folder before they are replicated, and automatically cleans up the staging folder when it exceeds a pre-configured threshold of the quota. | |
dfsreplicationconnections.sizeoffilesreceived | DFSReplicationConnections SizeofFilesReceived | NULL | Size of Files Received shows the uncompressed size (in bytes) of the files received on this connection. This is the number of bytes that would have been received had DFS Replication compression not been used. | |
dfsnamespaceservicereferrals.requestsprocessedpersec | DFSNamespaceServiceReferrals RequestsProcessedPersec | NULL | Requests Per Sec. shows the number of referral requests per second that were processed by the DFS Namespace service. | |
dfsreplicationservicevolumes.usnjournalrecordsread | DFSReplicationServiceVolumes USNJournalRecordsRead | NULL | USN Journal Records Read shows the number of update sequence number (USN) journal records that were read by the DFS Replication service. | |
dfsreplicatedfolders.deletedbytesgenerated | DFSReplicatedFolders DeletedBytesGenerated | NULL | Deleted Bytes Generated shows the total size (in bytes) of replicated deleted files and folders that were moved to the Conflict and Deleted folder after they were deleted from a replicated folder on a sending member. The DFS Replication service automatically cleans up the Conflict and Deleted folder when it exceeds a pre-configured threshold of the quota. | |
dfsreplicatedfolders.stagingfilescleanedup | DFSReplicatedFolders StagingFilesCleanedup | NULL | Staging Files Cleaned up shows the number of files and folders that were cleaned up from the staging folder by the DFS Replication service. The DFS Replication service stages files and folders in the staging folder before they are replicated, and automatically cleans up the staging folder when it exceeds a pre-configured threshold of the quota. | |
dfsreplicationconnections.rdcsizeoffilesreceived | DFSReplicationConnections RDCSizeofFilesReceived | NULL | RDC Size of Files Received shows the uncompressed size (in bytes) of files received with remote differential compression (RDC) for this connection. This is the number of bytes that would have been received had neither compression nor RDC been used. This is not the actual number of bytes received over the network. | |
dfsreplicatedfolders.conflictspaceinuse | DFSReplicatedFolders ConflictSpaceInUse | NULL | Conflict Space in Use shows the total size (in bytes) of the conflict loser files and folders currently in the Conflict and Deleted folder used by the DFS Replication service. The DFS Replication service automatically detects and resolves conflicts encountered in replicated folders and moves the losing version to the Conflict and Deleted folder. The service automatically cleans up the Conflict and Deleted folder when it exceeds a pre-configured threshold of the quota. | |
dfsreplicatedfolders.rdcbytesreceived | DFSReplicatedFolders RDCBytesReceived | NULL | RDC Bytes Received shows the number of bytes that were received in replicating files using remote differential compression (RDC) for this replicated folder. This is the actual bytes received over the network without the networking protocol overhead. | |
dfsreplicationconnections.compressedsizeoffilesreceived | DFSReplicationConnections CompressedSizeofFilesReceived | NULL | Compressed Size of Files Received shows the compressed size of files (in bytes) received on the connection. | |
dfsreplicatedfolders.conflictbytesgenerated | DFSReplicatedFolders ConflictBytesGenerated | NULL | Conflict Bytes Generated shows the total size (in bytes) of the files and folders in this replicated folder that were moved to the Conflict and Deleted folder by the DFS Replication service. The DFS Replication service automatically detects and resolves conflicts encountered in replicated folders and moves the losing version to the Conflict and Deleted folder. The service automatically cleans up the Conflict and Deleted folder when it exceeds a pre-configured threshold of the quota. | |
dfsreplicatedfolders.rdcnumberoffilesreceived | DFSReplicatedFolders RDCNumberofFilesReceived | NULL | RDC Number of Files Received shows the number files that were received for this replicated folder. | |
dfsreplicatedfolders.conflictfilescleanedup | DFSReplicatedFolders ConflictFilesCleanedup | NULL | Conflict Files Cleaned up shows the number the conflict loser files and folders that were deleted from the Conflict and Deleted folder by the DFS Replication service. The DFS Replication service automatically detects and resolves conflicts encountered in replicated folders and moves the losing version to the Conflict and Deleted folder. The service automatically cleans up the Conflict and Deleted folder when it exceeds a pre-configured threshold of the quota. | |
dfsreplicatedfolders.fileinstallssucceeded | DFSReplicatedFolders FileInstallsSucceeded | NULL | File Installs Succeeded shows the number of files that were successfully received from sending members and installed locally on this server. The DFS Replication service replicates staged files into the staging folder, uncompresses them in the Installing folder, and renames them to the target location. The second and third steps of this process are known as installing the file. | |
dfsreplicatedfolders.conflictbytescleanedup | DFSReplicatedFolders ConflictBytesCleanedup | NULL | Conflict Bytes Cleaned up shows the total size (in bytes) of the conflict loser files and folders that were deleted from the Conflict and Deleted folder by the DFS Replication service. The DFS Replication service automatically detects and resolves conflicts encountered in replicated folders and moves the losing version to the Conflict and Deleted folder. The service automatically cleans up the Conflict and Deleted folder when it exceeds a pre-configured threshold of the quota. | |
dfsreplicatedfolders.stagingbytescleanedup | DFSReplicatedFolders StagingBytesCleanedup | NULL | Staging Bytes Cleaned up shows the total size (in bytes) of the files and folders that were cleaned up from the staging folder by the DFS Replication service. The DFS Replication service stages files and folders in the staging folder before they are replicated, and automatically cleans up the staging folder when it exceeds a pre-configured threshold of the quota. | |
dfsreplicationservicevolumes.databasecommits | DFSReplicationServiceVolumes DatabaseCommits | NULL | Database Commits shows the number of database commit operations performed by the DFS Replication service. This counter indicates how intensive the DFS Replication service is from a database perspective. | |
dfsreplicatedfolders.fileinstallsretried | DFSReplicatedFolders FileInstallsRetried | NULL | File Installs Retried shows the number of file installs that are being retried due to sharing violations or other errors encountered when installing the files. The DFS Replication service replicates staged files into the staging folder, uncompresses them in the Installing folder, and renames them to the target location. The second and third steps of this process are known as installing the file. | |
dfsreplicatedfolders.stagingfilesgenerated | DFSReplicatedFolders StagingFilesGenerated | NULL | Staging Files Generated shows the number of times replicated files and folders were staged by the DFS Replication service. The DFS Replication service stages files and folders in a staging folder before they are replicated, and automatically cleans up the staging folder when it exceeds a pre-configured threshold of the quota. | |
dfsreplicatedfolders.totalfilesreceived | DFSReplicatedFolders TotalFilesReceived | NULL | Total Files Received shows the number of files that were received by this replicated folder. | |
dfsnamespaceservicereferrals.requestsprocessed | DFSNamespaceServiceReferrals RequestsProcessed | NULL | Requests Processed shows the number of referral requests that were processed by the DFS Namespace service. | |
dfsreplicatedfolders.deletedbytescleanedup | DFSReplicatedFolders DeletedBytesCleanedup | NULL | Deleted Bytes Cleaned up shows the total size (in bytes) of replicating deleted files and folders (in bytes) that were cleaned up from the Conflict and Deleted folder by the DFS Replication service. The service automatically cleans up the Conflict and Deleted folder when it exceeds a pre-configured threshold of the quota. | |
dfsreplicatedfolders.rdccompressedsizeoffilesreceived | DFSReplicatedFolders RDCCompressedSizeofFilesReceived | NULL | RDC Compressed Size of Files Received shows the compressed size (in bytes) of the files received with remote differential compression (RDC) for this replicated folder. This is the number of bytes that would have been received had RDC not been used. This is not the actual bytes received over the network. | |
dfsreplicationconnections.bytesreceivedpersecond | DFSReplicationConnections BytesReceivedPerSecond | NULL | Bytes Received Per Second shows an estimate of the average number of bytes that were received each second over the past 30 seconds. | |
dfsreplicationservicevolumes.usnjournalunreadpercentage | DFSReplicationServiceVolumes USNJournalUnreadPercentage | NULL | USN Journal Unread Percentage shows the percent of the update sequence number (USN) journal that has not yet been read and processed by the DFS Replication service. A journal wrap will occur if this counter reaches 100. | |
dfsreplicatedfolders.compressedsizeoffilesreceived | DFSReplicatedFolders CompressedSizeofFilesReceived | NULL | Compressed Size of Files Received shows the compressed size of files (in bytes) received for this replicated folder. | |
dfsreplicatedfolders.deletedfilescleanedup | DFSReplicatedFolders DeletedFilesCleanedup | NULL | Deleted Files Cleaned up shows the number of replicated deleted files and folders that were cleaned up from the Conflict and Deleted folder by the DFS Replication service. The service automatically cleans up the Conflict and Deleted folder when it exceeds a pre-configured threshold of the quota. | |
dfsnamespaceservicereferrals.requestsfailed | DFSNamespaceServiceReferrals RequestsFailed | NULL | Requests Failed shows the number of referral requests that were failed by the DFS Namespace service. | |
dfsreplicatedfolders.updatesdropped | DFSReplicatedFolders UpdatesDropped | NULL | Updates Dropped shows the number of redundant file replication update records that were ignored by the DFS Replication service because they did not change the replicated file or folder. For example, dropped updates can occur when access control lists (ACLs) are overwritten with identical ACLs on a file or folder. | |
dfsreplicationconnections.rdccompressedsizeoffilesreceived | DFSReplicationConnections RDCCompressedSizeofFilesReceived | NULL | Compressed Size of Files Received shows the compressed size (in bytes) of files received for this replicated folder. | |
dfsnamespaceserviceapirequests.requestsfailed | DFSNamespaceServiceAPIRequests RequestsFailed | NULL | Requests Failed shows the number of requests to one API that were failed by the DFS Namespace service. | |
dfsreplicatedfolders.bandwidthsavingsusingdfsreplication | DFSReplicatedFolders BandwidthSavingsUsingDFSReplication | NULL | Bandwidth Savings Using DFS Replication shows the percentage of bandwidth that was saved by the DFS Replication service for this replicated folder using a combination of remote differential compression (RDC) and other compression technologies that minimize network bandwidth. For example, a value of 20 indicates that the DFS Replication service used 20% less bandwidth than it would have used if it had transmitted the entire files uncompressed over the network. | |
dfsnamespaceservicereferrals.avgresponsetime | DFSNamespaceServiceReferrals AvgResponseTime | NULL | Avg Response Time shows the average response time to the referral requests that were processed by the DFS Namespace service. | |
dfsreplicationconnections.rdcnumberoffilesreceived | DFSReplicationConnections RDCNumberofFilesReceived | NULL | RDC Number of Files Received shows the number files that were received on this connection. | |
dfsreplicationconnections.bandwidthsavingsusingdfsreplication | DFSReplicationServiceVolumes BandwidthSavingsUsingDFSReplication | NULL | Bandwidth Savings Using DFS Replication shows the percentage of bandwidth that was saved by the DFS Replication service for this connection using a combination of remote differential compression (RDC) and other compression technologies that minimize network bandwidth use. For example, a value of 20 indicates that the DFS Replication service used 20% less bandwidth than it would have used if it had transmitted the entire files uncompressed over the network. | |
dfsnamespaceserviceapirequests.requestsprocessedpersec | DFSNamespaceServiceAPIRequests RequestsProcessedPersec | NULL | Requests Per Sec. Rate shows the number of API requests per second that were processed by the DFS Namespace service. | |
dfsreplicationconnections.totalbytesreceived | DFSReplicationConnections TotalBytesReceived | NULL | Total Bytes Received shows the total number of bytes received on the connection. The bytes received value includes file data and replication metadata. | |
dfsreplicatedfolders.conflictfilesgenerated | DFSReplicatedFolders ConflictFilesGenerated | NULL | Conflict Files Generated shows the number of files and folders in this replicated folder that were moved to the Conflict and Deleted folder by the DFS Replication service. The DFS Replication service automatically detects and resolves conflicts encountered in replicated folders and moves the losing version to the Conflict and Deleted folder. The service automatically cleans up the Conflict and Deleted folder when it exceeds a pre-configured threshold of the quota. | |
dfsreplicationservicevolumes.usnjournalrecordsaccepted | DFSReplicationServiceVolumes USNJournalRecordsAccepted | NULL | USN Journal Records Accepted shows the number of update sequence number (USN) journal records that were processed by the DFS Replication service. The DFS Replication service processes all USN journal records for replicated content on a volume and ignores records for non-replicated files and folders on the volume. | |
dfsnamespaceserviceapirequests.avgresponsetime | DFSNamespaceServiceAPIRequests AvgResponseTime | NULL | Avg Response Time shows the average response time to the requests to one API that were processed by the DFS Namespace service. | |
dfsnamespace.foldercount | DFSNamespace FolderCount | NULL | Folder count shows the number of DFS folders or links in a namespace. | |
dfsreplicatedfolders.sizeoffilesreceived | DFSReplicatedFolders SizeofFilesReceived | NULL | Size of Files Received shows the uncompressed size (in bytes) of the files received for this replicated folder. This is the number of bytes that would have been received had DFS Replication compression not been used. | |
dfsreplicatedfolders.rdcsizeoffilesreceived | DFSReplicatedFolders RDCSizeofFilesReceived | NULL | RDC Size of Files Received shows the uncompressed size (in bytes) of the files received with remote differential compression (RDC) for this replicated folder. This is the number of bytes that would have been received had neither compression nor RDC been used. This is not the actual bytes received over the network. | |
dfsreplicationservicevolumes.databaselookups | DFSReplicationServiceVolumes DatabaseLookups | NULL | Database Lookups shows the number of database search operations performed by the DFS Replication service This counter indicates how intensive the DFS Replication service is from a database perspective. | |
dfsreplicatedfolders.stagingbytesgenerated | DFSReplicatedFolders StagingBytesGenerated | NULL | Staging Bytes Generated shows the total size (in bytes) of replicated files and folders in the staging folder created by the DFS Replication service since last restart and is monotonically increasing counter. The DFS Replication service stages files and folders in the staging folder before they are replicated, and automatically cleans up the staging folder when it exceeds a pre-configured threshold of the quota. |
Agent G2 - Disk Performance Monitoring DotNet v4
Description
Disk Performance Monitoring
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Disk Performance Monitoring DotNet v4 | AverageDiskWriteQueueLength | AverageDiskWriteQueueLength | NULL | Avg. Disk Write Queue Length is the average number of write requests that were queued for the selected disk during the sample interval. |
AvgDisksecPerRead | AvgDisksecPerRead | NULL | Avg. Disk sec/Read is the average time, in seconds, of a read of data from the disk. | |
PercentDiskTime | PercentDiskTime | NULL | % Disk Time is the percentage of elapsed time that the selected disk drive was busy servicing read or write requests. | |
AverageDiskReadQueueLength | AverageDiskReadQueueLength | NULL | Avg. Disk Read Queue Length is the average number of read requests that were queued for the selected disk during the sample interval. | |
AvgDisksecPerWrite | AvgDisksecPerWrite | NULL | Avg. Disk sec/Write is the average time, in seconds, of a write of data to the disk. | |
AvgDiskBytesPerTransfer | AvgDiskBytesPerTransfer | NULL | Avg. Disk Bytes/Transfer is the average number of bytes transferred to or from the disk during write or read operations. The disk is efficient if it transfers large amounts of data relatively quickly. | |
AverageDiskQueueLength | AverageDiskQueueLength | NULL | Avg. Disk Queue Length is the average number of both read and write requests that were queued for the selected disk during the sample interval. |
Agent G2 - DNS Performance Counters DotNet v4
Description
Monitors DNS performance counters
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - DNS Performance Counters DotNet v4 | dns.dynamic.update.requests.received | DNS DynamicUpdateRequestsReceived | NULL | Dynamic Update Received is the total number of dynamic update requests received by the DNS server. |
dns.total.queries.received.per.sec | DNS TotalQueriesReceivedPerSec | NULL | Total Query Received/sec is the average number of queries received by DNS server in each second. | |
dns.dynamic.update.requests.empty.persec | DNS DynamicUpdateRequestsEmptyPerSec | NULL | Dynamic Update NoOperation/sec is the average number of No-operation/Empty dynamic update requests received by the DNS server in each second. | |
dns.nb.stat.memory.used | DNS NbStatMemoryUsed | NULL | Nbstat Memory is the total Nbstat memory used by DNS server. | |
dns.udp.queries.received.per.sec | DNS UDPQueriesReceivedPerSec | NULL | UDP Query Received/sec is the average number of UDP queries received by DNS server in each second. | |
dns.zone.transfers.success | DNS ZoneTransfersSuccess | NULL | Zone Transfer Success is the total number of successful zone transfers of the master DNS server. | |
dns.tcp.queries.received.per.sec | DNS TCPQueriesReceivedPerSec | NULL | TCP Query Received/sec is the average number of TCP queries received by DNS server in each second. | |
dns.caching.memory.used | DNS CachingMemoryUsed | NULL | Caching Memory is the total caching memory used by DNS server. | |
dns.secure.updates.failed | DNS SecureUpdatesFailed | NULL | Secure Update Failure is the total number of secure updates failed of the DNS server. | |
dns.tcp.message.memory.used | DNS TCPMessageMemoryUsed | NULL | TCP Message Memory is the total TCP message memory used by DNS server. | |
dns.database.node.memory.used | DNS DatabaseNodeMemoryUsed | NULL | Database Node Memory is the total database node memory used by DNS server. | |
dns.dynamic.update.requests.rejected | DNS DynamicUpdateRequestsRejected | NULL | Dynamic Update Rejected is the total number of dynamic updates rejected by the DNS server. | |
dns.dynamic.updates.written.to.database | DNS DynamicUpdateswrittentodatabase | NULL | Dynamic Update Written to Database is the total number of dynamic updates written to the database by the DNS server. | |
dns.secure.update.requests.received | DNS SecureUpdateRequestsReceived | NULL | Secure Update Received is the total number of secure update requests received by the DNS server. | |
dns.udp.responses.sent.per.sec | DNS UDPResponsesSentPerSec | NULL | UDP Response Sent/sec is the average number of UDP reponses sent by DNS server in each second. | |
dns.udp.message.memory.used | DNS UDPMessageMemoryUsed | NULL | UDP Message Memory is the total UDP message memory used by DNS server. | |
dns.record.flow.memory.used | DNS RecordFlowMemoryUsed | NULL | Record Flow Memory is the total record flow memory used by DNS server. | |
dns.tcp.responses.sent.per.sec | DNS TCPResponsesSentPerSec | NULL | TCP Response Sent/sec is the average number of TCP reponses sent by DNS server in each second. | |
dns.zone.transfers.failed | DNS ZoneTransfersFailed | NULL | Zone Transfer Failure is the total number of failed zone transfers of the master DNS server. | |
dns.total.responses.sent.per.sec | DNS TotalResponsesSentPerSec | NULL | Total Response Sent/sec is the average number of reponses sent by DNS server in each second. | |
dns.dynamic.update.timeouts | DNS DynamicUpdateTimeouts | NULL | Dynamic Update TimeOuts is the total number of dynamic update timeouts of the DNS server. |
Agent G2 - DNS Recursive Performance Counters DotNet v4
Description
Monitors DNS Recursive Class WMI Performance data
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - DNS Recursive Performance Counters DotNet v4 | DNS_RecursiveTimeOutPerSec | DNS_RecursiveTimeOutPerSec | NULL | Provides the average number of recursive query sending timeouts in each second. |
DNS_RecursiveQueryFailurePerSec | DNS_RecursiveQueryFailurePerSec | NULL | Provides the average number of recursive query failures in each second. | |
DNS_RecursiveQueriesPerSec | DNS_RecursiveQueriesPerSec | NULL | Provides the average number of recursive queries received by DNS server in each second. |
Agent G2 - Fujitsu PRIMERGY Health - Windows
Description
Monitor the following metrics for Fujitsu PRIMERGY Health on Windows servers: fujitsu_primergy_host_raidController_healthState, fujitsu_primergy_host_raidController_primaryStatus, fujitsu_primergy_diskDrive_healthState, fujitsu_primergy_diskDrive_primaryStatus, fujitsu_primergy_temperature_healthState, fujitsu_primergy_temperature_currentState, fujitsu_primergy_temperature_currentReading, fujitsu_primergy_voltage_healthState, fujitsu_primergy_voltage_currentState, fujitsu_primergy_voltage_currentReading, fujitsu_primergy_powerSupply_healthState, fujitsu_primergy_powerConsumptionSensor_healthState, fujitsu_primergy_powerConsumptionSensor_currentState, fujitsu_primergy_fan_healthState, fujitsu_primergy_fan_sensor_healthState, fujitsu_primergy_fan_sensor_currentState, fujitsu_primergy_fan_sensor_currentReading, fujitsu_primergy_managementController_healthState, fujitsu_primergy_processor_healthState, fujitsu_primergy_memory_healthState, fujitsu_primergy_cache_memory_healthState, fujitsu_primergy_physical_memory_healthState
Note: This template will be applicable only for the Agent version 14.0.0 or later.
Prerequisites
This template will be applicable only for the Agent version 14.0.0 or later.
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Fujitsu PRIMERGY Health - Windows | fujitsu_primergy_host_raidController_healthState | fujitsu_primergy_host_raidController_healthState | null | Monitors Fujitsu PGY Raid Controller Current Health State. The possible values are 0 to 30, where 5 means that the object is entirely healthy and 30 means that the object is completely non-functional.. Supported values: 0: Unknown, 10: Degraded / Warning - Warning; 15: Predictive Failure, 20: Major failure, 25: Critical failure, 30: Non-functional - Critical; 5: OK - Ok |
fujitsu_primergy_host_raidController_primaryStatus | fujitsu_primergy_host_raidController_primaryStatus | null | Monitors Fujitsu PGY Host Raid Controller PrimaryStatus, high-level status value intended to align with Red-Yellow-Green type representation of status. Supported values: 0: Unknown, 2: Warning - Warning; 3: Error - Critical; 1: OK - Ok | |
fujitsu_primergy_diskDrive_primaryStatus | fujitsu_primergy_diskDrive_primaryStatus | null | Monitors Fujitsu PGY DiskDrive Primary Status, high-level status value intended to align with Red-Yellow-Green type representation of status. Supported values: 0: Unknown, 2: Warning - Warning; 3: Error - Critical; 1: OK - Ok | |
fujitsu_primergy_temperature_healthState | fujitsu_primergy_temperature_healthState | null | Monitors Fujitsu PGY Temperature HealthState.The possible values are 0 to 30, where 5 means that the object is entirely healthy and 30 means that the object is completely non-functional.. Supported values: 0: Unknown, 10: Degraded / Warning - Warning; 15: Predictive Failure, 20: Major failure, 25: Critical failure, 30: Non-functional - Critical; 5: OK - Ok | |
fujitsu_primergy_temperature_currentState | fujitsu_primergy_temperature_currentState | null | Current state indicated by the Sensor. Lower Critical, Upper Critical, Critical - Critical; Unknown, Upper Non-Critical, Non-Critical - Warning; Normal - Ok | |
fujitsu_primergy_temperature_currentReading | fujitsu_primergy_temperature_currentReading | C | Current value indicated by the Sensor. | |
fujitsu_primergy_voltage_healthState | fujitsu_primergy_voltage_healthState | null | Monitors Fujitsu PGY Voltage HealthState.The possible values are 0 to 30, where 5 means that the object is entirely healthy and 30 means that the object is completely non-functional.. Supported values: 0: Unknown, 10: Degraded / Warning - Warning; 15: Predictive Failure, 20: Major failure, 25: Critical failure, 30: Non-functional - Critical; 5: OK - Ok | |
fujitsu_primergy_voltage_currentState | fujitsu_primergy_voltage_currentState | null | Current state indicated by the Sensor. Lower Critical, Upper Critical, Critical - Critical; Unknown, Upper Non-Critical, Non-Critical - Warning; Normal - Ok | |
fujitsu_primergy_voltage_currentReading | fujitsu_primergy_voltage_currentReading | v | Current value indicated by the Sensor | |
fujitsu_primergy_powerSupply_healthState | fujitsu_primergy_powerSupply_healthState | null | Monitors Fujitsu PGY PowerSupply HealthState.The possible values are 0 to 30, where 5 means that the object is entirely healthy and 30 means that the object is completely non-functional.. Supported values: 0: Unknown, 10: Degraded / Warning - Warning; 15: Predictive Failure, 20: Major failure, 25: Critical failure, 30: Non-functional - Critical; 5: OK - Ok | |
fujitsu_primergy_powerConsumptionSensor_healthState | fujitsu_primergy_powerConsumptionSensor_healthState | null | Monitors Fujitsu PGY Consumption Sensor HealthState.The possible values are 0 to 30, where 5 means that the object is entirely healthy and 30 means that the object is completely non-functional.. Supported values: 0: Unknown, 10: Degraded / Warning - Warning; 15: Predictive Failure, 20: Major failure, 25: Critical failure, 30: Non-functional - Critical; 5: OK - Ok | |
fujitsu_primergy_powerConsumptionSensor_currentState | fujitsu_primergy_powerConsumptionSensor_currentState | null | Current state indicated by the Sensor. Lower Critical, Upper Critical, Critical - Critical; Unknown, Upper Non-Critical, Non-Critical - Warning; Normal - Ok | |
fujitsu_primergy_fan_healthState | fujitsu_primergy_fan_healthState | null | Monitors Fujitsu PGY Fan HealthState.The possible values are 0 to 30, where 5 means that the object is entirely healthy and 30 means that the object is completely non-functional.. Supported values: 0: Unknown, 10: Degraded / Warning - Warning; 15: Predictive Failure, 20: Major failure, 25: Critical failure, 30: Non-functional - Critical; 5: OK - Ok | |
fujitsu_primergy_fan_sensor_healthState | fujitsu_primergy_fan_sensor_healthState | null | Monitors Fujitsu PGY Fan Sensor HealthState.The possible values are 0 to 30, where 5 means that the object is entirely healthy and 30 means that the object is completely non-functional.. Supported values: 0: Unknown, 10: Degraded / Warning - Warning; 15: Predictive Failure, 20: Major failure, 25: Critical failure, 30: Non-functional - Critical; 5: OK - Ok | |
fujitsu_primergy_fan_sensor_currentState | fujitsu_primergy_fan_sensor_currentState | null | Current state indicated by the Sensor. Lower Critical, Upper Critical, Critical - Critical; Unknown, Upper Non-Critical, Non-Critical - Warning; Normal - Ok | |
fujitsu_primergy_fan_sensor_currentReading | fujitsu_primergy_fan_sensor_currentReading | rpm | Current value indicated by the Sensor | |
fujitsu_primergy_managementController_healthState | fujitsu_primergy_managementController_healthState | null | Monitors Fujitsu PGY Management Controller HealthState.The possible values are 0 to 30, where 5 means that the object is entirely healthy and 30 means that the object is completely non-functional.. Supported values: 0: Unknown, 10: Degraded / Warning - Warning; 15: Predictive Failure, 20: Major failure, 25: Critical failure, 30: Non-functional - Critical; 5: OK - Ok | |
fujitsu_primergy_processor_healthState | fujitsu_primergy_processor_healthState | null | Monitors Fujitsu PGY Processor HealthState.The possible values are 0 to 30, where 5 means that the object is entirely healthy and 30 means that the object is completely non-functional.. Supported values: 0: Unknown, 10: Degraded / Warning - Warning; 15: Predictive Failure, 20: Major failure, 25: Critical failure, 30: Non-functional - Critical; 5: OK - Ok | |
fujitsu_primergy_memory_healthState | fujitsu_primergy_memory_healthState | null | Monitors Fujitsu PGY Memory HealthState.The possible values are 0 to 30, where 5 means that the object is entirely healthy and 30 means that the object is completely non-functional.. Supported values: 0: Unknown, 10: Degraded / Warning - Warning; 15: Predictive Failure, 20: Major failure, 25: Critical failure, 30: Non-functional - Critical; 5: OK - Ok | |
fujitsu_primergy_physical_memory_healthState | fujitsu_primergy_physical_memory_healthState | null | Monitors Fujitsu PGY Physical Memory HealthState.The possible values are 0 to 30, where 5 means that the object is entirely healthy and 30 means that the object is completely non-functional.. Supported values: 0: Unknown, 10: Degraded / Warning - Warning; 15: Predictive Failure, 20: Major failure, 25: Critical failure, 30: Non-functional - Critical; 5: OK - Ok |
Agent G2 - Fujitsu PRIMERGY Health - Linux
Description
Template for Linux environment to monitor Fujitsu PRIMERGY Health Monitoring metrics like: Fujitsu PGY Host Raid Controller HealthState, Fujitsu PGY Host Raid Controller PrimaryStatus, Fujitsu PGY DiskDrive HealthState, Fujitsu PGY DiskDrive PrimaryStatus, Fujitsu PGY Temperature HealthState, Fujitsu PGY Temperature CurrentState, Fujitsu PGY Temperature CurrentReading, Fujitsu PGY Voltage HealthState, Fujitsu PGY Voltage CurrentState, Fujitsu PGY Voltage CurrentReading, Fujitsu PGY PowerSupply HealthState, Fujitsu PGY Power Consumption Sensor HealthState, Fujitsu PGY Power Consumption Sensor CurrentState, Fujitsu PGY Fan HealthState, Fujitsu PGY Fan Sensor HealthState, Fujitsu PGY Fan Sensor CurrentState, Fujitsu PGY Fan Sensor CurrentReading, Fujitsu PGY Management Controller HealthState, Fujitsu PGY Processor HealthState, Fujitsu PGY Memory HealthState, Fujitsu PGY Cache Memory HealthState, Fujitsu PGY Physical Memory HealthState.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Fujitsu PRIMERGY Health - Linux | fujitsu_pgy_management_controller_health_state | PGY Management Controller Health State | NULL | PGY Management Controller Health State |
fujitsu_pgy_temperature_health_state | PGY Temperature Health State | NULL | PGY Temperature Health State | |
fujitsu_pgy_powerConsumption_sensor_health_state | PGY Power Consumption Health State | NULL | PGY Power Consumption Health State | |
fujitsu_pgy_voltage_health_state | PGY Voltage Health State | NULL | PGY Voltage Health State | |
fujitsu_pgy_diskdrive_health_state | PGY Disk Drive Health State | NULL | PGY Disk Drive Health State | |
fujitsu_pgy_temperature_current_state | PGY Temperature Current State | NULL | PGY Temperature Current State | |
fujitsu_pgy_fan_sensor_health_state | PGY Fan Sensor Health State | NULL | PGY Fan Sensor Health State | |
fujitsu_pgy_powersupply_health_state | PGY Power Supply Health State | NULL | PGY Power Supply Health State | |
fujitsu_pgy_cache_memory_health_state | PGY Cache Memory Health State | NULL | PGY Cache Memory Health State | |
fujitsu_pgy_diskdrive_primary_status | PGY Disk Drive Primary Status | NULL | PGY Disk Drive Primary Status | |
fujitsu_pgy_voltage_current_value | PGY Voltage Current Value | v | PGY Voltage Current Value | |
fujitsu_pgy_memory_health_state | PGY Memory Health State | NULL | PGY Memory Health State | |
fujitsu_pgy_fan_sensor_current_state | PGY Fan Sensor Current State | NULL | PGY Fan Sensor Current State | |
fujitsu_pgy_voltage_current_state | PGY Voltage Current State | NULL | PGY Voltage Current State | |
fujitsu_pgy_temperature_value | PGY Temperature Value | NULL | PGY Temperature Value | |
fujitsu_pgy_fan_sensor_current_value | PGY Fan Sensor Current Value | rpm | PGY Fan Sensor Current Value | |
fujitsu_pgy_powerConsumption_sensor_current_state | PGY Power Consumption Current State | NULL | PGY Power Consumption Current State | |
fujitsu_pgy_physical_memory_health_state | PGY Physical Memory Health State | NULL | PGY Physical Memory Health State | |
fujitsu_pgy_host_raidcontroller_health_state | PGY Host Raid Controller Health State | NULL | PGY Host Raid Controller Health State | |
fujitsu_pgy_host_raidcontroller_primary_status | PGY Host Raid Controller Primary Status | NULL | PGY Host Raid Controller Primary Status | |
fujitsu_pgy_fan_health_state | PGY Fan Health State | NULL | PGY Fan Health State | |
fujitsu_pgy_processor_health_state | PGY Processor Health State | NULL | PGY Processor Health State |
Agent G2 - Fujitsu PRIMERGY Health - Windows - DotNet v4
Description
Template for Windows environment (for .NET v4 or later) to monitor Fujitsu PRIMERGY Health Monitoring metrics like: Fujitsu PGY Host Raid Controller HealthState, Fujitsu PGY Host Raid Controller PrimaryStatus, Fujitsu PGY DiskDrive HealthState, Fujitsu PGY DiskDrive PrimaryStatus, Fujitsu PGY Temperature HealthState, Fujitsu PGY Temperature CurrentState, Fujitsu PGY Temperature CurrentReading, Fujitsu PGY Voltage HealthState, Fujitsu PGY Voltage CurrentState, Fujitsu PGY Voltage CurrentReading, Fujitsu PGY PowerSupply HealthState, Fujitsu PGY Power Consumption Sensor HealthState, Fujitsu PGY Power Consumption Sensor CurrentState, Fujitsu PGY Fan HealthState, Fujitsu PGY Fan Sensor HealthState, Fujitsu PGY Fan Sensor CurrentState, Fujitsu PGY Fan Sensor CurrentReading, Fujitsu PGY Management Controller HealthState, Fujitsu PGY Processor HealthState, Fujitsu PGY Memory HealthState, Fujitsu PGY Cache Memory HealthState, Fujitsu PGY Physical Memory HealthState.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Fujitsu PRIMERGY Health - Windows - DotNet v4 | fujitsu.pgy.temperature.currentreading | Fujitsu PGY Temperature CurrentReading | NULL | Current value indicated by the Sensor |
fujitsu.pgy.voltage.currentreading | Fujitsu PGY Voltage CurrentReading | NULL | Current value indicated by the Sensor | |
fujitsu.pgy.fan.sensor.currentreading | Fujitsu PGY Fan Sensor CurrentReading | NULL | Current value indicated by the Sensor |
Agent G2 - HyperV processorratio
Description
HyperV processorratio monitor Template
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - HyperV processorratio | Hyperv_VirtualtoLogicalProcessorRatio | Hyperv_VirtualtoLogicalProcessorRatio | NULL | Shows the count ratio of HyperV Virtual and Logical Processors. |
Agent G2 - K8s ApiServer Requests Advanced Metrics
Description
Monitors Apiserver Request Total and Apiserver Request Duration Seconds Bucket (verb = get, put, post) Metrics.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - K8s ApiServer Requests Advanced Metrics | apiserver.post.requests.duration.seconds | Kube apiserver Post Requests Duration Seconds | Seconds | Time taken for POST API responses. |
apiserver.get.requests.duration.seconds | Kube apiserver Get Requests Duration Seconds | Seconds | Time taken for GET API responses. | |
apiserver.requests.total.success.rate | Kube apiserver Requests Success Rate | Percentage | Percentage of success API requests. | |
apiserver.put.requests.duration.seconds | Kube apiserver Put Requests Duration Seconds | Seconds | Time taken for PUT API responses. |
Agent G2 - Kubernetes Pods Monitor
Description
Monitors pods of different kinds of kubernetes like Deployment, Daemonset, Replicasets and Statefulsets
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - k8spodsmonitor | kubernetes_statefulset_pods_running_percentage | StatefulSet Pods Running Percentage | Percentage | Percentage of pods running by desired pods of Statefulset. |
kubernetes_daemonset_pods_running_percentage | Daemonset Pods Running Percentage | Percentage | Percentage of pods running by desired pods of Daemonset. | |
kubernetes_deployment_pods_running_percentage | Deployment Pods Running Percentage | Percentage | Percentage of pods running by desired pods of Deployment. | |
kubernetes_daemonset_pods_running | Daemonset Pods Running | Count | The number of nodes that should be running the daemon pod and have one or more of the daemon pod running and ready. | |
kubernetes_statefulset_pods_running_percentage | StatefulSet Pods Running Percentage | Percentage | Percentage of pods running by desired pods of Statefulset. | |
kubernetes_daemonset_pods_desired | Daemonset Pods Desired | Count | The number of nodes that should be running the daemon pod. | |
kubernetes_statefulset_pods_desired | StatefulSet Pods Desired | Count | Number of desired pods for a StatefulSet. | |
kubernetes_deployment_pods_running | Deployment Pods Running | Count | The number of ready replicas per deployment. | |
kubernetes_replicaset_pods_desired | Replicaset Pods Desired | Count | Number of desired pods for a ReplicaSet. | |
kubernetes_statefulset_pods_desired | StatefulSet Pods Desired | Count | Number of desired pods for a StatefulSet. | |
kubernetes_deployment_pods_running_percentage | Deployment Pods Running Percentage | Percentage | Percentage of pods running by desired pods of Deployment. | |
kubernetes_deployment_pods_running | Deployment Pods Running | Count | The number of ready replicas per deployment. | |
kubernetes_statefulset_pods_running | StatefulSet Pods Running | Count | The number of ready replicas per StatefulSet. | |
kubernetes_daemonset_pods_desired | Daemonset Pods Desired | Count | The number of nodes that should be running the daemon pod. | |
kubernetes_replicaset_pods_running | Replicaset Pods Running | Count | The number of ready replicas per ReplicaSet. | |
kubernetes_deployment_pods_desired | Deployment Pods Desired | Count | Number of desired pods for a deployment. | |
kubernetes_replicaset_pods_desired | Replicaset Pods Desired | Count | Number of desired pods for a ReplicaSet. | |
kubernetes_daemonset_pods_running | Daemonset Pods Running | Count | The number of nodes that should be running the daemon pod and have one or more of the daemon pod running and ready. | |
kubernetes_deployment_pods_desired | Deployment Pods Desired | Count | Number of desired pods for a deployment. | |
kubernetes_replicaset_pods_running | Replicaset Pods Running | Count | The number of ready replicas per ReplicaSet. | |
kubernetes_replicaset_pods_running_percentage | Replicaset Pods Running Percentage | Percentage | Percentage of pods running by desired pods of Replicaset | |
kubernetes_replicaset_pods_running_percentage | Replicaset Pods Running Percentage | Percentage | Percentage of pods running by desired pods of Replicaset. | |
kubernetes_daemonset_pods_running_percentage | Daemonset Pods Running Percentage | Percentage | Percentage of pods running by desired pods of Daemonset. | |
kubernetes_statefulset_pods_running | StatefulSet Pods Running | Count | The number of ready replicas per StatefulSet. |
Agent G2 - Linux - ActiveMQ Monitors
Description
Monitors ActiveMQ application metrics
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - ActiveMQ Monitors | activemq.broker.memory.prct | ActiveMQ-BrokerMemoryPercentUsage | NULL | The percent of memory limit used. |
activemq.broker.temp.prct | ActiveMQ-BrokerTempPercentUsage | NULL | The space used by the store for temporary messages. | |
activemq.broker.store.prct | ActiveMQ-BrokerStorePercentUsage | NULL | The space used by the Message Store. | |
activemq.jvm.gc.collection_count | ActiveMQ-JVM.GC.collection_count | NULL | Number of garbage objects collected | |
activemq.queue.consumer_count | ActiveMQ-ConsumerCount | NULL | Number of consumers subscribed to this destination | |
activemq.queue.inflight_count | ActiveMQ-InFlightCount | NULL | Number of messages sent to a destination and have not received an acknowledgement. | |
activemq.queue.producer_count | ActiveMQ-ProducerCount | NULL | Number of producers | |
activemq.queue.enqueue_count | ActiveMQ-EnqueueCount | NULL | Number of messages that have been sent to the destination since the last restart. | |
activemq.queue.dequeue_count | ActiveMQ-DequeueCount | NULL | Number of messages that have been acknowledged (and removed) from the destination since last restart. | |
activemq.queue.size | ActiveMQ-QueueSize | NULL | The number of messages that currently reside in the queue. Potentially dispatched but unacknowledged. | |
activemq.jvm.threads | ActiveMQ-JVM.Threads | NULL | Number of threads. | |
activemq.jvm.uptime | ActiveMQ-Uptime | NULL | Uptime of the server | |
activemq.jvm.mem.non_heap_committed | ActiveMQ-JVM.Mem.non_heap_committed | NULL | Non-heap memory committed (in MB) for the server | |
activemq.jvm.open.fds | ActiveMQ-JVM.OpenFDs | NULL | Number of Open file descriptors of the server | |
activemq.jvm.mem.heap_used | ActiveMQ-JVM.Mem.heap_used | NULL | Heap memory usage (in MB) of the server | |
activemq.queue.avg_enqueuetime | ActiveMQ-AverageEnqueueTime | NULL | On average, the amount of time (ms) that messages remained enqueued. Or average time it is taking the consumers to successfully process messages. | |
activemq.queue.memory.prct | ActiveMQ-QueueMemoryPercentUsage | NULL | The percentage of the memory limit used by queues. | |
activemq.jvm.mem.non_heap_used | ActiveMQ-JVM.Mem.non_heap_used | NULL | Non-heap memory usage (in MB) of the server | |
activemq.queue.dispatch_count | ActiveMQ-DispatchCount | NULL | Number of messages that have been dispatched (Dequeue + Inflight). | |
activemq.queue.expired_count | ActiveMQ-ExpiredCount | NULL | Number of messages that were expired. | |
activemq.jvm.gc.collection_time | ActiveMQ-JVM.GC.collection_time | NULL | Time taken for collection of the garbage objects. | |
activemq.queue.max_enqueuetime | ActiveMQ-MaxEnqueueTime | NULL | The maximum amount of time that messages remained enqueued. | |
activemq.jvm.mem.heap_committed | ActiveMQ-JVM.Mem.heap_committed | NULL | Heap memory committed (in MB) for the server |
Agent G2 - Linux - ActiveMQ Performance Check
Description
Monitors ActiveMQ application metrics uses MBeans exposed via the JMX console. Please refer to OpsRamp documentation on how to enable JMX on your application.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - ActiveMQ Performance Check | activemq.broker.memory.prct | ActiveMQ-BrokerMemoryPercentUsage | NULL | The percent of memory limit used. |
activemq.broker.temp.prct | ActiveMQ-BrokerTempPercentUsage | NULL | The space used by the store for temporary messages. | |
activemq.broker.store.prct | ActiveMQ-BrokerStorePercentUsage | NULL | The space used by the Message Store. | |
activemq.jvm.gc.collection_count | ActiveMQ-JVM.GC.collection_count | NULL | Number of garbage objects collected | |
activemq.queue.consumer_count | ActiveMQ-ConsumerCount | NULL | Number of consumers subscribed to this destination | |
activemq.queue.inflight_count | ActiveMQ-InFlightCount | NULL | Number of messages sent to a destination and have not received an acknowledgement. | |
activemq.queue.producer_count | ActiveMQ-ProducerCount | NULL | Number of producers | |
activemq.queue.enqueue_count | ActiveMQ-EnqueueCount | NULL | Number of messages that have been sent to the destination since the last restart. | |
activemq.queue.dequeue_count | ActiveMQ-DequeueCount | NULL | Number of messages that have been acknowledged (and removed) from the destination since last restart. | |
activemq.queue.size | ActiveMQ-QueueSize | NULL | The number of messages that currently reside in the queue. Potentially dispatched but unacknowledged. | |
activemq.jvm.threads | ActiveMQ-JVM.Threads | NULL | Number of threads. | |
activemq.jvm.uptime | ActiveMQ-Uptime | NULL | Uptime of the server | |
activemq.jvm.mem.non_heap_committed | ActiveMQ-JVM.Mem.non_heap_committed | NULL | Non-heap memory committed (in MB) for the server | |
activemq.jvm.open.fds | ActiveMQ-JVM.OpenFDs | NULL | Number of Open file descriptors of the server | |
activemq.jvm.mem.heap_used | ActiveMQ-JVM.Mem.heap_used | NULL | Heap memory usage (in MB) of the server | |
activemq.queue.avg_enqueuetime | ActiveMQ-AverageEnqueueTime | NULL | On average, the amount of time (ms) that messages remained enqueued. Or average time it is taking the consumers to successfully process messages. | |
activemq.queue.memory.prct | ActiveMQ-QueueMemoryPercentUsage | NULL | The percentage of the memory limit used by queues. | |
activemq.jvm.mem.non_heap_used | ActiveMQ-JVM.Mem.non_heap_used | NULL | Non-heap memory usage (in MB) of the server | |
activemq.queue.dispatch_count | ActiveMQ-DispatchCount | NULL | Number of messages that have been dispatched (Dequeue + Inflight). | |
activemq.queue.expired_count | ActiveMQ-ExpiredCount | NULL | Number of messages that were expired. | |
activemq.jvm.gc.collection_time | ActiveMQ-JVM.GC.collection_time | NULL | Time taken for collection of the garbage objects. | |
activemq.queue.max_enqueuetime | ActiveMQ-MaxEnqueueTime | NULL | The maximum amount of time that messages remained enqueued. | |
activemq.jvm.mem.heap_committed | ActiveMQ-JVM.Mem.heap_committed | NULL | Heap memory committed (in MB) for the server |
Agent G2 - Linux - Apache CouchDB Monitors
Description
Monitors CouchDB application metrics using the CouchDB’s API which is normally accessible via http://127.0.0.1:5984/
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Apache CouchDB Monitors | couchdb.httpd_status_codes.412 | CouchDB-Httpd412Status | NULL | Number of HTTP 412 Precondition Failed responses |
couchdb.httpd_status_codes.404 | CouchDB-Httpd404Status | NULL | Number of HTTP 404 Not Found responses | |
couchdb.httpd.temporary_view_reads | CouchDB-HttpTemporaryViewReads | NULL | Number of temporary view reads | |
couchdb.request_time | CouchDB-RequestTime | ms | Length of a request inside CouchDB without MochiWeb | |
couchdb.httpd_status_codes.202 | CouchDB-Httpd202Status | Null | Number of HTTP 202 Accepted responses | |
couchdb.disk_size | CouchDB-DiskSize | NULL | Total size Disk for all dbs | |
couchdb.database_reads | CouchDB-DatabaseReads | NULL | Number of times a document was read from a database | |
couchdb.databases_open | CouchDB-OpenDatabases | NULL | Number of open databases | |
couchdb.httpd_request_methods_delete | CouchDB-DELETERequests | NULL | Number of HTTP DELETE requests | |
couchdb.httpd_status_codes.403 | CouchDB-Httpd403Status | NULL | Number of HTTP 403 Forbidden responses | |
couchdb.httpd_status_codes.201 | CouchDB-Httpd201Status | NULL | Number of HTTP 201 Created responses | |
couchdb.httpd.clients_requesting_changes | CouchDB-HttpClientsRequestingChanges | NULL | Number of clients for continuous _changes | |
couchdb.database_writes | CouchDB-DatabaseWrites | NULL | Number of times a database was changed | |
couchdb.open_fds | CouchDB-OpenFDs | NULL | Number of file descriptors CouchDB has open | |
couchdb.httpd_status_codes.405 | CouchDB-Httpd405Status | NULL | Number of HTTP 405 Method Not Allowed responses | |
couchdb.httpd_request_methods_head | CouchDB-HEADRequests | NULL | Number of HTTP HEAD requests | |
couchdb.httpd.requests | CouchDB-HttpRequests | NULL | Number of HTTP requests | |
couchdb.httpd_status_codes.301 | CouchDB-Httpd301Status | NULL | Number of HTTP 301 Moved Permanently responses | |
couchdb.httpd_request_method_post | CouchDB-POSTRequests | NULL | Number of HTTP POST requests | |
couchdb.cache_hits | CouchDB-CacheHits | NULL | Number of authentication cache hits | |
couchdb.httpd_request_methods_get | CouchDB-GETRequests | NULL | Number of HTTP GET requests | |
couchdb.httpd_status_codes.400 | CouchDB-Httpd400Status | NULL | Number of HTTP 400 Bad Request responses | |
couchdb.httpd.bulk_requests | CouchDB-HttpBulkRequests | NULL | Number of bulk requests | |
couchdb.doc_count | CouchDB-DocCount | NULL | Number doc's in all dbs | |
couchdb.cache_misses | CouchDB-CacheMisses | NULL | Number of authentication cache misses | |
couchdb.httpd_status_codes.304 | CouchDB-Httpd304Status | NULL | Number of HTTP 304 Not Modified responses | |
couchdb.httpd.view_reads | CouchDB-HttpViewReads | NULL | Number of view reads | |
couchdb.httpd_status_codes.401 | CouchDB-Httpd401Status | NULL | Number of HTTP 401 Unauthorized responses | |
couchdb.httpd_status_codes.200 | CouchDB-Httpd200Status | NULL | Number of HTTP 200 OK responses | |
couchdb.httpd_status_codes.409 | CouchDB-Httpd409Status | NULL | Number of HTTP 409 Conflict responses | |
couchdb.httpd_request_methods_put | CouchDB-PUTRequests | NULL | Number of HTTP PUT requests | |
couchdb.httpd_request_methods_copy | CouchDB-COPYRequests | NULL | Number of HTTP COPY requests | |
couchdb.httpd_status_codes.500 | CouchDB-Httpd500Status | NULL | Number of HTTP 500 Internal Server Error responses |
Agent G2 - Linux - Apache Httpd Monitors
Description
Monitoring Template for Apache application. Monitors Apache busy workers, bytes per request, bytes per second, etc.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Apache Httpd Monitors | apache.performance.idle_workers | Apache-IdleWorkers | NULL | Provides the number of idle workers |
apache.net.request_per_sec | Apache-RequestsPerSec | NULL | Provides the number of requests made per second | |
apache.net.bytes_per_sec | Apache-BytesPerSec | NULL | Privides the number of bytes transferred per second | |
apache.net.request_per_request | Apache-BytesPerRequest | NULL | Privides the number of bytes transferred per request | |
apache.performance.cpu_load | Apache-CPULoad | NULL | Provides the CPU Load of the apache service | |
apache.performance.scoreboard | Apache-ScoreBoard | NULL | Provides the scoreboard metrics | |
apache.performance.uptime | Apache-Uptime | NULL | Checks the uptime apache service | |
apache.net.total_kbytes | Apache-TotalkBytes | NULL | Provides the number of total kbytes | |
apache.performance.total_accesses | Apache-TotalAccesses | NULL | Provides the total number of accesses made. | |
apache.performance.open_slots | Apache-OpenSlots | NULL | Provides the number of open slots | |
apache.performance.busy_workers | Apache-BusyWorkers | NULL | Provides the number of busy workers |
Agent G2 - Linux - Apache Pulsar Bookkeeper Monitoring
Description
Monitors Apache Pulsar Bookkeeper metrics for components like Server, Journal and Storage metrics
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Pulsar Bookkeeper Monitor | pulsar_bookie_server_status | Pulsar Bookie Server Status | NULL | The server status for bookie server. 1: the bookie is running in writable mode.0: the bookie is running in readonly mode. |
pulsar_bookie_ledgers_count | Pulsar Bookie Ledgers Count | NULL | The total number of ledgers stored in the bookie. | |
pulsar_bookie_write_cache_size | Pulsar Bookie Write Cache Size | bytes | The bookie write cache size. | |
pulsar_bookie_journal_journal_sync_count | Pulsar Bookie Journal Journal Sync Count | NULL | The total number of journal fsync operations happening at the bookie. The success label is used to distinguish successes and failures. | |
pulsar_bookie_journal_journal_cb_queue_size | Pulsar Bookie Journal Journal Cb Queue Size | NULL | The total number of callbacks pending in the callback queue. | |
pulsar_bookie_entries_count | Pulsar Bookie Entries Count | NULL | The total number of entries stored in the bookie. | |
pulsar_bookie_journal_journal_force_write_queue_size | Pulsar Bookie Journal Journal Force Write Queue Size | NULL | The total number of force write (fsync) requests pending in the force-write queue. | |
pulsar_bookie_flush | Pulsar Bookie Flush | NULL | The table flush latency of bookie memory. | |
pulsar_bookie_read_bytes_count | Pulsar Bookie Read Bytes Count | bytes | The total number of bytes read from the bookie. | |
pulsar_bookie_write_bytes_count | Pulsar Bookie Write Bytes Count | bytes | The total number of bytes written to the bookie. | |
pulsar_bookie_throttled_write_requests_count | Pulsar Bookie Throttled Write Requests Count | NULL | The number of write requests to be throttled. | |
pulsar_bookie_ledger_writable_dirs | Pulsar Bookie Ledger Writable Dirs | NULL | The number of writable directories in the bookie. | |
pulsar_bookie_journal_journal_queue_size | Pulsar Bookie Journal Journal Queue Size | NULL | The total number of requests pending in the journal queue. | |
pulsar_bookie_read_cache_size | Pulsar Bookie Read Cache Size | bytes | The bookie read cache size. | |
pulsar_bookkeeper_server_add_entry_count | Pulsar Bookkeeper Server Add Entry Count | NULL | The total number of ADD_ENTRY requests received at the bookie. The success label is used to distinguish successes and failures. | |
pulsar_bookkeeper_server_read_entry_count | Pulsar Bookkeeper Server Read Entry Count | NULL | The total number of READ_ENTRY requests received at the bookie. The success label is used to distinguish successes and failures. | |
pulsar_bookie_deleted_ledger_count | Pulsar Bookie Deleted Ledger Count | NULL | The total number of ledgers deleted since the bookie has started. |
Agent G2 - Linux - Apache Pulsar Broker Monitoring
Description
Monitors Apache Pulsar Broker metrics for components like Namespace, Topic, Lookup, Connection and Jetty.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Pulsar Broker Monitor | pulsar_ml_cache_pool_used | Pulsar Ml Cache Pool Used | NULL | The total used memory of chunk lists in direct arena. |
pulsar_broker_lookup_sum | Pulsar Broker Lookup Sum | NULL | Total latency of all lookup operations. | |
pulsar_connection_created_total_count | Pulsar Connection Created Total Count | NULL | The total number of connections. | |
pulsar_broker_lookup_failures_count | Pulsar Broker Lookup Failures Count | NULL | The number of lookup failures. | |
pulsar_ml_cache_pool_allocated | Pulsar Ml Cache Pool Allocated | NULL | The total allocated memory of chunk lists in direct arena. | |
pulsar_ml_cache_pool_active_allocations_normal | Pulsar Ml Cache Pool Active Allocations Normal | NULL | The number of currently active normal allocations in direct arena. | |
pulsar_broker_lookup_answers_count | Pulsar Broker Lookup Answers Count | NULL | The number of lookup responses (i.e. not redirected requests). | |
pulsar_storage_write_rate | Pulsar Storage Write Rate | NULL | The total message batches (entries) written to the storage for this namespace (message batches / second). | |
pulsar_jetty_async_requests_waiting_max | Pulsar Jetty Async Requests Waiting Max | NULL | Maximum number of waiting async requests. | |
pulsar_jetty_dispatched_active | Pulsar Jetty Dispatched Active | NULL | Number of dispatches currently active. | |
pulsar_jetty_requests_active | Pulsar Jetty Requests Active | NULL | Number of requests currently active. | |
pulsar_connection_create_fail_count | Pulsar Connection Create Fail Count | NULL | The number of failed connections. | |
pulsar_jetty_dispatched_active_max | Pulsar Jetty Dispatched Active Max | NULL | Maximum number of active dispatches being handled. | |
pulsar_ml_cache_pool_active_allocations | Pulsar Ml Cache Pool Active Allocations | NULL | The number of currently active allocations in direct arena. | |
pulsar_throughput_out | Pulsar Throughput Out | NULL | The total throughput of the namespace going out from this broker (bytes/second). | |
pulsar_broker_throttled_connections | Pulsar Broker Throttled Connections | NULL | The number of throttled connections. | |
pulsar_ml_count | Pulsar Ml Count | NULL | The number of currently opened managed ledgers. | |
pulsar_ml_cache_misses_throughput | Pulsar Ml Cache Misses Throughput | NULL | The amount of data is not retrieved from the cache on the broker side (in byte/s). | |
pulsar_ml_cache_pool_active_allocations_huge | Pulsar Ml Cache Pool Active Allocations Huge | NULL | The number of currently active huge allocation in direct arena. | |
pulsar_rate_in | Pulsar Rate In | NULL | The total message rate of the namespace coming into this broker (messages/second). | |
pulsar_throughput_in | Pulsar Throughput In | NULL | The total throughput of the namespace coming into this broker (bytes/second). | |
pulsar_active_connections | Pulsar Active Connections | NULL | The number of active connections. | |
pulsar_storage_logical_size | Pulsar Storage Logical Size | bytes | The storage size of topics in the namespace owned by the broker without replicas. | |
pulsar_ml_cache_pool_active_allocations_small | Pulsar Ml Cache Pool Active Allocations Small | NULL | The number of currently active small allocations in direct arena. | |
pulsar_broker_lookup_pending_requests | Pulsar Broker Lookup Pending Requests | NULL | The number of pending lookups in broker. When it is up to the threshold, new requests are rejected. | |
pulsar_jetty_requests_active_max | Pulsar Jetty Requests Active Max | NULL | Maximum number of requests that have been active at once. | |
pulsar_subscriptions_count | Pulsar Subscriptions Count | NULL | The number of Pulsar subscriptions of the namespace served by this broker. | |
pulsar_producers_count | Pulsar Producers Count | NULL | The number of active producers of the namespace connected to this broker. | |
pulsar_jetty_stats_seconds | Pulsar Jetty Stats Seconds | seconds | Time in seconds stats have been collected for. | |
pulsar_ml_cache_hits_throughput | Pulsar Ml Cache Hits Throughput | NULL | The amount of data is retrieved from the cache on the broker side (in byte/s). | |
pulsar_ml_cache_hits_rate | Pulsar Ml Cache Hits Rate | NULL | The number of cache hits per second on the broker side. | |
pulsar_ml_cache_used_size | Pulsar Ml Cache Used Size | NULL | The size in byte used to store the entries payloads. | |
pulsar_broker_throttled_connections_global_limit | Pulsar Broker Throttled Connections Global Limit | NULL | The number of throttled connections because of per-connection limit. | |
pulsar_consumers_count | Pulsar Consumers Count | NULL | The number of active consumers of the namespace connected to this broker. | |
pulsar_ml_cache_misses_rate | Pulsar Ml Cache Misses Rate | NULL | The number of cache misses per second on the broker side. | |
pulsar_broker_lookup_count | Pulsar Broker Lookup Count | NULL | Number of samples of the latency of all lookup operations. | |
pulsar_broker_lookup_redirects_count | Pulsar Broker Lookup Redirects Count | NULL | The number of lookup redirected requests. | |
pulsar_connection_create_success_count | Pulsar Connection Create Success Count | NULL | The number of successfully created connections. | |
pulsar_jetty_async_requests_waiting | Pulsar Jetty Async Requests Waiting | NULL | Currently waiting async requests. | |
pulsar_storage_read_rate | Pulsar Storage Read Rate | NULL | The total message batches (entries) read from the storage for this namespace (message batches / second). | |
pulsar_jetty_request_time_max_seconds | Pulsar Jetty Request Time Max Seconds | seconds | Maximum time spent handling requests. | |
pulsar_topics_count | Pulsar Topics Count | NULL | The number of Pulsar topics of the namespace owned by this broker. | |
pulsar_connection_closed_total_count | Pulsar Connection Closed Total Count | NULL | The total number of closed connections. | |
pulsar_broker_topic_load_pending_requests | Pulsar Broker Topic Load Pending Requests | NULL | The load of pending topic operations. | |
pulsar_rate_out | Pulsar Rate Out | NULL | The total message rate of the namespace going out from this broker (messages/second). | |
pulsar_jetty_dispatched_time_max | Pulsar Jetty Dispatched Time Max | seconds | Maximum time spent in dispatch handling. | |
pulsar_storage_size | Pulsar Storage Size | bytes | The total storage size of the topics in this namespace owned by this broker. | |
pulsar_ml_cache_evictions | Pulsar Ml Cache Evictions | NULL | The number of cache evictions during the last minute. |
Agent G2 - Linux - Apache Spark Monitors
Description
Monitors spark metrics
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Apache Spark Monitors | spark.cores | Spark Cores | NULL | The number of CPUs available for all workers |
spark.memory | Spark Memory | NULL | Calculates the total memory available on spark master | |
spark.workers | Spark Workers | NULL | The number of workers connected to the master | |
spark.applications.active | Spark Applications Active | NULL | The number of applications waiting or running | |
spark.memory.used | Spark Memory Used | NULL | Calculates the memory used by the applications on spark master | |
spark.drivers.active | Spark Drivers Active | NULL | The number of drivers available | |
spark.applications.completed | Spark Applications Completed | NULL | The number of application completed | |
spark.cores.used | Spark Cores Used | NULL | The number of CPUs used for all applications |
Agent G2 - Linux - Apache Spark Performance Check
Description
Monitors Spark metrics using Spark REST API
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Apache Spark Performance Check | spark.cores | Spark Cores | NULL | The number of CPUs available for all workers |
spark.memory.used | Spark Memory Used | NULL | Calculates the memory used by the applications on spark master | |
spark.cores.used | Spark Cores Used | NULL | The number of CPUs used for all applications | |
spark.memory | Spark Memory | NULL | Calculates the total memory available on spark master | |
spark.drivers.active | Spark Drivers Active | NULL | The number of drivers available | |
spark.applications.completed | Spark Applications Completed | NULL | The number of application completed | |
spark.applications.active | Spark Applications Active | NULL | The number of applications waiting or running | |
spark.workers | Spark Workers | NULL | The number of workers connected to the master |
Agent G2 - Linux - Apache Status Check - Agent
Description
Checks the server-status page and monitors metrics from that page.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Apache Status Check - Agent | apache.net.request_per_request | Apache-BytesPerRequest | NULL | Provides the number of bytes transferred per request |
apache.performance.idle_workers | Apache-IdleWorkers | NULL | Provides the number of idle workers | |
apache.performance.cpu_load | Apache-CPULoad | NULL | Provides the CPU Load of the apache service | |
apache.performance.uptime | Apache-Uptime | NULL | Checks the uptime apache service | |
apache.net.bytes_per_sec | Apache-BytesPerSec | NULL | Provides the number of bytes transferred per second | |
apache.performance.open_slots | Apache-OpenSlots | NULL | Provides the number of open slots | |
apache.net.total_kbytes | Apache-TotalkBytes | NULL | Provides the number of total kbytes | |
apache.performance.total_accesses | Apache-TotalAccesses | NULL | Provides the total number of accesses made | |
apache.net.request_per_sec | Apache-RequestsPerSec | NULL | Provides the number of requests made per second | |
apache.performance.busy_workers | Apache-BusyWorkers | NULL | Provides the number of busy workers | |
apache.performance.scoreboard | Apache-ScoreBoard | NULL | Provides the scoreboard metrics |
Agent G2 - Linux - Apache Tomcat Monitors
Description
Monitors tomcat application metrics
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Apache Tomcat Monitors | tomcat.req.processor.error_count | Tomcat-ReqProcessorErrorCount | NULL | Errors per second on all the request processors running on the Apache Tomcat |
tomcat.active_sessions.count | Tomcat-ActiveSessions | NULL | Number of active sessions to the server | |
tomcat.jvm.mem.non_heap_committed | Tomcat-JVM.Mem.non_heap_committed | NULL | Non-heap memory committed (in MB) for the server | |
tomcat.req.processor.processing_time | Tomcat-ReqProcessorProcessingTime | NULL | The amount of processing time taken per second | |
tomcat.servlet.error_count | Tomcat-ServletErrorCount | NULL | Number of erroneous requests received by the servlet per second | |
tomcat.jvm.uptime | Tomcat-Uptime | NULL | Uptime of the server | |
tomcat.jvm.mem.heap_used | Tomcat-JVM.Mem.heap_used | NULL | Heap memory usage (in MB) of the server | |
tomcat.cache.hits_count | Tomcat-CacheHitsCount | NULL | Number of times the cache was hit per second | |
tomcat.cache.access_count | Tomcat-CacheAccessCount | NULL | Number of times the cache was accessed per second | |
tomcat.servlet.request_count | Tomcat-ServletRequestCount | NULL | Number of requests served by the servlet per second | |
tomcat.jvm.gc.collection_time | Tomcat-JVM.GC.collection_time | NULL | Time taken for collection of the garbage objects | |
tomcat.servlet.processing_time | Tomcat-ServletProcessingTime | NULL | The amount of processing time taken per second | |
tomcat.jvm.mem.heap_committed | Tomcat-JVM.Mem.heap_committed | NULL | Heap memory committed (in MB) for the server | |
tomcat.threads.busy | Tomcat-ThreadsBusy | NULL | Number of busy threads | |
tomcat.jvm.mem.non_heap_used | Tomcat-JVM.Mem.non_heap_used | NULL | Non-heap memory usage (in MB) of the server | |
tomcat.req.processor.request_count | Tomcat-ReqProcessorRequestCount | NULL | Requests per second on all the request processors running on the Apache Tomcat | |
tomcat.threads.count | Tomcat-ThreadCount | NULL | Number of threads created | |
tomcat.jvm.threads.count | Tomcat-JVM.Threads | NULL | Number of threads | |
tomcat.jvm.gc.collection_count | Tomcat-JVM.GC.collection_count | NULL | Number of garbage objects collected | |
tomcat.jvm.open_fds_count | Tomcat-JVM.OpenFDs | NULL | Number of Open file descriptors of the server | |
tomcat.req.processor.mbytes_sent | Tomcat-ReqProcessorDataSent | NULL | MBytes sent per second by all the request processors running on the Apache Tomcat | |
tomcat.req.processor.mbytes_received | Tomcat-ReqProcessorDataReceived | NULL | MBytes received per second by all the request processors running on the Apache Tomcat | |
tomcat.jsp.reload_count | Tomcat-JspReloadCount | NULL | Number of times JSPs were reloaded on all the applications per second | |
tomcat.jsp.count | Tomcat-JspCount | NULL | Number of times JSPs were accessed on all the applications per second |
Agent G2 - Linux - Apache Tomcat Performance Check
Description
Monitors tomcat application metrics. This monitor uses MBeans exposed via the JMX console. Please refer to Visatar documentation on how to enable JMX on your application.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Apache Tomcat Performance Check | tomcat.req.processor.error_count | Tomcat-ReqProcessorErrorCount | NULL | Errors per second on all the request processors running on the Apache Tomcat |
tomcat.cache.hits_count | Tomcat-CacheHitsCount | NULL | Number of times the cache was hit per second | |
tomcat.jvm.gc.collection_time | Tomcat-JVM.GC.collection_time | NULL | Time taken for collection of the garbage objects | |
tomcat.jvm.uptime | Tomcat-Uptime | NULL | Uptime of the server | |
tomcat.jvm.mem.heap_committed | Tomcat-JVM.Mem.heap_committed | NULL | Heap memory committed (in MB) for the server | |
tomcat.req.processor.request_count | Tomcat-ReqProcessorRequestCount | NULL | Requests per second on all the request processors running on the Apache Tomcat | |
tomcat.jsp.reload_count | Tomcat-JspReloadCount | NULL | Number of times JSPs were reloaded on all the applications per second | |
tomcat.req.processor.mbytes_sent | Tomcat-ReqProcessorDataSent | NULL | MBytes sent per second by all the request processors running on the Apache Tomcat | |
tomcat.jvm.open_fds_count | Tomcat-JVM.OpenFDs | NULL | Number of Open file descriptors of the server | |
tomcat.jvm.mem.non_heap_used | Tomcat-JVM.Mem.non_heap_used | NULL | Non-heap memory usage (in MB) of the server | |
tomcat.jvm.mem.non_heap_committed | Tomcat-JVM.Mem.non_heap_committed | NULL | Non-heap memory committed (in MB) for the server | |
tomcat.cache.access_count | Tomcat-CacheAccessCount | NULL | Number of times the cache was accessed per second | |
tomcat.jsp.count | Tomcat-JspCount | NULL | Number of times JSPs were accessed on all the applications per second | |
tomcat.servlet.request_count | Tomcat-ServletRequestCount | NULL | Number of requests served by the servlet per second | |
tomcat.jvm.mem.heap_used | Tomcat-JVM.Mem.heap_used | NULL | Heap memory usage (in MB) of the server | |
tomcat.servlet.processing_time | Tomcat-ServletProcessingTime | NULL | The amount of processing time taken per second | |
tomcat.threads.count | Tomcat-ThreadCount | NULL | Number of threads created | |
tomcat.req.processor.processing_time | Tomcat-ReqProcessorProcessingTime | NULL | The amount of processing time taken per second | |
tomcat.req.processor.mbytes_received | Tomcat-ReqProcessorDataReceived | NULL | MBytes received per second by all the request processors running on the Apache Tomcat | |
tomcat.jvm.threads.count | Tomcat-JVM.Threads | NULL | Number of threads | |
tomcat.jvm.gc.collection_count | Tomcat-JVM.GC.collection_count | NULL | Number of garbage objects collected | |
tomcat.threads.busy | Tomcat-ThreadsBusy | NULL | Number of busy threads | |
tomcat.active_sessions.count | Tomcat-ActiveSessions | NULL | Number of active sessions to the server | |
tomcat.servlet.error_count | Tomcat-ServletErrorCount | NULL | Number of erroneous requests received by the servlet per second |
Agent G2 - Linux - Apache ZooKeeper Monitors
Description
Monitors zoo keeper metrics
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Apache ZooKeeper Monitors | zookeeper.packets_received | ZooKeeper Total Packets Received | NULL | The number of packets received |
zookeeper.connections | ZooKeeper Total Connections | NULL | The total count of client connections | |
zookeeper.nodes | ZooKeeper Total Node Count | NULL | The number of znodes in the ZooKeeper namespace | |
zookeeper.packets_sent | ZooKeeper Total Packets sent | NULL | The number of packets sent | |
zookeeper.node.count | ZooKeeper Total Node Count | NULL | Counts the number of zookeeper nodes | |
zookeeper.outstanding_requests | ZooKeeper OutStanding Requests | NULL | The number of queued requests when the server is under load and is receiving more sustained requests than it can process | |
zookeeper.latency.max | ZooKeeper Max Request Latency | NULL | The amount of time it takes for the server to respond to a client request | |
zookeeper.latency.min | ZooKeeper Min Request Latency | NULL | The amount of time it takes for the server to respond to a client request | |
zookeeper.zxid.count | ZooKeeper Zxid Count | NULL | Zxid Count | |
zookeeper.max.client.cnxns.perhost | ZooKeeper Max client connections perhost | NULL | Counts the number of max client connections perhost on the zookeeper server | |
zookeeper.zxid.epoch | ZooKeeper Zxid Epoch | NULL | Zxid Epoch | |
zookeeper.latency.avg | ZooKeeper Average Request Latency | NULL | The amount of time it takes for the server to respond to a client request | |
zookeeper.packets.sent | ZooKeeper Total Packets sent | NULL | Calculates the number of packets sent from the server | |
zookeeper.bytes_received | ZooKeeper Total Bytes Received | NULL | The number of bytes received | |
zookeeper.outstanding.requests | ZooKeeper Out Standing Requests | NULL | Counts the number of outstanding client requests on the zookeeper server | |
zookeeper.packets.received | ZooKeeper Total Packets Received | NULL | Calculates the number of packets received from the server | |
zookeeper.bytes_sent | ZooKeeper Total Bytes Sent | NULL | The number of bytes sent | |
zookeeper.request.latency | ZooKeeper Request Latency | ms | Average latency time for client requests on the zookeeper server in milliseconds |
Agent G2 - Linux - Cassandra2 Monitors
Description
Monitors cassandra version 2.x metrics
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Cassandra2 Monitor | cassandra.tasks_completed | Cassandra Request Completed Tasks | NULL | Approximate number of tasks thread pool has completed execution per second on path - request |
cassandra.bloom_filter_false_ratio | Cassandra Bloom Filter False Ratio | NULL | The ratio of Bloom filter false positives to total checks | |
cassandra.jvm.open_fds | Cassandra JVM OpenFDs | NULL | Number of Open file descriptors of the server | |
cassandra.compression_ratio | Cassandra Compression Ratio | NULL | The compression ratio for all SSTables in a column family | |
cassandra.write.request_timeouts | Cassandra Write Request Timeouts | NULL | Count of write requests not acknowledged within configurable timeout window | |
cassandra.commitlog.tasks_completed | Cassandra Commitlog Completed Tasks | NULL | Approximate number of completed task per second | |
cassandra.dropped.messages | Cassandra Dropped Messages | NULL | Number of dropped message for each verb per second. | |
cassandra.compaction.tasks_completed | Cassandra Compaction Completed Tasks | NULL | Estimated number of completed compaction tasks per second. | |
cassandra.bloom_filter_false_positives | Cassandra Cassandra Bloom Filter False Positives | NULL | The number of Bloom filter false positives | |
cassandra.compaction.tasks_pending | Cassandra Compaction Pending Tasks | NULL | Estimated number of pending compaction tasks. | |
cassandra.streaming.bytes_outgoing | Cassandra Data Sent | NULL | Outgoing data per second in megabytes | |
cassandra.load_count | Cassandra Disk Space Used | NULL | Disk space used on a node | |
cassandra.internal.pending_tasks | Cassandra Internal Pending Tasks | NULL | Approximate number of pending tasks thread pool has on path - internal | |
cassandra.cache.requests | Cassandra Requests Count | NULL | The number of requests to a cache | |
cassandra.jvm.mem_heap_used | Cassandra JVM Mem heap_used | NULL | Heap memory usage (in MB) of the server | |
cassandra.memtable_live_data_size | Cassandra Memtable Live Data Size | NULL | Size of data stored in memtable | |
cassandra.tasks_active | Cassandra Request Active Tasks | NULL | Approximate number of tasks thread pool is actively executing on path - request | |
cassandra.compaction_completed | Cassandra Compactions Completed | NULL | Number of compactions completed per second | |
cassandra.capacity | Cassandra Capacity | NULL | The capacity of the caches, such as the key cache and row cache | |
cassandra.jvm.threads | Cassandra JVM Threads | NULL | Number of threads | |
cassandra.cache.size | Cassandra Size | NULL | Size of cache | |
cassandra.live_disk_space_used.count | Cassandra Live Disk Space Used Count | NULL | Disk space used by "live" SSTables (only counts non-obsolete files). | |
cassandra.max_row_size | Cassandra Max Row Size | NULL | Size of the largest compacted row | |
cassandra.jvm.gc_collection_count | Cassandra JVM GC collection_count | NULL | Number of garbage objects collected | |
cassandra.read.requests | Cassandra Read Requests | NULL | Number of read requests | |
cassandra.tasks_pending | Cassandra Request Pending Tasks | NULL | Approximate number of pending tasks thread pool has on path - request | |
cassandra.internal.currently_blocked_tasks | Cassandra Internal Current Blocked Tasks | NULL | Number of currently blocked tasks on path - internal | |
cassandra.read.request_latency | Cassandra Read Request Latency | ms | Read latency for all client requests | |
cassandra.jvm.mem_non_heap_committed | Cassandra JVM Mem non_heap_committed | NULL | Non-heap memory committed (in MB) for the server | |
cassandra.live_ss_table_count | Cassandra Live SS Table Count | NULL | Number of "live" (non-obsolete) SSTables | |
cassandra.total_disk_space_used.count | Cassandra Total Disk Space Used Count | NULL | Disk space used by a column family | |
cassandra.cache.hitrate | Cassandra Cache Hit Rate | NULL | Cache hit rate. | |
cassandra.exceptions_count | Cassandra Data Exceptions | NULL | The number of exceptions thrown | |
cassandra.internal.tasks_completed | Cassandra Internal Completed Tasks | NULL | Approximate number of tasks thread pool has completed execution per second on path - internal | |
cassandra.tasks_currently_blocked | Cassandra Request Current Blocked Tasks | NULL | Number of currently blocked tasks on path - request | |
cassandra.memtable_switch_count.count | Cassandra Memtable Switch Count | NULL | Number of times a full memtable has been switched out for an empty one due to flushing | |
cassandra.jvm.uptime | Cassandra Uptime | NULL | Uptime of the server | |
cassandra.memtable_columns_count | Cassandra Memtable Columns Count | NULL | Number of columns in memtable | |
cassandra.write.request_unavailables | Cassandra Write Request Unavailables | NULL | Count write of requests for which the required number of nodes was unavailable | |
cassandra.compaction_bytes_written.count | Cassandra Compacted Bytes | NULL | Compacted bytes size | |
cassandra.write.request_latency | Cassandra Write Request Latency | ms | Write latency for all client requests | |
cassandra.internal.tasks_active | Cassandra Internal Active Tasks | NULL | Approximate number of tasks thread pool is actively executing on path - internal | |
cassandra.tasks_blocked | Cassandra Request Blocked Tasks | NULL | Number of blocked tasks per second on path - request | |
cassandra.read.request_unavailables | Cassandra Read Request Unavailables | NULL | Count read of requests for which the required number of nodes was unavailable | |
cassandra.commitlog.tasks_pending | Cassandra Commitlog Pending Tasks | NULL | Approximate number of pending task | |
cassandra.mean_row_size | Cassandra Mean Row Size | NULL | Average size of compacted rows | |
cassandra.write.requests | Cassandra Write Requests | NULL | Number of writes requests | |
cassandra.internal.tasks_blocked | Cassandra Internal Blocked Tasks | NULL | Number of blocked tasks per second on path - internal | |
cassandra.read.request_timeouts | Cassandra Read Request Timeouts | NULL | Count of read requests not acknowledged within configurable timeout window | |
cassandra.streaming.bytes_incoming | Cassandra Data Received | NULL | Incoming data per second in megabytes | |
cassandra.min_row_size | Cassandra Min Row Size | NULL | Size of the smallest compacted row | |
cassandra.connection.timeouts | Cassandra Cassandra Connection Timeouts | NULL | Number of timeouts occurred for this node per second | |
cassandra.bloom_filter_disk_space_used | Cassandra Bloom Filter Disk Space Used | NULL | Disk space used by the Bloom filters | |
cassandra.commitlog.total_size | Cassandra Commitlog Size | NULL | Current data size of all commit log segments in megabytes | |
cassandra.jvm.mem_heap_committed | Cassandra JVM Mem heap_committed | NULL | Heap memory committed (in MB) for the server | |
cassandra.streaming.active_outbounds | Cassandra Active Outbound Streams | NULL | Currently active outbound streams. | |
cassandra.cache.hits_count | Cassandra Hits Count | NULL | The number of hits to a cache | |
cassandra.jvm.gc_collection_time | Cassandra JVM GC collection_time | NULL | Time taken for collection of the garbage objects. | |
cassandra.jvm.mem_non_heap_used | Cassandra JVM Mem non_heap_used | NULL | Non-heap memory usage (in MB) of the server |
Agent G2 - Linux - ClickHouse
Description
Monitors both ClickHouse application server and cluster devices
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - ClickHouse | clickhouse_jemalloc_retained | Jemalloc Retained | byte | The amount of memory in virtual memory mappings that were retained rather than being returned to the operating system. |
clickhouse_ReplicasMaxInsertsInQueue | Replicas Max Inserts In Queue | NULL | This metric will be renamed in a future minor release. | |
clickhouse_S3WriteRequestsCount_per_sec | S3 Write Requests Count | requests/sec | Number of POST, DELETE, PUT and PATCH requests to S3 storage. | |
clickhouse_MergeTreeDataWriterUncompressedBytes_per_sec | Merge Tree Data Writer Uncompressed Bytes | bytes/sec | Uncompressed bytes (for columns as they stored in memory) INSERTed to MergeTree tables. | |
clickhouse_MergeTreeDataWriterCompressedBytes_per_sec | Merge Tree Data Writer Compressed Bytes | bytes/sec | Compressed bytes (for columns as they stored on disk) INSERTed to MergeTree tables. | |
clickhouse_MergeTreeDataWriterRows_per_sec | Merge Tree Data Writer Rows | rows/sec | Number of rows INSERTed to MergeTree tables. | |
clickhouse_MergeTreeDataWriterBlocks_per_sec | Merge Tree Data Writer Blocks | blocks/sec | Number of blocks INSERTed to MergeTree tables. | |
clickhouse_MergeTreeDataWriterBlocksAlreadySorted_per_sec | Merge Tree Data Writer Blocks Already Sorted | blocks/sec | Number of blocks that were already sorted by primary key or index of MergeTree tables. | |
clickhouse_MergeTreeDataWriterBlocksAverageSize_bytes | Merge Tree Data Writer Blocks Average Size | bytes | Average size of blocks INSERTed to MergeTree tables. | |
clickhouse_MergeTreeDataWriterBlocksCompressed_bytes | Merge Tree Data Writer Blocks Compressed | bytes | Compressed size of blocks INSERTed to MergeTree tables. | |
clickhouse_MergeTreeDataWriterBlocksUncompressed_bytes | Merge Tree Data Writer Blocks Uncompressed | bytes | Uncompressed size of blocks INSERTed to MergeTree tables. | |
clickhouse_MergeTreeDataWriterRows_compressed_bytes | Merge Tree Data Writer Rows Compressed | bytes | Size of rows INSERTed to MergeTree tables, compressed. | |
clickhouse_MergeTreeDataWriterMarks | Merge Tree Data Writer Marks | marks | Number of marks INSERTed to MergeTree tables. | |
clickhouse_MergeTreeDataWriterMaxMarkSize_bytes | Merge Tree Data Writer Max Mark Size | bytes | Maximum size of a mark INSERTed to MergeTree tables. | |
clickhouse_MergeTreeDataWriterRowsPerMark | Merge Tree Data Writer Rows Per Mark | rows | Number of rows INSERTed per mark to MergeTree tables. | |
clickhouse_MergeTreeDataWriterCompressedBytesPerMark_bytes | Merge Tree Data Writer Compressed Bytes Per Mark | bytes | Compressed size of marks INSERTed to MergeTree tables. | |
clickhouse_MergeTreeDataWriterUncompressedBytesPerMark_bytes | Merge Tree Data Writer Uncompressed Bytes Per Mark | bytes | Uncompressed size of marks INSERTed to MergeTree tables. | |
clickhouse_MergeTreeDataWriterRowsPerByte | Merge Tree Data Writer Rows Per Byte | rows/byte | Number of rows INSERTed per byte to MergeTree tables. | |
clickhouse_MergeTreeDataWriterCompressionRatio | Merge Tree Data Writer Compression Ratio | ratio | Compression ratio for rows INSERTed to MergeTree tables. | |
clickhouse_MergeTreeDataWriterAverageCompressedBytesPerByte | Merge Tree Data Writer Average Compressed Bytes Per Byte | bytes/byte | Average compressed size of a byte INSERTed to MergeTree tables. | |
clickhouse_MergeTreeDataWriterCompressedRows_bytes | Merge Tree Data Writer Compressed Rows | rows | Number of rows INSERTed to MergeTree tables, compressed. | |
clickhouse_MergeTreeDataWriterUncompressedRows_bytes | Merge Tree Data Writer Uncompressed Rows | rows | Number of rows INSERTed to MergeTree tables, uncompressed. | |
clickhouse_MergeTreeDataWriterFinalCompressedBytes_bytes | Merge Tree Data Writer Final Compressed Bytes | bytes | Compressed size of final data INSERTed to MergeTree tables. | |
clickhouse_MergeTreeDataWriterFinalUncompressedBytes_bytes | Merge Tree Data Writer Final Uncompressed Bytes | bytes | Uncompressed size of final data INSERTed to MergeTree tables. | |
clickhouse_MergeTreeDataWriterDiskWritePerformance_bytes_per_sec | Merge Tree Data Writer Disk Write Performance | bytes/sec | Disk write speed for rows INSERTed to MergeTree tables. | |
clickhouse_MergeTreeDataWriterRowsPerDiskWrite | Merge Tree Data Writer Rows Per Disk Write | rows | Number of rows INSERTed per disk write to MergeTree tables. | |
clickhouse_MergeTreeDataWriterWritePerformance_rows_per_sec | Merge Tree Data Writer Write Performance | rows/sec | Write performance of rows INSERTed to MergeTree tables. | |
clickhouse_MergeTreeDataWriterElapsed | Merge Tree Data Writer Elapsed | seconds | Time taken to write data to MergeTree tables. | |
clickhouse_MergeTreeDataWriterRows_before_mark | Merge Tree Data Writer Rows Before Mark | rows | Number of rows INSERTed to MergeTree tables before a mark. | |
clickhouse_MergeTreeDataWriterRows_before_sample | Merge Tree Data Writer Rows Before Sample | rows | Number of rows INSERTed to MergeTree tables before a sample. | |
clickhouse_MergeTreeDataWriterMergeElapsed | Merge Tree Data Writer Merge Elapsed | seconds | Time taken to perform merge operations on MergeTree tables. | |
clickhouse_MergeTreeDataWriterFinalMergeElapsed | Merge Tree Data Writer Final Merge Elapsed | seconds | Time taken to perform final merge operations on MergeTree tables. | |
clickhouse_MergeTreeDataWriterPartsMerged | Merge Tree Data Writer Parts Merged | parts | Number of parts merged during MergeTree operations. | |
clickhouse_MergeTreeDataWriterFinalParts | Merge Tree Data Writer Final Parts | parts | Number of final parts after merge operations on MergeTree tables. | |
clickhouse_MergeTreeDataWriterPartMutation | Merge Tree Data Writer Part Mutation | NULL | Information about part mutations during MergeTree operations. | |
clickhouse_MergeTreeDataWriterPartMutationAdd | Merge Tree Data Writer Part Mutation Add | NULL | Information about added part mutations during MergeTree operations. | |
clickhouse_MergeTreeDataWriterPartMutationRemove | Merge Tree Data Writer Part Mutation Remove | NULL | Information about removed part mutations during MergeTree operations. | |
clickhouse_MergeTreeDataWriterPartMutationClear | Merge Tree Data Writer Part Mutation Clear | NULL | Information about cleared part mutations during MergeTree operations. | |
clickhouse_MergeTreeDataWriterPartMutationAlter | Merge Tree Data Writer Part Mutation Alter | NULL | Information about altered part mutations during MergeTree operations. | |
clickhouse_MergeTreeDataWriterTableRowsInserted | Merge Tree Data Writer Table Rows Inserted | rows | Number of rows INSERTed to MergeTree tables. | |
clickhouse_MergeTreeDataWriterTableRowsDeleted | Merge Tree Data Writer Table Rows Deleted | rows | Number of rows DELETed from MergeTree tables. | |
clickhouse_MergeTreeDataWriterTableRowsUpdated | Merge Tree Data Writer Table Rows Updated | rows | Number of rows UPDATed in MergeTree tables. | |
clickhouse_MergeTreeDataWriterTableTTLRows | Merge Tree Data Writer Table TTL Rows | rows | Number of rows with expired TTL settings in MergeTree tables. | |
clickhouse_MergeTreeDataWriterPartMutations | Merge Tree Data Writer Part Mutations | mutations | Number of part mutations during MergeTree operations. | |
clickhouse_MergeTreeDataWriterZooKeeperCheck | Merge Tree Data Writer ZooKeeper Check | NULL | Information about ZooKeeper checks during MergeTree operations. | |
clickhouse_MergeTreeDataWriterTableTTLCheck | Merge Tree Data Writer Table TTL Check | NULL | Information about TTL checks during MergeTree operations. | |
clickhouse_MergeTreeDataWriterSettings | Merge Tree Data Writer Settings | NULL | Information about settings used during MergeTree operations. |
Agent G2 - Linux - ContainerD_v2
Description
Agent G2 - Linux - ContainerD_v2
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - ContainerD_v2 | containerd_cpu_throttling_throttledTime | CPU Throttled Time | Percentage | cpu throttled time |
containerd_memory_usage_limit | Memory Usage Limit | Megabytes | limit of memory usage | |
containerd_memory_swap_limit | Swap Usage Limit | Megabytes | limit of swap usage | |
containerd_memory_usage_failcnt | Memory Usage fail Rate | NULL | rate of number of times that the cgroup limit was exceeded | |
containerd_blkio_sectors_recursive | BlkIO Sectors | Bytes | number of sectors transferred to/from disk by the group | |
containerd_memory_kernel_tcp_usage | ContinaerD Kernel TCP Usage | Megabytes | current tcp buf memory allocation | |
containerd_hugetlb_failcnt | HugeTLB fail Rate | NULL | Rate of allocation failure due to HugeTLB limit | |
containerd_cpu_usage_total | CPU Total Usage | Percentage | total Cpu usage of container with respect to host system | |
containerd_memory_dirty | Memory Dirty | Megabytes | bytes that are waiting to get written back to the disk | |
containerd_cpu_usage_total_over_limit | CPU Total Usage Over Limit | Percentage | container total cpu usage with respect to limit. (if limit is not set the metric is not sent) | |
containerd_memory_kernel_usage | ContinaerD Kernel Usage | Megabytes | current kernel memory allocation | |
containerd_memory_rss | Memory RSS | Megabytes | bytes of anonymous and swap cache memory (includes transparent hugepages) | |
containerd_containers_stopped | Stopped Containers | Count | Total number of Stopped Containers | |
containerd_blkio_wait_time_recursive | BlkIO Wait Time | Bytes | Total amount of time the IOs for this cgroup spent waiting in the scheduler queues for service | |
containerd_memory_cache | Memory Cache | Megabytes | bytes of page cache memory | |
containerd_containers_running | Running Containers | Count | Total number of running containers | |
containerd_memory_rss_huge | Memory RSS Huge | Megabytes | bytes of anonymous transparent hugepages | |
containerd_cpu_usage_user | CPU User Usage | Percentage | user Cpu usage of container with respect to host system | |
containerd_memory_usage_over_limit | Memory Usage Over Limit | Percentage | Memory Usage percentage with respect to limit(if limit is not set then node total memory is used) | |
containerd_memory_kernel_max | Kernel Max | Megabytes | max kernel memory usage recorded | |
containerd_container_uptime | Container Uptime | Seconds | Uptime of the Current Container | |
containerd_cpu_usage_system | CPU System Usage | Percentage | system Cpu usage of container with respect to host system | |
containerd_proc_open_fds | number of open fd | Count | Number of open file descriptors | |
containerd_memory_usage_max | Memory Usage Max | Megabytes | show max memory usage recorded | |
containerd_blkio_queued_recursive | BlkIO Queued | Bytes | Total number of requests queued up at any given instant for the cgroup | |
containerd_memory_swap_max | Swap Usage Max | Megabytes | show max swap usage recorded | |
containerd_memory_kernel_tcp_failcnt | Kernel TCP fail rate | NULL | rate of number of tcp buf memory usage hits limits | |
containerd_hugetlb_usage | ContianerD HugeTLB usage | Bytes | current usage for "hugepagesize" hugetlb | |
containerd_blkio_merged_recursive | BlkIO Merged | Bytes | Total number of bios/requests merged into requests belonging to this cgroup | |
containerd_blkio_service_time_recursive | BlkIO Service Time | Bytes | Total amount of time between request dispatch and request completion for the IOs | |
containerd_blkio_serviced_recursive | BlkIO Serviced | Bytes | Number of IOs (bio) issued to the disk by the group | |
containerd_blkio_service_bytes_recursive | BlkIO Service Bytes | Bytes | Number of bytes transferred to/from the disk | |
containerd_memory_swap_failcnt | Swap Usage fail Rate | NULL | rate of number of times that the cgroup swap limit was exceeded | |
containerd_memory_kernel_failcnt | Kernel fail count | NULL | rate of the number of kernel memory usage hits limits | |
containerd_image_size | Image Size | Bytes | Image sizes of different container images | |
containerd_memory_swap_usage | ContinaerD Swap Usage | Megabytes | swap Usage in Bytes | |
containerd_hugetlb_max | HugeTLB max usage | Bytes | max "hugepagesize" hugetlb usage recorded | |
containerd_memory_usage | Memory Usage | Megabytes | memory Usage in Bytes | |
containerd_blkio_time_recursive | BlkIO Time | Bytes | disk time allocated to cgroup per device in milliseconds | |
containerd_memory_kernel_tcp_limit | Kernel TCP Limit | Megabytes | show hard limit for tcp buf memory | |
containerd_memory_kernel_limit | Kernel Limit | Megabytes | hard limit for kernel memory | |
containerd_memory_kernel_tcp_max | Kernel TCP Max | Megabytes | max tcp buf memory usage recorded |
Agent G2 - Linux - Couchbase Monitors
Description
Applicable on Couchbase servers
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - ContainerD_v2 | containerd_memory_usage_failcnt | Memory Usage fail Rate | NULL | rate of number of times that the cgroup limit was exceeded |
containerd_blkio_sectors_recursive | BlkIO Sectors | Bytes | number of sectors transferred to/from disk by the group | |
containerd_memory_kernel_tcp_usage | ContinaerD Kernel TCP Usage | Megabytes | current tcp buf memory allocation | |
containerd_hugetlb_failcnt | HugeTLB fail Rate | NULL | Rate of allocation failure due to HugeTLB limit | |
containerd_cpu_usage_total | CPU Total Usage | Percentage | total Cpu usage of container with repect to host system | |
containerd_memory_dirty | Memory Dirty | Megabytes | bytes that are waiting to get written back to the disk | |
containerd_cpu_usage_total_over_limit | CPU Total Usage Over Limit | Percentage | container total cpu usage with respect to limit. (if limit is not set the metric is not sent) | |
containerd_memory_kernel_usage | ContinaerD Kernel Usage | Megabytes | current kernel memory allocation | |
containerd_memory_rss | Memory RSS | Megabytes | bytes of anonymous and swap cache memory (includes transparent hugepages) | |
containerd_containers_stopped | Stopped Containers | Count | Total number of Stopped Containers | |
containerd_blkio_wait_time_recursive | BlkIO Wait Time | Bytes | Total amount of time the IOs for this cgroup spent waiting in the scheduler queues for service | |
containerd_memory_cache | Memory Cache | Megabytes | bytes of page cache memory | |
containerd_containers_running | Running Containers | Count | Total number of running containers | |
containerd_memory_rss_huge | Memory RSS Huge | Megabytes | bytes of anonymous transparent hugepages | |
containerd_cpu_usage_user | CPU User Usage | Percentage | user Cpu usage of container with repect to host system | |
containerd_memory_usage_over_limit | Memory Usage Over Limit | Percentage | Memory Usage percentage with respect to limit(if limit is not set then node total memory is used) | |
containerd_memory_kernel_max | Kernel Max | Megabytes | max kernel memory usage recorded | |
containerd_container_uptime | Container Uptime | Seconds | Uptime of the Current Container | |
containerd_cpu_usage_system | CPU System Usage | Percentage | system Cpu usage of container with repect to host system | |
containerd_proc_open_fds | number of open fd | Count | Number of open file descriptors | |
containerd_memory_usage_max | Memory Usage Max | Megabytes | show max memory usage recorded | |
containerd_blkio_queued_recursive | BlkIO Queued | Bytes | Total number of requests queued up at any given instant for the cgroup | |
containerd_memory_swap_max | Swap Usage Max | Megabytes | show max swap usage recorded | |
containerd_memory_kernel_tcp_failcnt | Kernel TCP fail rate | NULL | rate of number of tcp buf memory usage hits limits | |
containerd_hugetlb_usage | ContianerD HugeTLB usage | Bytes | current usage for "hugepagesize" hugetlb | |
containerd_blkio_merged_recursive | BlkIO Merged | Bytes | Total number of bios/requests merged into requests belonging to this cgroup | |
containerd_blkio_service_time_recursive | BlkIO Service Time | Bytes | Total amount of time between request dispatch and request completion for the IOs | |
containerd_blkio_serviced_recursive | BlkIO Serviced | Bytes | Number of IOs (bio) issued to the disk by the group | |
containerd_blkio_service_bytes_recursive | BlkIO Service Bytes | Bytes | Number of bytes transferred to/from the disk | |
containerd_memory_swap_failcnt | Swap Usage fail Rate | NULL | rate of number of times that the cgroup swap limit was exceeded | |
containerd_memory_kernel_failcnt | Kernel fail count | NULL | rate of the number of kernel memory usage hits limits | |
containerd_image_size | Image Size | Bytes | Image sizes of different container images | |
containerd_memory_swap_usage | ContinaerD Swap Usage | Megabytes | swap Usage in Bytes | |
containerd_hugetlb_max | HugeTLB max usage | Bytes | max "hugepagesize" hugetlb usage recorded | |
containerd_memory_usage | Memory Usage | Megabytes | memory Usage in Bytes | |
containerd_blkio_time_recursive | BlkIO Time | Bytes | disk time allocated to cgroup per device in milliseconds | |
containerd_memory_kernel_tcp_limit | Kernel TCP Limit | Megabytes | show hard limit for tcp buf memory | |
containerd_memory_kernel_limit | Kernel Limit | Megabytes | hard limit for kernel memory | |
containerd_memory_kernel_tcp_max | Kernel TCP Max | Megabytes | max tcp buf memory usage recorded |
Agent G2 - Linux - CRI-O Monitoring
Description
Monitoring of OCI-based implementation of Kubernetes Container Runtime Interface
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - CRI-O Monitoring | crio_operations_latency_microseconds | Operations Latency Microseconds | NULL | Latency in microseconds of CRI-O operations. Broken down by operation type |
crio_memory_working_set | Memory Working Set | Bytes | The amount of working set memory in bytes. | |
crio_image_pulls_failures | Image Pulls Failures | NULL | Failed image pulls by image name and their error category. | |
crio_operations_errors | Operations Errors | NULL | Cumulative number of CRI-O operation errors by operation type. | |
crio_mem_resident | Mem Resident | Bytes | Resident memory size in bytes. | |
crio_inodes_used | Inodes Used | NULL | represents the inodes used by the images. (This may not equal InodesCapacity - InodesAvailable because the underlying filesystem may also be used for purposes other than storing images.) | |
crio_cpu_usage_core | CPU Usage | Nanoseconds | Cumulative CPU usage (sum across all cores) since object creation. | |
crio_cpu_time | Cpu Time | NULL | Total user and system CPU time spent in seconds. | |
crio_operations_latency_microseconds_count | Operations Latency Microseconds Count | Microseconds | Latency in microseconds of CRI-O operations. Broken down by operation type. count value | |
crio_operations_latency_microseconds_sum | Operations Latency Microseconds Sum | Microseconds | Latency in microseconds of CRI-O operations. Broken down by operation type. sum value | |
crio_mem_virtual | Mem Virtual | Bytes | Virtual memory size in bytes. | |
crio_filesystem_used | Filesystem Used | Bytes | represents the bytes used for images on the filesystem. (This may differ from the total bytes used on the filesystem and may not equal CapacityBytes - AvailableBytes.) | |
crio_operations | Operations Count | NULL | Cumulative number of CRI-O operations by operation type. | |
crio_process_open_fds | Process Open Fds | NULL | Number of open file descriptors. |
Agent G2 - Linux - Docker Host Monitoring Template
Description
Docker Host Monitoring Template
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Docker Host Monitoring Template | docker.cpu | Docker Cpu | NULL | CPU usage of the Docker Container |
docker.container.states | Docker Container states | NULL | The state of Container | |
docker.mem.cache.95percentile | Cache Size 95percentile | NULL | 95th percentile value of docker.mem.cache | |
docker.mem.inactive_file | Inactive Cache Memory | NULL | The amount of "inactive" cache memory. Inactive memory may be reclaimed first when the system needs memory | |
docker.cpu.shares | Shares of CPU | NULL | Shares of CPU usage allocated to the container | |
docker.image.virtual_size | Image Virtual Size | NULL | Size of all layers of the image on disk | |
docker.mem.rss.95percentile | RSS Memory 95percentile | NULL | 95th percentile value of docker.mem.rss | |
docker.mem.sw_limit | Swap Memory Limit | NULL | The swap + memory limit for the container, if set | |
kubernetes.network.tx_bytes | Network Tx Bytes | NULL | The amount of bytes per second transmitted | |
docker.cpu.system | CPU System | NULL | The percent of time the CPU is executing system calls on behalf of processes of this container, unnormalized | |
docker.cpu.throttled | CPU Throttled | NULL | Number of times the cgroup has been throttled | |
docker.image.size | Image Size | NULL | Size of all layers of the image on disk | |
docker.mem.in_use | Memory In Use | NULL | The fraction of used memory to available memory, IF THE LIMIT IS SET | |
docker.io.write_bytes | IO Write Bytes | NULL | Bytes written per second to disk by the processes of the container | |
docker.mem.active_file | Active Cache Memory | NULL | The amount of "active" cache memory. Active memory is reclaimed by the system only after "inactive" has been reclaimed | |
docker.containers.running | Containers Running by Image | NULL | The number of containers running on this host tagged by image | |
docker.mem.mapped_file | Memory Mapped by Process | NULL | The amount of memory mapped by the processes in the control group | |
kubernetes.memory.usage | Memory Usage | NULL | The amount of memory used | |
docker.mem.cache | Cache Size | NULL | The amount of memory that is being used to cache data from disk (e.g. memory contents that can be associated precisely with a block on a block device) | |
docker.cpu.usage | CPU Usage | NULL | The percent of CPU time obtained by this container | |
docker.mem.sw_in_use | Swap Memory In Use | NULL | The fraction of used swap + memory to available swap + memory, if the limit is set | |
docker.mem.in_use.95percentile | Memory In Use 95percentile | NULL | 95th percentile of docker.mem.in_use | |
docker.images.intermediate | Images intermediate | NULL | The number of intermediate images, which are intermediate layers that make up other images | |
docker.containers.running_total | Docker Container Running Total | NULL | The total number of containers running on this host | |
docker.mem.active_anon | Active RSS Memory | NULL | The amount of "active" RSS memory. Active memory is not swapped to disk | |
docker.io.read_bytes.95percentile | IO Read Bytes 95percentile | NULL | 95th percentile of docker.io.read_bytes | |
docker.mem.sw_limit.95percentile | Swap Memory Limit 95percentile | NULL | 95th percentile of docker.mem.sw_limit. Ordinarily this value will not change | |
kubernetes.network.rx_bytes | Network Rx Bytes | NULL | The amount of bytes per second received | |
docker.container.size_rootfs | Root Filesystem Size | NULL | Total size of all the files in the container | |
kubernetes.memory.limits | Memory Limits | NULL | The limit of memory set | |
docker.mem.inactive_anon | Inactive RSS Memory | NULL | The amount of "inactive" RSS memory. Inactive memory is swapped to disk when necessary | |
docker.mem.pgpgout | Pages Uncharged Rate | NULL | The rate at which pages are "uncharged" (removed from the accounting) of a cgroup | |
docker.mem.rss | RSS Memory | NULL | The amount of non-cache memory that belongs to the container's processes. Used for stacks, heaps, etc. | |
docker.io.read_bytes | IO Read Bytes | NULL | Bytes read per second from disk by the processes of the container | |
docker.mem.pgpgin | Pages Charged Rate | NULL | The rate at which pages are "charged" (added to the accounting) of a cgroup | |
docker.container.size_rw.95percentile | Total Files Size 95Percentile | NULL | 95th percentile of docker.container.size_rw | |
docker.mem.pgfault | Memory Page Faults | NULL | The rate that processes in the container trigger page faults by accessing a nonexistent or protected part of its virtual address space. Usually a page fault of this type results in a segmentation fault | |
docker.container.size_rw | Total Files Size | NULL | Total size of all the files in the container which have been created or changed by processes running in the container | |
docker.images.available | Images Available | NULL | The number of top-level images | |
docker.containers.stopped | Containers Stopped by Image | NULL | The number of containers stopped on this host tagged by image | |
docker.memory | Docker Memory | NULL | Memory usage of the Docker Container | |
docker.cpu.system.95percentile | CPU System 95Percentile | NULL | 95th percentile of docker.cpu.system | |
docker.mem.limit | Memory Limit | NULL | The memory limit for the container, if set | |
docker.io.write_bytes.95percentile | IO Write Bytes 95percentile | NULL | 95th percentile of docker.io.write_bytes | |
docker.mem.pgmajfault | Memory Page Faults Virtual | NULL | The rate that processes in the container trigger page faults by accessing a part virtual address space that was swapped out or corresponded to a mapped file. Usually a page fault of type results in fetching data from disk instead of from memory | |
docker.cpu.user.95percentile | CPU User 95Percentile | NULL | 95th percentile of docker.cpu.user | |
docker.cpu.user | CPU User | NULL | The percent of time the CPU is under direct control of processes of this container, unnormalized | |
docker.mem.limit.95percentile | Memory Limit 95percentile | NULL | 95th percentile of docker.mem.limit. Ordinarily this value will not change | |
docker.disk.io | Docker Disk Iops | NULL | Disk IOPS of the Docker Container | |
docker.container.size_rootfs.95percentile | Root Filesystem Size 95Percentile | NULL | 95th percentile of docker.container.size_rootfs | |
docker.mem.sw_in_use.95percentile | Swap Memory In Use 95percentile | NULL | 95th percentile of docker.mem.sw_in_use |
Agent G2 - Linux - Docker Monitoring Template
Description
Template to monitor docker host metrics and containers.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Docker Monitoring Template | docker.containers.running.total | Docker Total Containers | NULL | |
docker.cpu | Docker Cpu | NULL | CPU usage of the Docker Container | |
docker.memory | Docker Memory | NULL | Memory usage of the Docker Container | |
docker.network | Docker Network | NULL | Network usage of the Docker Container | |
docker.disk.io | Docker Disk Iops | NULL | Disk IOPS of the Docker Container |
Agent G2 - Linux - Docker Monitors
Description
Docker Monitoring Template
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Docker Monitors | docker.containers.running.total | Docker Total Containers | NULL | |
docker.disk.io | Docker Disk Iops | NULL | Disk IOPS of the Docker Container | |
docker.cpu | Docker Cpu | NULL | CPU usage of the Docker Container | |
docker.network | Docker Network | NULL | Network usage of the Docker Container | |
docker.memory | Docker Memory | NULL | Memory usage of the Docker Container |
Agent G2 - Linux - Elasticsearch Monitors
Description
Monitors various performance metrics on the Elasticsearch servers
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Elasticsearch Monitors | es.flush.time | ElasticSearch Flush time | NULL | |
es.jvm.gc.collection_time | ESJVM GC collection_time | NULL | ||
es.merges.ops | ElasticSearch Merges ops | NULL | ||
es.search.fetch.ops | ElasticSearch Search fetch ops | NULL | ||
es.cache.field.size | ElasticSearch Cache field size | NULL | ||
es.jvm.gc.par_new.collection_time | ESJVM GC par_new collection_time | NULL | ||
es.cluster.initializing_shards | ElasticSearch Cluster Initializing shards | NULL | ||
es.search.query.time | ElasticSearch Search query time | NULL | ||
es.process.openfds | ElasticSearch Process OpenFDs | NULL | ||
es.jvm.gc.copy.count | ESJVM GC copy count | NULL | ||
es.cluster.unassigned_shards | ElasticSearch Cluster Unassigned shards | NULL | ||
es.docs.deleted | ElasticSearch Docs deleted | NULL | ||
es.refresh.time | ElasticSearch Refresh time | NULL | ||
es.search.fetch.time | ElasticSearch Search fetch time | NULL | ||
es.failure.domains | ElasticSearch Failure Domains | NULL | ||
es.jvm.gc.par_new.count | ESJVM GC par_new count | NULL | ||
es.merged.docs.size | ElasticSearch Merged docs size | NULL | ||
es.cache.filter.size | ElasticSearch Cache filter size | NULL | ||
es.search.query.ops | ElasticSearch Search query ops | NULL | ||
es.cluster.shards.active.primary | ElasticSearch Cluster Active primary shards | NULL | ||
es.jvm.mem.heap_committed | ESJVM Mem heap_committed | NULL | ||
es.cache.filter.evictions | ElasticSearch Cache filter evictions | NULL | ||
es.cluster.nodes | ElasticSearch Cluster Nodes | NULL | ||
es.docs.total | ElasticSearch Docs total | NULL | ||
es.cache.field.evictions | ElasticSearch Cache field evictions | NULL | ||
es.cluster.data_nodes | ElasticSearch Cluster Data nodes | NULL | ||
es.jvm.mem.non_heap_used | ESJVM Mem non_heap_used | NULL | ||
es.Store.size | ElasticSearch Store size | NULL | ||
es.cluster.relocating_shards | ElasticSearch Cluster Relocating shards | NULL | ||
es.jvm.gc.concurrent_mark_sweep.collection_time | ESJVM GC concurrent_mark_sweep collection_time | NULL | ||
es.cache.filter.count | ElasticSearch Cache filter count | NULL | ||
es.cluster.shards.active | ElasticSearch Cluster Active shards | NULL | ||
es.jvm.mem.non_heap_committed | ESJVM Mem non_heap_committed | NULL | ||
es.jvm.gc.copy.collection_time | ESJVM GC copy collection_time | NULL | ||
es.merges.time | ElasticSearch Merges time | NULL | ||
es.refresh.ops | ElasticSearch Refresh ops | NULL | ||
es.indexing.index.time | ElasticSearch Indexing index time | NULL | ||
es.indexing.index.count | ElasticSearch Indexing index count | NULL | ||
es..master.eligible.nodes | ElasticSearch Master Eligible Nodes | NULL | ||
es.jvm.gc.collection_count | ESJVM GC collection_count | NULL | ||
es.flush.ops | ElasticSearch Flush ops | NULL | ||
es.merged.docs.count | ElasticSearch Merged docs count | NULL | ||
es.jvm.gc.concurrent_mark_sweep.count | ESJVM GC concurrent_mark_sweep count | NULL | ||
es.jvm.mem.heap_used | ESJVM Mem heap_used | NULL | ||
es.jvm.threads | ESJVM Threads | NULL |
Agent G2 - Linux - etcd - v2
Description
Agent G2 - Linux - etcd - v2
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - etcd - v2 | etcd_debugging_snap_save_marshalling_duration_seconds_bucket | Snap Save Marshalling Duration | NULL | The marshalling cost distributions of save called by snapshot. |
etcd_go_threads | Go Threads | NULL | Number of OS threads created. | |
etcd_network_peer_sent_bytes_total_per_sec | Network Peer Sent Bytes Total | NULL | The total number of bytes sent to peers per sec | |
etcd_go_memstats_stack_inuse_bytes | Go Memstats Stack Inuse Bytes | NULL | Number of bytes in use by the stack allocator. | |
etcd_debugging_store_watchers | Store Watchers | NULL | Count of currently active watchers. | |
etcd_debugging_mvcc_keys_total | Mvcc Keys | NULL | Total number of keys. | |
etcd_go_memstats_heap_idle_bytes | Go Memstats Heap Idle Bytes | NULL | Number of heap bytes waiting to be used. | |
etcd_grpc_server_handled_total | Grpc Server Handled Total | NULL | Total number of RPCs completed on the server, regardless of success or failure. | |
etcd_grpc_proxy_events_coalescing_total | Grpc Proxy Events Coalescing | NULL | Total number of events coalescing | |
etcd_disk_wal_fsync_duration_seconds_bucket | Disk Wal Fsync Duration | NULL | The latency distributions of fsync called by wal. | |
etcd_network_peer_round_trip_time_seconds | Network Peer Round Trip Time Seconds | NULL | Round-Trip-Time histogram between peers | |
etcd_server_proposals_failed_total | Server Proposals Failed | NULL | The total number of failed proposals seen. | |
etcd_debugging_store_expires_total_per_sec | Store Expires | NULL | Total number of expired keys per sec | |
etcd_grpc_proxy_cache_hits_total | Grpc Proxy Cache Hits | NULL | Total number of cache hits | |
etcd_debugging_mvcc_db_total_size_in_bytes | Mvcc Db Size In Bytes | NULL | Total size of the underlying database in bytes. | |
etcd_go_memstats_mspan_sys_bytes | Go Memstats Mspan Sys Bytes | NULL | Number of bytes used for mspan structures obtained from system. | |
etcd_debugging_snap_save_total_duration_seconds_bucket | Snap Save Duration | NULL | The total latency distributions of save called by snapshot. | |
etcd_debugging_mvcc_delete_total | Mvcc Delete | NULL | Total number of deletes seen by this member. | |
etcd_go_memstats_gc_cpu_fraction | Go Memstats Gc Cpu Fraction | NULL | The fraction of this program's available CPU time used by the GC since the program started. | |
etcd_debugging_mvcc_db_compaction_total_duration_milliseconds_bucket | Mvcc Db Compaction Duration | NULL | Bucketed histogram of db compaction total duration. | |
etcd_network_client_grpc_sent_bytes_total_per_sec | Network Client Grpc Sent Bytes | NULL | The total number of bytes sent to grpc clients per sec | |
etcd_go_memstats_heap_alloc_bytes | Go Memstats Heap Alloc Bytes | NULL | Number of heap bytes allocated and still in use. | |
etcd_network_client_grpc_sent_bytes_total | Network Client Grpc Sent Bytes | NULL | The total number of bytes sent to grpc clients. | |
etcd_debugging_store_expires_total | Store Expires | NULL | Total number of expired keys. | |
etcd_debugging_mvcc_db_compaction_keys_total | Mvcc Db Compaction Keys | NULL | Total number of db keys compacted. | |
etcd_debugging_server_lease_expired_total | Server Lease Expired | NULL | The total number of expired leases. | |
etcd_network_client_grpc_received_bytes_total_per_sec | Network Client Grpc Received Bytes | NULL | The total number of bytes received from grpc clients per sec | |
etcd_debugging_store_writes_total | Store Writes | NULL | Total number of writes (e.g. set/compareAndDelete) seen by this member. | |
etcd_go_memstats_buck_hash_sys_bytes | Go Memstats Buck Hash Sys Bytes | NULL | Number of bytes used by the profiling bucket hash table. | |
etcd_server_leader_changes_seen_total_per_sec | Server Leader Changes Seen | NULL | The number of leader changes seen per sec | |
etcd_network_peer_received_bytes_total_per_sed | Network Peer Received Bytes Total | Bytes Per Sec | The total number of bytes received from peers per sec | |
etcd_server_proposals_applied_total | Server Proposals Applied | NULL | The total number of consensus proposals applied. | |
etcd_grpc_proxy_watchers_coalescing_total | Grpc Proxy Watchers Coalescing | NULL | Total number of current watchers coalescing | |
etcd_server_is_leader | Server Is Leader | NULL | Whether or not this member is a leader. 1 if is, 0 otherwise. | |
etcd_process_start_time_seconds | Process Start Time Seconds | NULL | Start time of the process since unix epoch in seconds. | |
etcd_debugging_mvcc_events_total | Mvcc Events | NULL | Total number of events sent by this member. | |
etcd_debugging_mvcc_txn_total | Mvcc Txn | NULL | Total number of txns seen by this member. | |
etcd_debugging_store_watch_requests_total | Store Watch Requests | NULL | Total number of incoming watch requests (new or reestablished). | |
etcd_go_memstats_lookups_total_per_sec | Go Memstats Lookups Total | NULL | Total number of pointer lookups per sec | |
etcd_server_version | Server Version | NULL | Which version is running. 1 for 'server_version' label with current version | |
etcd_grpc_server_started_total | Grpc Server Started Total | NULL | Total number of RPCs started on the server. | |
etcd_go_memstats_alloc_bytes | Go Memstats Alloc Bytes | NULL | Number of bytes allocated and still in use. | |
etcd_disk_backend_commit_duration_seconds_bucket | Disk Backend Commit Duration | NULL | The latency distributions of commit called by backend. | |
etcd_debugging_store_reads_total_per_sec | Store Reads | NULL | Total number of reads action by (get/getRecursive), local to this member per sec | |
etcd_process_open_fds | Process Open Fds | NULL | Number of open file descriptors. | |
etcd_disk_wal_fsync_duration_seconds_count | Disk Wal Fsync Duration Seconds Count | NULL | The latency distributions of fsync called by wal. | |
etcd_go_memstats_heap_inuse_bytes | Go Memstats Heap Inuse Bytes | NULL | Number of heap bytes that are in use. | |
etcd_go_memstats_next_gc_bytes | Go Memstats Next Gc Bytes | NULL | Number of heap bytes when next garbage collection will take place. | |
etcd_debugging_mvcc_range_total_per_sec | Mvcc Range | NULL | Total number of ranges seen by this member per sec | |
etcd_process_cpu_seconds_total_per_sec | Process Cpu Seconds Total | NULL | Total user and system CPU time spent per sec | |
etcd_debugging_mvcc_db_compaction_keys_total_per_sec | Mvcc Db Compaction Keys | NULL | Total number of db keys compacted per sec | |
etcd_go_memstats_heap_released_bytes | Go Memstats Heap Released Bytes | NULL | Number of heap bytes released to OS. | |
etcd_server_leader_changes_seen_total | Server Leader Changes Seen | NULL | The number of leader changes seen. | |
etcd_debugging_mvcc_put_total | Mvcc Put | NULL | Total number of puts seen by this member. | |
etcd_process_resident_memory_bytes | Process Resident Memory Bytes | NULL | Resident memory size in bytes. | |
etcd_debugging_mvcc_slow_watcher_total | Mvcc Slow Watcher | NULL | Total number of unsynced slow watchers. | |
etcd_debugging_mvcc_index_compaction_pause_duration_milliseconds_bucket | Mvcc Index Compaction Pause Duration | NULL | Bucketed histogram of index compaction pause duration. | |
etcd_debugging_mvcc_db_compaction_pause_duration_milliseconds_bucket | Mvcc Db Compaction Pause Duration | NULL | Bucketed histogram of db compaction pause duration. | |
etcd_grpc_proxy_cache_keys_total | Grpc Proxy Cache Keys | NULL | Total number of keys/ranges cached | |
etcd_go_memstats_alloc_bytes_total_per_sec | Go Memstats Alloc Bytes Total | NULL | Total number of bytes allocated, even if freed per sec | |
etcd_go_memstats_mcache_inuse_bytes | Go Memstats Mcache Inuse Bytes | NULL | Number of bytes in use by mcache structures. | |
etcd_debugging_store_reads_total | Store Reads | NULL | Total number of reads action by (get/getRecursive), local to this member. | |
etcd_go_memstats_heap_objects | Go Memstats Heap Objects | NULL | Number of allocated objects. | |
etcd_server_has_leader | Server Has Leader | NULL | Whether or not a leader exists. 1 is existence, 0 is not. | |
etcd_debugging_mvcc_watch_stream_total | Mvcc Watch Stream | NULL | Total number of watch streams. | |
etcd_go_memstats_sys_bytes | Go Memstats Sys Bytes | NULL | Number of bytes obtained from system. | |
etcd_network_peer_received_bytes_total_per_sec | Network Peer Received Bytes Total | NULL | The total number of bytes received from peers per sec | |
etcd_debugging_mvcc_put_total_per_sec | Mvcc Put | NULL | Total number of puts seen by this member per sec | |
etcd_go_memstats_mspan_inuse_bytes | Go Memstats Mspan Inuse Bytes | NULL | Number of bytes in use by mspan structures. | |
etcd_debugging_mvcc_range_total | Mvcc Range | NULL | Total number of ranges seen by this member. | |
etcd_go_memstats_other_sys_bytes | Go Memstats Other Sys Bytes | NULL | Number of bytes used for other system allocations. | |
etcd_debugging_store_writes_total_per_sec | Store Writes | NULL | Total number of writes (e.g. set/compareAndDelete) seen by this member per sec | |
etcd_network_client_grpc_received_bytes_total | Network Client Grpc Received Bytes | NULL | The total number of bytes received from grpc clients. | |
etcd_go_info | Go Info | NULL | Information about the Go runtime. | |
etcd_debugging_mvcc_slow_watcher_total_per_sec | Mvcc Slow Watcher | NULL | Total number of unsynced slow watchers per sec | |
etcd_debugging_store_watch_requests_total_per_sec | Store Watch Requests | NULL | Total number of incoming watch requests (new or reestablished) per sec | |
etcd_debugging_snap_save_marshalling_duration_seconds_count | Snap Save Marshalling Duration | NULL | The marshalling cost distributions of save called by snapshot. | |
etcd_debugging_snap_save_total_duration_seconds_count | Snap Save Duration | NULL | The total latency distributions of save called by snapshot. | |
etcd_network_client_grpc_sent_bytes_total_per_sec | Network Client Grpc Sent Bytes | NULL | The total number of bytes sent to grpc clients per sec | |
etcd_disk_backend_commit_duration_seconds_count | Disk Backend Commit Duration | NULL | The latency distributions of commit called by backend. | |
etcd_debugging_store_reads_bytes_total | Store Reads Bytes | NULL | Total number of bytes read out for reads action by (get/getRecursive), local to this member | |
etcd_debugging_mvcc_slow_watcher_total_per_sec | Mvcc Slow Watcher | NULL | Total number of unsynced slow watchers per sec | |
etcd_debugging_snap_save_total_duration_seconds_bucket | Snap Save Duration | NULL | The total latency distributions of save called by snapshot. | |
etcd_disk_wal_fsync_duration_seconds_count | Disk Wal Fsync Duration Seconds Count | NULL | The latency distributions of fsync called by wal. | |
etcd_debugging_store_reads_bytes_total_per_sec | Store Reads Bytes | NULL | Total number of bytes read out for reads action by (get/getRecursive), local to this member per sec | |
etcd_debugging_store_writes_bytes_total | Store Writes Bytes | NULL | Total number of bytes written out for writes action by (set/compareAndDelete), local to this member | |
etcd_disk_backend_commit_duration_seconds_bucket | Disk Backend Commit Duration | NULL | The latency distributions of commit called by backend. | |
etcd_debugging_store_writes_bytes_total_per_sec | Store Writes Bytes | NULL | Total number of bytes written out for writes action by (set/compareAndDelete), local to this member per sec |
Agent G2 - Linux - Hadoop JobTracker Service Monitors
Description
Monitors Hadoop Job tracker metrics
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Hadoop JobTracker Service Monitors | hadoop.jobtracker.reduce.slots.used | Hadoop JobTracker Reduce Slots Used | NULL | The Number of currently occupied/reserved used reduce slots |
hadoop.jobtracker.rpc.latency | Hadoop JobTracker RPC Latency | NULL | Calculates the average time spent by an RPC request in the queue | |
hadoop.jobtracker.nodes.alive | Hadoop JobTracker Alive Nodes | NULL | The Total Number of Alive Nodes in the cluster | |
hadoop.jobtracker.nodes.total | Hadoop JobTracker Total Nodes | NULL | The Total Number of Nodes in the cluster | |
hadoop.jobtracker.map.slots.used | Hadoop JobTracker Map Slots Used | NULL | The Number of Currently occupied/reserved used map slots | |
hadoop.jobtracker.nodes.black.listed | Hadoop JobTracker Black Listed | NULL | The Number of BlackListed trackers in the cluster | |
hadoop.jobtracker.nodes.gray.listed | Hadoop JobTracker Gray Listed | NULL | The Number of Graylisted trackers in the cluster | |
hadoop.jobtracker.map.slots.total | Hadoop JobTracker Map Slots | NULL | The Total Number of Map Slots in the cluster | |
hadoop.jobtracker.jobs | Hadoop JobTracker Total Jobs | NULL | The Total Number of jobs in the cluster | |
hadoop.jobtracker.reduce.slots.total | Hadoop JobTracker Reduce Slots | NULL | The Number of Currently occupied/reserved reduce slots | |
hadoop.jobtracker.failures | Hadoop JobTracker Failure Nodes | NULL | The Number of Decommissioned trackers in the cluster | |
hadoop.jobtracker.dir.failures | Hadoop JobTracker Dir Failures | NULL | The Number of Failure directories in the cluster | |
hadoop.jobtracker.nodes.dead | Hadoop JobTracker Dead Nodes | NULL | The Total Number of Dead Nodes in the Cluster |
Agent G2 - Linux - HAProxy Monitors
Description
Monitors HAProxy application metrics
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - HAProxy Monitors | haproxy.http.errors_3xx | HAProxy 3xx HTTP Error | NULL | Number of http error responses with 3xx code |
haproxy.session_current | HAProxy Sessions Active | NULL | Current number of concurrent connections. | |
haproxy.errors_resp_rate | HAProxy Response Errors | NULL | The rate of response errors. | |
haproxy.requests_queue | HAProxy Queued Requests | NULL | Number of requests in the server queue. | |
haproxy.http.errors_1xx | HAProxy 1xx HTTP Error | NULL | Number of http error responses with 1xx code | |
haproxy.http.errors_5xx | HAProxy 5xx HTTP Error | NULL | Number of http error responses with 5xx code | |
haproxy.denied_resp_rate | HAProxy Denied Responses | NULL | The rate of denied responses. | |
haproxy.warning.retr_rate | HAProxy Warn Retries | NULL | The rate of retries (warning). | |
haproxy.warning.redis_rate | HAProxy Warn Redis patches | NULL | The rate of dispatches (warning). | |
haproxy.servers_backup | HAProxy Backup Servers | NULL | Number of current backup servers (backend). Validates against total backup servers. | |
haproxy.mbytes_in_rate | HAProxy Data Received | NULL | The rate at which the data is received per server in MB. | |
haproxy.http.errors_2xx | HAProxy 2xx HTTP Error | NULL | Number of http error responses with 2xx code | |
haproxy.http.errors_4xx | HAProxy 4xx HTTP Error | NULL | Number of http error responses with 4xx code | |
haproxy.errors_con_rate | HAProxy Connection Errors | NULL | The rate of connection errors. | |
haproxy.requests_rate | HAProxy Requests | NULL | The rate of received HTTP requests. | |
haproxy.denied_req_rate | HAProxy Denied Requests | NULL | The rate of denied requests. | |
haproxy.errors_req_rate | HAProxy Request Errors | NULL | The rate of request errors. | |
haproxy.mbytes_out_rate | HAProxy Data Sent | NULL | The rate at which the data is sent per server in MB. | |
haproxy.session_rate | HAProxy Sessions | NULL | Number of sessions per second. | |
haproxy.lastchk_time | HAProxy Last Health Check Time | NULL | Time in ms took to finish last health check. | |
haproxy.servers_active | HAProxy Active Servers | NULL | Number of current active servers (backend). Validates against total active servers. |
Agent G2 - Linux - HAProxy Performance Statistics
Description
Monitors HAProxy stats module
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - HAProxy Performance Statistics | haproxy.denied_resp_rate | HAProxy Denied Responses | NULL | The rate of denied responses. |
haproxy.servers_backup | HAProxy Backup Servers | NULL | Number of current backup servers (backend). Validates against total backup servers. | |
haproxy.warning.redis_rate | HAProxy Warn Redis patches | NULL | The rate of dispatches (warning). | |
haproxy.warning.retr_rate | HAProxy Warn Retries | NULL | The rate of retries (warning). | |
haproxy.lastchk_time | HAProxy Last Health Check Time | NULL | Time in ms took to finish last health check. | |
haproxy.denied_req_rate | HAProxy Denied Requests | NULL | The rate of denied requests. | |
haproxy.http.errors_4xx | HAProxy 4xx HTTP Error | NULL | Number of http error responses with 4xx code | |
haproxy.requests_rate | HAProxy Requests | NULL | The rate of received HTTP requests. | |
haproxy.errors_resp_rate | HAProxy Response Errors | NULL | The rate of response errors. | |
haproxy.errors_req_rate | HAProxy Request Errors | NULL | The rate of request errors. | |
haproxy.http.errors_1xx | HAProxy 1xx HTTP Error | NULL | Number of http error responses with 1xx code | |
haproxy.servers_active | HAProxy Active Servers | NULL | Number of current active servers (backend). Validates against total active servers. | |
haproxy.errors_con_rate | HAProxy Connection Errors | NULL | The rate of connection errors. | |
haproxy.mbytes_out_rate | HAProxy Data Sent | NULL | The rate at which the data is sent per server in MB. | |
haproxy.http.errors_2xx | HAProxy 2xx HTTP Error | NULL | Number of http error responses with 2xx code | |
haproxy.mbytes_in_rate | HAProxy Data Received | NULL | The rate at which the data is received per server in MB. | |
haproxy.http.errors_3xx | HAProxy 3xx HTTP Error | NULL | Number of http error responses with 3xx code | |
haproxy.http.errors_5xx | HAProxy 5xx HTTP Error | NULL | Number of http error responses with 5xx code | |
haproxy.session_rate | HAProxy Sessions | NULL | Number of sessions per second. | |
haproxy.requests_queue | HAProxy Queued Requests | NULL | Number of requests in the server queue. | |
haproxy.session_current | HAProxy Sessions Active | NULL | Current number of concurrent connections. |
Agent G2 - Linux - HBase Monitors
Description
Monitors HBase application metrics
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - HBase Monitors | hbase.dead.region.servers | HBase Dead Region Servers | NULL | The number of dead region servers. |
hbase.live.region.servers | HBase Live Region Servers | NULL | The number of online region servers. | |
hbase.average.load | HBase Average Load | NULL | Average number of regions served by each region server. | |
hbase.cluster.requests | HBase Cluster Requests | NULL | The total number of requests from all region servers to a cluster. |
Agent G2 - Linux - HBase Performance Check
Description
Monitors HBase application metrics via MBeans exposed by the JMX Console
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - HBase Performance Check | hbase.live.region.servers | HBase Live Region Servers | NULL | The number of online region servers. |
hbase.dead.region.servers | HBase Dead Region Servers | NULL | The number of dead region servers. | |
hbase.cluster.requests | HBase Cluster Requests | NULL | The total number of requests from all region servers to a cluster. | |
hbase.average.load | HBase Average Load | NULL | Average number of regions served by each region server. |
Agent G2 - Linux - HDFS Datanode Template
Description
Monitor HDFS Datanodes
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - HDFS Datanode Template | hdfs.datanode.num_blocks_failed_to_uncache | HDFS Datanode NumBlocksFailedToUncache | NULL | The number of failed blocks to remove from cache. |
hdfs.datanode.last_volume_failure_date | HDFS Datanode LastVolumeFailureDate | NULL | The date/time of the last volume failure in milliseconds since epoch. | |
hdfs.datanode.cache_used | HDFS Datanode Cache Used | NULL | Cache used in bytes. | |
hdfs.datanode.num_blocks_cached | HDFS Datanode NumBlocksCached | NULL | The number of blocks cached. | |
hdfs.datanode.cache_capacity | HDFS Datanode Cache Capacity | NULL | Cache capacity in bytes. | |
hdfs.datanode.dfs_remaining_percent | HDFS Datanode Dfs Remaining Percent | NULL | The remaining disk space left in Percent. | |
hdfs.datanode.estimated_capacity_lost_total | HDFS Datanode EstimatedCapacityLostTotal | NULL | The estimated capacity lost in bytes. | |
hdfs.datanode.num_blocks_failed_to_cache | HDFS Datanode NumBlocksFailedToCache | NULL | The number of blocks that failed to cache. | |
hdfs.datanode.process_cpu_load | HDFS Datanode Process CpuLoad | NULL | The CPU Load of the Process. | |
hdfs.datanode.dfs_capacity | HDFS Datanode Dfs Capacity | NULL | Disk capacity in bytes. | |
hdfs.datanode.num_failed_volumes | HDFS Datanode NumFailedVolumes | NULL | Number of failed volumes. | |
hdfs.datanode.dfs_used_percent | HDFS Datanode Dfs Used Percent | NULL | Disk usage in Percent. |
Agent G2 - Linux - HDFS Namenode Template
Description
Monitor HDFS Namenodes
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - HDFS Namenode Template | hdfs.namenode.capacity_total | HDFS Namenode CapacityTotal | NULL | Total disk capacity in bytes. |
hdfs.namenode.num_live_data_nodes | HDFS Namenode NumLiveDataNodes | NULL | Total number of live data nodes. | |
hdfs.namenode.pending_deletion_blocks | HDFS Namenode PendingDeletionBlocks | NULL | Number of pending deletion blocks. | |
hdfs.namenode.files_total | HDFS Namenode FilesTotal | NULL | Total number of files. | |
hdfs.namenode.volume_failures_total | HDFS Namenode VolumeFailuresTotal | NULL | Total volume failures. | |
hdfs.namenode.under_replicated_blocks | HDFS Namenode UnderReplicatedBlocks | NULL | Number of under replicated blocks. | |
hdfs.namenode.scheduled_replication_blocks | HDFS Namenode ScheduledReplicationBlocks | NULL | Number of blocks scheduled for replication. | |
hdfs.namenode.num_stale_data_nodes | HDFS Namenode NumStaleDataNodes | NULL | Number of stale data nodes. | |
hdfs.namenode.nondfs_used_percent | HDFS Namenode NonDfsUsedPercent | NULL | Total space used by NonDfs in Percentage. | |
hdfs.namenode.blocks_total | HDFS Namenode BlocksTotal | NULL | Total number of blocks. | |
hdfs.namenode.num_decom_dead_data_nodes | HDFS Namenode NumDecomDeadDataNodes | NULL | Number of decommissioning dead data nodes. | |
hdfs.namenode.num_failed_data_nodes | HDFS Namenode NumFailedDataNodes | NULL | Total number of failed data nodes. | |
hdfs.namenode.estimated_capacity_lost_total | HDFS Namenode EstimatedCapacityLostTotal | NULL | Estimated capacity lost in bytes. | |
hdfs.namenode.num_stale_storages | HDFS Namenode NumStaleStorages | NULL | Number of stale storages. | |
hdfs.namenode.total_load | HDFS Namenode TotalLoad | NULL | Total load on the file system. | |
hdfs.namenode.missing_blocks | HDFS Namenode MissingBlocks | NULL | Number of missing blocks. | |
hdfs.namenode.corrupt_blocks | HDFS Namenode CorruptBlocks | NULL | Number of corrupt blocks. | |
hdfs.namenode.num_dead_data_nodes | HDFS Namenode NumDeadDataNodes | NULL | Total number of dead data nodes. | |
hdfs.namenode.num_decom_live_data_nodes | HDFS Namenode NumDecomLiveDataNodes | NULL | Number of decommissioning live data nodes. | |
hdfs.namenode.max_objects | HDFS Namenode MaxObjects | NULL | Maximum number of files HDFS supports. | |
hdfs.namenode.pending_replication_blocks | HDFS Namenode PendingReplicationBlocks | NULL | Number of blocks pending replication. | |
hdfs.namenode.num_decommissioning_data_nodes | HDFS Namenode NumDecommissioningDataNodes | NULL | Number of decommissioning data nodes. | |
hdfs.namenode.capacity_used_percent | HDFS Namenode CapacityUsedPercent | NULL | Disk usage in Percent. | |
hdfs.namenode.capacity_remaining_percent | HDFS Namenode CapacityRemainingPercent | NULL | Remaining disk space left in Percent. |
Agent G2 - Linux - IPTables Monitors
Description
Monitoring Template for IP Tables application. Monitors chain bandwidth, close connections, established connections, filter failures, etc.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - IPTables Monitors | iptables.chain_bandwidth | IPTables-ChainBandwidth | NULL | Captures traffic following through the IPTables which matches a given Chain. |
iptables.mangle_rules | IPTables-MangleRules | NULL | Checks a given table for a specific number of rules. If the number of rules in that table is less than what is specified in the argument it throws an alert. | |
iptables.nat_rules | IPTables-NatRules | NULL | Checks a given table for a specific number of rules. If the number of rules in that table is less than what is specified in the argument it throws an alert. | |
iptables.established_connections | IPTables-ESTABLISHEDConnections | NULL | Provides the number of active ESTABLISHED connections. | |
iptables.icmp_connections | IPTables-ICMPConnections | NULL | Provides the number of active ICMP connections. | |
iptables.udp_connections | IPTables-UDPConnections | NULL | Provides the number of active UDP connections. | |
iptables.tcp_connections | IPTables-TCPConnections | NULL | Provides the number of active TCP connections. | |
iptables.syn_connections | IPTables-SYNConnections | NULL | Provides the number of active SYN connections. | |
iptables.close_connections | IPTables-CLOSEConnections | NULL | Provides the number of active CLOSE connections. | |
iptables.filter_rules | IPTables-FilterRules | NULL | Checks a given table for a specific number of rules. If the number of rules in that table is less than what is specified in the argument it throws an alert. | |
iptables.time_wait_connections | IPTables-TIME_WAITConnections | NULL | Provides the number of active TIME_WAIT connections. |
Agent G2 - Linux - K3S ApiServer
Description
Template for monitoring K3S through Kubernetes API server
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - K3S ApiServer | apiserver.dropped.requests.total.count | Kube apiserver Dropped Requests Total Count | NULL | Monotonic count of requests dropped with Try-again-later response. |
apiserver.http.requests.total | Kube apiserver HTTP Requests Total | NULL | Total number of HTTP requests made. | |
apiserver.audit.event.total | Kube apiserver Audit Event Total | NULL | Counter of audit events generated and sent to the audit backend. | |
apiserver.authenticated.user.requests | Kube apiserver Authenticated User Requests | NULL | Counter of authenticated requests broken out by username. | |
apiserver.request.duration.seconds.bucket | Kube apiserver Request Duration Seconds Bucket | histogram | Response latency distribution in seconds for each verb, dry run value, group, version, resource, subresource, scope, and component. | |
apiserver.go.threads.total | Kube apiserver Go Threads Total | NULL | Number of OS threads created. | |
apiserver.inflight.requests | Kube apiserver Inflight Requests | NULL | Maximal number of currently used inflight request limit of this apiserver per request kind in the last second. | |
apiserver.request.count | Kube apiserver Request Count | NULL | Counter of apiserver requests broken out for each verb, group, version, resource, scope, component, client, and HTTP response contentType and code. | |
apiserver.http.requests.total.count | Kube apiserver HTTP Requests Total Count | NULL | Total number of HTTP requests made. | |
apiserver.go.goroutines | Kube apiserver Goroutines | NULL | Number of goroutines that currently exist. | |
apiserver.request.count.count | Kube apiserver Request Count Count | NULL | Counter of apiserver requests broken out for each verb, group, version, resource, scope, component, client, and HTTP response contentType and code. | |
apiserver.rest.client.requests.total.count | Kube apiserver Rest Client Requests Total Count | NULL | Number of HTTP requests, partitioned by status code, method, and host. | |
apiserver.APIServiceRegistrationController.depth | Kube apiserver APIService Registration Controller Depth | NULL | Current depth of workqueue: APIServiceRegistrationController. | |
apiserver.etcd.object.counts | Kube apiserver ETCD Object Counts | NULL | Number of stored objects at the time of the last check split by kind. | |
apiserver.authenticated.user.requests.count | Kube apiserver Authenticated User Requests Count | NULL | Counter of authenticated requests broken out by username. |
Agent G2 - Linux - K3S CoreDNS
Description
Kubernetes CoreDNS
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - K3S CoreDNS | coredns.response_size.bytes.sum | Response Size Bytes Sum | NULL | Size of the returned response in bytes. |
coredns.query.count | Query count | NULL | Total query count. | |
coredns.request_duration.seconds.sum | Request Duration Seconds Sum | NULL | Duration to process each query. | |
coredns.panics | Total Panics | NULL | Total number of panics. | |
coredns.request_duration.seconds.count | Request Duration Seconds Count | NULL | Duration per upstream interaction. |
Agent G2 - Linux - K3S Kube State
Description
Template for monitoring K3S using Kube State
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - K3S Kube State | kubernetes_state.container.cpu_limit | Container Cpu Limit | NULL | The limit on cpu cores to be used by a container |
kubernetes_state.resourcequota.pods.used | Resourcequota Pods Used | NULL | Observed number of pods used for a resource quota | |
kubernetes_state.replicaset.replicas_desired | Replicaset Replicas Desired | NULL | Number of desired pods for a ReplicaSet | |
kubernetes_state.node.cpu_capacity | Node Cpu Capacity | NULL | The total CPU resources of the node. | |
kubernetes_state.deployment.replicas_desired | Deployment Replicas Desired | NULL | The number of desired replicas per deployment wrong help in kube-state-metrics.cross check | |
kubernetes_state.resourcequota.services.loadbalancers.limit | Resourcequota Services Loadbalancers Limit | NULL | Hard limit of the number of loadbalancers for a resource quota | |
kubernetes_state.node.memory_capacity | Node Memory Capacity | NULL | The total memory resources of the node. | |
kubernetes_state.daemonset.ready | Daemonset Ready | NULL | The number of nodes that should be running the daemon pod and have one or more of the daemon pod running and ready. | |
kubernetes_state.replicaset.replicas | Replicaset Replicas | NULL | The number of replicas per ReplicaSet. | |
kubernetes_state.resourcequota.services.nodeports.limit | Resourcequota Services Nodeports Limit | NULL | Hard limit of the number of node ports for a resource quota | |
kubernetes_state.container.cpu_requested | Container Cpu Requested | NULL | The number of requested cpu cores by a container | |
kubernetes_state.resourcequota.requests.cpu.limit | Resourcequota Requests Cpu Limit | NULL | Hard limit on the total of CPU core requested for a resource quota | |
kubernetes_state.resourcequota.requests.storage.limit | Resourcequota Requests Storage Limit | NULL | Hard limit on the total of storage bytes requested for a resource quota | |
kubernetes_state.resourcequota.limits.memory.used | Resourcequota Limits Memory Used | NULL | Observed sum of limits for memory bytes for a resource quota | |
kubernetes_state.node.pods_capacity | Node Pods Capacity | NULL | The total pod resources of the node. | |
kubernetes_state.deployment.replicas | Deployment Replicas | NULL | The number of replicas per deployment. | |
kubernetes_state.deployment.replicas_available | Deployment Replicas Available | NULL | The number of available replicas per deployment. | |
kubernetes_state.resourcequota.requests.memory.used | Resourcequota Requests Memory Used | NULL | Observed sum of memory bytes requested for a resource quota | |
kubernetes_state.resourcequota.persistentvolumeclaims.limit | Resourcequota Persistentvolumeclaims Limit | NULL | Hard limit of the number of PVC for a resource quota | |
kubernetes_state.container.memory_requested | Container Memory Requested | NULL | The number of requested memory bytes by a container | |
kubernetes_state.resourcequota.services.used | Resourcequota Services Used | NULL | Observed number of services used for a resource quota | |
kubernetes_state.resourcequota.limits.memory.limit | Resourcequota Limits Memory Limit | NULL | Hard limit on the sum of memory bytes limits for a resource quota | |
kubernetes_state.deployment.replicas_unavailable | Deployment Replicas Unavailable | NULL | The number of unavailable replicas per deployment. | |
kubernetes_state.node.memory_allocatable | Node Memory Allocatable | NULL | The memory resources of a node that are available for scheduling. | |
kubernetes_state.resourcequota.pods.limit | Resourcequota Pods Limit | NULL | Hard limit of the number of pods for a resource quota | |
kubernetes_state.container.memory_limit | Container Memory Limit | NULL | The limit on memory to be used by a container | |
kubernetes_state.deployment.rollingupdate.max_unavailable | Deployment Rollingupdate Max Unavailable | NULL | Maximum number of unavailable replicas during a rolling update of a deployment. | |
kubernetes_state.resourcequota.persistentvolumeclaims.used | Resourcequota Persistentvolumeclaims Used | NULL | Observed number of persistent volume claims used for a resource quota | |
kubernetes_state.daemonset.misscheduled | Daemonset Misscheduled | NULL | The number of nodes running a daemon pod but are not supposed to. | |
kubernetes_state.resourcequota.services.loadbalancers.used | Resourcequota Services Loadbalancers Used | NULL | Observed number of loadbalancers used for a resource quota | |
kubernetes_state.resourcequota.requests.cpu.used | Resourcequota Requests Cpu Used | NULL | Observed sum of CPU cores requested for a resource quota | |
kubernetes_state.resourcequota.services.nodeports.used | Resourcequota Services Nodeports Used | NULL | Observed number of node ports used for a resource quota | |
kubernetes_state.resourcequota.requests.storage.used | Resourcequota Requests Storage Used | NULL | Observed sum of storage bytes requested for a resource quota | |
kubernetes_state.node.cpu_allocatable | Node Cpu Allocatable | NULL | The CPU resources of a node that are available for scheduling. | |
kubernetes_state.resourcequota.requests.memory.limit | Resourcequota Requests Memory Limit | NULL | Hard limit on the total of memory bytes requested for a resource quota | |
kubernetes_state.container.restarts | Container Restarts | NULL | The number of restarts per container | |
kubernetes_state.replicaset.fully_labeled_replicas | Replicaset Fully Labeled Replicas | NULL | The number of fully labeled replicas per ReplicaSet. | |
kubernetes_state.deployment.replicas_updated | Deployment Replicas Updated | NULL | The number of updated replicas per deployment. | |
kubernetes_state.resourcequota.limits.cpu.limit | Resourcequota Limits Cpu Limit | NULL | Hard limit on the sum of CPU core limits for a resource quota | |
kubernetes_state.resourcequota.services.limit | Resourcequota Services Limit | NULL | Hard limit of the number of services for a resource quota | |
kubernetes_state.resourcequota.limits.cpu.used | Resourcequota Limits Cpu Used | NULL | Observed sum of limits for CPU cores for a resource quota | |
kubernetes_state.replicaset.replicas_ready | Replicaset Replicas Ready | NULL | The number of ready replicas per ReplicaSet | |
kubernetes_state.node.pods_allocatable | Node Pods Allocatable | NULL | The pod resources of a node that are available for scheduling. | |
kubernetes_state.daemonset.desired | Daemonset Desired | NULL | The number of nodes that should be running the daemon pod. | |
kubernetes_state.daemonset.scheduled | Daemonset Scheduled | NULL | The number of nodes running at least one daemon pod and are supposed to. |
Agent G2 - Linux - K3S Master Agent
Description
Agent G2 - Linux - K3S Master Agent
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - K3S Master Agent - K8scoredns | coredns.request_duration.seconds.sum | Request Duration Seconds Sum | NULL | Duration to process each query. |
coredns.panics | Total Panics | NULL | Total number of panics. | |
coredns.query.count | Query count | NULL | Total query count. | |
coredns.response_size.bytes.sum | Response Size Bytes Sum | NULL | Size of the returns response in bytes. | |
coredns.request_duration.seconds.count | Request Duration Seconds Count | NULL | Duration per upstream interaction | |
Agent G2 - Linux - K3S Master Agent - K8sApiServer | apiserver.http.requests.total | Kube apiserver HTTP Requests Total | NULL | Total number of HTTP requests made. |
apiserver.request.duration.seconds.bucket | Kube apiserver Request Duration Seconds Bucket | histogram | Response latency distribution in seconds for each verb, dry run value, group, version, resource, subresource, scope and component. | |
apiserver.authenticated.user.requests | Kube apiserver Authenticated User Requests | NULL | Counter of authenticated requests broken out by username. | |
apiserver.rest.client.requests.total | Kube apiserver Rest Client Requests Total | NULL | Number of HTTP requests, partitioned by status code, method, and host. | |
apiserver.dropped.requests.total | Kube apiserver Dropped Requests Total | NULL | Accumulated number of requests dropped with Try-again-later response | |
apiserver.request.count.count | Kube apiserver Request Count Count | NULL | Counter of apiserver requests broken out for each verb, group, version, resource, scope, component, client, and HTTP response contentType and code. | |
apiserver.go.goroutines | Kube apiserver Goroutines | NULL | Number of goroutines that currently exist. | |
apiserver.request.count | Kube apiserver Request Count | NULL | Counter of apiserver requests broken out for each verb, group, version, resource, scope, component, client, and HTTP response contentType and code. | |
apiserver.etcd.object.counts | Kube apiserver ETCD Object Counts | NULL | Number of stored objects at the time of last check split by kind. | |
apiserver.APIServiceRegistrationController.depth | Kube apiserver APIService Registration Controller Depth | NULL | Current depth of workqueue: APIServiceRegistrationController | |
apiserver.audit.event.total | Kube apiserver Audit Event Total | NULL | Counter of audit events generated and sent to the audit backend. | |
apiserver.go.threads.total | Kube apiserver Go Threads Total | NULL | Number of OS threads created. | |
apiserver.inflight.requests | Kube apiserver Inflight Requests | NULL | Maximal number of currently used inflight request limit of this apiserver per request kind in last second. | |
apiserver.rest.client.requests.total.count | Kube apiserver Rest Client Requests Total Count | NULL | Number of HTTP requests, partitioned by status code, method, and host. | |
apiserver.dropped.requests.total.count | Kube apiserver Dropped Requests Total Count | NULL | Monotonic count of requests dropped with Try-again-later response | |
apiserver.authenticated.user.requests.count | Kube apiserver Authenticated User Requests Count | NULL | Counter of authenticated requests broken out by username. | |
apiserver.http.requests.total.count | Kube apiserver HTTP Requests Total Count | NULL | Total number of HTTP requests made. |
Agent G2 - Linux - Kafka Consumer Monitors
Description
Agent G2 - Linux - Kafka Consumer Monitors
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Kafka Consumer Monitors | kafka.broker.offsets | Kafka Broker Offsets | NULL | Broker offsets |
kafka.consumer.offsets | Kafka Consumer Offsets | NULL | Consumer offsets | |
kafka.consumer.lag | Kafka Consumer Lag | % | Lag in the consumer data |
Agent G2 - Linux - Kafka Monitors
Description
Monitors performance related metrics via MBeans exposed by the JMX console
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Kafka Monitors | kafka.jvm.threads | Kafka JVM Threads | NULL | Number of threads. |
kafka.metrics.fetch.requests | Kafka Fetch Requests | NULL | Request rate | |
kafka.jvm.uptime | Kafka Uptime | NULL | Uptime of the server | |
kafka.metrics.produce_remote_time | Kafka Producer Remote Time | ms | Time the request waits for the follower | |
kafka.metrics.offset_commit_resp_queue_time | Kafka Offset Commit Response Queue Time | ms | Time the request waiting in the response queue | |
kafka.metrics.controlled_shutdown.requests | Kafka Controlled Shutdown Requests | NULL | Request rate | |
kafka.producer.requests_delayed | Kafka Producer Delayed Requests | NULL | Requests delayed in the producer purgatory | |
kafka.fetch.requests_waiting | Kafka Fetch Purgatory Size | NULL | Requests waiting in the fetch purgatory | |
kafka.metrics.fetch_follower.resp_queue_time | Kafka Fetch Follower Response Queue Time | ms | Time the request waiting in the response queue | |
kafka.metrics.controlled_shutdown.total_time | Kafka Controlled Shutdown Total Time | ms | Request total time | |
kafka.channel.queue_size_request | Kafka Request Queue Size | NULL | ||
kafka.metrics.controlled_shutdown.resp_send_time | Kafka Controlled Shutdown Response Send Time | ms | Time to send the response | |
kafka.controller.active_controller_count | Kafka Active Controller Count | NULL | Is controller active on broker | |
kafka.metrics.leader_isr.local_time | Kafka Leader And Isr Local Time | ms | Time the request being processed at the leader | |
kafka.metrics.offset_commit.remote_time | Kafka Offset Commit Remote Time | ms | Time the request waits for the follower | |
kafka.metrics.fetch_follower.local_time | Kafka Fetch Follower Local Time | ms | Time the request being processed at the leader | |
kafka.channel.queue_size_response | Kafka Response Queue Size | NULL | ||
kafka.metrics.metadata.req_queue_time | Kafka Metadata Request Queue Time | ms | Time the request waiting in the request queue | |
kafka.metrics.controlled_shutdown.req_queue_time | Kafka Controlled Shutdown Request Queue Time | ms | Time the request waiting in the request queue | |
kafka.metrics.leader_isr.remote_time | Kafka Leader And Isr Remote Time | ms | Time the request waits for the follower | |
kafka.metrics.update_metadata.requests | Kafka Update Metadata Requests | NULL | Request rate | |
kafka.metrics.fetch_consumer.resp_send_time | Kafka Fetch Consumer Response Send Time | ms | Time to send the response | |
kafka.metrics.update_metadata.remote_time | Kafka Update Metadata Remote Time | ms | Time the request waits for the follower | |
kafka.producer.requests_waiting | Kafka Producer Purgatory Size | NULL | Requests waiting in the producer purgatory | |
kafka.metrics.fetch_consumer.total_time | Kafka Fetch Consumer Total Time | ms | Request total time | |
kafka.metrics.offsets.req_queue_time | Kafka Offsets Request Queue Time | ms | Time the request waiting in the request queue | |
kafka.metrics.fetch_consumer.req_queue_time | Kafka Fetch Consumer Request Queue Time | ms | Time the request waiting in the request queue | |
kafka.metrics.offsets.resp_queue_time | Kafka Offsets Response Queue Time | ms | Time the request waiting in the response queue | |
kafka.fetch.requests_delayed | Kafka Fetch Delayed Requests | NULL | Requests delayed in the fetch purgatory | |
kafka.metrics.stop_replica_total_time | Kafka Stop Replica Total Time | ms | Request total time | |
kafka.log.flush_rate | Kafka LogFlush Rate And Time | NULL | Log flush rate and time |
Agent G2 - Linux - Kong Monitoring - V2
Description
Monitors kong application related metrics like Connections Accepted,Connections Active,Connections Handled,Connections Reading,Connections Waiting,Connections Writing,Connections Total,Database Reachable,Db Entities Total,Db Entity Count Errors,Enterprise License Errors,Memory Lua Shared Dict Bytes,Memory Lua Shared Dict Total Bytes,Memory Workers Lua Vms Bytes,Nginx Http Current Connections,Nginx Metric Errors Total,Nginx Timers
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Kong Monitoring - V2 | kong_nginx_timers | Kong Nginx Timers | NULL | Number of nginx timers |
kong_db_entity_count_errors | Kong Db Entity Count Errors | NULL | Errors during entity count collection | |
kong_memory_lua_shared_dict_bytes | Kong Memory Lua Shared Dict Bytes | bytes | Allocated slabs in bytes in a shared_dict | |
kong_enterprise_license_errors | Kong Enterprise License Errors | NULL | Errors when collecting license info | |
kong_memory_workers_lua_vms_bytes | Kong Memory Workers Lua Vms Bytes | bytes | Allocated bytes in worker Lua VM | |
kong_connections_total | Kong Connections Total | Requests | Total number of client requests. | |
kong_connections_handled | Connections Handled | NULL | Total number of handled connections. (Same as accepts unless resource limits were reached). | |
kong_memory_lua_shared_dict_total_bytes | Kong Memory Lua Shared Dict Total Bytes | bytes | Total capacity in bytes of a shared_dict | |
kong_nginx_http_current_connections | Kong Nginx Http Current Connections | Connections | Number of HTTP connections | |
kong_db_entities_total | Kong Db Entities Total | NULL | Total number of Kong db entities | |
kong_nginx_metric_errors_total | Kong Nginx Metric Errors Total | NULL | Number of nginx-lua-prometheus errors | |
kong_connections_reading | Connections Reading | NULL | Current number of connections where Kong is reading the request header. | |
kong_connections_active | Connections Active | NULL | Current number of active client connections including Waiting connections. | |
kong_connections_accepted | Connections Accepted | NULL | Total number of accepted client connections. | |
kong_connections_waiting | Connections Waiting | NULL | Current number of idle client connections waiting for a request. | |
kong_connections_writing | Connections Writing | NULL | Current number of connections where nginx is writing the response back to the client. |
Agent G2 - Linux - Kubernetes ApiServer
Description
Template for monitoring Kubernetes through Kubernetes API server
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Kubernetes ApiServer | apiserver.authenticated.user.requests.count | Kube apiserver Authenticated User Requests Count | NULL | Counter of authenticated requests broken out by username. |
apiserver.go.goroutines | Kube apiserver Goroutines | NULL | Number of goroutines that currently exist. | |
apiserver.audit.event.total | Kube apiserver Audit Event Total | NULL | Counter of audit events generated and sent to the audit backend. | |
apiserver.inflight.requests | Kube apiserver Inflight Requests | NULL | Maximal number of currently used inflight request limit of this apiserver per request kind in last second. | |
apiserver.http.requests.total | Kube apiserver HTTP Requests Total | NULL | Total number of HTTP requests made. | |
apiserver.rest.client.requests.total.count | Kube apiserver Rest Client Requests Total Count | NULL | Number of HTTP requests, partitioned by status code, method, and host. | |
apiserver.dropped.requests.total.count | Kube apiserver Dropped Requests Total Count | NULL | Monotonic count of requests dropped with Try-again-later response | |
apiserver.rest.client.requests.total | Kube apiserver Rest Client Requests Total | NULL | Number of HTTP requests, partitioned by status code, method, and host. | |
apiserver.etcd.object.counts | Kube apiserver ETCD Object Counts | NULL | Number of stored objects at the time of last check split by kind. | |
apiserver.request.duration.seconds.bucket | Kube apiserver Request Duration Seconds Bucket | histogram | Response latency distribution in seconds for each verb, dry run value, group, version, resource, subresource, scope and component. | |
apiserver.request.count.count | Kube apiserver Request Count Count | NULL | Counter of apiserver requests broken out for each verb, group, version, resource, scope, component, client, and HTTP response contentType and code. | |
apiserver.APIServiceRegistrationController.depth | Kube apiserver APIService Registration Controller Depth | NULL | Current depth of workqueue: APIServiceRegistrationController | |
apiserver.authenticated.user.requests | Kube apiserver Authenticated User Requests | NULL | Counter of authenticated requests broken out by username. | |
apiserver.request.count | Kube apiserver Request Count | NULL | Counter of apiserver requests broken out for each verb, group, version, resource, scope, component, client, and HTTP response contentType and code. | |
apiserver.dropped.requests.total | Kube apiserver Dropped Requests Total | NULL | Accumulated number of requests dropped with Try-again-later response | |
apiserver.go.threads.total | Kube apiserver Go Threads Total | NULL | Number of OS threads created. | |
apiserver.http.requests.total.count | Kube apiserver HTTP Requests Total Count | NULL | Total number of HTTP requests made. |
Agent G2 - Linux - Kubernetes Controller
Description
Template for monitoring default Kubenetes Controller
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Kubernetes Controller | controller.go.goroutines | Kube Controller Go Goroutines | NULL | Number of goroutines that currently exist. |
controller.workqueue.nodes.evictions | Kube Controller Node Collector Evictions Number | NULL | Number of Node evictions that happened since current instance of NodeController started. | |
controller.workqueue.work_longest_duration | Kube Controller Workqueue Longest Running Processor Seconds | NULL | How many seconds has the longest running processor for workqueue been running. | |
controller.workqueue.work_duration.sum | Kube Controller Workqueue Work Duration Seconds Sum | NULL | How long in seconds processing an item from workqueue takes. | |
controller.rate_limiter.use | Kube Controller Node Lifecycle Controller Rate Limiter Use | NULL | A metric measuring the saturation of the rate limiter for node_lifecycle_controller. | |
controller.workqueue.depth | Kube Controller Workqueue Depth | NULL | Current depth of workqueue. | |
controller.process.open_fds | Kube Controller Process Open Fds | NULL | Number of open file descriptors. | |
controller.workqueue.nodes.count | Kube Controller Registered Nodes | NULL | Number of registered Nodes per zones. | |
controller.workqueue.queue_duration.count | Kube Controller Workqueue Queue Duration Seconds Count | NULL | Total how long in seconds an item stays in workqueue before being requested. | |
controller.workqueue.nodes.unhealthy | Kube Controller Node Collector Unhealthy Nodes in Zone | NULL | Number of not Ready Nodes per zones. | |
controller.process.max_fds | Kube Controller Process Max Fds | NULL | Maximum number of open file descriptors. | |
controller.workqueue.retries | Kube Controller Workqueue Retries Total | NULL | Total number of retries handled by workqueue. | |
controller.workqueue.work_duration.count | Kube Controller Workqueue Work Duration Seconds Count | NULL | Total time in seconds processing an item from workqueue takes. | |
controller.workqueue.adds | Kube Controller Workqueue Adds Total | NULL | Total number of adds handled by workqueue. | |
controller.workqueue.queue_duration.sum | Kube Controller Workqueue Queue Duration Seconds Sum | NULL | How long in seconds an item stays in workqueue before being requested. | |
controller.workqueue.work_unfinished_duration | Kube Controller Workqueue Unfinished Work Seconds | NULL | How many seconds of work has done that is in progress and hasn't been observed by work_duration. Large values indicate stuck threads. | |
controller.threads | Kube Controller Os Threads | NULL | Number of OS threads created. |
Agent G2 - Linux - Kubernetes CoreDNS Monitoring
Description
Kubernetes CoreDNS
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Kubernetes CoreDNS Monitoring | coredns.request_duration.seconds.count | Request Duration Seconds Count | NULL | Duration per upstream interaction |
coredns.request_duration.seconds.sum | Request Duration Seconds Sum | NULL | Duration to process each query. | |
coredns.response_size.bytes.sum | Response Size Bytes Sum | NULL | Size of the returns response in bytes. | |
coredns.panics | Total Panics | NULL | Total number of panics. | |
coredns.query.count | Query count | NULL | Total query count. |
Agent G2 - Linux - Kubernetes DNS Monitoring
Description
Agent G2 - Linux - Kubernetes DNS Monitoring
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Kubernetes DNS Monitoring | kubedns.request_duration.seconds.sum | Request Duration Seconds Sum | NULL | Time (in seconds) each request took to resolve. |
kubedns.response_size.bytes.count | Response Size Bytes Count | NULL | Number of responses on which the kubedns.response_size.bytes.sum metric is evaluated. | |
kubedns.request_duration.seconds.count | Request Duration Seconds Count | NULL | Number of requests on which the kubedns.request_duration.seconds.sum metric is evaluated. | |
kubedns.error_count | Error Count | NULL | Number of DNS requests resulting in an error. | |
kubedns.request_count | Request Count | NULL | Total number of DNS requests made. | |
kubedns.response_size.bytes.sum | Response Size Bytes Sum | NULL | Size of the returned response in bytes. | |
kubedns.cachemiss_count | Cachemiss Count | NULL | Number of DNS cache misses (from the start of the process). |
Agent G2 - Linux - Kubernetes Kubelet
Description
Template for monitoring Kubernetes metrics from Kubelet for each Node
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Kubernetes Kubelet | kube_memory_requests | Memory Requests | NULL | The requested memory |
kube_cpu_cfs_periods | Cpu Cfs Periods | NULL | Number of elapsed enforcement period intervals | |
kube_kubelet_volume_stats_inodes_used | Kubelet Volume Stats Inodes Used | NULL | The number of used inodes in the volume | |
kube_node_memory_usage | Node Memory Usage | NULL | Memory usage of node (Plotted in Megabytes) | |
kube_memory_sw_limit | Memory Sw Limit | NULL | Memory swap limit for the container. | |
kube_io_write_bytes | Io Write Bytes | NULL | The amount of bytes written to the disk | |
kube_node_cpu_allocatable | Node Cpu Allocatable | NULL | Cpu allocatable of node | |
kube_cpu_cfs_throttled_periods | Cpu Cfs Throttled Periods | NULL | Number of throttled period intervals | |
kube_memory_swap | Memory Swap | NULL | Container swap usage in bytes. | |
kube_kubelet_container_log_filesystem_used_bytes | Kubelet Container Log Filesystem Used Bytes | NULL | Bytes used by the container's logs on the filesystem (requires kubernetes 1.14+) | |
kube_cpu_usage_total | Cpu Usage Total | NULL | Cpu time consumed in seconds. | |
kube_network_rx_bytes | Network Rx Bytes | NULL | The amount of bytes per second received | |
kube_node_memory_capacity | Node Memory Capacity | NULL | Memory capacity of node (Plotted in Megabytes) | |
kube_containers_restarts | Containers Restarts | NULL | The number of times the container has been restarted | |
kube_node_memory_allocatable | Node Memory Allocatable | NULL | Memory allocatable of node | |
kube_memory_limits | Memory Limits | NULL | Memory limit for the container. | |
kube_node_cpu_usage | Node Cpu Usage | NULL | Cpu usage of node (Plotted in Millicores) | |
kube_filesystem_usage_pct | Filesystem Usage Pct | NULL | Number of megabytes that can be consumed by the container on this filesystem. | |
kube_kubelet_volume_stats_capacity_bytes | Kubelet Volume Stats Capacity Bytes | NULL | The capacity in bytes of the volume | |
kube_network_tx_bytes | Network Tx Bytes | NULL | The amount of bytes per second transmitted | |
kube_runtime_memory_rss | Runtime Memory Rss | NULL | Size of runtime RSS in megabytes | |
kube_kubelet_memory_rss | Kubelet Memory Rss | NULL | Size of kubelet RSS in megabytes | |
kube_cpu_cfs_throttled_seconds | Cpu Cfs Throttled Seconds | NULL | Total time duration the container has been throttled | |
kube_runtime_cpu_usage | Runtime Cpu Usage | NULL | The number of cores used by the runtime | |
kube_kubelet_cpu_usage | Kubelet Cpu Usage | NULL | The number of cores used by kubelet | |
kube_network_rx_dropped | Network Rx Dropped | NULL | The amount of rx packets dropped per second | |
kube_io_read_bytes | Io Read Bytes | NULL | The amount of bytes read from the disk | |
kube_pods_running | Pods Running | NULL | The number of running pods | |
kube_filesystem_usage | Filesystem Usage | NULL | Number of megabytes that are consumed by the container on this filesystem. | |
kube_node_ephemeral_storage_capacity | Node Ephemeral Storage Capacity | MB | Ephemeral storage capacity of node | |
kube_network_rx_errors | Network Rx Errors | NULL | The amount of rx errors per second | |
kube_kubelet_evictions | Kubelet Evictions | NULL | The number of pods that have been evicted from the kubelet (ALPHA in kubernetes v1.16) | |
kube_rest_client_latency | Rest Client Latency | NULL | Avg Request latency in seconds. Broken down by verb and URL since last pool. | |
kube_network_tx_dropped | Network Tx Dropped | NULL | The amount of tx packets dropped per second | |
kube_apiserver_certificate_expiration | Apiserver Certificate Expiration | NULL | Avg Distribution of the remaining lifetime on the certificate used to authenticate a request since last pool. | |
kube_kubelet_volume_stats_available_bytes | Kubelet Volume Stats Available Bytes | NULL | The number of available bytes in the volume | |
kube_cpu_requests | Cpu Requests | NULL | The requested cpu cores | |
kube_cpu_user_total | Cpu User Total | NULL | User cpu time consumed in seconds. | |
kube_kubelet_volume_stats_inodes_free | Kubelet Volume Stats Inodes Free | NULL | The number of free inodes in the volume | |
kube_kubelet_runtime_errors | Kubelet Runtime Errors | NULL | The number of runtime operations errors | |
kube_memory_working_set | Memory Working Set | NULL | Current working set in megabytes - this is what the OOM killer is watching for | |
kube_node_cpu_capacity | Node Cpu Capacity | NULL | Cpu capacity of Node (Plotted in Millicores) | |
kube_memory_usage | Memory Usage | NULL | Current memory usage in bytes including all memory regardless of when it was accessed | |
kube_kubelet_volume_stats_inodes | Kubelet Volume Stats Inodes | NULL | The maximum number of inodes in the volume | |
kube_memory_rss | Memory Rss | NULL | Size of RSS in bytes | |
kube_node_ephemeral_storage_allocatable | Node Ephemeral Storage Allocatable | MB | Ephemeral storage allocatable of node | |
kube_network_tx_errors | Network Tx Errors | NULL | The amount of tx errors per second | |
kube_node_memory_usage_percentage | Node Memory Usage Percentage | NULL | Memory usage percentage of node | |
kube_kubelet_runtime_operations | Kubelet Runtime Operations | NULL | The number of runtime operations | |
kube_cpu_limits | Cpu Limits | NULL | The limit of cpu cores set | |
kube_rest_client_requests | Rest Client Requests | NULL | The number of HTTP requests | |
kube_memory_cache | Memory Cache | NULL | Number of bytes of page cache memory. | |
kube_kubelet_volume_stats_used_bytes | Kubelet Volume Stats Used Bytes | NULL | The number of used bytes in the volume | |
kube_containers_running | Containers Running | NULL | The number of running containers | |
kube_cpu_system_total | Cpu System Total | NULL | System cpu time consumed in seconds. | |
kube_ephemeral_storage_usage | Ephemeral Storage Usage | NULL | Ephemeral storage usage of the POD | |
kube_kubelet_network_plugin_latency | Kubelet Network Plugin Latency | NULL | Avg Latency in seconds of network plugin operations. Broken down by operation type since last pool. | |
kube_cpu_load_10s_avg | Cpu Load 10S Avg | NULL | Container cpu load average over the last 10 seconds | |
kube_node_cpu_usage_percentage | Node Cpu Usage Percentage | NULL | Cpu usage percentage of node |
Agent G2 - Linux - Kubernetes Master Agent
Description
Agent G2 - Linux - Kubernetes Master Agent
Prerequisites
NULL
Supported Metrics
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Kubernetes Master Agent - K8sScheduler | scheduler.binding.duration.seconds | Kube Scheduler Binding Duration Seconds Sum | NULL | Binding duration in seconds sum |
scheduler.threads | Kube Scheduler OS Threads | NULL | Number of OS threads created | |
scheduler.gc_duration_seconds.sum | Kube Scheduler Go GC Duration Seconds Sum | NULL | A summary of the GC invocation durations | |
scheduler.gc_duration_seconds.count | Kube Scheduler Go GC Duration Seconds Count | NULL | A summary of the GC invocation durations | |
scheduler.binding.latency.sum | Kube Scheduler Binding Latency Microseconds Sum | NULL | Binding latency in microseconds sum | |
scheduler.scheduling.algorithm.predicate_duration.count | Kube Scheduler Scheduling Algorithm Predicate Evaluation Count | NULL | Scheduling algorithm predicate evaluation duration | |
scheduler.client.http.requests_duration.count | Kube Scheduler Rest Client Request Latency Seconds Count | NULL | Total request latency in seconds. Broken down by verb and URL | |
scheduler.scheduling.scheduling_latency.count | Kube Scheduler Scheduling Latency Seconds Count | NULL | Scheduling latency in seconds split by sub-parts of the scheduling operation | |
scheduler.volume_scheduling_duration.count | Kube Scheduler Volume Scheduling Duration Seconds Count | NULL | Volume scheduling stage latency count | |
scheduler.scheduling.scheduling_duration.quantile | Kube Scheduler Scheduling Duration Seconds | NULL | Scheduling latency in seconds split by sub-parts of the scheduling operation | |
scheduler.scheduling.algorithm_latency.sum | Kube Scheduler Scheduling Algorithm Latency Microseconds Sum | NULL | Scheduling algorithm latency in microseconds sum | |
scheduler.scheduling.algorithm.preemption_duration.count | Kube Scheduler Scheduling Algorithm Preemption Evaluation Count | NULL | Scheduling algorithm preemption evaluation duration | |
scheduler.pod_preemption.attempts | Kube Scheduler Total Preemption Attempts | NULL | Total preemption attempts in the cluster till now | |
scheduler.schedule_attempts.total | Kube Scheduler Schedule Attempts Total | NULL | Number of attempts to schedule pods, by the result. 'unschedulable' means a pod could not be scheduled, while 'error' means an internal scheduler problem | |
scheduler.scheduling.algorithm.priority_duration.count | Kube Scheduler Scheduling Algorithm Priority Evaluation Count | NULL | Scheduling algorithm priority evaluation duration | |
scheduler.scheduling.algorithm.priority_duration.sum | Kube Scheduler Scheduling Algorithm Priority Evaluation Sum | NULL | Scheduling algorithm priority evaluation duration | |
scheduler.scheduling.scheduling_latency.quantile | Kube Scheduler Scheduling Latency Seconds | NULL | Scheduling latency in seconds split by sub-parts of the scheduling operation | |
scheduler.binding.latency.count | Kube Scheduler Binding Latency Microseconds Count | NULL | Total Binding latency in microseconds count | |
scheduler.e2e.scheduling_duration.sum | Kube Scheduler E2E Scheduling Duration Seconds Sum | NULL | E2e scheduling latency in seconds (scheduling algorithm + binding) | |
scheduler.cache.lookups | Kube Scheduler Equiv Cache Lookups Total | NULL | Total number of equivalence cache lookups, by whether or not a cache entry was found | |
scheduler.go.goroutines | Kube Scheduler Go Goroutines | NULL | Number of goroutines that currently exist | |
scheduler.scheduling.scheduling_duration.count | Kube Scheduler Scheduling Duration Seconds Count | NULL | Scheduling latency in seconds split by sub-parts of the scheduling operation | |
scheduler.scheduling.algorithm_duration.sum | Kube Scheduler Scheduling Algorithm Duration Seconds Sum | NULL | Scheduling algorithm latency in seconds sum | |
scheduler.gc_duration_seconds.quantile | Kube Scheduler Go GC Duration Seconds | NULL | A summary of the GC invocation durations | |
scheduler.e2e.scheduling_duration.count | Kube Scheduler E2E Scheduling Duration Seconds Count | NULL | Total E2e scheduling latency in seconds (scheduling algorithm + binding) | |
scheduler.client.http.requests | Kube Scheduler Rest Client Requests Total | NULL | Number of HTTP requests, partitioned by status code, method, and host | |
scheduler.process.open_fds | Kube Scheduler Process Open Fds | NULL | Number of open file descriptors | |
scheduler.client.http.requests_duration.sum | Kube Scheduler Rest Client Request Latency Seconds Sum | NULL | Request latency in seconds sum. Broken down by verb and URL | |
scheduler.pod_preemption.victims | Kube Scheduler Pod Preemption Victims | NULL | Number of selected preemption victims | |
scheduler.process.max_fds | Kube Scheduler Process Max Fds | NULL | Maximum number of open file descriptors | |
scheduler.scheduling.scheduling_duration.sum | Kube Scheduler Scheduling Duration Seconds Sum | NULL | Scheduling latency in seconds split by sub-parts of the scheduling operation | |
scheduler.volume_scheduling_duration.sum | Kube Scheduler Volume Scheduling Duration Seconds Sum | NULL | Volume scheduling stage latency sum | |
scheduler.e2e.scheduling_latency.count | Kube Scheduler E2E Scheduling Latency Microseconds Count | NULL | Total E2e scheduling latency in microseconds (scheduling algorithm + binding) | |
scheduler.scheduling.algorithm_duration.count | Kube Scheduler Scheduling Algorithm Duration Seconds Count | NULL | Total Scheduling algorithm latency in seconds count | |
scheduler.scheduling.algorithm_latency.count | Kube Scheduler Scheduling Algorithm Latency Microseconds Count | NULL | Total Scheduling algorithm latency in microseconds count | |
scheduler.e2e.scheduling_latency.sum | Kube Scheduler E2E Scheduling Latency Microseconds Sum | NULL | E2e scheduling latency in microseconds (scheduling algorithm + binding) | |
scheduler.scheduling.algorithm.predicate_duration.sum | Kube Scheduler Scheduling Algorithm Predicate Evaluation Sum | NULL | Scheduling algorithm predicate evaluation duration | |
scheduler.scheduling.algorithm.preemption_duration.sum | Kube Scheduler Scheduling Algorithm Preemption Evaluation Sum | NULL | Scheduling algorithm preemption evaluation duration | |
scheduler.scheduling.scheduling_latency.sum | Kube Scheduler Scheduling Latency Seconds Sum | NULL | Scheduling latency in seconds split by sub-parts of the scheduling operation | |
scheduler.binding.duration.count | Kube Scheduler Binding Duration Seconds Count | NULL | Total Binding duration in seconds count | |
Agent G2 - Linux - Kubernetes Master Agent - K8sApiServer | apiserver.go.goroutines | Kube apiserver Goroutines | NULL | Number of goroutines that currently exist. |
apiserver.authenticated.user.requests.count | Kube apiserver Authenticated User Requests Count | NULL | Counter of authenticated requests broken out by username. | |
apiserver.rest.client.requests.total | Kube apiserver Rest Client Requests Total | NULL | Number of HTTP requests, partitioned by status code, method, and host. | |
apiserver.rest.client.requests.total.count | Kube apiserver Rest Client Requests Total Count | NULL | Number of HTTP requests, partitioned by status code, method, and host. | |
apiserver.request.count.count | Kube apiserver Request Count Count | NULL | Counter of apiserver requests broken out for each verb, group, version, resource, scope, component, client, and HTTP response contentType and code. | |
apiserver.APIServiceRegistrationController.depth | Kube apiserver APIService Registration Controller Depth | NULL | Current depth of workqueue: APIServiceRegistrationController | |
apiserver.audit.event.total | Kube apiserver Audit Event Total | NULL | Counter of audit events generated and sent to the audit backend. | |
apiserver.go.threads.total | Kube apiserver Go Threads Total | NULL | Number of OS threads created. | |
apiserver.http.requests.total.count | Kube apiserver HTTP Requests Total Count | NULL | Total number of HTTP requests made. | |
apiserver.http.requests.total | Kube apiserver HTTP Requests Total | NULL | Total number of HTTP requests made. | |
apiserver.inflight.requests | Kube apiserver Inflight Requests | NULL | Maximal number of currently used inflight request limit of this apiserver per request kind in the last second. | |
apiserver.request.count | Kube apiserver Request Count | NULL | Counter of apiserver requests broken out for each verb, group, version, resource, scope, component, client, and HTTP response contentType and code. | |
apiserver.etcd.object.counts | Kube apiserver ETCD Object Counts | NULL | Number of stored objects at the time of the last check split by kind. | |
apiserver.dropped.requests.total | Kube apiserver Dropped Requests Total | NULL | Accumulated number of requests dropped with Try-again-later response | |
apiserver.dropped.requests.total.count | Kube apiserver Dropped Requests Total Count | NULL | Monotonic count of requests dropped with Try-again-later response | |
apiserver.authenticated.user.requests | Kube apiserver Authenticated User Requests | NULL | Counter of authenticated requests broken out by username. | |
apiserver.request.duration.seconds.bucket | Kube apiserver Request Duration Seconds Bucket | histogram | Response latency distribution in seconds for each verb, dry run value, group, version, resource, subresource, scope, and component. | |
Agent G2 - Linux - Kubernetes Master Agent - K8scoredns | coredns.request_duration.seconds.count | Request Duration Seconds Count | NULL | Duration per upstream interaction |
coredns.request_duration.seconds.sum | Request Duration Seconds Sum | NULL | Duration to process each query. | |
coredns.panics | Total Panics | NULL | Total number of panics. | |
coredns.query.count | Query count | NULL | Total query count. | |
coredns.response_size.bytes.sum | Response Size Bytes Sum | NULL | Size of the returns response in bytes. | |
Agent G2 - Linux - Kubernetes Master Agent - K8sController | controller.workqueue.work_duration.sum | Kube Controller Workqueue Work Duration Seconds Sum | NULL | How long in seconds processing an item from workqueue takes |
controller.process.open_fds | Kube Controller Process Open Fds | NULL | Number of open file descriptors | |
controller.go.goroutines | Kube Controller Go Goroutines | NULL | Number of goroutines that currently exist | |
controller.workqueue.work_duration.count | Kube Controller Workqueue Work Duration Seconds Count | NULL | Total time in seconds processing an item from workqueue takes | |
controller.workqueue.queue_duration.sum | Kube Controller Workqueue Queue Duration Seconds Sum | NULL | How long in seconds an item stays in workqueue before being requested. | |
controller.workqueue.nodes.count | Kube Controller Registered Nodes | NULL | Number of registered Nodes per zones. | |
controller.threads | Kube Controller Os Threads | NULL | Number of OS threads created. | |
controller.workqueue.retries | Kube Controller Workqueue Retries Total | NULL | Total number of retries handled by workqueue. | |
controller.workqueue.work_unfinished_duration | Kube Controller Workqueue Unfinished Work Seconds | NULL | How many seconds of work has done that is in progress and hasn't been observed by work_duration. Large values indicate stuck threads. | |
controller.workqueue.work_longest_duration | Kube Controller Workqueue Longest Running Processor Seconds | NULL | How many seconds has the longest running processor for workqueue been running. | |
controller.workqueue.depth | Kube Controller Workqueue Depth | NULL | Current depth of workqueue. | |
controller.workqueue.nodes.unhealthy | Kube Controller Node Collector Unhealthy Nodes in Zone | NULL | Number of not Ready Nodes per zones. | |
controller.rate_limiter.use | Kube Controller Node Lifecycle Controller Rate Limiter Use | NULL | A metric measuring the saturation of the rate limiter for node_lifecycle_controller. | |
controller.workqueue.adds | Kube Controller Workqueue Adds Total | NULL | Total number of adds handled by workqueue. | |
controller.process.max_fds | Kube Controller Process Max Fds | NULL | Maximum number of open file descriptors. | |
controller.workqueue.queue_duration.count | Kube Controller Workqueue Queue Duration Seconds Count | NULL | Total how long in seconds an item stays in workqueue before being requested. | |
controller.workqueue.nodes.evictions | Kube Controller Node Collector Evictions Number | NULL | Number of Node evictions that happened since current instance of NodeController started. |
Agent G2 - Linux - Kubernetes Monitoring Template
Description
Agent G2 - Linux - Kubernetes Monitoring Template
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Kubernetes Monitoring Template - K8scoredns | coredns.response_size.bytes.sum | Response Size Bytes Sum | NULL | Size of the returns response in bytes. |
coredns.request_duration.seconds.sum | Request Duration Seconds Sum | NULL | Duration to process each query. | |
coredns.panics | Total Panics | NULL | Total number of panics. | |
coredns.query.count | Query count | NULL | Total query count. | |
coredns.request_duration.seconds.count | Request Duration Seconds Count | NULL | Duration per upstream interaction | |
Agent G2 - Linux - Kubernetes Monitoring Template - K8sstate | kubernetes_state.resourcequota.requests.cpu.limit | Resourcequota Requests Cpu Limit | NULL | Hard limit on the total of CPU core requested for a resource quota |
kubernetes_state.resourcequota.requests.memory.limit | Resourcequota Requests Memory Limit | NULL | Hard limit on the total of memory bytes requested for a resource quota | |
kubernetes_state.daemonset.desired | Daemonset Desired | NULL | The number of nodes that should be running the daemon pod. | |
kubernetes_state.node.cpu_capacity | Node Cpu Capacity | NULL | The total CPU resources of the node. | |
kubernetes_state.container.restarts | Container Restarts | NULL | The number of restarts per container | |
kubernetes_state.deployment.replicas_unavailable | Deployment Replicas Unavailable | NULL | The number of unavailable replicas per deployment. | |
kubernetes_state.daemonset.scheduled | Daemonset Scheduled | NULL | The number of nodes running at least one daemon pod and are supposed to | |
kubernetes_state.replicaset.fully_labeled_replicas | Replicaset Fully Labeled Replicas | NULL | The number of fully labeled replicas per ReplicaSet. | |
kubernetes_state.node.pods_capacity | Node Pods Capacity | NULL | The total pod resources of the node. | |
kubernetes_state.daemonset.ready | Daemonset Ready | NULL | The number of nodes that should be running the daemon pod and have one or more of the daemon pod running and ready. | |
kubernetes_state.resourcequota.persistentvolumeclaims.used | Resourcequota Persistentvolumeclaims Used | NULL | Observed number of persistent volume claims used for a resource quota | |
kubernetes_state.replicaset.replicas | Replicaset Replicas | NULL | The number of replicas per ReplicaSet. | |
kubernetes_state.deployment.replicas_desired | Deployment Replicas Desired | NULL | The number of desired replicas per deployment wrong help in kube-state-metrics.cross check | |
kubernetes_state.container.cpu_limit | Container Cpu Limit | NULL | The limit on cpu cores to be used by a container | |
kubernetes_state.resourcequota.services.loadbalancers.limit | Resourcequota Services Loadbalancers Limit | NULL | Hard limit of the number of loadbalancers for a resource quota | |
kubernetes_state.resourcequota.limits.memory.used | Resourcequota Limits Memory Used | NULL | Observed sum of limits for memory bytes for a resource quota | |
kubernetes_state.container.memory_limit | Container Memory Limit | NULL | The limit on memory to be used by a container | |
kubernetes_state.deployment.replicas | Deployment Replicas | NULL | The number of replicas per deployment. | |
kubernetes_state.resourcequota.services.used | Resourcequota Services Used | NULL | Observed number of services used for a resource quota | |
kubernetes_state.deployment.replicas_available | Deployment Replicas Available | NULL | The number of available replicas per deployment. | |
kubernetes_state.resourcequota.services.loadbalancers.used | Resourcequota Services Loadbalancers Used | NULL | Observed number of loadbalancers used for a resource quota | |
kubernetes_state.resourcequota.limits.cpu.limit | Resourcequota Limits Cpu Limit | NULL | Hard limit on the sum of CPU core limits for a resource quota | |
kubernetes_state.resourcequota.limits.cpu.used | Resourcequota Limits Cpu Used | NULL | Observed sum of limits for CPU cores for a resource quota | |
kubernetes_state.replicaset.replicas_desired | Replicaset Replicas Desired | NULL | Number of desired pods for a ReplicaSet | |
kubernetes_state.resourcequota.services.limit | Resourcequota Services Limit | NULL | Hard limit of the number of services for a resource quota | |
kubernetes_state.daemonset.misscheduled | Daemonset Misscheduled | NULL | The number of nodes running a daemon pod but are not supposed to. | |
kubernetes_state.resourcequota.requests.cpu.used | Resourcequota Requests Cpu Used | NULL | Observed sum of CPU cores requested for a resource quota | |
kubernetes_state.resourcequota.requests.storage.limit | Resourcequota Requests Storage Limit | NULL | Hard limit on the total of storage bytes requested for a resource quota | |
kubernetes_state.resourcequota.pods.used | Resourcequota Pods Used | NULL | Observed number of pods used for a resource quota | |
kubernetes_state.node.memory_allocatable | Node Memory Allocatable | NULL | The memory resources of a node that are available for scheduling. | |
kubernetes_state.resourcequota.services.nodeports.used | Resourcequota Services Nodeports Used | NULL | Observed number of node ports used for a resource quota | |
kubernetes_state.resourcequota.limits.memory.limit | Resourcequota Limits Memory Limit | NULL | Hard limit on the sum of memory bytes limits for a resource quota | |
kubernetes_state.node.pods_allocatable | Node Pods Allocatable | NULL | The pod resources of a node that are available for scheduling. | |
kubernetes_state.deployment.rollingupdate.max_unavailable | Deployment Rollingupdate Max Unavailable | NULL | Maximum number of unavailable replicas during a rolling update of a deployment. | |
kubernetes_state.container.memory_requested | Container Memory Requested | NULL | The number of requested memory bytes by a container | |
kubernetes_state.resourcequota.requests.memory.used | Resourcequota Requests Memory Used | NULL | Observed sum of memory bytes requested for a resource quota | |
kubernetes_state.deployment.replicas_updated | Deployment Replicas Updated | NULL | The number of updated replicas per deployment. | |
kubernetes_state.resourcequota.requests.storage.used | Resourcequota Requests Storage Used | NULL | Observed sum of storage bytes requested for a resource quota | |
kubernetes_state.replicaset.replicas_ready | Replicaset Replicas Ready | NULL | The number of ready replicas per ReplicaSet | |
kubernetes_state.resourcequota.persistentvolumeclaims.limit | Resourcequota Persistentvolumeclaims Limit | NULL | Hard limit of the number of PVC for a resource quota | |
kubernetes_state.container.cpu_requested | Container Cpu Requested | NULL | The number of requested cpu cores by a container | |
kubernetes_state.resourcequota.pods.limit | Resourcequota Pods Limit | NULL | Hard limit of the number of pods for a resource quota | |
kubernetes_state.resourcequota.services.nodeports.limit | Resourcequota Services Nodeports Limit | NULL | Hard limit of the number of node ports for a resource quota | |
kubernetes_state.node.memory_capacity | Node Memory Capacity | NULL | The total memory resources of the node. | |
kubernetes_state.node.cpu_allocatable | Node Cpu Allocatable | NULL | The CPU resources of a node that are available for scheduling. | |
Agent G2 - Linux - Kubernetes Monitoring Template - K8sdns | ||||
kubedns.response_size.bytes.count | Response Size Bytes Count | NULL | Number of responses on which the kubedns.response_size.bytes.sum metric is evaluated. | |
kubedns.request_count | Request Count | NULL | Total number of DNS requests made. | |
kubedns.request_duration.seconds.sum | Request Duration Seconds Sum | NULL | Time (in seconds) each request took to resolve. | |
kubedns.request_duration.seconds.count | Request Duration Seconds Count | NULL | Number of requests on which the kubedns.request_duration.seconds.sum metric is evaluated. | |
kubedns.cachemiss_count | Cachemiss Count | NULL | Number of DNS cache misses (from the start of the process). | |
kubedns.error_count | Error Count | NULL | Number of DNS requests resulting in an error. | |
kubedns.response_size.bytes.sum | Response Size Bytes Sum | NULL | Size of the returned response in bytes. |
Agent G2 - Linux - Kubernetes Metric Server - v2
Description
Template for monitoring Kubernetes using Kubernetes Metric Server. Template “Agent G2 - Linux - Kubernetes Metric Server” does not currently support customizing the namespace. In this v2 template, there is an option to monitor the “kube-system” namespace by default, and customers could specify a different namespace if needed.
Prerequisites
No Prerequisites.
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Kubernetes Metric Server Monitor - v2 | metrics_server.authenticated_user_requests | Authenticated User Requests | null | Counter of authenticated requests broken out by username. |
metrics_server.go_gc_duration_seconds_count | Go GC Duration Seconds Count | null | A summary of the GC invocation durations. | |
metrics_server.go_gc_duration_seconds_quantile | Go GC Duration Seconds Quantile | Seconds | A summary of the GC invocation durations. | |
metrics_server.go_gc_duration_seconds_sum | Go GC Duration Seconds Sum | null | A summary of the GC invocation durations. | |
metrics_server.go_goroutines | Go Goroutines | null | Number of goroutines that currently exist. | |
metrics_server.kubelet_summary_request_duration_count | Kubelet Summary Request Duration Count | null | The Kubelet summary request latencies in seconds. | |
metrics_server.kubelet_summary_request_duration_sum | Kubelet Summary Request Duration Sum | null | The Kubelet summary request latencies in seconds. | |
metrics_server.kubelet_summary_scrapes_total | Kubelet Summary Scrapes Total | null | Total number of attempted Summary API scrapes done by Metrics Server. | |
metrics_server.manager_tick_duration_count | Manager Tick Duration Count | null | The total time spent collecting and storing metrics in seconds. | |
metrics_server.manager_tick_duration_sum | Manager Tick Duration Sum | null | The total time spent collecting and storing metrics in seconds. | |
metrics_server.process_cpu_seconds_total | Process Cpu Seconds Total | null | Total user and system CPU time spent in seconds. | |
metrics_server.process_max_fds | Process Max Fds | null | Maximum number of open file descriptors. | |
metrics_server.process_open_fds | Process Open Fds | null | Number of open file descriptors. | |
metrics_server.scraper_duration_count | Scraper Duration Count | null | Time spent scraping sources in seconds. | |
metrics_server.scraper_duration_sum | Scraper Duration Sum | null | Time spent scraping sources in seconds. | |
metrics_server.scraper_last_time | Scraper Last Time | null | Last time metrics-server performed a scrape since unix epoch in seconds. |
Agent G2 - Linux - Kubernetes Scheduler
Description
Template for monitoring default Kubernetes Scheduler
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Kubernetes Scheduler | scheduler.schedule_attempts.total | Kube Scheduler Schedule Attempts Total | NULL | Number of attempts to schedule pods, by the result. 'unschedulable' means a pod could not be scheduled, while 'error' means an internal scheduler problem |
scheduler.scheduling.algorithm.priority_duration.count | Kube Scheduler Scheduling Algorithm Priority Evaluation Count | NULL | Scheduling algorithm priority evaluation duration | |
scheduler.client.http.requests_duration.count | Kube Scheduler Rest Client Request Latency Seconds Count | NULL | Total request latency in seconds. Broken down by verb and URL | |
scheduler.gc_duration_seconds.count | Kube Scheduler Go GC Duration Seconds Count | NULL | A summary of the GC invocation durations | |
scheduler.scheduling.scheduling_latency.sum | Kube Scheduler Scheduling Latency Seconds Sum | NULL | Scheduling latency in seconds split by sub-parts of the scheduling operation | |
scheduler.pod_preemption.attempts | Kube Scheduler Total Preemption Attempts | NULL | Total preemption attempts in the cluster till now | |
scheduler.client.http.requests_duration.sum | Kube Scheduler Rest Client Request Latency Seconds Sum | NULL | Request latency in seconds sum. Broken down by verb and URL | |
scheduler.e2e.scheduling_latency.sum | Kube Scheduler E2E Scheduling Latency Microseconds Sum | NULL | E2e scheduling latency in microseconds (scheduling algorithm + binding) | |
scheduler.scheduling.algorithm.preemption_duration.count | Kube Scheduler Scheduling Algorithm Preemption Evaluation Count | NULL | Scheduling algorithm preemption evaluation duration | |
scheduler.scheduling.algorithm.predicate_duration.count | Kube Scheduler Scheduling Algorithm Predicate Evaluation Count | NULL | Scheduling algorithm predicate evaluation duration | |
scheduler.scheduling.algorithm.predicate_duration.sum | Kube Scheduler Scheduling Algorithm Predicate Evaluation Sum | NULL | Scheduling algorithm predicate evaluation duration | |
scheduler.pod_preemption.victims | Kube Scheduler Pod Preemption Victims | NULL | Number of selected preemption victims | |
scheduler.binding.duration.count | Kube Scheduler Binding Duration Seconds Count | NULL | Total Binding duration in seconds count | |
scheduler.scheduling.algorithm.priority_duration.sum | Kube Scheduler Scheduling Algorithm Priority Evaluation Sum | NULL | Scheduling algorithm priority evaluation duration | |
scheduler.binding.duration.seconds | Kube Scheduler Binding Duration Seconds Sum | NULL | Binding duration in seconds sum | |
scheduler.volume_scheduling_duration.count | Kube Scheduler Volume Scheduling Duration Seconds Count | NULL | Volume scheduling stage latency count | |
scheduler.e2e.scheduling_latency.count | Kube Scheduler E2E Scheduling Latency Microseconds Count | NULL | Total E2e scheduling latency in microseconds (scheduling algorithm + binding) | |
scheduler.binding.latency.count | Kube Scheduler Binding Latency Microseconds Count | NULL | Total Binding latency in microseconds count | |
scheduler.volume_scheduling_duration.sum | Kube Scheduler Volume Scheduling Duration Seconds Sum | NULL | Volume scheduling stage latency sum | |
scheduler.scheduling.scheduling_duration.quantile | Kube Scheduler Scheduling Duration Seconds | NULL | Scheduling latency in seconds split by sub-parts of the scheduling operation | |
scheduler.threads | Kube Scheduler OS Threads | NULL | Number of OS threads created | |
scheduler.process.open_fds | Kube Scheduler Process Open Fds | NULL | Number of open file descriptors | |
scheduler.go.goroutines | Kube Scheduler Go Goroutines | NULL | Number of goroutines that currently exist | |
scheduler.process.max_fds | Kube Scheduler Process Max Fds | NULL | Maximum number of open file descriptors | |
scheduler.binding.latency.sum | Kube Scheduler Binding Latency Microseconds Sum | NULL | Binding latency in microseconds sum | |
scheduler.scheduling.algorithm_latency.sum | Kube Scheduler Scheduling Algorithm Latency Microseconds Sum | NULL | Scheduling algorithm latency in microseconds sum | |
scheduler.e2e.scheduling_duration.sum | Kube Scheduler E2E Scheduling Duration Seconds Sum | NULL | E2e scheduling latency in seconds (scheduling algorithm + binding) | |
scheduler.scheduling.algorithm_latency.count | Kube Scheduler Scheduling Algorithm Latency Microseconds Count | NULL | Total Scheduling algorithm latency in microseconds count | |
scheduler.scheduling.scheduling_duration.sum | Kube Scheduler Scheduling Duration Seconds Sum | NULL | Scheduling latency in seconds split by sub-parts of the scheduling operation | |
scheduler.client.http.requests | Kube Scheduler Rest Client Requests Total | NULL | Number of HTTP requests, partitioned by status code, method, and host | |
scheduler.e2e.scheduling_duration.count | Kube Scheduler E2E Scheduling Duration Seconds Count | NULL | Total E2e scheduling latency in seconds (scheduling algorithm + binding) | |
scheduler.scheduling.algorithm.preemption_duration.sum | Kube Scheduler Scheduling Algorithm Preemption Evaluation Sum | NULL | Scheduling algorithm preemption evaluation duration | |
scheduler.cache.lookups | Kube Scheduler Equiv Cache Lookups Total | NULL | Total number of equivalence cache lookups, by whether or not a cache entry was found | |
scheduler.gc_duration_seconds.sum | Kube Scheduler Go GC Duration Seconds Sum | NULL | A summary of the GC invocation durations | |
scheduler.scheduling.algorithm_duration.count | Kube Scheduler Scheduling Algorithm Duration Seconds Count | NULL | Total Scheduling algorithm latency in seconds count | |
scheduler.scheduling.scheduling_latency.count | Kube Scheduler Scheduling Latency Seconds Count | NULL | Scheduling latency in seconds split by sub-parts of the scheduling operation | |
scheduler.scheduling.algorithm_duration.sum | Kube Scheduler Scheduling Algorithm Duration Seconds Sum | NULL | Scheduling algorithm latency in seconds sum | |
scheduler.scheduling.scheduling_duration.count | Kube Scheduler Scheduling Duration Seconds Count | NULL | Scheduling latency in seconds split by sub-parts of the scheduling operation | |
scheduler.scheduling.scheduling_latency.quantile | Kube Scheduler Scheduling Latency Seconds | NULL | Scheduling latency in seconds split by sub-parts of the scheduling operation | |
scheduler.gc_duration_seconds.quantile | Kube Scheduler Go GC Duration Seconds | NULL | A summary of the GC invocation durations |
Agent G2 - Linux - Kubernetes State
Description
Template for monitoring Kubernetes using Kube State
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Kubernetes State | kubernetes_state.node.memory_capacity | Node Memory Capacity | NULL | The total memory resources of the node. |
kubernetes_state.resourcequota.limits.memory.limit | Resourcequota Limits Memory Limit | NULL | Hard limit on the sum of memory bytes limits for a resource quota | |
kubernetes_state.node.cpu_capacity | Node Cpu Capacity | NULL | The total CPU resources of the node. | |
kubernetes_state.container.memory_limit | Container Memory Limit | NULL | The limit on memory to be used by a container | |
kubernetes_state.daemonset.scheduled | Daemonset Scheduled | NULL | The number of nodes running at least one daemon pod and are supposed to | |
kubernetes_state.deployment.replicas_available | Deployment Replicas Available | NULL | The number of available replicas per deployment. | |
kubernetes_state.node.memory_allocatable | Node Memory Allocatable | NULL | The memory resources of a node that are available for scheduling. | |
kubernetes_state.resourcequota.persistentvolumeclaims.used | Resourcequota Persistentvolumeclaims Used | NULL | Observed number of persistent volume claims used for a resource quota | |
kubernetes_state.deployment.rollingupdate.max_unavailable | Deployment Rollingupdate Max Unavailable | NULL | Maximum number of unavailable replicas during a rolling update of a deployment. | |
kubernetes_state.deployment.replicas_desired | Deployment Replicas Desired | NULL | The number of desired replicas per deployment wrong help in kube-state-metrics.cross check | |
kubernetes_state.resourcequota.services.loadbalancers.limit | Resourcequota Services Loadbalancers Limit | NULL | Hard limit of the number of loadbalancers for a resource quota | |
kubernetes_state.resourcequota.pods.limit | Resourcequota Pods Limit | NULL | Hard limit of the number of pods for a resource quota | |
kubernetes_state.resourcequota.services.limit | Resourcequota Services Limit | NULL | Hard limit of the number of services for a resource quota | |
kubernetes_state.resourcequota.services.used | Resourcequota Services Used | NULL | Observed number of services used for a resource quota | |
kubernetes_state.resourcequota.requests.cpu.used | Resourcequota Requests Cpu Used | NULL | Observed sum of CPU cores requested for a resource quota | |
kubernetes_state.resourcequota.limits.memory.used | Resourcequota Limits Memory Used | NULL | Observed sum of limits for memory bytes for a resource quota | |
kubernetes_state.resourcequota.persistentvolumeclaims.limit | Resourcequota Persistentvolumeclaims Limit | NULL | Hard limit of the number of PVC for a resource quota | |
kubernetes_state.resourcequota.pods.used | Resourcequota Pods Used | NULL | Observed number of pods used for a resource quota | |
kubernetes_state.resourcequota.requests.memory.limit | Resourcequota Requests Memory Limit | NULL | Hard limit on the total of memory bytes requested for a resource quota | |
kubernetes_state.resourcequota.requests.storage.limit | Resourcequota Requests Storage Limit | NULL | Hard limit on the total of storage bytes requested for a resource quota | |
kubernetes_state.daemonset.misscheduled | Daemonset Misscheduled | NULL | The number of nodes running a daemon pod but are not supposed to. | |
kubernetes_state.deployment.replicas_unavailable | Deployment Replicas Unavailable | NULL | The number of unavailable replicas per deployment. | |
kubernetes_state.resourcequota.limits.cpu.limit | Resourcequota Limits Cpu Limit | NULL | Hard limit on the sum of CPU core limits for a resource quota | |
kubernetes_state.daemonset.ready | Daemonset Ready | NULL | The number of nodes that should be running the daemon pod and have one or more of the daemon pod running and ready. | |
kubernetes_state.node.pods_capacity | Node Pods Capacity | NULL | The total pod resources of the node. | |
kubernetes_state.replicaset.fully_labeled_replicas | Replicaset Fully Labeled Replicas | NULL | The number of fully labeled replicas per ReplicaSet. | |
kubernetes_state.deployment.replicas | Deployment Replicas | NULL | The number of replicas per deployment. | |
kubernetes_state.resourcequota.requests.cpu.limit | Resourcequota Requests Cpu Limit | NULL | Hard limit on the total of CPU core requested for a resource quota | |
kubernetes_state.replicaset.replicas_ready | Replicaset Replicas Ready | NULL | The number of ready replicas per ReplicaSet | |
kubernetes_state.container.memory_requested | Container Memory Requested | NULL | The number of requested memory bytes by a container | |
kubernetes_state.node.cpu_allocatable | Node Cpu Allocatable | NULL | The CPU resources of a node that are available for scheduling. | |
kubernetes_state.replicaset.replicas_desired | Replicaset Replicas Desired | NULL | Number of desired pods for a ReplicaSet | |
kubernetes_state.container.cpu_limit | Container Cpu Limit | NULL | The limit on cpu cores to be used by a container | |
kubernetes_state.replicaset.replicas | Replicaset Replicas | NULL | The number of replicas per ReplicaSet. | |
kubernetes_state.resourcequota.services.nodeports.used | Resourcequota Services Nodeports Used | NULL | Observed number of node ports used for a resource quota | |
kubernetes_state.container.cpu_requested | Container Cpu Requested | NULL | The number of requested cpu cores by a container | |
kubernetes_state.resourcequota.services.loadbalancers.used | Resourcequota Services Loadbalancers Used | NULL | Observed number of loadbalancers used for a resource quota | |
kubernetes_state.resourcequota.limits.cpu.used | Resourcequota Limits Cpu Used | NULL | Observed sum of limits for CPU cores for a resource quota | |
kubernetes_state.resourcequota.requests.memory.used | Resourcequota Requests Memory Used | NULL | Observed sum of memory bytes requested for a resource quota | |
kubernetes_state.daemonset.desired | Daemonset Desired | NULL | The number of nodes that should be running the daemon pod. | |
kubernetes_state.container.restarts | Container Restarts | NULL | The number of restarts per container | |
kubernetes_state.resourcequota.services.nodeports.limit | Resourcequota Services Nodeports Limit | NULL | Hard limit of the number of node ports for a resource quota | |
kubernetes_state.deployment.replicas_updated | Deployment Replicas Updated | NULL | The number of updated replicas per deployment. | |
kubernetes_state.resourcequota.requests.storage.used | Resourcequota Requests Storage Used | NULL | Observed sum of storage bytes requested for a resource quota |
Agent G2 - Linux - Kubernetes State Monitoring
Description
Monitors to collect kube-state-metrics from Kubernetes cluster.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Kubernetes State Monitoring | kubernetes_state.resourcequota.limits.memory.used | Resourcequota Limits Memory Used | NULL | Observed sum of limits for memory bytes for a resource quota |
kubernetes_state.deployment.replicas_available | Deployment Replicas Available | NULL | The number of available replicas per deployment. | |
kubernetes_state.deployment.replicas_unavailable | Deployment Replicas Unavailable | NULL | The number of unavailable replicas per deployment. | |
kubernetes_state.resourcequota.persistentvolumeclaims.limit | Resourcequota Persistentvolumeclaims Limit | NULL | Hard limit of the number of PVC for a resource quota | |
kubernetes_state.resourcequota.limits.cpu.limit | Resourcequota Limits Cpu Limit | NULL | Hard limit on the sum of CPU core limits for a resource quota | |
kubernetes_state.resourcequota.services.limit | Resourcequota Services Limit | NULL | Hard limit of the number of services for a resource quota | |
kubernetes_state.resourcequota.requests.storage.limit | Resourcequota Requests Storage Limit | NULL | Hard limit on the total of storage bytes requested for a resource quota | |
kubernetes_state.daemonset.misscheduled | Daemonset Misscheduled | NULL | The number of nodes running a daemon pod but are not supposed to. | |
kubernetes_state.node.cpu_capacity | Node Cpu Capacity | NULL | The total CPU resources of the node. | |
kubernetes_state.resourcequota.services.loadbalancers.limit | Resourcequota Services Loadbalancers Limit | NULL | Hard limit of the number of loadbalancers for a resource quota | |
kubernetes_state.container.memory_requested | Container Memory Requested | NULL | The number of requested memory bytes by a container | |
kubernetes_state.deployment.replicas | Deployment Replicas | NULL | The number of replicas per deployment. | |
kubernetes_state.resourcequota.requests.memory.limit | Resourcequota Requests Memory Limit | NULL | Hard limit on the total of memory bytes requested for a resource quota | |
kubernetes_state.resourcequota.requests.memory.used | Resourcequota Requests Memory Used | NULL | Observed sum of memory bytes requested for a resource quota | |
kubernetes_state.replicaset.replicas | Replicaset Replicas | NULL | The number of replicas per ReplicaSet. | |
kubernetes_state.deployment.rollingupdate.max_unavailable | Deployment Rollingupdate Max Unavailable | NULL | Maximum number of unavailable replicas during a rolling update of a deployment. | |
kubernetes_state.deployment.replicas_updated | Deployment Replicas Updated | NULL | The number of updated replicas per deployment. | |
kubernetes_state.node.pods_allocatable | Node Pods Allocatable | NULL | The pod resources of a node that are available for scheduling. | |
kubernetes_state.resourcequota.limits.cpu.used | Resourcequota Limits Cpu Used | NULL | Observed sum of limits for CPU cores for a resource quota | |
kubernetes_state.resourcequota.services.used | Resourcequota Services Used | NULL | Observed number of services used for a resource quota | |
kubernetes_state.daemonset.desired | Daemonset Desired | NULL | The number of nodes that should be running the daemon pod. | |
kubernetes_state.daemonset.scheduled | Daemonset Scheduled | NULL | The number of nodes running at least one daemon pod and are supposed to | |
kubernetes_state.resourcequota.services.nodeports.used | Resourcequota Services Nodeports Used | NULL | Observed number of node ports used for a resource quota | |
kubernetes_state.replicaset.fully_labeled_replicas | Replicaset Fully Labeled Replicas | NULL | The number of fully labeled replicas per ReplicaSet. | |
kubernetes_state.resourcequota .requests.cpu.limit | Resourcequota Requests Cpu Limit | NULL | Hard limit on the total of CPU core requested for a resource quota | |
kubernetes_state.resourcequota.services.nodeports.limit | Resourcequota Services Nodeports Limit | NULL | Hard limit of the number of node ports for a resource quota | |
kubernetes_state.node.memory_capacity | Node Memory Capacity | NULL | The total memory resources of the node. | |
kubernetes_state.daemonset.ready | Daemonset Ready | NULL | The number of nodes that should be running the daemon pod and have one or more of the daemon pod running and ready. | |
kubernetes_state.deployment.replicas_desired | Deployment Replicas Desired | NULL | The number of desired replicas per deployment wrong help in kube-state-metrics.cross check | |
kubernetes_state.replicaset.replicas_desired | Replicaset Replicas Desired | NULL | Number of desired pods for a ReplicaSet | |
kubernetes_state.resourcequota.requests.cpu.used | Resourcequota Requests Cpu Used | NULL | Observed sum of CPU cores requested for a resource quota | |
kubernetes_state.resourcequota.services.loadbalancers.used | Resourcequota Services Loadbalancers Used | NULL | Observed number of loadbalancers used for a resource quota | |
kubernetes_state.container.restarts | Container Restarts | NULL | The number of restarts per container | |
kubernetes_state.resourcequota.requests.storage.used | Resourcequota Requests Storage Used | NULL | Observed sum of storage bytes requested for a resource quota | |
kubernetes_state.node.pods_capacity | Node Pods Capacity | NULL | The total pod resources of the node. | |
kubernetes_state.resourcequota.pods.limit | Resourcequota Pods Limit | NULL | Hard limit of the number of pods for a resource quota | |
kubernetes_state.node.memory_allocatable | Node Memory Allocatable | NULL | The memory resources of a node that are available for scheduling. | |
kubernetes_state.node.cpu_allocatable | Node Cpu Allocatable | NULL | The CPU resources of a node that are available for scheduling. | |
kubernetes_state.replicaset.replicas_ready | Replicaset Replicas Ready | NULL | The number of ready replicas per ReplicaSet | |
kubernetes_state.container.memory_limit | Container Memory Limit | NULL | The limit on memory to be used by a container | |
kubernetes_state.resourcequota.pods.used | Resourcequota Pods Used | NULL | Observed number of pods used for a resource quota | |
kubernetes_state.container.cpu_limit | Container Cpu Limit | NULL | The limit on cpu cores to be used by a container | |
kubernetes_state.resourcequota.limits.memory.limit | Resourcequota Limits Memory Limit | NULL | Hard limit on the sum of memory bytes limits for a resource quota | |
kubernetes_state.container.cpu_requested | Container Cpu Requested | NULL | The number of requested cpu cores by a container | |
kubernetes_state.resourcequota.persistentvolumeclaims.used | Resourcequota Persistentvolumeclaims Used | NULL | Observed number of persistent volume claims used for a resource quota |
Agent G2 - Linux - KVM Monitoring Template
Description
Agent G2 - Linux - KVM Monitoring Template
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - KVM Monitoring Template | kvm.disk | Kvm Disk | NULL | |
kvm.disk.iops | Kvm Disk Iops | NULL | ||
kvm.network | Kvm Network | NULL | ||
kvm.domain.states | Kvm Domain States | NULL | ||
kvm.memory | Kvm Memory | NULL | ||
kvm.cpu | Kvm Cpu | NULL | ||
kvm.total.domains | Kvm Total Domains | NULL |
Agent G2 - Linux - KVM Monitors
Description
Linux KVM Monitoring Template
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - KVM Monitors | kvm.disk | Kvm Disk | NULL | |
kvm.disk.iops | Kvm Disk Iops | NULL | ||
kvm.network | Kvm Network | NULL | ||
kvm.domain.states | Kvm Domain States | NULL | ||
kvm.memory | Kvm Memory | NULL | ||
kvm.cpu | Kvm Cpu | NULL | ||
kvm.total.domains | Kvm Total Domains | NULL |
Agent G2 - Linux - Lighttpd Monitors
Description
Monitors Lighttpd application metrics from the server-status module.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Lighttpd Monitors | lighttpd.busy_servers | Lighttpd Busy Servers | NULL | Provides the number of busy servers |
lighttpd.uptime | Lighttpd Uptime | NULL | Checks the uptime lighttpd service | |
lighttpd.bytes_per_request | Lighttpd-BytesPerRequest | NULL | Provides the number of bytes transferred per request | |
lighttpd.bytes_per_request | Lighttpd Bytes per request | NULL | Provides the number of bytes transferred per request | |
lighttpd.idle_servers | Lighttpd Idle Servers | NULL | Provides the number of idle servers | |
lighttpd.open.slots | Lighttpd Open Slots | NULL | Provides the number of open slots | |
lighttpd.total.accesses | Lighttpd Total Accesses | NULL | Provides the total number of accesses made | |
lighttpd.bytes_rate | Lighttpd Bytes Rate | NULL | Provides the number of bytes transferred per second | |
lighttpd.requests_rate | Lighttpd Requests Rate | NULL | Provides the number of requests made per second | |
lighttpd.total.kbytes | Lighttpd Total kBytes | NULL | Provides the number of total kbytes |
Agent G2 - Linux - Memcache Monitors
Description
Monitors Memcache application metrics
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Memcache Monitors | memcache.curr_connections | Memcache-CurrentConnections | NULL | Number of open connections |
memcache.memory_used | Memcache-MemoryUsed | NULL | Total memory used by the server engine | |
memcache.uptime | Memcache-Uptime | NULL | Number of minutes this server has been running | |
memcache.tempoom_rate | Memcache-TempOOMPersec | NULL | Number of temporary out-of-memory errors sent to clients per second. | |
memcache.gets_rate | Memcache-GetsPersec | NULL | Cumulative number of get requests for node per second | |
memcache.sets_rate | Memcache-SetsPersec | NULL | Cumulative number of set requests for node per second | |
memcache.avg_item_size | Memcache-AvgItemSize | NULL | Average size of an item | |
memcache.misses_rate | Memcache-MissesPersec | NULL | Number of items that have been requested but not found per second. | |
memcache.bytes_written_rate | Memcache-WrittenBytesPersec | NULL | Average data written by this server in a second from the network in MB | |
memcache.fill_percent | Memcache-FillPercent | NULL | Percentage of bytes used by this server | |
memcache.ratio.resident.item | Memcache-ResidentItemRatio | NULL | Percentage of items that are resident (in RAM) | |
memcache.cas_hits_rate | Memcache-CASHitsPersec | NULL | Number of successful CAS operations per second. | |
memcache.evictions_rate | Memcache-EvictionsPersec | NULL | Number of valid items removed per second, from cache to free memory for new items. | |
memcache.bytes_read_rate | Memcache-ReadBytesPersec | NULL | Average data read by this server in a second from the network in MB. | |
memcache.get_hit_percent | Memcache-GetHitPercent | NULL | Percentage number of keys that have been requested and found. This value must be more for an optimal performance. | |
memcache.connections_rate | Memcache-ConnectionsPersec | NULL | Average number of connections per second | |
memcache.ops_rate | Memcache-OpsPersec | NULL | Number of total operations for node per second. | |
memcache.hits_rate | Memcache-HitsPersec | NULL | Number of keys that have been requested and found per second. | |
memcache.disk_reads_rate | Memcache-DiskReadsPersec | NULL | Number of items fetched from disk. | |
memcache.cas_badval_rate | Memcache-CASBadvalPersec | NULL | Number of CAS operations per second that failed to modify a value due to a bad CAS id | |
memcache.curr_items | Memcache-CurrentItems | NULL | Current number of items stored by the server. | |
memcache.ratio.cache.miss | Memcache-CacheMissRatio | NULL | Percentage number of items fetched from disk against total requests. | |
memcache.cas_misses_rate | Memcache-CASMissesPersec | NULL | Number of CAS operations per second against missing keys. | |
memcache.delete_hits_rate | Memcache-DeletesPersec | NULL | Number of successful deletions per second |
Agent G2 - Linux - Memcached Performance Check
Description
Monitors Memcache application metrics
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Memcached Performance Check | memcache.curr_connections | Memcache-CurrentConnections | NULL | Number of open connections |
memcache.memory_used | Memcache-MemoryUsed | NULL | Total memory used by the server engine | |
memcache.uptime | Memcache-Uptime | NULL | Number of minutes this server has been running | |
memcache.tempoom_rate | Memcache-TempOOMPersec | NULL | Number of temporary out-of-memory errors sent to clients per second. | |
memcache.gets_rate | Memcache-GetsPersec | NULL | Cumulative number of get requests for node per second | |
memcache.sets_rate | Memcache-SetsPersec | NULL | Cumulative number of set requests for node per second | |
memcache.avg_item_size | Memcache-AvgItemSize | NULL | Average size of an item | |
memcache.misses_rate | Memcache-MissesPersec | NULL | Number of items that have been requested but not found per second. | |
memcache.bytes_written_rate | Memcache-WrittenBytesPersec | NULL | Average data written by this server in a second from the network in MB | |
memcache.fill_percent | Memcache-FillPercent | NULL | Percentage of bytes used by this server | |
memcache.ratio.resident.item | Memcache-ResidentItemRatio | NULL | Percentage of items that are resident (in RAM) | |
memcache.cas_hits_rate | Memcache-CASHitsPersec | NULL | Number of successful CAS operations per second. | |
memcache.evictions_rate | Memcache-EvictionsPersec | NULL | Number of valid items removed per second, from cache to free memory for new items. | |
memcache.bytes_read_rate | Memcache-ReadBytesPersec | NULL | Average data read by this server in a second from the network in MB. | |
memcache.get_hit_percent | Memcache-GetHitPercent | NULL | Percentage number of keys that have been requested and found. This value must be more for an optimal performance. | |
memcache.connections_rate | Memcache-ConnectionsPersec | NULL | Average number of connections per second | |
memcache.ops_rate | Memcache-OpsPersec | NULL | Number of total operations for node per second. | |
memcache.hits_rate | Memcache-HitsPersec | NULL | Number of keys that have been requested and found per second. | |
memcache.disk_reads_rate | Memcache-DiskReadsPersec | NULL | Number of items fetched from disk. | |
memcache.cas_badval_rate | Memcache-CASBadvalPersec | NULL | Number of CAS operations per second that failed to modify a value due to a bad CAS id | |
memcache.curr_items | Memcache-CurrentItems | NULL | Current number of items stored by the server. | |
memcache.ratio.cache.miss | Memcache-CacheMissRatio | NULL | Percentage number of items fetched from disk against total requests. | |
memcache.cas_misses_rate | Memcache-CASMissesPersec | NULL | Number of CAS operations per second against missing keys. | |
memcache.delete_hits_rate | Memcache-DeletesPersec | NULL | Number of successful deletions per second |
Agent G2 - Linux - Memory Statistics
Description
Agent G2 - Linux - Memory Statistics
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Memory Statistics | PhysicalMemory | Physical Memory | MB | Physical memory information |
memory.stats.swapping.rate | Virtual Memory Swapping Rate | KB/s | Rate of swapping the entire process from physical memory to disk and vice-versa. | |
memory.stats.swap.util.rate | Swap Memory Usage Rate | NULL | Rate of swap memory utilization. | |
memory.stats.physical.util.rate | Physical Memory Usage Rate | NULL | Rate of physical memory utilization. | |
SwapMemory | Swap Memory | MB | Swap memory information. | |
memory.stats.paging.rate | Virtual Memory Paging Rate | KB/s | Rate of swapping least used process memory blocks from physical memory to disk and vice-versa. |
Agent G2 - Linux - Mesos Agent Monitoring Template
Description
Agent G2 - Linux - Mesos Agent Monitoring Template
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Mesos Agent Monitoring Template | mesos.slave.tasks_running | Tasks running | NULL | Number of running tasks |
mesos.slave.invalid_framework_messages | Invalid framework messages | NULL | Number of invalid framework messages | |
mesos.slave.frameworks_active | Frameworks active | NULL | Number of active frameworks | |
mesos.slave.disk_percent | Allocated disk space percent | NULL | Percentage of allocated disk space | |
mesos.slave.tasks_lost | Tasks lost | NULL | Number of lost tasks | |
mesos.slave.executors_terminated | Executors terminated | NULL | Number of executors terminated | |
mesos.slave.gpus_percent | Allocated GPUs percent | NULL | Percentage of allocated GPUs | |
mesos.slave.cpus_used | Number of CPUs used | NULL | Number of allocated CPUs | |
mesos.slave.tasks_starting | Tasks starting | NULL | Number of starting tasks | |
mesos.slave.valid_framework_messages | Valid framework messages | NULL | Number of valid framework messages | |
mesos.state.task.cpu | Task cpu | NULL | Task cpu | |
mesos.slave.invalid_status_updates | Invalid status updates | NULL | Number of invalid status updates | |
mesos.slave.valid_status_updates | Mesos slave valid_status_updates | NULL | Number of valid status updates | |
mesos.stats.system.load_15min | Mesos stats system load_15min | NULL | Load average for the past 15 minutes | |
mesos.slave.disk_total | Disk space total | NULL | Disk space | |
mesos.slave.mem_total | Memory total | NULL | Total memory | |
mesos.slave.recovery_errors | Recovery errors | NULL | Number of errors encountered during slave recovery | |
mesos.stats.uptime_secs | Uptime seconds | NULL | Slave uptime | |
mesos.slave.executors_running | Executors running | NULL | Number of executors running | |
mesos.slave.mem_percent | Allocated memory percent | NULL | Percentage of allocated memory | |
mesos.stats.system.mem_total_bytes | Mesos stats system mem_total_bytes | NULL | Total memory | |
mesos.slave.disk_used | Allocated disk space | NULL | Allocated disk space | |
mesos.slave.executors_terminating | Executors terminating | NULL | Number of executors terminating | |
mesos.slave.tasks_finished | Tasks finished | NULL | Number of finished tasks | |
mesos.slave.gpus_used | Number of GPUs used | NULL | Number of allocated GPUs | |
mesos.slave.cpus_total | CPUs total | NULL | Number of CPUs | |
mesos.slave.mem_used | Allocated memory | NULL | Used memory | |
mesos.stats.system.load_1min | Mesos stats system load_1min | NULL | Load average for the past 1 minute | |
mesos.slave.tasks_staging | Tasks staging | NULL | Number of staging tasks | |
mesos.state.task.mem | Task mem | NULL | Task memory | |
mesos.slave.executors_registering | Executors registering | NULL | Number of executors registering | |
mesos.slave.tasks_failed | Tasks failed | NULL | Number of failed tasks | |
mesos.slave.cpus_percent | Alloated CPUs percent | NULL | Percentage of allocated CPUs | |
mesos.stats.system.mem_free_bytes | Mesos stats system mem_free_bytes | NULL | Free memory | |
mesos.stats.system.load_5min | Mesos stats system load_5min | NULL | Load average for the past 5 minutes | |
mesos.state.task.disk | Task disk | NULL | Task disk | |
mesos.stats.system.cpus_total | Number of CPUs | NULL | Number of CPUs | |
mesos.slave.gpus_total | GPUs total | NULL | Number of GPUs | |
mesos.slave.tasks_killed | Tasks killed | NULL | Number of killed tasks |
Agent G2 - Linux - Mesos Master Monitoring Template
Description
Mesos Master Monitoring Template
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Mesos Master Monitoring Template | marathon.queue.offers.reject.last | Marathon Queue Offeres Reject Last | NULL | Summary of unused offers for all last offers |
mesos.role.disk | Mesos Role Disk | NULL | Mesos Role Disk | |
marathon.queue.offers.unused | Marathon Queue Offers Unused | NULL | The number of unused offers for this launch attempt | |
dcos.health.admin.router.agent | Admin router agent service health | NULL | Admin Router Agent | |
mesos.cluster.slaves_unreachable | Agents unreachable | NULL | Number of unreachable agents. Unreachable agents are periodically garbage collected from the registry, which will cause this value to decrease. | |
dcos.health.telegraf.socket | Telegraf socket health | NULL | Telegraf Socket | |
marathon.instances | Marathon Instances | NULL | Number of instances of a given application | |
marathon.queue.size | Marathon Queue Size | NULL | Number of app offer queues | |
dcos.health.component.package.manager | Component Package Manager (Pkgpanda) service health | NULL | DC/OS Component Package Manager (Pkgpanda) | |
mesos.stats.elected | Elected as master | NULL | Whether this is the elected master | |
mesos.cluster.mem_percent | Allocated memory percent | NULL | Percentage of allocated memory | |
mesos.registrar.state_store_ms.p9999 | Registrar state_store_ms.p9999 | NULL | 99.99th percentile registry write latency in ms | |
marathon.backoffSeconds | Marathon Backoff Seconds | NULL | Task backoff period | |
dcos.health.telegraf | Telegraf service health | NULL | Telegraf | |
mesos.cluster.mem_used | Allocated memory | NULL | Allocated memory in MB | |
mesos.cluster.frameworks_active | Frameworks active | NULL | Number of active frameworks | |
dcos.health.signal.timer | Signal Timer service health | NULL | DC/OS Signal Timer | |
marathon.queue.count | Marathon Queue Count | NULL | Number of instances left to launch | |
dcos.health.signal | Signal service health | NULL | DC/OS Signal | |
dcos.health.marathon | Marathon service health | NULL | Marathon | |
mesos.cluster.invalid_status_updates | Invalid status updates | NULL | Number of invalid status updates | |
mesos.registrar.state_store_ms.p999 | Registrar state_store_ms.p999 | NULL | 99.9th percentile registry write latency in ms | |
dcos.health.log.agent | Agent Log service health | NULL | DC/OS Log Agent | |
dcos.health.diagnostics.agent.socket | Diagnostics Agent socket health | NULL | DC/OS Diagnostics Agent Socket | |
mesos.cluster.disk_used | Allocated disk space | NULL | Allocated disk space in MB | |
mesos.cluster.gpus_used | Number of GPUs used | NULL | Number of allocated GPUs | |
mesos.cluster.tasks_error | Invalid tasks | NULL | Number of tasks that were invalid | |
mesos.registrar.state_store_ms.max | Max registry write latency | NULL | Maximum registry write latency in ms | |
dcos.health.authentication | Authentication service health | NULL | DC/OS Authentication (OAuth) | |
mesos.cluster.event_queue_http_requests | Event queue HTTP requests | NULL | Number of HTTP requests in the event queue | |
marathon.queue.offers.reject.launch | Marathon Queue Offeres Reject Launch | NULL | Summary of unused offers for the launch attempt | |
mesos.cluster.event_queue_messages | Event queue messages | NULL | Number of messages in the event queue | |
mesos.registrar.state_store_ms.p50 | Registrar state_store_ms.p50 | NULL | Median registry write latency in ms | |
mesos.cluster.frameworks_disconnected | Frameworks disconnected | NULL | Number of disconnected frameworks | |
mesos.cluster.slaves_inactive | Agents inactive | NULL | Number of inactive agents | |
mesos.stats.system.load_5min | Mesos stats system load_5min | NULL | Load average for the past 5 minutes | |
mesos.cluster.tasks_failed | Failed tasks | NULL | Number of failed tasks | |
mesos.cluster.cpus_total | CPUs total | NULL | Number of CPUs | |
dcos.health.mesos.master | Mesos Master service health | NULL | Mesos Master | |
dcos.health.mesos.agent.public | Mesos Public Agent service health | NULL | Mesos Agent Public | |
mesos.stats.system.load_15min | Mesos stats system load_15min | NULL | Load average for the past 15 minutes | |
mesos.cluster.valid_status_update_acknowledgements | Valid status update acknowledgement messages | NULL | Number of valid status update acknowledgement messages | |
mesos.registrar.state_store_ms.count | Registry write count | NULL | Registry write count | |
mesos.cluster.slaves_connected | Agents connected | NULL | Number of connected agents | |
mesos.cluster.tasks_finished | Tasks finished | NULL | Number of finished tasks | |
dcos.health.mesos.agent | Mesos Agent service health | NULL | Mesos Agent | |
dcos.health.mesos.dns | Mesos DNS service health | NULL | Mesos DNS | |
mesos.cluster.frameworks_inactive | Frameworks inactive | NULL | Number of inactive frameworks | |
mesos.cluster.gpus_total | GPUs total | NULL | Number of GPUs | |
mesos.registrar.state_store_ms | Registry write latency | NULL | Registry write latency in ms | |
mesos.framework.mem | Mesos Framework Memory | NULL | Mesos Framework Memory | |
marathon.tasksHealthy | Marathon Tasks Healthy | NULL | Number of healthy tasks for a given application | |
dcos.health.logrotate.agent.timer | Logrotate Timer | NULL | Logrotate Timer | |
dcos.health.net.watchdog | Net Watchdog service health | NULL | DC/OS Net Watchdog | |
dcos.health.admin.router.master | Admin router master service health | NULL | Admin Router Master | |
dcos.health.jobs | Jobs (Metronome) service health | NULL | DC/OS Jobs (Metronome) | |
dcos.health.resolv.timer | Generate resolv.conf Timer service health | NULL | Generate resolv.conf Timer | |
mesos.cluster.tasks_starting | Tasks starting | NULL | Number of starting tasks | |
marathon.deployments | Marathon Deployments | NULL | Number of running or pending deployments | |
dcos.health.package.manager | Package Manager (Cosmos) service health | NULL | DC/OS Package Manager (Cosmos) | |
dcos.health.log.agent.socket | Agent Log socket health | NULL | DC/OS Log Socket | |
dcos.health.checks.timer | Checks Timer service health | NULL | DC/OS Checks Timer | |
mesos.cluster.cpus_used | Number of CPUs used | NULL | Number of allocated CPUs | |
dcos.health.diagnostics.agent | Diagnostics Agent service health | NULL | DC/OS Diagnostics Agent | |
marathon.taskRateLimit | Marathon Task Rate Limit | NULL | The task rate limit for a given application | |
mesos.cluster.recovery_slave_removals | Agents removals | NULL | Number of agent removed for various reasons, including maintenance. | |
mesos.stats.system.cpus_total | Number of CPUs | NULL | Number of CPUs | |
marathon.tasksStaged | Marathon Tasks Staged | NULL | Number of tasks staged for a given application | |
mesos.role.cpu | Mesos Role CPU | NULL | Mesos Framework Disk | |
marathon.queue.offers.processed | Marathon Queue Offers Processed | NULL | The number of processed offers for this launch attempt | |
mesos.cluster.tasks_running | Tasks running | NULL | Number of running tasks | |
mesos.cluster.event_queue_dispatches | Dispatches in the event queue | NULL | Number of dispatches in the event queue | |
dcos.health.log.master | Master Log service health | NULL | DC/OS Log Master | |
dcos.health.poststart.checks | Poststart Checks service health | NULL | DC/OS Poststart Checks | |
mesos.stats.system.mem_free_bytes | Mesos stats system mem_free_bytes | NULL | Free memory | |
mesos.stats.uptime_secs | Uptime seconds | NULL | Slave uptime | |
marathon.disk | Marathon DISK | NULL | Configured DISKs for each instance of a given application | |
mesos.registrar.log.recovered | Registrar log recovered | NULL | Whether the replicated log for the registrar has caught up with the other masters in the cluster. A cluster is operational as long as a quorum of "recovered" masters is available in the cluster. | |
mesos.registrar.queued_operations | Mesos Registrar queued_operations | NULL | Mesos Registrar queued_operations | |
dcos.health.logrotate.master | Master Logrotate service health | NULL | Logrotate Mesos Master | |
mesos.cluster.mem_total | Memory total | NULL | Memory in MB | |
marathon.cpus | Marathon CPUs | NULL | Configured CPUs for each instance of a given application | |
mesos.cluster.cpus_percent | Alloated CPUs percent | NULL | Percentage of allocated CPUs | |
mesos.cluster.outstanding_offers | Outstanding resource offers | NULL | Number of outstanding resource offers | |
mesos.cluster.slaves_disconnected | Agents disconnected | NULL | Number of disconnected agents | |
mesos.stats.system.mem_total_bytes | Mesos stats system mem_total_bytes | NULL | Total memory | |
mesos.cluster.slave_shutdowns_scheduled | Slave shutdowns scheduled | NULL | Number of slaves which have failed their health check and are scheduled to be removed | |
mesos.registrar.state_store_ms.p99 | Registrar state_store_ms.p99 | NULL | 99th percentile registry write latency in ms | |
mesos.cluster.gpus_percent | Allocated GPUs percent | NULL | Percentage of allocated GPUs | |
marathon.backoffFactor | Marathon Backoff Factor | NULL | Backoff time multiplication factor for each consecutive failed task launch. | |
dcos.health.history | History service health | NULL | DC/OS History | |
dcos.health.checks.api | Checks API service health | NULL | DC/OS Checks API | |
mesos.cluster.tasks_lost | Tasks lost | NULL | Number of lost tasks | |
dcos.health.resolv | Generate resolv.conf service health | NULL | Generate resolv.conf | |
mesos.cluster.valid_framework_to_executor_messages | Valid framework to executor messages | NULL | Number of valid framework to executor messages | |
dcos.health.net | Net service health | NULL | DC/OS Net | |
mesos.framework.disk | Mesos Framework Disk | NULL | Mesos Framework Disk | |
marathon.tasksUnhealthy | Marathon Tasks Unhealthy | NULL | Number of unhealthy tasks for a given application | |
mesos.cluster.valid_status_updates | Valid status update messages | NULL | Number of valid status update messages | |
mesos.framework.cpu | Mesos Framework CPU | NULL | Mesos Framework CPU | |
mesos.cluster.frameworks_connected | Frameworks connected | NULL | Number of connected frameworks | |
dcos.health.log.master.socket | Master Log socket health | NULL | DC/OS Log Socket | |
mesos.cluster.invalid_status_update_acknowledgements | Number of invalid operation status update acknowledgements | NULL | Number of invalid operation status update acknowledgements | |
mesos.cluster.invalid_framework_to_executor_messages | Invalid framework to executor messages | NULL | Number of invalid framework to executor messages | |
dcos.health.gc.timer | Docker GC Timer | NULL | Docker GC Timer | |
mesos.cluster.tasks_killed | Tasks killed | NULL | Number of killed tasks | |
mesos.stats.system.load_1min | Mesos stats system load_1min | NULL | Load average for the past 1 minute | |
dcos.health.exhibitor | Exhibitor service health | NULL | Exhibitor | |
mesos.cluster.slave_shutdowns_canceled | Slave shutdowns canceled | NULL | Number of cancelled slave shutdowns | |
mesos.role.mem | Mesos Role Memory | NULL | Mesos Role Memory | |
mesos.cluster.slave_registrations | Slave registrations | NULL | Number of agent registrations | |
mesos.cluster.disk_percent | Allocated disk space percent | NULL | Percentage of allocated disk space | |
mesos.registrar.state_store_ms.p95 | Registrar state_store_ms.p95 | NULL | 95th percentile registry write latency in ms | |
dcos.health.logrotate.master.timer | Logrotate Timer | NULL | Logrotate Timer | |
dcos.health.rexray | REX-Ray service health | NULL | REX-Ray | |
mesos.cluster.dropped_messages | Dropped messages | NULL | Number of dropped messages | |
marathon.queue.delay | Marathon Queue Delay | NULL | Wait before the next launch attempt | |
marathon.mem | Marathon Memory | NULL | Configured memory for each instance of a given application | |
dcos.health.checks.api.socket | Checks API socket health | NULL | DC/OS Checks API Socket | |
mesos.cluster.slave_removals | Slave removals | NULL | Number of agent removed for various reasons, including maintenance | |
marathon.tasksRunning | Marathon Tasks Running | NULL | Number of tasks running for a given application | |
mesos.registrar.state_fetch_ms | Registry read latency | NULL | Registry read latency in ms | |
mesos.registrar.state_store_ms.p90 | Registrar state_store_ms.p90 | NULL | 90th percentile registry write latency in ms | |
mesos.cluster.tasks_staging | Tasks staging | NULL | Number of staging tasks | |
mesos.registrar.registry_size_bytes | Mesos Registrar registry_size_bytes | NULL | Mesos Registrar registry_size_bytes | |
mesos.cluster.disk_total | Disk space total | NULL | Disk space in MB | |
mesos.cluster.slaves_active | Agents active | NULL | Number of active agents | |
dcos.health.logrotate.agent | Agent Logrotate service health | NULL | Logrotate Mesos Agent | |
mesos.cluster.slave_reregistrations | Slave reregistrations | NULL | Number of agent re-registrations | |
mesos.registrar.state_store_ms.min | Min registry write latency | NULL | Minimum registry write latency in ms | |
dcos.health.gc | Docker GC | NULL | Docker GC | |
marathon.apps | Marathon Applications Count | NULL | Number of applications |
Agent G2 - Linux - MongoDB API Based Status and Performance Check
Description
Monitors the MongoDB using the MongoDB API.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - MongoDB API Based Status and Performance Check | mongo.network.requests | MongoDB Network Requests | NULL | Network Requests per second |
mongo.btree | MongoDB BTree | NULL | Provides Index Counters details as Accesses, Hits, Misses, Resets, Miss_ratio | |
mongo.heap.usage | MongoDB Heap Usage | NULL | Heap Usage | |
mongo.asserts | MongoDB Asserts | NULL | Provides the Mongo DB asserts | |
mongo.size | MongoDB Size | NULL | Provides the size of individual databases in Gigabytes | |
mongo.journaled.status | MongoDB Journaled Status | NULL | Provides the Mongo DB journaledstatus | |
mongo.memory | MongoDB Memory | NULL | Mongo DB resident, virtual, mapped memory | |
mongo.index.miss.ratio | MongoDB Index Miss Ratio | NULL | Provides the Mongo DB index miss ratio | |
mongo.pagefaults | MongoDB Page Faults | NULL | Page Faults | |
mongo.current.queue | MongoDB Current Queue | NULL | Current readers and writers in Queue | |
mongo.lock.percent | MongoDB Lock Percent | NULL | Lock percentage | |
mongo.index.size | MongoDB Index Size | NULL | Provides the Index size of databases in KBs | |
mongo.journal.commits | MongoDB Journal Commits In WL | NULL | Provides the number of journal commits that occurred in the write lock | |
mongo.replication.lag | MongoDB Replication Lag | NULL | Checks the replication lag | |
mongo.op.counters | MongoDB OP Counters | NULL | Provides opcounts for operations like insert, update, query, delete, getmore, command | |
mongo.cursors | MongoDB Cursors | NULL | Provides the Mongo DB cursors | |
mongo.traffic | MongoDB Traffic | NULL | MongoDB current available traffic | |
mongo.active.clients | MongoDB Active Clients | NULL | Current active, read and write clients | |
mongo.connections | MongoDB Connections | NULL | Mongo DB current and available connections | |
mongo.avg.flush | MongoDB Background Avg Flush | NULL | Provides the background average flush of dbs | |
mongo.uptime | MongoDB Uptime | NULL | Uptime in minutes |
Agent G2 - Linux - MongoDB Shard (mongos) Performance Check
Description
Monitors the mongos metric. This requires the python-pymongo package to be install. Check OpsRamp documents for more information
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - MongoDB Shard Mongos Performance Check | mongos.chunks | Mongos Chunks | NULL | Total number of mongos chunks |
mongos.chunks.balance | Mongos Chunks Balance | NULL | Provides Mongos chunks balance b/w database and collections | |
mongos.shards | Mongos Shards | NULL | Total number of mongos shards | |
mongos.collections | Mongos Collections | NULL | Total number of mongos sharded collections |
Agent G2 - Linux - MongoDB Shard(Mongos) Monitors
Description
Monitors the mongos metric. This requires the python-pymongo package to be install. Check OpsRamp documents for more information.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - MongoDB Shard Mongos Monitors | mongos.chunks.balance | Mongos Chunks Balance | NULL | Provides Mongos chunks balance b/w database and collections |
mongos.shards | Mongos Shards | NULL | Total number of mongos shards | |
mongos.chunks | Mongos Chunks | NULL | Total number of mongos chunks | |
mongos.collections | Mongos Collections | NULL | Total number of mongos sharded collections |
Agent G2 - Linux - MongoDB Status and Performance Check
Description
Monitoring Template for Mongo database application. This monitor requires the python-pymongo package to be installed on the server. Check OpsRamp documentation for more information.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - MongoDB Status and Performance Check | mongodb.networkrequests | MongoDB-NetworkRequests | NULL | Network Requests per second |
mongodb.size | MongoDB-Size | NULL | Provides the size of individual databases in Gigabytes | |
mongodb.activeclients | MongoDB-ActiveClients | NULL | Current active, read and write clients | |
mongodb.pagefaults | MongoDB-PageFaults | NULL | Page Faults | |
mongodb.replicaprimary | MongoDB-ReplicaPrimary | NULL | Check if the primary server of a replica set has changed | |
mongodb.btree | MongoDB-BTree | NULL | Provides Index Counters details as Accesses, Hits, Resets, Miss_ratio | |
mongodb.replicationlag | MongoDB-ReplicationLag | NULL | Checks the replication lag | |
mongodb.opcounters | MongoDB-OPCounters | NULL | Provides opcounts for operations like insert, update, query, delete, getmore, command | |
mongodb.lock | MongoDB-Lock | NULL | Lock percentage | |
mongodb.asserts | MongoDB-Asserts | NULL | The number of regular, warning, msg, user and rollovers asserts raised since this process started | |
mongodb.cursors | MongoDB-Cursors | NULL | The number of cursors that the server is maintaining for clients and that have timed out since this server was started | |
mongodb.uptime | MongoDB-Uptime | NULL | Uptime in minutes | |
mongodb.heapusage | MongoDB-HeapUsage | NULL | Heap Usage | |
mongodb.connections | MongoDB-Connections | NULL | MongoDB current and available connections | |
mongodb.replicationstate | MongoDB-ReplicationState | NULL | Replication states will be the following cases: StartingPhase1(0), Primary(1), Secondary(2), Recoverying(3), Fatal Error(4), StartingPhase2(5), Arbiter(7), Down(8), Not_Running_with_Replication(-1) | |
mongodb.indexsize | MongoDB-IndexSize | NULL | Provides the Index size of databases in KBs | |
mongodb.background_avg_flush | MongoDB-BackgroundAvgFlush | NULL | mongod writes to and flushes (fsyncs) the journal immediately. Data files are flushes only occasionally and in the background. By default these flushes occur every 60 seconds. | |
mongodb.memory | MongoDB-Memory | NULL | MongoDB resident, virtual, mapped memory | |
mongodb.currentqueue | MongoDB-CurrentQueue | NULL | Current readers and writers in Queue | |
mongodb.journaled_status | MongoDB-JournaledStatus | NULL | The average amount of data in megabytes written to the recovery log in the last four seconds is the JournaledMB and the data written to the databases datafiles in the last four seconds is writeToDataFilesMB. |
Agent G2 - Linux - MongoDB(API) Monitors
Description
Monitors the mongodb by getting the metrics using mongo API
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - MongoDB API Monitors | mongo.index.size | MongoDB Index Size | KB | Provides the Index size of databases in KBs |
mongo.memory | MongoDB Memory | NULL | Mongo DB resident, virtual, mapped memory | |
mongo.size | MongoDB Size | GB | Provides the size of individual databases in Gigabytes | |
mongo.current.queue | MongoDB Current Queue | NULL | Current readers and writers in Queue | |
mongo.heap.usage | MongoDB Heap Usage | NULL | Heap Usage | |
mongo.index.miss.ratio | MongoDB Index Miss Ratio | NULL | Provides the Mongo DB index miss ratio | |
mongo.asserts | MongoDB Asserts | NULL | Provides the Mongo DB asserts | |
mongo.network.requests | MongoDB Network Requests | NULL | Network Requests per second | |
mongo.connections | MongoDB Connections | NULL | Mongo DB current and available connections | |
mongo.traffic | MongoDB Traffic | NULL | MongoDB current available traffic | |
mongo.uptime | MongoDB Uptime | NULL | Uptime in minutes | |
mongo.active.clients | MongoDB Active Clients | NULL | Current active, read and write clients | |
mongo.journaled.status | MongoDB Journaled Status | NULL | Provides the Mongo DB journaled status | |
mongo.lock.percent | MongoDB Lock Percent | NULL | Lock percentage | |
mongo.avg.flush | MongoDB Background Avg Flush | NULL | Provides the background average flush of databases | |
mongo.pagefaults | MongoDB Page Faults | NULL | Page Faults | |
mongo.replication.lag | MongoDB Replication Lag | NULL | Checks the replication lag | |
mongo.journal.commits | MongoDB Journal Commits In WL | NULL | Provides the number of journal commits that occurred in the write lock | |
mongo.btree | MongoDB BTree | NULL | Provides Index Counters details as Accesses, Hits, Misses, Resets, Miss_ratio | |
mongo.cursors | MongoDB Cursors | NULL | Provides the Mongo DB cursors | |
mongo.op.counters | MongoDB OP Counters | NULL | Provides opcounts for operations like insert, update, query, delete, getmore, command |
Agent G2 - Linux - Mongos Monitors
Description
Monitors the mongos metric
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Mongos Monitors | mongos.chunks | Mongos Chunks | NULL | Total number of mongos chunks |
mongos.chunks.balance | Mongos Chunks Balance | NULL | Provides Mongos chunks balance between database and collections | |
mongos.shards | Mongos Shards | NULL | Total number of mongos shards | |
mongos.collections | Mongos Collections | NULL | Total number of mongos sharded collections |
Agent G2 - Linux - MySQL Advanced Monitors
Description
It monitors the mysql.Innodb_log_waits, mysql.long_running_procs, and mysqlindex_usage metrics.
Prerequisites
Provide database names as input arguments separated by commas when applying the template at the device level.
Syntax: DBName1,DBName2
Parameters
Name | Description | Default value |
---|---|---|
MySQL DBname | Dbnames to collect the metric data | NA |
MySQL IPAddress | IPAddress of the server where mysql is running | 127.0.0.1 |
MySQL Password | Password of the given username | NA |
MySQL Port | Port on which mysql is running | 3306 |
MySQL Username | Username to connect to mysql | NA |
Note: All field attributes are mandatory. Please use default values wherever applicable.
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - MySQL Advanced Monitors | mysql.Innodb_log_waits | Mysql Innodb log waits | NULL | The number of times that the log buffer was too small, and a wait was required for it to be flushed before continuing. |
mysql.long_running_procs | Mysql long running process | NULL | Total running process longer than 1 minute in a specific db. | |
mysqlindex_usage | Mysql Index Usage | NULL | The sum of index lengths for the tables within the specified database. |
Agent G2 - Linux - MySQL Global Performance Statistics
Description
Monitoring template for MySQL application. Monitors command statistics, select statistics, slave status, threads, etc.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - MySQL Global Performance Statistics | mysql.traffic | MySQL-Traffic | NULL | |
mysql.threads_connected | MySQL-ThreadsConnected | NULL | The number of currently open connections. | |
mysql.command_statistics | MySQL-CommandStatistics | NULL | ||
mysql.select_statistics | MySQL-SelectStatistics | NULL | ||
mysql.slave_status | MySQL-SlaveStatus | NULL | If a database has replication configured, then this monitor checks the status of the slave instance which is returned by the command SHOW SLAVE STATUS. It also checks the number of seconds the slave is behind the master and if IO and SQL is running on the slave | |
mysql.threads | MySQL-Threads | NULL | Cached threads - The number of threads in the thread cache. Connected threads - The number of currently open connections. Running threads- The number of threads that are not sleeping. |
Agent G2 - Linux - MySQL InnoDB Statistics
Description
Monitoring template for MySQL InnoDB application statistics. Monitors InnoDB buffer pool hit rate, buffer pool pages data, buffer pool pages dirty, etc.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - MySQL InnoDB Statistics | mysql.inno.db_buffer_pool_pages_free | MySQL-InnoDBBufferPoolPagesFree | NULL | The number of free pages. |
mysql.inno.db_data_read | MySQL-InnoDBDataRead | NULL | The amount of data read since the server was started. | |
mysql.inno.db_buffer_pool_pages_flushed | MySQL-InnoDBBufferPoolPagesFlushed | NULL | The number of buffer pool page-flush requests. | |
mysql.inno.db_log_flushed_upto | MySQL-InnoDBLogFlushedUpto | NULL | Log flushed up to. | |
mysql.mutex_spin_waits | MySQL-MutexSpinWaits | NULL | Semaphores-Mutex Spin Waits. | |
mysql.inno.db_data_writes | MySQL-InnoDBDataWrites | NULL | The total number of data writes. | |
mysql.inno.db_buffer_pool_hit_rate | MySQL-InnoDBBufferPoolHitRate | NULL | Buffer Pool hit rate. | |
mysql.inno.db_data_written | MySQL-InnoDBDataWritten | NULL | The amount of data written so far, in bytes. | |
mysql.inno.db_row_lock_time | MySQL-InnoDBRowLockTime | NULL | The total time spent in acquiring row locks, in milliseconds. | |
mysql.log_sequence_number | MySQL-LogSequenceNumber | NULL | Log Sequence number. | |
mysql.queries_inside_innodb_core | MySQL-QueriesInsideInnoDBCore | NULL | Row Operations - Queries inside InnoDB core. | |
mysql.inno.dbio_capacity | MySQL-InnoDBIOCapacity | NULL | ||
mysql.inno.db_transaction_history_length | MySQL-InnoDBTransactionHistoryLength | NULL | ||
mysql.inno.db_flush_log_at_trx_commit | MySQL-InnoDBFlushLogAtTRXCommit | NULL | ||
mysql.inno.db_buffer_pool_pages_misc | MySQL-InnoDBBufferPoolPagesMisc | NULL | The number of pages that are busy because they have been allocated for administrative overhead such as row locks or the adaptive hash index. | |
mysql.inno.db_queries_queue | MySQL-InnoDBQueriesQueue | NULL | Row Operations - Queries in queue. | |
mysql.inno.db_row_lock_waits | MySQL-InnoDBRowLockWaits | NULL | The number of times a row lock had to be waited for. | |
mysql.inno.db_data_reads | MySQL-InnoDBDataReads | NULL | The total number of data reads. | |
mysql.open_read_views_inside_innodb_core | MySQL-OpenReadViewsInsideInnoDBCore | NULL | Row Operations - Read views open inside InnoDB. | |
mysql.mutex_spin_rounds | MySQL-MutexSpinRounds | NULL | Semaphores-Mutex Spin Rounds. | |
mysql.inno.db_buffer_pool_pages_data | MySQL-InnoDBBufferPoolPagesData | NULL | The number of pages containing data (dirty or clean). | |
mysql.inno.db_buffer_pool_pages_dirty | MySQL-InnoDBBufferPoolPagesDirty | NULL | The number of pages currently dirty. | |
mysql.inno.db_log_last_check_point | MySQL-InnoDBLogLastCheckPoint | NULL | Log Last check point. | |
mysql.inno.db_buffer_pool_read_requests | MySQL-InnoDBBufferPoolReadRequests | NULL | The number of logical read requests InnoDB has done. | |
mysql.inno.db_non_hash_searches | MySQL-InnoDBNonHashSearches | NULL | Non-Hash Searches. | |
mysql.inno.db_hash_searches | MySQL-InnoDBHashSearches | NULL | Hash Searches. | |
mysql.inno.db_force_recovery | MySQL-InnoDBForceRecovery | NULL | Crash recovery mode, the possible values are 0-6: Normal Startup(0), Server Starts even if it detects a corrupt page(1), Prevent Master Thread from running(2), No Transaction Rollbacks after crash recovery(3), Prevent insert buffer merge Operations(4), Ignores undo logs-InnoDB treats incomplete Transactions as Committed(5), Ignores Redo log roll-forward in connection with recovery.(6). | |
mysql.inno.db_buffer_pool_size | MySQL-InnoDBBufferPoolSize | NULL | The size in bytes of the memory buffer InnoDB uses to cache data and indexes of its tables. The default value is 8MB. The larger you set this value, the less disk I/O is needed to access data in tables. | |
mysql.inno.db_data_fsyncs | MySQL-InnoDBDataFSyncs | NULL | The number of fsync operations so far. | |
mysql.inno.db_buffer_pool_reads | MySQL-InnoDBBufferPoolReads | NULL | The number of logical reads that InnoDB could not satisfy from the buffer pool, and had to read directly from the disk. | |
mysql.inno.db_log_file_size | MySQL-InnoDBLogFileSize | NULL | The size in bytes of each log file in a log group. The combined size of log files must be less than 4GB. The default value is 5MB. Sensible values range from 1MB to 1/N-th of the size of the buffer pool, where N is the number of log files in the group. | |
mysql.mutex_os_waits | MySQL-MutexOSWaits | NULL | Semaphores-Mutex OS Waits. | |
mysql.inno.db_buffer_pool_pages_total | MySQL-InnoDBBufferPoolPagesTotal | NULL | The total size of the buffer pool, in pages. | |
mysql.inno.db_log_buffer_size | MySQL-InnoDBLogBufferSize | NULL | The size in bytes of the buffer that InnoDB uses to write to the log files on disk. The default value is 1MB for the built-in InnoDB, 8MB for InnoDB Plugin. Sensible values range. |
Agent G2 - Linux - MySQL Monitors
Description
Monitoring template for MySQL application. Monitors command statistics, select statistics, slave status, threads, etc.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - MySQL Monitors | mysql.traffic | MySQL-Traffic | NULL | |
mysql.threads_connected | MySQL-ThreadsConnected | NULL | The number of currently open connections. | |
mysql.command_statistics | MySQL-CommandStatistics | NULL | ||
mysql.select_statistics | MySQL-SelectStatistics | NULL | ||
mysql.slave_status | MySQL-SlaveStatus | NULL | If a database has replication configured, then this monitor checks the status of the slave instance which is returned by the command SHOW SLAVE STATUS. It also checks the number of seconds the slave is behind the master and if IO and SQL is running on the slave | |
mysql.threads | MySQL-Threads | NULL | Cached threads - The number of threads in the thread cache. Connected threads - The number of currently open connections. Running threads- The number of threads that are not sleeping. |
Agent G2 - Linux - MySQL Variable Statistics
Description
Monitoring template for MySQL application statistics. Monitors aborted clients, aborted connects, command statistics, foreign key checks, log warnings, etc.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - MySQL Variable Statistics | mysql.threads_created | MySQL-ThreadsCreated | NULL | The number of threads created to handle connections. |
mysql.query_cache_free_memory | MySQL-QueryCacheFreeMemory | NULL | The amount of free memory for the query cache. | |
mysql.max_heap_table_size | MySQL-MaxHeapTableSize | NULL | This variable sets the maximum size to which user-created MEMORY tables are permitted to grow. The value of the variable is used to calculate MEMORY table MAX_ROWS values. | |
mysql.threads_cached | MySQL-ThreadsCached | NULL | The number of threads in the thread cache. | |
mysql.traffic | MySQL-Traffic | NULL | ||
mysql.long_query_time | MySQL-LongQueryTime | NULL | If a query takes longer than this many seconds, the server increments the Slow_queries status variable. | |
mysql.tmp_table_size | MySQL-TMPTableSize | NULL | The maximum size of internal in-memory temporary tables. If an in-memory temporary table exceeds the limit, MySQL automatically converts it to an on-disk MyISAM table. | |
mysql.queries | MySQL-Queries | NULL | ||
mysql.aborted_connects | MySQL-AbortedConnects | NULL | The number of failed attempts to connect to the MySQL server. | |
mysql.query_cache_size | MySQL-QueryCacheSize | NULL | Max amount of data that can be stored in the cache. | |
mysql.select_statistics | MySQL-SelectStatistics | NULL | ||
mysql.threads_running | MySQL-ThreadsRunning | NULL | The number of threads that are not sleeping. | |
mysql.query_cache_total_blocks | MySQL-QueryCacheTotalBlocks | NULL | The total number of blocks in the query cache. | |
mysql.query_cache_free_blocks | MySQL-QueryCacheFreeBlocks | NULL | The number of free memory blocks in the query cache. | |
mysql.max_used_connections | MySQL-MaxUsedConnections | NULL | Max number of connections that have been in use simultaneously since the server started. | |
mysql.query_cache_low_memory_purnes | MySQL-QueryCacheLowMemoryPurnes | NULL | The number of queries that were deleted from the query cache because of low memory. | |
mysql.slave_status | MySQL-SlaveStatus | NULL | If a database has replication configured, then this monitor checks the status of the slave instance which is returned by the command SHOW SLAVE STATUS. It also checks the number of seconds the slave is behind the master and if IO and SQL is running on the slave. | |
mysql.thread_cache_size | MySQL-ThreadCacheSize | NULL | How many threads the server should cache for reuse. When a client disconnects, the client's threads are put in the cache if there are fewer than this monitor value. | |
mysql.query_cache_not_cached | MySQL-QueryCacheNotCached | NULL | The number of noncached queries. | |
mysql.query_cache_inserts | MySQL-QueryCacheInserts | NULL | The number of queries added to the query cache. | |
mysql.table_definition_cache | MySQL-TableDefinitionCache | NULL | The number of table definitions (from .frm files) that can be stored in the definition cache. | |
mysql.log_warnings | MySQL-LogWarnings | NULL | Whether to produce additional warning messages. It is enabled (1) by default and can be disabled by setting it to 0. | |
mysql.aborted_clients | MySQL-AbortedClients | NULL | The number of connections that were aborted because the client died without closing the connection properly. | |
mysql.query_cache_queries_in_cache | MySQL-QueryCacheQueriesInCache | NULL | The number of queries registered in the query cache. | |
mysql.threads_connected | MySQL-ThreadsConnected | NULL | The number of currently open connections. | |
mysql.foreign_key_checks | MySQL-ForeignKeyChecks | NULL | By default foreign key constraints for InnoDB tables are checked and the value will be 1. If set to 0, they are ignored. Disabling foreign key checking can be useful for reloading InnoDB tables in an order different from that required by their parent/child relationships. | |
mysql.query_cache_hits | MySQL-QueryCacheHits | NULL | The number of query cache hits. | |
mysql.open_table_cache | MySQL-OpenTableCache | NULL | The number of open tables for all threads. | |
mysql.max_user_connections | MySQL-MaxUserConnections | NULL | Max number of simultaneous connections permitted to any given MySQL user. Default value is 0 (no limit). | |
mysql.command_statistics | MySQL-CommandStatistics | NULL | ||
mysql.thread_stack_size | MySQL-ThreadStackSize | NULL | The stack size for each thread. Many of the limits detected by the crash-me test are dependent on this value. |
Agent G2 - Linux - NFS Stats Monitors
Description
Agent G2 - Linux - NFS Stats Monitors
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - NFS Stats Monitors | nfs.lookup.ops | NFS Lookup Operations | NULL | The number of lookup operations per second issued to the filesystem. |
nfs.lookup.exe | NFS Lookup Op Request Execution Time | NULL | The duration from the time that NFS client does the RPC lookup request to its kernel until the RPC request is completed, this includes the RTT time above. | |
nfs.server.write.mbytes | NFS Server Written Data | NULL | The number megabytes written to the NFS server by the NFS client via an NFS WRITE request. | |
nfs.read.retransmissions | NFS Read Retransmissions | NULL | The number of retransmissions. | |
nfs.write.rtt | NFS Write Op Request RTT Time | NULL | The duration from the time that client kernel sends the RPC write request until the time it receives the reply. | |
nfs.read.ops | NFS Read Operations | NULL | The number of read operations per second issued to the filesystem. | |
nfs.lookup.retransmissions | NFS Lookup Retransmissions | NULL | The number of retransmissions. | |
nfs.lookup.kbytes | NFS Lookup Op Data | NULL | The amount of lookup data, in kB, per second. | |
nfs.write.ops | NFS Write Operations | NULL | The number of write operations per second issued to the filesystem. | |
nfs.directory.lookups | NFS Directory Lookups | NULL | The number of name lookups in directories there have been. | |
nfs.write.kbytes | NFS Write Op Data | NULL | The amount of data, in kB, written per second. | |
nfs.app.write.mbytes | NFS Application Written Data | NULL | The number of megabytes written by applications using the NFS mounted filesystem with the write(2) system call. | |
nfs.app.write.direct.mbytes | NFS Application Written (O_DIRECT) Data | NULL | The number of megabytes written to files opened with the O_DIRECT flag. | |
nfs.backlog | NFS Backlog | NULL | The length of the backlog queue. | |
nfs.read.kbytes | NFS Read Op Data | NULL | The amount of data, in kB, read per second. | |
nfs.server.read.mbytes | NFS Server Read Data | NULL | The number megabytes read from the NFS server by the NFS client via an NFS READ request. | |
nfs.lookup.rtt | NFS Lookup Op Request RTT Time | NULL | The duration from the time that client kernel sends the RPC lookup request until the time it receives the reply. | |
nfs.open.operations | NFS Open Operations | NULL | The number of times files or directories have been opened. | |
nfs.write.size | NFS Write Op Data Size | NULL | The average number of kB written per each operation. | |
nfs.write.exe | NFS Write Op Request Execution Time | NULL | The duration from the time that NFS client does the RPC write request to its kernel until the RPC request is completed, this includes the RTT time above. | |
nfs.read.size | NFS Read Op Data Size | NULL | The average number of kB read per each operation. | |
nfs.app.read.mbytes | NFS Application Read Data | NULL | The number of megabytes read by applications using the NFS mounted filesystem with the read(2) system call. | |
nfs.ops | NFS Operations | NULL | The number of operations per second issued to the filesystem. | |
nfs.age | NFS Mount Age | NULL | The duration of time that the share was mounted in minutes. | |
nfs.lookup.size | NFS Lookup Op Data Size | NULL | The average amount of data size in kB for each lookup operation. | |
nfs.rpc.badxids | NFS RPC Bad XIDs | NULL | The number of unmatchable XIDs that have been received. | |
nfs.read.rtt | NFS Read Op Request RTT Time | NULL | The duration from the time that client kernel sends the RPC read request until the time it receives the reply. | |
nfs.app.read.direct.mbytes | NFS Application Read(O_DIRECT) Data | NULL | The number of megabytes read from files opened with the O_DIRECT flag. | |
nfs.read.exe | NFS Read Op Request Execution Time | NULL | The duration from the time that NFS client does the RPC read request to its kernel until the RPC request is completed, this includes the RTT time above. | |
nfs.write.retransmissions | NFS Write Retransmissions | NULL | The number of retransmissions. |
Agent G2 - Linux Network Interface
Description
Monitors the performance and status of network interfaces on Linux systems. It captures and analyzes ten essential metrics, including traffic, packet rates, errors, discards, operational states, and output queue lengths.
Prerequisites
NA
Supported Metrics
Metric Name | Metric Display Name | Metric Description | Unit | |
---|---|---|---|---|
Agent G2 - Linux Network Interface | system_linux_network_interface_trafficIn | System Linux Network Interface TrafficIn | Monitors the incoming network traffic on a Linux network interface. It indicates the amount of data received by the interface. | Kbps |
system_linux_network_interface_trafficOut | System Linux Network Interface TrafficOut | Monitors the outgoing network traffic on a Linux network interface. It indicates the amount of data transmitted by the interface. | Kbps | |
system_linux_network_interface_packetsIn | System Linux Network Interface PacketsIn | Monitors the incoming network packets (number of packets) on a Linux network interface. It counts the number of packets received by the interface. | packets/sec | |
system_linux_network_interface_packetsOut | System Linux Network Interface PacketsOut | Monitors the outgoing network packets (number of packets) on a Linux network interface. It counts the number of packets transmitted by the interface. | packets/sec | |
system_linux_network_interface_errorsIn | System Linux Network Interface ErrorsIn | Monitors the incoming network errors (number of errors) on a Linux network interface. It counts the number of errors encountered while receiving data. | Errors per Sec | |
system_linux_network_interface_errorsOut | System Linux Network Interface ErrorsOut | Monitors the outgoing network errors (number of errors) on a Linux network interface. It counts the number of errors encountered while transmitting data. | Errors per Sec | |
system_linux_network_interface_discardsIn | System Linux Network Interface DiscardsIn | Monitors the incoming discarded network packets (number of discarded packets) on a Linux network interface. It counts the number of packets that were discarded upon receipt. | psec | |
system_linux_network_interface_discardsOut | System Linux Network Interface DiscardsOut | Monitors the outgoing discarded network packets (number of discarded packets) on a Linux network interface. It counts the number of packets that were discarded upon transmission. | psec | |
system_linux_network_interface_operationalState | System Linux Network Interface OperationalState | Monitors operational state of a Linux network interface. It typically provides information such as whether the interface is up or down. | NA | |
system_linux_network_interface_outputQueueLength | System Linux Network Interface OutputQueueLength | Monitors the length of the output queue on a Linux network interface. It indicates the number of packets queued up for transmission. | count |
Agent G2 - Linux NFS Mount Point Monitoring - v5
Description
Monitors the Linux NFS mount points availability, accessibility, and utilization. In Version 5 of the template, we introduced enhanced functionality to exclude specific users given in input while checking mount point writability. Additionally, users now have the flexibility to choose between monitoring all mounts or exclusively focusing on permanent mount points(i.e. only nfs mounts present in /etc/fstab).Pre-Requisites: NFS needs to be installed and 1 or more NFS Mount points should be available on target device. Agent must be installed as root.
Template Usage Guidelines
While applying this template on the device, users need to provide specific input parameters in below two formats only(Case-Insensitive) -
Format 1:
MountType:All;ExcludedUsers:nobody,nfsnobody,anon
(This format monitors both temporary and permanent mount points, this is the default value)
Format 2:
MountType:Permanent;ExcludedUsers:nobody,nfsnobody,anon
(This format monitors only Permanent Mounts which are available in \etc\fstab file)
Prerequisites
NFS needs to be installed and NFS Mount points should be available on target device. Agent must be installed as root.
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux NFS Mount Point Custom Monitor - v5 | system_linux_nfs_mountpoint_utilization | System Linux NFS Mountpoint Utilization | Percentage | Monitors the NFS Mount points utilization. |
system_linux_nfs_mountpoint_accessibility | System Linux NFS Mountpoint Accessibility | - | Monitors the accessibility of NFS Mount points. An accessible NFS mount point typically means both read and write access(marked as "0" in output), and inaccessibility (marked as "1" in output) often means it's not writable. | |
system_linux_nfs_mountpoint_availability | System Linux NFS Mountpoint Availability | - | Monitors the availability of NFS Mount points. The availability status of mount points is often determined by checking whether the NFS mount point is present in the current list of NFS shares compared to the list stored in the previous state. If a mount point was present in the previous state but is not in the current state, it is marked as "1" (not available). If it is present in both states, it is marked as "0" (available). |
Agent G2 - Linux NFS Mount Point Monitoring - v4
Description
Monitors the Linux NFS mount points availability, accessibility, and utilization.In Version 4 of the template, we introduced enhanced functionality to monitor user-specific mounts accessibility. Additionally, users now have the flexibility to choose between monitoring all mounts or exclusively focusing on permanent mount points (i.e. only nfs mounts present in /etc/fstab).
Template Usage Guidelines:
While applying this template on the device, users need to provide specific input parameters in below two formats only (Case-Insensitive).
Format 1: All (This format monitors both temporary and permanent mount points)
Format 2: Permanent (This format monitors only Permanent Mounts which are available in \etc\fstab file)
Prerequisites
NFS needs to be installed and NFS Mount points should be available on target device.
Supported Metric
Monitor Name | Monitor Description | Metric Name | Metric Display Name | Unit | Metric Description |
---|---|---|---|---|---|
Agent G2 - Linux NFS Mount Point Custom Monitor - v4 | Monitors the Linux NFS mount points availability, accessibility, and utilization | system_linux_nfs_mountpoint_utilization | System Linux NFS Mountpoint Utilization | Percentage | Monitors the NFS Mount points utilization. |
system_linux_nfs_mountpoint_accessibility | System Linux NFS Mountpoint Accessibility | Monitors the accessibility of NFS Mount points. An accessible NFS mount point typically means both read and write access(marked as “0” in output), and inaccessibility (marked as “1” in output) often means it’s not writable. | |||
system_linux_nfs_mountpoint_availability | System Linux NFS Mountpoint Availability | Monitors the availability of NFS Mount points.
The availability status of mount points is often determined by checking whether the NFS mount point is present in the current list of NFS shares compared to the list stored in the previous state. If a mount point was present in the previous state but is not in the current state, it is marked as “1” (not available). If it is present in both states, it is marked as “0” (available). |
Agent G2 - Linux - Nginx Monitors
Description
Monitors Nginx application metrics
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Nginx Monitors | nginx.connections.writing | Nginx Writing Connections | NULL | Nginx reads the request body, processes the request or writes response to a client. |
nginx.perconnections_requests | Nginx Requests per connection | Requests / conn | Average number of requests per connection. | |
nginx.connections.reading | Nginx Reading Connections | NULL | Nginx reads the request header. | |
nginx.connections.active | Nginx Connections | Active Connections | Number of all open connections including connections to backends. | |
nginx.requests.handled | Nginx Handled Requests | requests | The number of requests served by connections handled by Nginx. | |
nginx.requests_rate | Nginx Requests Rate | requests / sec | Average number of requests per second. | |
nginx.connections_rate | Nginx Connections Rate | Connections / sec | Average number of connections per second. | |
nginx.response.time | Nginx Response Time | Seconds | Time taken for Nginx request-response. | |
nginx.connections.waiting | Nginx Waiting Connections | NULL | Keep-alive connections, actually it will be active - (reading + writing). |
Agent G2 - Linux - NGINX Perfomance Check
Description
Monitors Nginx application metrics. This requires Nginx to be compiled with the Stub Status module.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - NGINX Perfomance Check | nginx.requests.handled | Nginx Handled Requests | NULL | The number of requests served by connections handled by Nginx. |
nginx.connections.reading | Nginx Reading Connections | NULL | Nginx reads the request header. | |
nginx.response.time | Nginx Response Time | NULL | Time taken for Nginx request-response. | |
nginx.requests_rate | Nginx Requests Rate | NULL | Average number of requests per second. | |
nginx.connections.writing | Nginx Writing Connections | NULL | Nginx reads the request body, processes the request or writes response to a client. | |
nginx.perconnections_requests | Nginx Requests per connection | NULL | Average number of requests per connection. | |
nginx.connections.waiting | Nginx Waiting Connections | NULL | Keep-alive connections, actually it will be active - (reading + writing). | |
nginx.connections_rate | Nginx Connections Rate | NULL | Average number of connections per second. | |
nginx.connections.active | Nginx Connections | NULL | Number of all open connections including connections to backends. |
Agent G2 - Linux - NTP Monitoring Offset
Description
Monitoring delay in seconds
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - NTP Monitoring Offset | ntp.offset | Ntp Offset | NULL | The time difference between the local clock and the NTP reference clock |
Agent G2 - Linux NTP Statistics
Description
Monitors the NTP offset and jitter values of only system peer (the remote NTP server marked with a * at the beginning) for Linux systems.
Prerequisites
NTP needs to be installed and need Root privileges to execute the script.
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux NTP Statistics | system_linux_NTP_offset | System Linux NTP Offset | milliseconds | Monitors the time offset between the system clock and the Network Time Protocol (NTP) reference, indicating how much the system clock is ahead or behind the NTP reference. |
system_linux_NTP_jitter | System Linux NTP Jitter | milliseconds | Monitors the jitter in the Network Time Protocol (NTP) synchronization, which reflects the variability in timekeeping accuracy. It measures the short-term fluctuations in the time offset between the system clock and the NTP reference. |
Agent G2 - Linux - Nvidia GPU Monitoring
Description
Monitors Nvidia GPU Metrics like Gpu utilization, Power usage, Memory usage and Temperature per Gpu instance
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Nvidia Gpu Monitor | nvidia_dcgm_fb_mem_used | Nvidia Dcgm Framebuffer Memory Used | Percentage | Framebuffer Memory Used |
nvidia_dcgm_memory_temp | Nvidia Dcgm Memory Temp | Celsius | Memory temperature | |
nvidia_dcgm_gpu_util | Nvidia Dcgm Gpu Util | Percentage | GPU utilization | |
nvidia_dcgm_mem_copy_util | Nvidia Dcgm Mem Util | Percentage | Memory utilization | |
nvidia_dcgm_gpu_temp | Nvidia Dcgm Gpu Temp | Celsius | GPU temperature | |
nvidia_dcgm_power_usage | Nvidia Dcgm Power Usage | Watts | Power draw | |
nvidia_dcgm_mem_clock | Nvidia Dcgm Mem Clock Freq | Megahertz | Memory clock frequency |
Agent G2 - Linux - OKD ApiServer
Description
Template for monitoring OKD through Kubernetes API server
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - OKD ApiServer | apiserver.rest.client.requests.total.count | Kube apiserver Rest Client Requests Total Count | NULL | Number of HTTP requests, partitioned by status code, method, and host. |
apiserver.dropped.requests.total.count | Kube apiserver Dropped Requests Total Count | NULL | Monotonic count of requests dropped with Try-again-later response. | |
apiserver.request.duration.seconds.bucket | Kube apiserver Request Duration Seconds Bucket | histogram | Response latency distribution in seconds for each verb, dry run value, group, version, resource, subresource, scope, and component. | |
apiserver.request.count | Kube apiserver Request Count | NULL | Counter of apiserver requests broken out for each verb, group, version, resource, scope, component, client, and HTTP response contentType and code. | |
apiserver.APIServiceRegistrationController.depth | Kube apiserver APIService Registration Controller Depth | NULL | Current depth of workqueue: APIServiceRegistrationController. | |
apiserver.rest.client.requests.total | Kube apiserver Rest Client Requests Total | NULL | Number of HTTP requests, partitioned by status code, method, and host. | |
apiserver.audit.event.total | Kube apiserver Audit Event Total | NULL | Counter of audit events generated and sent to the audit backend. | |
apiserver.go.goroutines | Kube apiserver Goroutines | NULL | Number of goroutines that currently exist. | |
apiserver.inflight.requests | Kube apiserver Inflight Requests | NULL | Maximal number of currently used inflight request limit of this apiserver per request kind in the last second. | |
apiserver.go.threads.total | Kube apiserver Go Threads Total | NULL | Number of OS threads created. | |
apiserver.authenticated.user.requests | Kube apiserver Authenticated User Requests | NULL | Counter of authenticated requests broken out by username. | |
apiserver.etcd.object.counts | Kube apiserver ETCD Object Counts | NULL | Number of stored objects at the time of the last check split by kind. | |
apiserver.http.requests.total.count | Kube apiserver HTTP Requests Total Count | NULL | Total number of HTTP requests made. | |
apiserver.request.count.count | Kube apiserver Request Count Count | NULL | Counter of apiserver requests broken out for each verb, group, version, resource, scope, component, client, and HTTP response contentType and code. | |
apiserver.authenticated.user.requests.count | Kube apiserver Authenticated User Requests Count | NULL | Counter of authenticated requests broken out by username. | |
apiserver.http.requests.total | Kube apiserver HTTP Requests Total | NULL | Total number of HTTP requests made. |
Agent G2 - Linux - OKD CoreDNS
Description
Kubernetes CoreDNS
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - OKD CoreDNS | coredns.request_duration.seconds.sum | Request Duration Seconds Sum | NULL | Duration to process each query. |
coredns.query.count | Query count | NULL | Total query count. | |
coredns.response_size.bytes.sum | Response Size Bytes Sum | NULL | Size of the returned response in bytes. | |
coredns.request_duration.seconds.count | Request Duration Seconds Count | NULL | Duration per upstream interaction. | |
coredns.panics | Total Panics | NULL | Total number of panics. |
Agent G2 - Linux - OKD Kube Controller
Description
Template for monitoring OKD using Kube Controller
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - OKD Kube Controller | controller.workqueue.work_unfinished_duration | Kube Controller Workqueue Unfinished Work Seconds | NULL | How many seconds of work has done that is in progress and hasn't been observed by work_duration. Large values indicate stuck threads. |
controller.go.goroutines | Kube Controller Go Goroutines | NULL | Number of goroutines that currently exist. | |
controller.workqueue.adds | Kube Controller Workqueue Adds Total | NULL | Total number of adds handled by workqueue. | |
controller.process.open_fds | Kube Controller Process Open Fds | NULL | Number of open file descriptors. | |
controller.workqueue.retries | Kube Controller Workqueue Retries Total | NULL | Total number of retries handled by workqueue. | |
controller.workqueue.nodes.count | Kube Controller Registered Nodes | NULL | Number of registered Nodes per zones. | |
controller.workqueue.work_duration.sum | Kube Controller Workqueue Work Duration Seconds Sum | NULL | How long in seconds processing an item from workqueue takes. | |
controller.workqueue.depth | Kube Controller Workqueue Depth | NULL | Current depth of workqueue. | |
controller.workqueue.work_duration.count | Kube Controller Workqueue Work Duration Seconds Count | NULL | Total time in seconds processing an item from workqueue takes. | |
controller.workqueue.nodes.evictions | Kube Controller Node Collector Evictions Number | NULL | Number of Node evictions that happened since the current instance of NodeController started. | |
controller.threads | Kube Controller OS Threads | NULL | Number of OS threads created. | |
controller.workqueue.work_longest_duration | Kube Controller Workqueue Longest Running Processor Seconds | NULL | How many seconds has the longest running processor for workqueue been running. | |
controller.rate_limiter.use | Kube Controller Node Lifecycle Controller Rate Limiter Use | NULL | A metric measuring the saturation of the rate limiter for node_lifecycle_controller. | |
controller.process.max_fds | Kube Controller Process Max Fds | NULL | Maximum number of open file descriptors. | |
controller.workqueue.queue_duration.sum | Kube Controller Workqueue Queue Duration Seconds Sum | NULL | How long in seconds an item stays in workqueue before being requested. | |
controller.workqueue.queue_duration.count | Kube Controller Workqueue Queue Duration Seconds Count | NULL | Total how long in seconds an item stays in workqueue before being requested. | |
controller.workqueue.nodes.unhealthy | Kube Controller Node Collector Unhealthy Nodes in Zone | NULL | Number of not Ready Nodes per zones. |
Agent G2 - Linux - Postfix Monitors
Description
Monitors postfix queues
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Postfix Monitors | postfix.queues.deferred | Postfix Queues Deferred | NULL | Postfix deferred mails queues count |
postfix.queues.bounce | Postfix Queues Bounce | NULL | Postfix bounce mails queues count | |
postfix.queues.maildrop | Postfix Queues Maildrop | NULL | Postfix maildrop mails queues count | |
postfix.queues.incoming | Postfix Queues Incoming | NULL | Postfix incoming mail queues count | |
postfix.queues.active | Postfix Queues Active | NULL | Postfix active mails queues count |
Agent G2 - Linux - OKD Kube Scheduler
Description
Template for monitoring OKD using Kube Scheduler
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - OKD Kube Scheduler | scheduler.client.http.requests_duration.count | Kube Scheduler Rest Client Request Latency Seconds Count | NULL | Total request latency in seconds. Broken down by verb and URL. |
scheduler.binding.latency.count | Kube Scheduler Binding Latency Microseconds Count | NULL | Total Binding latency in microseconds count. | |
scheduler.scheduling.scheduling_latency.count | Kube Scheduler Scheduling Latency Seconds Count | NULL | Scheduling latency in seconds split by sub-parts of the scheduling operation. | |
scheduler.scheduling.scheduling_duration.quantile | Kube Scheduler Scheduling Duration Seconds | NULL | Scheduling latency in seconds split by sub-parts of the scheduling operation. | |
scheduler.volume_scheduling_duration.sum | Kube Scheduler Volume Scheduling Duration Seconds Sum | NULL | Volume scheduling stage latency sum. | |
scheduler.volume_scheduling_duration.count | Kube Scheduler Volume Scheduling Duration Seconds Count | NULL | Volume scheduling stage latency count. | |
scheduler.go.goroutines | Kube Scheduler Go Goroutines | NULL | Number of goroutines that currently exist. | |
scheduler.pod_preemption.victims | Kube Scheduler Pod Preemption Victims | NULL | Number of selected preemption victims. | |
scheduler.scheduling.algorithm.preemption_duration.sum | Kube Scheduler Scheduling Algorithm Preemption Evaluation Sum | NULL | Scheduling algorithm preemption evaluation duration. | |
scheduler.gc_duration_seconds.sum | Kube Scheduler Go GC Duration Seconds Sum | NULL | A summary of the GC invocation durations. | |
scheduler.client.http.requests | Kube Scheduler Rest Client Requests Total | NULL | Number of HTTP requests, partitioned by status code, method, and host. | |
scheduler.e2e.scheduling_latency.sum | Kube Scheduler E2E Scheduling Latency Microseconds Sum | NULL | E2e scheduling latency in microseconds (scheduling algorithm + binding). | |
scheduler.scheduling.scheduling_duration.sum | Kube Scheduler Scheduling Duration Seconds Sum | NULL | Scheduling latency in seconds split by sub-parts of the scheduling operation. | |
scheduler.e2e.scheduling_duration.sum | Kube Scheduler E2E Scheduling Duration Seconds Sum | NULL | E2e scheduling latency in seconds (scheduling algorithm + binding). | |
scheduler.binding.latency.sum | Kube Scheduler Binding Latency Microseconds Sum | NULL | Binding latency in microseconds sum. | |
scheduler.scheduling.scheduling_duration.count | Kube Scheduler Scheduling Duration Seconds Count | NULL | Scheduling latency in seconds split by sub-parts of the scheduling operation. | |
scheduler.scheduling.scheduling_latency.quantile | Kube Scheduler Scheduling Latency Seconds | NULL | Scheduling latency in seconds split by sub-parts of the scheduling operation. | |
scheduler.e2e.scheduling_duration.count | Kube Scheduler E2E Scheduling Duration Seconds Count | NULL | Total E2e scheduling latency in seconds (scheduling algorithm + binding). | |
scheduler.binding.duration.count | Kube Scheduler Binding Duration Seconds Count | NULL | Total Binding duration in seconds count. | |
scheduler.threads | Kube Scheduler OS Threads | NULL | Number of OS threads created. | |
scheduler.binding.duration.seconds | Kube Scheduler Binding Duration Seconds Sum | NULL | Binding duration in seconds sum. | |
scheduler.pod_preemption.attempts | Kube Scheduler Total Preemption Attempts | NULL | Total preemption attempts in the cluster till now. | |
scheduler.process.open_fds | Kube Scheduler Process Open Fds | NULL | Number of open file descriptors. | |
scheduler.scheduling.scheduling_latency.sum | Kube Scheduler Scheduling Latency Seconds Sum | NULL | Scheduling latency in seconds split by sub-parts of the scheduling operation. | |
scheduler.client.http.requests_duration.sum | Kube Scheduler Rest Client Request Latency Seconds Sum | NULL | Request latency in seconds sum. Broken down by verb and URL. | |
scheduler.schedule_attempts.total | Kube Scheduler Schedule Attempts Total | NULL | Number of attempts to schedule pods, by the result. 'unschedulable' means a pod could not be scheduled, while 'error' means an internal scheduler problem. | |
scheduler.gc_duration_seconds.quantile | Kube Scheduler Go GC Duration Seconds | NULL | A summary of the GC invocation durations. | |
scheduler.scheduling.algorithm.preemption_duration.count | Kube Scheduler Scheduling Algorithm Preemption Evaluation Count | NULL | Scheduling algorithm preemption evaluation duration. | |
scheduler.scheduling.algorithm.predicate_duration.sum | Kube Scheduler Scheduling Algorithm Predicate Evaluation Sum | NULL | Scheduling algorithm predicate evaluation duration. | |
scheduler.scheduling.algorithm_duration.sum | Kube Scheduler Scheduling Algorithm Duration Seconds Sum | NULL | Scheduling algorithm latency in seconds sum. | |
scheduler.scheduling.algorithm.preemption_duration.count | Kube Scheduler Scheduling Algorithm Preemption Evaluation Count | NULL | Scheduling algorithm preemption evaluation duration. | |
scheduler.e2e.scheduling_latency.count | Kube Scheduler E2E Scheduling Latency Microseconds Count | NULL | Total E2e scheduling latency in microseconds (scheduling algorithm + binding). | |
scheduler.scheduling.algorithm.predicate_duration.count | Kube Scheduler Scheduling Algorithm Predicate Evaluation Count | NULL | Scheduling algorithm predicate evaluation duration. | |
scheduler.scheduling.algorithm_duration.count | Kube Scheduler Scheduling Algorithm Duration Seconds Count | NULL | Total Scheduling algorithm latency in seconds count. | |
scheduler.scheduling.algorithm.priority_duration.sum | Kube Scheduler Scheduling Algorithm Priority Evaluation Sum | NULL | Scheduling algorithm priority evaluation duration. | |
scheduler.scheduling.algorithm_latency.sum | Kube Scheduler Scheduling Algorithm Latency Microseconds Sum | NULL | Scheduling algorithm latency in microseconds sum. | |
scheduler.gc_duration_seconds.count | Kube Scheduler Go GC Duration Seconds Count | NULL | A summary of the GC invocation durations. | |
scheduler.scheduling.algorithm_latency.count | Kube Scheduler Scheduling Algorithm Latency Microseconds Count | NULL | Total Scheduling algorithm latency in microseconds count. | |
scheduler.cache.lookups | Kube Scheduler Equiv Cache Lookups Total | NULL | Total number of equivalence cache lookups, by whether or not a cache entry was found. | |
scheduler.process.max_fds | Kube Scheduler Process Max Fds | NULL | Maximum number of open file descriptors. |
Agent G2 - Linux - OKD Kube State
Description
Template for monitoring OKD using Kube State
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - OKD Kube State | kubernetes_state.node.cpu_allocatable | Node Cpu Allocatable | NULL | The CPU resources of a node that are available for scheduling. |
kubernetes_state.resourcequota.requests.storage.used | Resourcequota Requests Storage Used | NULL | Observed sum of storage bytes requested for a resource quota. | |
kubernetes_state.resourcequota.persistentvolumeclaims.limit | Resourcequota Persistentvolumeclaims Limit | NULL | Hard limit of the number of PVC for a resource quota. | |
kubernetes_state.node.cpu_capacity | Node Cpu Capacity | NULL | The total CPU resources of the node. | |
kubernetes_state.replicaset.replicas_desired | Replicaset Replicas Desired | NULL | Number of desired pods for a ReplicaSet. | |
kubernetes_state.resourcequota.pods.limit | Resourcequota Pods Limit | NULL | Hard limit of the number of pods for a resource quota. | |
kubernetes_state.replicaset.replicas_ready | Replicaset Replicas Ready | NULL | The number of ready replicas per ReplicaSet. | |
kubernetes_state.deployment.replicas_unavailable | Deployment Replicas Unavailable | NULL | The number of unavailable replicas per deployment. | |
kubernetes_state.resourcequota.requests.memory.limit | Resourcequota Requests Memory Limit | NULL | Hard limit on the total of memory bytes requested for a resource quota. | |
kubernetes_state.resourcequota.limits.cpu.limit | Resourcequota Limits Cpu Limit | NULL | Hard limit on the sum of CPU core limits for a resource quota. | |
kubernetes_state.resourcequota.requests.memory.used | Resourcequota Requests Memory Used | NULL | Observed sum of memory bytes requested for a resource quota. | |
kubernetes_state.replicaset.replicas | Replicaset Replicas | NULL | The number of replicas per ReplicaSet. | |
kubernetes_state.resourcequota.limits.memory.limit | Resourcequota Limits Memory Limit | NULL | Hard limit on the sum of memory bytes limits for a resource quota. | |
kubernetes_state.resourcequota.requests.cpu.used | Resourcequota Requests Cpu Used | NULL | Observed sum of CPU cores requested for a resource quota. | |
kubernetes_state.resourcequota.services.nodeports.limit | Resourcequota Services Nodeports Limit | NULL | Hard limit of the number of node ports for a resource quota. | |
kubernetes_state.daemonset.scheduled | Daemonset Scheduled | NULL | The number of nodes running at least one daemon pod and are supposed to. | |
kubernetes_state.deployment.replicas_updated | Deployment Replicas Updated | NULL | The number of updated replicas per deployment. | |
kubernetes_state.container.memory_requested | Container Memory Requested | NULL | The number of requested memory bytes by a container. | |
kubernetes_state.container.cpu_limit | Container Cpu Limit | NULL | The limit on cpu cores to be used by a container. | |
kubernetes_state.container.restarts | Container Restarts | NULL | The number of restarts per container. | |
kubernetes_state.resourcequota.pods.used | Resourcequota Pods Used | NULL | Observed number of pods used for a resource quota. | |
kubernetes_state.resourcequota.persistentvolumeclaims.used | Resourcequota Persistentvolumeclaims Used | NULL | Observed number of persistent volume claims used for a resource quota. | |
kubernetes_state.resourcequota.limits.memory.used | Resourcequota Limits Memory Used | NULL | Observed sum of limits for memory bytes for a resource quota. | |
kubernetes_state.deployment.replicas_available | Deployment Replicas Available | NULL | The number of available replicas per deployment. | |
kubernetes_state.daemonset.misscheduled | Daemonset Misscheduled | NULL | The number of nodes running a daemon pod but are not supposed to. | |
kubernetes_state.resourcequota.requests.cpu.limit | Resourcequota Requests Cpu Limit | NULL | Hard limit on the total of CPU core requested for a resource quota. | |
kubernetes_state.resourcequota.limits.cpu.used | Resourcequota Limits Cpu Used | NULL | Observed sum of limits for CPU cores for a resource quota. | |
kubernetes_state.resourcequota.services.loadbalancers.limit | Resourcequota Services Loadbalancers Limit | NULL | Hard limit of the number of loadbalancers for a resource quota. | |
kubernetes_state.container.memory_limit | Container Memory Limit | NULL | The limit on memory to be used by a container. | |
kubernetes_state.resourcequota.services.nodeports.used | Resourcequota Services Nodeports Used | NULL | Observed number of node ports used for a resource quota. | |
kubernetes_state.resourcequota.services.limit | Resourcequota Services Limit | NULL | Hard limit of the number of services for a resource quota. | |
kubernetes_state.node.pods_capacity | Node Pods Capacity | NULL | The total pod resources of the node. | |
kubernetes_state.node.memory_allocatable | Node Memory Allocatable | NULL | The memory resources of a node that are available for scheduling. | |
kubernetes_state.container.cpu_requested | Container Cpu Requested | NULL | The number of requested cpu cores by a container. | |
kubernetes_state.deployment.rollingupdate.max_unavailable | Deployment Rollingupdate Max Unavailable | NULL | Maximum number of unavailable replicas during a rolling update of a deployment. | |
kubernetes_state.daemonset.ready | Daemonset Ready | NULL | The number of nodes that should be running the daemon pod and have one or more of the daemon pod running and ready. | |
kubernetes_state.deployment.replicas_desired | Deployment Replicas Desired | NULL | The number of desired replicas per deployment wrong help in kube-state-metrics.cross check | |
kubernetes_state.resourcequota.services.used | Resourcequota Services Used | NULL | Observed number of services used for a resource quota. | |
kubernetes_state.daemonset.desired | Daemonset Desired | NULL | The number of nodes that should be running the daemon pod. | |
kubernetes_state.resourcequota.services.loadbalancers.used | Resourcequota Services Loadbalancers Used | NULL | Observed number of loadbalancers used for a resource quota. | |
kubernetes_state.node.pods_allocatable | Node Pods Allocatable | NULL | The pod resources of a node that are available for scheduling. | |
kubernetes_state.resourcequota.requests.storage.limit | Resourcequota Requests Storage Limit | NULL | Hard limit on the total of storage bytes requested for a resource quota. | |
kubernetes_state.replicaset.fully_labeled_replicas | Replicaset Fully Labeled Replicas | NULL | The number of fully labeled replicas per ReplicaSet. | |
kubernetes_state.node.memory_capacity | Node Memory Capacity | NULL | The total memory resources of the node. | |
kubernetes_state.deployment.replicas | Deployment Replicas | NULL | The number of replicas per deployment. |
Agent G2 - Linux - OS DISKIOPS Template
Description
Agent G2 - Linux - OS DISKIOPS Template
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - OS DISKIOPS Template | DISKIOPS | DISKIOPS | NULL | Monitors the Disk I/O Read in MBps, Disk Write MBps and Disk busy percentage. |
Agent G2 - Linux Postfix Statistics
Description
Monitors the counts and sizes of specific mail queue types - namely, incoming, active, maildrop, deferred, and bounce queues.
Prerequisites:
Postfix needs to be installed and need Root privileges to execute the script.
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux Postfix Statistics | system_linux_postfix_active_mailqueue_size | System Linux Postfix Active Mailqueue Size | Bytes | Monitors the size (in bytes) of active emails in the Postfix mail queue. |
system_linux_postfix_deferred_mailqueue_size | System Linux Postfix Deferred Mailqueue Size | Bytes | **Monitors the size (in bytes) of deferred emails in the Postfix mail queue. | |
system_linux_postfix_maildrop_mailqueue_size | System Linux Postfix Maildrop Mailqueue Size | Bytes | Monitors the size (in bytes) of emails in the Postfix mail queue marked for maildrop. | |
system_linux_postfix_bounce_mailqueue_size | System Linux Postfix Bounce Mailqueue Size | Bytes | Monitors the size (in bytes) of bounced emails in the Postfix mail queue. | |
system_linux_postfix_incoming_mailqueue_size | System Linux Postfix Incoming Mailqueue Size | Bytes | Monitors the size (in bytes) of incoming emails in the Postfix mail queue. | |
system_linux_postfix_deferred_mailqueue_count | System Linux Postfix Deferred Mailqueue Count | Count | Monitors the count of deferred emails in the Postfix mail queue. | |
system_linux_postfix_bounce_mailqueue_count | System Linux Postfix Bounce Mailqueue Count | Count | Monitors the count of bounced emails in the Postfix mail queue. | |
system_linux_postfix_maildrop_mailqueue_count | System Linux Postfix Maildrop Mailqueue Count | Count | Monitors the count of emails in the Postfix mail queue marked for maildrop. | |
system_linux_postfix_incoming_mailqueue_count | System Linux Postfix Incoming Mailqueue Count | Count | Monitors the count of incoming emails in the Postfix mail queue. | |
system_linux_postfix_active_mailqueue_count | system_linux_postfix_active_mailqueue_count | Count | Monitors the count of active emails in the Postfix mail queue. |
Agent G2 - Linux - PostgresDB Aliveness Check Status
Description
To monitor Postgresql db_aliveness, Supported versions of PostgreSQL 11 or later (Validated this template on version PostgreSQL-11 and 12 versions).
Prerequisites
- Agent installed on the target machine.
- Create a Postgres environment file and provide the file path as input parameter while applying the template.
- We need to provide env path for single instance - <env path> & for multiple instances - <env path1>, <env path2>,<env path3> Along with setting up postgres environment this env file should contain other parameters like PGDATABASE,PGARCHIVEDIR,PGDATADIR,PGWALDIR,PGPORT,SERVICENAME.
- Agent must have permission to access the provided .env file
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - PostgresDB Aliveness Check Status Monitor | postgres_check_dbalive | Postgres Check DBAlive | NULL | To monitor the aliveness of the postgres database |
Agent G2 - Linux - PostgresDB Aliveness Check Status - v2
Description
To monitor the aliveness of the postgres database
Template Usage Guidelines:
- Create a Postgres environment file and provide the file path as input parameter while applying the template.
- We need to provide env path for single instance -
<env path> & for multiple instances - <env path1>,<env path2>,<env path3>
- Along with setting up postgres environment, this env file should contain other parameters like PGDATABASE,PGARCHIVEDIR,PGDATADIR,PGWALDIR,PGPORT,SERVICENAME.
- Agent must have permission to access the provided .env file.
- Sample pg_new_5432.env file data:
export PATH=$PATH:/usr/pgsql-12/bin/ PGDATABASE=postgres PGPORT=5432 PGARCHIVEDIR=/var/lib/pgsql/backups_main/archive/ PGDATADIR=/var/lib/pgsql/12/data PGWALDIR=/var/lib/pgsql/12/data/pg_wal SERVICENAME=postgresql-12
Prerequisite
Agent must be installed on the target machine.
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - PostgresDB Aliveness Check Status Monitor - v2 | postgres_check_dbalive | Postgres Check DBAlive | To monitor the aliveness of the postgres database |
Agent G2 - Linux - PostgreSQL Monitors
Description
Agent G2 - Linux - PostgreSQL Monitors
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - PostgreSQL Monitors | postgres.dbstat.commit | PostgreSQL Commits | NULL | The total number of commits for this database since it was created or reset. |
postgres.dbstat.reads | PostgreSQL Reads | NULL | Reports information from the pg_stat_database view, The total number of disk blocks read. | |
postgres.conn.idle | PostgreSQL Connections Idle | NULL | Checks number of connections idle in given state | |
postgres.db.size | PostgreSQL Database Size | MB | Total Size of all dbs | |
postgres.locks.not.granted | PostgreSQL Locks Not Granted | NULL | Checks the total number of locks are not granted on one or more databases | |
postgres.conn.waiting | PostgreSQL Connections Waiting | NULL | Checks number of connections waiting in given state | |
postgres.dbstat.tup.inserted | PostgreSQL Tup Inserted | NULL | Reports information from the pg_stat_database view, The total number of tuples inserted. | |
postgres.autovac.freeze | PostgreSQL Autovac Freeze | % | Provides the percentage of current transactions to the max freeze number. | |
postgres.backends | PostgreSQL Backends | % | Check the number of connections to the database. Compares it with the Max_connection provided in the postgres conf file | |
postgres.dbstat.tup.returned | PostgreSQL Tup Returned | NULL | Reports information from the pg_stat_database view, The total number of tuples returned. | |
postgres.dbstat.hits | PostgreSQL Hits | NULL | Reports information from the pg_stat_database view , The total number of buffer hits. | |
postgres.tnxage.running | PostgreSQL Tnxage Running | NULL | Checks the number and duration of "running transaction" queries on one or more databases | |
postgres.locks.granted | PostgreSQL Locks Granted | NULL | Checks the total number of locks are granted on one or more databases | |
postgres.bloat | PostgreSQL Bloat | NULL | Checks the amount of bloat in tables and indexes | |
postgres.dbstat.rollback | PostgreSQL Rollbacks | NULL | The total number of rollbacks for this database since it was created or reset. | |
postgres.dbstat.tup.deleted | PostgreSQL Tup Deleted | NULL | Reports information from the pg_stat_database view, The total number of tuples deleted. | |
postgres.conn.total | PostgreSQL Connections Total | NULL | Checks total number of connections in given state | |
postgres.conn.running | PostgreSQL Connections Running | NULL | Checks number of connections running in given state | |
postgres.ping | PostgreSQL Ping | NULL | Provides the Ping response time of PostgreSQL database. | |
postgres.dbstat.tup.fetched | PostgreSQL Tup Fetched | NULL | Reports information from the pg_stat_database view, The total number of tuples fetched. | |
postgres.conn.idle.tnx | PostgreSQL Connections Idle Tnx | NULL | Checks the number connections "idle in transaction" state | |
postgres.locks.all | PostgreSQL Locks | NULL | Checks the total number of locks on one or more databases | |
postgres.tnxage.idle.tnx | PostgreSQL Tnxage IdleTnx | NULL | Checks the number and duration of "idle in transaction" queries on one or more databases | |
postgres.dbstat.tup.updated | PostgreSQL Tup Updated | NULL | Reports information from the pg_stat_database view, The total number of tuples updated. | |
postgres.wal | PostgreSQL Wal | NULL | Checks how many WAL files exist in the pg_xlog directory |
Agent G2 - Linux PostgresDB Aliveness Check Status - v3
Description:
Monitors the PostgreSQL database aliveness status. The metric value is 1 if the PostgreSQL database is alive; otherwise, the value is 0.
Prerequisites:
- The Agent must be installed as Root on the target machine, with Agent version 14.0 or later.
- Create a PostgreSQL environment file and provide the file path as an input parameter when applying the template.
- For a single instance, provide the environment path as <env path>. For multiple instances, provide the environment paths as <env path1>, <env path2>, <env path3>.
- This env file should contain other parameters such as PGDATADIR, PGPORT, SERVICENAME, SERVICELEVEL, POSTGRESUSER, and POSTGRESPATH. The last three parameters (SERVICELEVEL, POSTGRESUSER, POSTGRESPATH) are necessary only for checking the status of PostgreSQL using pg_ctl; otherwise, they are optional.
- The agent must have execute permissions on the provided .env file.
The enhancements introduced in Version 3 (V3) of the script provide users with the flexibility to check the PostgreSQL aliveness status using either systemctl, pg_isready commands, or a combination of pg_isready and pg_ctl commands by giving respective parameters in env file as input.
Template Usage Guidelines: While assigning template on device, users need to pass specific input parameters -
For a single instance, provide the environment path as <env path>. For multiple instances, provide the environment paths as <env path1>, <env path2>, <env path3>.
Example:
Single Input Parameter:/root/pg_new_5432.env
Multiple Input Parameters:
/var/lib/pgsql/pg_new_5432.env,/var/lib/pgsql/pg_new_5433.env
- Create a Postgres environment file and provide the file path as input parameter while applying the template.
- We need to provide env path for single instance - <env path> & for multiple instances - <env path1>,<env path2>,<env path3>
- Along with setting up postgres environment, this env file should contain other parameters like PGDATABASE,PGARCHIVEDIR,PGDATADIR,PGWALDIR,PGPORT,SERVICENAME.
- Agent must have permission to access the provided .env file Example content of the .env file -
SampleContent while checking postgres status with pg_ctl command:
export PATH=$PATH:/usr/pgsql-12/bin/
PGPORT=5432
PGDATADIR=/var/lib/pgsql/12/data
SERVICENAME=postgresql-12
SERVICELEVEL=pgctl
POSTGRESUSER=postgres
POSTGRESPATH=/usr/pgsql-12/bin/
SampleContent while checking postgres status without pg_ctl:
export PATH=$PATH:/usr/pgsql-12/bin/
PGPORT=5432
PGDATADIR=/var/lib/pgsql/12/data
SERVICENAME=postgresql-12
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux PostgresDB Aliveness Check Status Monitor - v3 | postgres_check_dbalive | Postgres Check DBAlive | - | To monitor the aliveness of the postgres database |
Agent G2 - Linux OS Performance Monitoring - Advanced
Description
Template to monitor Linux OS advanced performance metrics related to OS Resource parameters, Real memory (page outs & scan rate), Total and individual swap utilization. It has been validated on following Linux flavors: RHEL 7.9, Centos 7 ,Ubuntu 20.04, Open SUSE Linux.
Prerequisites
Applicable on devices which is running Opsramp Agent ( v7.0.0 or above)
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux OS Resource Parameters | system_linux_openFileDescriptors_UsedCount | System Linux OpenFileDescriptors Used Count | count | Current number of Open File Descriptors |
system_linux_messageQueueIDs_Utilization | System Linux MessageQueueIDs Utilization | % | Used percentage of current message queue ID's | |
system_linux_Semaphores_Utilization | System Linux Semaphores Utilization | % | Semaphore arrays or sets used percentage | |
system_linux_loggedInUsers_Pct | System Linux Logged In Users Pct | % | Current number of logged in users percentage | |
system_linux_sharedMemoryIDs_Utilization | System Linux SharedMemoryIDs Utilization | % | Used percentage of shared memory ID's | |
system_linux_messageQueueIDs_UsedCount | System Linux MessageQueueIDs Used Count | count | Current number of message queue ID’s in use | |
system_linux_openFileDescriptors_Utilization | System Linux OpenFileDescriptors Utilization | % | Linux Open File Descriptors Used Percentage | |
system_linux_runningProcesses_Pct | System Linux Running Processes Pct | % | Current running processes percentage | |
system_linux_Semaphores_UsedCount | System Linux Semaphores Used Count | count | Current number of semaphore arrays in use | |
system_linux_sharedMemoryIDs_UsedCount | System Linux SharedMemoryIDs Used Count | count | Current number of shared memory ID’s in use | |
system_linux_loggedInUsers_Count | System Linux LoggedInUsers Count | count | Current number of logged in users | |
system_linux_runningProcesses_Count | System Linux RunningProcesses Count | count | Current number of running processes | |
Agent G2 - Linux Real Memory Stats | system_linux_RealMemory_PageOuts_KiloBytesPerSec | System Linux Real Memory PageOuts KiloBytesPerSec | KBps | Memory pages page out rate in Kilo Bytes per second. |
system_linux_RealMemory_PageOuts_PagesPerSec | System Linux Real Memory PageOuts PagesPerSec | psec | Memory page out rate in pages per second. | |
system_linux_RealMemory_Scan_Rate | System Linux Real Memory Scan Rate | psec | Number of pages scanned (directly) per second. It will collect data for last 10 min (i.e time configured in /etc/cron.d/sysstat file).\n\nPrerequisite: sysstat package should be installed and sar -B command should respond | |
Agent G2 - Linux Swap Memory Utilization | system_linux_swapMemory_Utilization | System Linux Swap Memory Utilization | % | Swap memory utilization in percent. |
system_linux_individual_SwapArea_Utilization | System Linux Individual Swap Area Utilization | % | Individual swap area utilization in percent. |
Agent G2 - Linux OS Performance Monitoring - Advanced - V2
Description
Template to monitor Linux OS advanced performance metrics related to OS Resource parameters, Real memory (page ins, page outs, swap in, swap out & scan rate), Total and individual swap utilization. It has been validated on following Linux flavors: RHEL 7.9, Centos 7 ,Ubuntu 20.04, Open SUSE Linux.
Prerequisites
Applicable on devices which is running Opsramp Agent ( v7.0.0 or above)
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux OS Resource Parameters | system_linux_openFileDescriptors_UsedCount | System Linux OpenFileDescriptors Used Count | count | Current number of Open File Descriptors |
system_linux_messageQueueIDs_Utilization | System Linux MessageQueueIDs Utilization | % | Used percentage of current message queue ID's | |
system_linux_Semaphores_Utilization | System Linux Semaphores Utilization | % | Semaphore arrays or sets used percentage | |
system_linux_loggedInUsers_Pct | System Linux Logged In Users Pct | % | Current number of logged in users percentage | |
system_linux_sharedMemoryIDs_Utilization | System Linux SharedMemoryIDs Utilization | % | Used percentage of shared memory ID's | |
system_linux_messageQueueIDs_UsedCount | System Linux MessageQueueIDs Used Count | count | Current number of message queue ID’s in use | |
system_linux_openFileDescriptors_Utilization | System Linux OpenFileDescriptors Utilization | % | Linux Open File Descriptors Used Percentage | |
system_linux_runningProcesses_Pct | System Linux Running Processes Pct | % | Current running processes percentage | |
system_linux_Semaphores_UsedCount | System Linux Semaphores Used Count | count | Current number of semaphore arrays in use | |
system_linux_sharedMemoryIDs_UsedCount | System Linux SharedMemoryIDs Used Count | count | Current number of shared memory ID’s in use | |
system_linux_loggedInUsers_Count | System Linux LoggedInUsers Count | count | Current number of logged in users | |
system_linux_runningProcesses_Count | System Linux RunningProcesses Count | count | Current number of running processes | |
Agent G2 - Linux Real Memory Stats - V2 | system_linux_RealMemory_PageOuts_KiloBytesPerSec | System Linux Real Memory PageOuts KiloBytesPerSec | KBps | Memory pages page out rate in Kilo Bytes per second. |
system_linux_RealMemory_PageOuts_PagesPerSec | System Linux Real Memory PageOuts PagesPerSec | psec | Memory page out rate in pages per second. | |
system_linux_RealMemory_Scan_Rate | System Linux Real Memory Scan Rate | psec | Number of pages scanned (directly) per second. It will collect data for last 10 min (i.e time configured in /etc/cron.d/sysstat file).\n\nPrerequisite: sysstat package should be installed and sar -B command should respond | |
system_linux_RealMemory_SwapOuts_KiloBytesPerSec | System Linux Real Memory SwapOuts KiloBytesPerSec | KBps | Swap out rate in Kilo Bytes per second. | |
system_linux_RealMemory_PageIns_PagesPerSec | System Linux Real Memory PageIns PagesPerSec | psec | Memory pages page in rate in Pages per second. | |
system_linux_RealMemory_PageIns_KiloBytesPerSec | System Linux Real Memory PageIns KiloBytesPerSec | KBps | Memory pages page in rate in Kilo Bytes per second. | |
system_linux_RealMemory_SwapIns_KiloBytesPerSec | System Linux Real Memory SwapIns KiloBytesPerSec | KBps | Swap in rate in Kilo Bytes per second. | |
Agent G2 - Linux Swap Memory Utilization | system_linux_swapMemory_Utilization | System Linux Swap Memory Utilization | % | Swap memory utilization in percent. |
system_linux_individual_SwapArea_Utilization | System Linux Individual Swap Area Utilization | % | Individual swap area utilization in percent. |
Agent G2 - Linux OS Performance Monitoring - Advanced - v3
Description:
Monitors the data related to OS resource parameters like Open FDs,Logge In users, Context Switches, Processes created, Running processes, Page Faults, Semaphores, Shared Memory IDs,Message queue IDs, TCP Connection states, Swap memory, Real memory Statistics and disk statistics. Ensure that the “ss” command is available in system to get TCP Connections data and util-linux package version 2.19 or higher, Kernel version above 2.6 be available on the system to get Disk Stats data.
Note: When the server is rebooted or when the disk counter’s valid range is exceeded, negative values for disk metrics will occur, as we are calculating rates.
Prerequisites
Ensure that the “ss” command is available in system to get TCP Connections data and util-linux package version 2.19 or higher, Kernel version above 2.6 be available on the system to get Disk Stats data.
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux Swap Memory Utilization | system_linux_swapMemory_Utilization | System Linux Swap Memory Utilization | % | Swap memory utilization in percent. |
system_linux_individual_SwapArea_Utilization | System Linux Individual Swap Area Utilization | % | Individual swap area utilization in percent. | |
Agent G2 - Linux Real Memory Stats - V2 | system_linux_RealMemory_Scan_Rate | System Linux Real Memory Scan Rate | psec | Number of pages scanned (directly) per second. It will collect data for last 10 min (i.e time configured in /etc/cron.d/sysstat file). Prerequisite: sysstat package should be installed and sar -B command should respond |
system_linux_RealMemory_PageOuts_PagesPerSec | System Linux Real Memory PageOuts PagesPerSec | psec | Memory page out rate in pages per second. | |
system_linux_RealMemory_PageOuts_KiloBytesPerSec | System Linux Real Memory PageOuts KiloBytesPerSec | KBps | Memory pages page out rate in Kilo Bytes per second. | |
system_linux_RealMemory_PageIns_KiloBytesPerSec | System Linux Real Memory PageIns KiloBytesPerSec | KBps | Memory pages page in rate in Kilo Bytes per second. | |
system_linux_RealMemory_PageIns_PagesPerSec | System Linux Real Memory PageIns PagesPerSec | psec | Memory pages page in rate in Pages per second. | |
system_linux_RealMemory_SwapIns_KiloBytesPerSec | System Linux Real Memory SwapIns KiloBytesPerSec | KBps | Swap in rate in Kilo Bytes per second. | |
system_linux_RealMemory_SwapOuts_KiloBytesPerSec | System Linux Real Memory SwapOuts KiloBytesPerSec | KBps | Swap out rate in Kilo Bytes per second. | |
Agent G2 - Linux TCP Connection States Monitor | system_linux_tcp_connection_states | System Linux TCP Connection States | count | Monitors the count of TCP connections in various states on the Linux system. |
Agent G2 - Linux Disk Statistics | system_linux_disk_IOQueueLength | System Linux Disk IOQueueLength | null | Monitors the length of the I/O queue for disk operations. |
system_linux_disk_averageLatency | System Linux Disk AverageLatency | microsec | Monitors the average response time of the disk subsystem to process I/O requests. | |
system_linux_disk_averageReadRequestSize | System Linux Disk AverageReadRequestSize | KB | Monitors the average size of read requests issued to the disk. | |
system_linux_disk_averageRequestSize | System Linux Disk AverageRequestSize | KB | Monitors the average size of all read and write requests issued to the disk. | |
system_linux_disk_averageWriteRequestSize | System Linux Disk AverageWriteRequestSize | KB | Monitors the average size of write requests issued to the disk. | |
system_linux_disk_timeUtilization | System Linux Disk TimeUtilization | % | Monitors the utilization of the disk, measured as the percentage of time the disk is actively processing I/O operations. | |
system_linux_disk_readOperationsRate | System Linux Disk ReadOperationsRate | Monitors the rate of read operations issued to the disk per second. | psec | |
system_linux_disk_readThroughput | System Linux Disk ReadThroughput | KBps | Monitors the rate of data read from the disk per second. | |
system_linux_disk_averageReadWaitTime | System Linux Disk AverageReadWaitTime | microsec | Monitors the average wait time for read requests. | |
system_linux_disk_averageRequestWaitTime | System Linux Disk AverageRequestWaitTime | microsec | Monitors the time a disk operation (read and write) waits in the I/O queue before being processed. | |
system_linux_disk_writeOperationsRate | System Linux Disk WriteOperationsRate | psec | Monitors the rate of write operations issued to the disk per second. | |
system_linux_disk_writeThroughput | System Linux Disk WriteThroughput | KBps | Monitors the rate of data written to the disk per second. | |
system_linux_disk_averageWriteWaitTime | System Linux Disk AverageWriteWaitTime | microsec | Monitors the average wait time for write requests. | |
Agent G2 - Linux OS Resource Parameters - v2 | system_linux_openFileDescriptors_Utilization | System Linux OpenFileDescriptors Utilization | % | Linux Open File Descriptors Used Percentage |
system_linux_openFileDescriptors_UsedCount | System Linux OpenFileDescriptors Used Count | count | Current number of Open File Descriptors | |
system_linux_loggedInUsers_Pct | System Linux Logged In Users Pct | % | Current number of logged in users percentage | |
system_linux_loggedInUsers_Count | System Linux LoggedInUsers Count | count | Current number of logged in users | |
system_linux_runningProcesses_Pct | System Linux Running Processes Pct | % | Current running processes percentage | |
system_linux_runningProcesses_Count | System Linux RunningProcesses Count | count | Current number of running processes | |
system_linux_Semaphores_Utilization | System Linux Semaphores Utilization | % | Semaphore arrays or sets used percentage | |
system_linux_Semaphores_UsedCount | System Linux Semaphores Used Count | count | Current number of semaphore arrays in use | |
system_linux_messageQueueIDs_Utilization | System Linux MessageQueueIDs Utilization | % | Used percentage of current message queue ID's | |
system_linux_messageQueueIDs_UsedCount | System Linux MessageQueueIDs Used Count | count | Current number of message queue ID?s in use | |
system_linux_sharedMemoryIDs_Utilization | System Linux SharedMemoryIDs Utilization | % | Used percentage of shared memory ID's | |
system_linux_sharedMemoryIDs_UsedCount | System Linux SharedMemoryIDs Used Count | count | Current number of shared memory ID?s in use | |
system_linux_createdProcessesPerSec | System Linux CreatedProcesses PerSec | Count per sec | Provides the count of processes created or managed per second on the Linux system since the system was rebooted. | |
system_linux_pageFaultsPerSec | System Linux PageFaults PerSec | Count per sec | Monitors the count of page faults per second on the Linux system. | |
system_linux_kernelContextSwitchesPerSec | System Linux KernelContextSwitches PerSec | Count per sec | Monitors the count of context switches per second on the Linux system. |
Agent G2 - Linux Process Statistics Monitoring
Description
Monitors the process statistics of given process list. Should give process list in configuration parameter - custom arguments param
Prerequisites
NULL
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux Process Statistics Monitor | system.process.stats.cpu | System Process Stats Cpu Usage | % | Monitors Cpu Usage of each process of Linux Device. |
system.process.stats.count | System Process Stats Instance Count | count | Monitors Instance count of each process of Linux Device. | |
system.process.stats.threads | System Process Stats Thread Count | count | Monitors thread count of each process of Linux Device. | |
system.process.stats.open.fds | System Process Stats Open Fd count | count | Monitors open fd count of each process of Linux Device. | |
system.process.stats.memory | System Process Stats Memory Usage | % | Monitors Memory Usage of each process of Linux Device. |
Agent G2 - Linux Process Statistics Monitoring - v2
Description
Agent G2 - Linux Process Statistics Monitoring - v2
Prerequisites
NULL
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux Process Statistics Monitor -v2 | system.process.stats.cpu | System Process Stats Cpu Usage | % | Monitors Cpu Usage of each process of Linux Device. |
system.process.stats.count | System Process Stats Instance Count | count | Monitors Instance count of each process of Linux Device. | |
system.process.stats.threads | System Process Stats Thread Count | count | Monitors thread count of each process of Linux Device. | |
system.process.stats.open.fds | System Process Stats Open Fd count | count | Monitors open fd count of each process of Linux Device. | |
system.process.stats.memory | System Process Stats Memory Usage | % | Monitors Memory Usage of each process of Linux Device. |
Agent G2 - Linux Veritas Cluster Monitoring
Description
Template to monitor veritas Linux cluster parameters like cluster node state, service group state, resource state, service group failover status and this template need to assigned on all cluster nodes. This template tested with two node veritas Linux cluster (version - “Veritas_InfoScale_7.3.1”) running on CentOS7.
Prerequisites
NULL
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
G2 - Linux Veritas Cluster Monitor | system_linux_veritas_cluster_resource_state | System Linux Veritas Cluster Resource State | System Linux Veritas Cluster Resource State - Below are the possible states of the resource: OFFLINE:0 ONLINE:1 FAULTED:2 PARTIAL:3 STARTING:4 STOPPING:5 MIGRATING:6 OFFLINE|FAULTED:7 OFFLINE|STARTING:8 PARTIAL|FAULTED:9 PARTIAL|STARTING:10 PARTIAL|STOPPING:11 ONLINE|STOPPING:12 | |
system_linux_veritas_cluster_group_online_status | System Linux Veritas Cluster Group Online Status | System Linux Veritas Cluster Group Online Status - Below are the possible values : 0 - Service group online on cluster node. 1 - Service group not online on any cluster node. | ||
system_linux_veritas_cluster_resource_online_status | System Linux Veritas Cluster Resource Online Status | System Linux Veritas Cluster Resource Online Status . Below are the possible values: 0 - Resource state in online on any cluster node, 1 - Resource state is not online on any cluster node | ||
system_linux_veritas_cluster_node_state | System Linux Veritas Cluster Node State | System Linux Veritas Cluster Node State - Below are the possible states: RUNNING : 0 ADMIN_WAIT : 1 CURRENT_DISCOVER_WAIT : 2 CURRENT_PEER_WAIT : 3 EXITING : 4 EXITED : 5 EXITING_FORCIBLY : 6 FAULTED : 7 INITING : 8 LEAVING : 9 LOCAL_BUILD : 10 REMOTE_BUILD : 11 STALE_ADMIN_WAIT : 12 STALE_DISCOVER_WAIT : 13 STALE_PEER_WAIT : 14 UNKNOWN : 15 | ||
system_linux_veritas_cluster_group_state | System Linux Veritas Cluster Group State | System Linux Veritas Cluster Group State - Below are the possible values : OFFLINE\t: 0 ONLINE\t: 1 FAULTED : 2 PARTIAL : 3 STARTING : 4 STOPPING : 5 MIGRATING : 6 OFFLINE|FAULTED : 7 OFFLINE|STARTING : 8 PARTIAL|FAULTED : 9 PARTIAL|STARTING : 10 PARTIAL|STOPPING : 11 ONLINE|STOPPING : 12 | ||
G2 - Linux Veritas Cluster Group Failover Monitor | system_linux_veritas_cluster_group_failover_status | System Linux Veritas Cluster Group Failover Status | System Linux Veritas Cluster Group Failover Status - Below are the possible values: 0 - No change. 1 - Cluster group change from one node to another due to failover. 2 - The specific cluster group is not online on any cluster node. |
Agent G2 - Logfile Monitoring
Description
The Logfile monitor is used to validate whether the given input string is found or not in the specified input logfile. It sends an alert based on the check type (yes or no), and regex patterns are also allowed in the input string. We have added support for alert tokens in G2 logfile monitoring. In the global template, we have included all tokens, and customers can choose the required tokens based on their requirements.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 Logfile Monitor | system_logfile_search_monitor | system_logfile_search_monitor | Using logfile monitor to validate given input string is found or not by from given input logfile and sends the alerts |
Agent G2 - Microsoft Active Directory 2003 - Performance Counters
Description
Template for AD 2003 Servers.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft Active Directory 2003 - Performance Counters | DSServerBindsPersec | DSServerBindsPersec | Shows the number of DC-to-DC binds per second that are serviced by this DC. | |
LDAPClientSessions | LDAPClientSessions | The number of sessions of connected LDAP clients. Lack of activity points to network problems. | ||
DRAOutboundObjectsPersec | DRAOutboundObjectsPersec | The number of objects sent (per second) through outbound replication to replication partners. | ||
ABClientSessions | ABClientSessions | AB Client Sessions is the number of connected Address Book client sessions. | ||
DRAInboundObjectsPersec | DRAInboundObjectsPersec | The number of objects received (per second) through inbound replication from replication partners. | ||
DRAInboundObjectsAppliedPersec | DRAInboundObjectsAppliedPersec | This counter excludes changes that are received but not applied (for example, when the update is already made) and also how many replication updates are occurring on the server as a result of changes generated on other servers. | ||
KerberosAuthentications | KerberosAuthentications | The number of times per second that clients use a client ticket to this domain controller to authenticate to this domain controller. A lack of activity can indicate network problems that are preventing authentication requests from succeeding. | ||
DRAInboundObjectUpdatesRemaininginPacket | DRAInboundObjectUpdatesRemaininginPacket | This counter tells you whether the monitored server is receiving changes, but is taking a long time applying them to the database. The value should be low, with a higher value indicating that the hardware is incapable of adequately servicing replication (warranting a server upgrade). | ||
NTLMAuthentications | NTLMAuthentications | The number of NTLM authentications (per second) serviced by this domain controller | ||
LDAPSearchesPersec | LDAPSearchesPersec | The number of search operations per second performed by LDAP clients. A lack of activity points to network problems. | ||
DSDirectoryReadsPersec | DSDirectoryReadsPersec | Shows the number of directory reads per second. | ||
DSDirectoryWritesPersec | DSDirectoryWritesPersec | Shows the number of directory writes per second. | ||
LDAPActiveThreads | LDAPActiveThreads | LDAP Active Threads is the current number of threads in use by the LDAP subsystem of the local direcotry service. | ||
LDAPWritesPersec | LDAPWritesPersec | Shows the rate at which LDAP clients perform write operations. | ||
DRAOutboundBytesTotalPersec | DRAOutboundBytesTotalPersec | Bytes per second | It is the sum of the number of bytes of uncompressed data (never compressed) and compressed data (after compression) sent per second. Lack of activity indicates that the hardware or network is slowing down replication. | |
DSNotifyQueueSize | DSNotifyQueueSize | The number of pending update notifications that have been queued, but not yet transmitted to clients. | ||
DRAInboundBytesTotalPersec | DRAInboundBytesTotalPersec | Bytes per second | It is the sum of the number of bytes (per second) of uncompressed data (never compressed) and compressed data (after compression) received through replication. Lack of activity indicates that the network is slowing down replication. | |
LDAPUDPoperationsPersec | LDAPUDPoperationsPersec | Shows the number of UDP operations that the LDAP server is processing per second. | ||
LDAPBindTime | LDAPBindTime | Milliseconds | This counter shows the time required for completion of the last LDAP binding, with a higher value pointing to either hardware or network performance problems. | |
DSClientBindsPersec | DSClientBindsPersec | Shows the number of Ntdsapi.dll binds per second serviced by this DC. | ||
DRAPendingReplicationSynchronizations | DRAPendingReplicationSynchronizations | The number of directory synchronizations that are queued for this server that are not yet processed. This counter helps in determining replication backlog - the larger the number, the larger the backlog. This value should be low, with a higher value indicating that the hardware is not adequately servicing replication. |
Agent G2 - Microsoft Active Directory 2008 - Performance Counters
Description
Template for AD 2008 Servers.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft Active Directory 2008 - Performance Counters | LDAPWritesPersec | LDAPWritesPersec | Shows the rate at which LDAP clients perform write operations. | |
NTLMAuthentications | NTLMAuthentications | The number of NTLM authentications (per second) serviced by this domain controller | ||
LDAPClientSessions | LDAPClientSessions | The number of sessions of connected LDAP clients. Lack of activity points to network problems. | ||
ABClientSessions | ABClientSessions | AB Client Sessions is the number of connected Address Book client sessions. | ||
DSClientBindsPersec | DSClientBindsPersec | Shows the number of Ntdsapi.dll binds per second serviced by this DC. | ||
LDAPActiveThreads | LDAPActiveThreads | LDAP Active Threads is the current number of threads in use by the LDAP subsystem of the local directory service. | ||
LDAPUDPoperationsPersec | LDAPUDPoperationsPersec | Shows the number of UDP operations that the LDAP server is processing per second. | ||
LDAPBindTime | LDAPBindTime | Milliseconds | This counter shows the time required for completion of the last LDAP binding, with a higher value pointing to either hardware or network performance problems. | |
DRAOutboundObjectsPersec | DRAOutboundObjectsPersec | The number of objects sent (per second) through outbound replication to replication partners. | ||
DRAInboundObjectsPersec | DRAInboundObjectsPersec | The number of objects received (per second) through inbound replication from replication partners. | ||
DRAPendingReplicationSynchronizations | DRAPendingReplicationSynchronizations | The number of directory synchronizations that are queued for this server that are not yet processed. This counter helps in determining replication backlog - the larger the number, the larger the backlog. This value should be low, with a higher value indicating that the hardware is not adequately servicing replication. | ||
DSNotifyQueueSize | DSNotifyQueueSize | The number of pending update notifications that have been queued, but not yet transmitted to clients. | ||
DSServerBindsPersec | DSServerBindsPersec | Shows the number of DC-to-DC binds per second that are serviced by this DC. | ||
DSDirectoryReadsPersec | DSDirectoryReadsPersec | Shows the number of directory reads per second. | ||
DRAInboundObjectUpdatesRemaininginPacket | DRAInboundObjectUpdatesRemaininginPacket | This counter tells you whether the monitored server is receiving changes, but is taking a long time applying them to the database. The value should be low, with a higher value indicating that the hardware is incapable of adequately servicing replication (warranting a server upgrade). | ||
KerberosAuthentications | KerberosAuthentications | The number of times per second that clients use a client ticket to this domain controller to authenticate to this domain controller. A lack of activity can indicate network problems that are preventing authentication requests from succeeding. | ||
DRAInboundObjectsAppliedPersec | DRAInboundObjectsAppliedPersec | This counter excludes changes that are received but not applied (for example, when the update is already made) and also how many replication updates are occurring on the server as a result of changes generated on other servers. | ||
LDAPSearchesPersec | LDAPSearchesPersec | The number of search operations per second performed by LDAP clients. A lack of activity points to network problems. | ||
DSDirectoryWritesPersec | DSDirectoryWritesPersec | Shows the number of directory writes per second. | ||
DRAOutboundBytesTotalPersec | DRAOutboundBytesTotalPersec | Bytes per second | It is the sum of the number of bytes of uncompressed data (never compressed) and compressed data (after compression) sent per second. Lack of activity indicates that the hardware or network is slowing down replication. | |
DRAInboundBytesTotalPersec | DRAInboundBytesTotalPersec | Bytes per second | It is the sum of the number of bytes (per second) of uncompressed data (never compressed) and compressed data (after compression) received through replication. Lack of activity indicates that the network is slowing down replication. |
Agent G2 - Microsoft Active Directory 2012 DotNet v4 - Performance Counters
Description
Monitors Microsoft Active Directory Performance Counters
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft Active Directory 2012 DotNet v4 - Performance Counters | LDAPWritesPersec | LDAPWritesPersec | Shows the rate at which LDAP clients perform write operations. | |
DRAOutboundBytesTotalPersec | DRAOutboundBytesTotalPersec | Bytes per second | It is the sum of the number of bytes of uncompressed data (never compressed) and compressed data (after compression) sent per second. Lack of activity indicates that the hardware or network is slowing down replication. | |
DSClientBindsPersec | DSClientBindsPersec | Shows the number of Ntdsapi.dll binds per second serviced by this DC. | ||
LDAPClientSessions | LDAPClientSessions | The number of sessions of connected LDAP clients. Lack of activity points to network problems. | ||
DRAInboundObjectsPersec | DRAInboundObjectsPersec | The number of objects received (per second) through inbound replication from replication partners. | ||
DSDirectoryReadsPersec | DSDirectoryReadsPersec | Shows the number of directory reads per second. | ||
LDAPUDPoperationsPersec | LDAPUDPoperationsPersec | Shows the number of UDP operations that the LDAP server is processing per second. | ||
LDAPActiveThreads | LDAPActiveThreads | LDAP Active Threads is the current number of threads in use by the LDAP subsystem of the local directory service. | ||
LDAPSearchesPersec | LDAPSearchesPersec | The number of search operations per second performed by LDAP clients. A lack of activity points to network problems. | ||
LDAPBindTime | LDAPBindTime | Milliseconds | This counter shows the time required for completion of the last LDAP binding, with a higher value pointing to either hardware or network performance problems. | |
DSDirectoryWritesPersec | DSDirectoryWritesPersec | Shows the number of directory writes per second. | ||
DRAPendingReplicationSynchronizations | DRAPendingReplicationSynchronizations | The number of directory synchronizations that are queued for this server that are not yet processed. This counter helps in determining replication backlog - the larger the number, the larger the backlog. This value should be low, with a higher value indicating that the hardware is not adequately servicing replication. | ||
ABClientSessions | ABClientSessions | AB Client Sessions is the number of connected Address Book client sessions. | ||
DSServerBindsPersec | DSServerBindsPersec | Shows the number of DC-to-DC binds per second that are serviced by this DC. | ||
DRAInboundBytesTotalPersec | DRAInboundBytesTotalPersec | Bytes per second | It is the sum of the number of bytes (per second) of uncompressed data (never compressed) and compressed data (after compression) received through replication. Lack of activity indicates that the network is slowing down replication. | |
DSNotifyQueueSize | DSNotifyQueueSize | The number of pending update notifications that have been queued, but not yet transmitted to clients. | ||
DRAInboundObjectsAppliedPersec | DRAInboundObjectsAppliedPersec | This counter excludes changes that are received but not applied (for example, when the update is already made) and also how many replication updates are occurring on the server as a result of changes generated on other servers. | ||
DRAInboundObjectUpdatesRemaininginPacket | DRAInboundObjectUpdatesRemaininginPacket | This counter tells you whether the monitored server is receiving changes, but is taking a long time applying them to the database. The value should be low, with a higher value indicating that the hardware is incapable of adequately servicing replication (warranting a server upgrade). | ||
DRAOutboundObjectsPersec | DRAOutboundObjectsPersec | The number of objects sent (per second) through outbound replication to replication partners. |
Agent G2 - Microsoft Active Directory 2019 DotNet v4- Performance Counters
Description
Monitors Microsoft Active Directory Performance Counters
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft Active Directory 2019 DotNet v4- Performance Counters | DSNotifyQueueSize | DSNotifyQueueSize | The number of pending update notifications that have been queued, but not yet transmitted to clients. | |
DSServerBindsPersec | DSServerBindsPersec | Shows the number of DC-to-DC binds per second that are serviced by this DC. | ||
DRAOutboundBytesTotalPersec | DRAOutboundBytesTotalPersec | Bytes per second | It is the sum of the number of bytes of uncompressed data (never compressed) and compressed data (after compression) sent per second. Lack of activity indicates that the hardware or network is slowing down replication. | |
DRAPendingReplicationSynchronizations | DRAPendingReplicationSynchronizations | The number of directory synchronizations that are queued for this server that are not yet processed. This counter helps in determining replication backlog - the larger the number, the larger the backlog. This value should be low, with a higher value indicating that the hardware is not adequately servicing replication. | ||
DRAInboundObjectsAppliedPersec | DRAInboundObjectsAppliedPersec | This counter excludes changes that are received but not applied (for example, when the update is already made) and also how many replication updates are occurring on the server as a result of changes generated on other servers. | ||
DSDirectoryWritesPersec | DSDirectoryWritesPersec | Shows the number of directory writes per second. | ||
DRAInboundBytesTotalPersec | DRAInboundBytesTotalPersec | Bytes per second | It is the sum of the number of bytes (per second) of uncompressed data (never compressed) and compressed data (after compression) received through replication. Lack of activity indicates that the network is slowing down replication. | |
DSDirectoryReadsPersec | DSDirectoryReadsPersec | Shows the number of directory reads per second. | ||
LDAPBindTime | LDAPBindTime | Milliseconds | This counter shows the time required for completion of the last LDAP binding, with a higher value pointing to either hardware or network performance problems. | |
LDAPClientSessions | LDAPClientSessions | The number of sessions of connected LDAP clients. Lack of activity points to network problems. | ||
DRAOutboundObjectsPersec | DRAOutboundObjectsPersec | The number of objects sent (per second) through outbound replication to replication partners. | ||
LDAPWritesPersec | LDAPWritesPersec | Shows the rate at which LDAP clients perform write operations. | ||
DRAInboundObjectUpdatesRemaininginPacket | DRAInboundObjectUpdatesRemaininginPacket | This counter tells you whether the monitored server is receiving changes, but is taking a long time applying them to the database. The value should be low, with a higher value indicating that the hardware is incapable of adequately servicing replication (warranting a server upgrade). | ||
LDAPSearchesPersec | LDAPSearchesPersec | The number of search operations per second performed by LDAP clients. A lack of activity points to network problems. | ||
ABClientSessions | ABClientSessions | AB Client Sessions is the number of connected Address Book client sessions. | ||
DSClientBindsPersec | DSClientBindsPersec | Shows the number of Ntdsapi.dll binds per second serviced by this DC. | ||
LDAPActiveThreads | LDAPActiveThreads | LDAP Active Threads is the current number of threads in use by the LDAP subsystem of the local directory service. | ||
DRAInboundObjectsPersec | DRAInboundObjectsPersec | The number of objects received (per second) through inbound replication from replication partners. | ||
LDAPUDPoperationsPersec | LDAPUDPoperationsPersec | Shows the number of UDP operations that the LDAP server is processing per second. |
Agent G2 - Microsoft DotNet Performance Counters DotNet v4
Description
Monitors Microsoft DotNet Performance Counters
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft DotNet Performance Counters DotNet v4 | netclr.numberof.current.physicalthreads | NETCLR Number of Current Physical Threads | Shows the number of native OS threads created and owned by the CLR to act as underlying threads for .NET thread objects. This counter's value does not include the threads used by the CLR in its internal operations; it is a subset of the threads in the OS process. | |
aspnet.requests.current | ASPNET Requests Current | Shows the current number of requests, including those that are queued, currently executing, or waiting to be written to the client. Under the ASP.NET process model, when this counter exceeds the requestQueueLimit defined in the processModel configuration section, ASP.NET will begin rejecting requests. | ||
aspnet.request.wait.time | ASPNET Request Wait Time | MS | Shows the number of milliseconds the most recent request was waiting in the queue. | |
netclr.current.queue.length | NETCLR Current Queue Length | Shows the total number of threads currently waiting to acquire some managed lock in the application. This counter is not an average over time; it displays the last observed value. | ||
netclr.numberofexceps.thrown.persec | NETCLR Number of Exceps Thrown Per sec | Shows the number of exceptions thrown per second. These include both .NET exceptions and unmanaged exceptions that get converted into .NET exceptions e.g. null pointer reference exception in unmanaged code would get re-thrown in managed code as a .NET System.NullReferenceException; this counter includes both handled and unhandled exceptions. | ||
webservice.get.requests.persec | Web Service Get Requests Per sec | Shows the rate HTTP requests using the GET method are made. Get requests are the most common HTTP request. | ||
netclr.numberof.current.logicalthreads | NETCLR Number of Current Logical Threads | Shows the number of current .NET thread objects in the application. A .NET thread object is created either by new System.Threading.Thread or when an unmanaged thread enters the managed environment. This counter maintains the count of both running and stopped threads. This counter is not an average over time; it just displays the last observed value. | ||
webservice.current.connections | Web Service Current Connections | Shows the current number of connections established with the Web service. | ||
netclr.contention.rate.persec | NETCLR Contention Rate Per sec | Rate at which threads in the runtime attempt to acquire a managed lock unsuccessfully. Managed locks can be acquired in many ways; by the "lock" statement in C# or by calling System.Monitor. Enter or by using MethodImplOptions.Synchronized custom attribute. | ||
webservice.post.requests.persec | Web Service Post Requests Per sec | Shows the rate HTTP requests using the POST method are made. |
Agent G2 - Microsoft Exchange 2007 - Server Role - Client Access Servers (CAS) DotNet v4
Description
Applicable on Exchange Servers with the CAS role
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft Exchange 2007 - Server Role - Client Access Servers DotNet v4 | SyncCommandsPersec | Sync Commands Per sec | Sync Commands/sec is the number of Sync commands that are processed per second. Clients use this command to synchronize items within a folder. | |
DownloadTasksCompleted | Download Tasks Completed | Download Tasks Completed is the number of OAB download tasks completed. | ||
OWARequestsPersec | OWA Requests Per sec | Requests/sec is the number of requests handled by Outlook Web App per second. | ||
DownloadTaskQueued | Download Task Queued | Download Task Queued is one(1) if task is queued for execution, otherwise zero(0). | ||
AverageSearchTime | Average Search Time | Average Search Time is the average time that elapsed while waiting for a search to complete. | ||
CurrentConnections | Current Connections | The number of active connections to the WWW service. | ||
AverageTimetoProcessaFreeBusyRequest | Average Time to Process a Free Busy Request | Average Time to Process a Free Busy Request is the average time to process a free busy request in seconds. One request may contain multiple mailboxes. Free busy responses do not have meeting suggestions. | ||
AverageResponseTime | Average Response Time | Average Response Time is the average time (in milliseconds) that elapsed between the beginning and end of an OEH or ASPX request. | ||
ActiveSyncRequestsPersec | ActiveSync Requests Per sec | Shows the number of HTTP requests that are received from the client via ASP.NET per second. | ||
AvailabilityRequestssec | Availability Requests per sec | Availability Requests per second is the number of requests serviced per second. The request can be only for free busy or include suggestions. One request may contain multiple mailboxes. |
Agent G2 - Microsoft Exchange 2007 - Server Role - Client Access Servers ASPNET (CAS ASPNET) DotNet v4
Description
Applicable on Exchange Servers with CAS ASPNET role
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft Exchange 2007 - Server Role - Client Access Servers ASPNET DotNet v4 | RequestWaitTime | Request Wait Time | RPC Requests outstanding is the current number of outstanding RPC requests. | |
WorkerProcessRestarts | Worker Process Restarts | Number of times a worker process has restarted on the machine. | ||
RequestsCurrent | Requests Current | The current number of requests both executing and queued. | ||
ApplicationRestarts | Application Restarts | Number of times the application has been restarted during the web server lifetime. | ||
RequestsInApplicationQueue | Requests In Application Queue | The current number of requests, including those that are queued, currently executing, or waiting to be written to the client. Under the ASP.NET process model, when this counter exceeds the requestQueueLimit defined in the processModel configuration section, ASP.NET will begin rejecting requests. |
Agent G2 - Microsoft Exchange 2007 - Server Role - Edge Transport Server (ETS) DotNet v4
Description
Applicable on Exchange Servers with the ETS role
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft Exchange 2007 - Server Role - Edge Transport Server DotNet v4 | LogThreadsWaiting | Log Threads Waiting | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of threads waiting for their data to be written to the log to complete an update of the database. If this number is too high, the log may be a bottleneck. This value should be less than 10 on average. Regular spikes concurrent with log record stall spikes indicate that the transaction log disks are a bottleneck. If the value for log threads waiting is more than the spindles available for the logs, there is a bottleneck on the log disks. | |
IODatabaseReadsPersec | IO Database Reads Per sec | I/O Database Reads/sec is the rate of database read operations completed. | ||
RetryRemoteDeliveryQueueLength | Retry Remote Delivery Queue Length | Retry Remote Delivery Queue Length is the number of messages in retry in the remote delivery queues. | ||
SubmissionQueueLength | Submission Queue Length | The number of items in the submission queue when sample was taken. | ||
LargestDeliveryQueueLength | Largest Delivery Queue Length | The number of items in the largest delivery queue. | ||
AggregateDeliveryQueueLengthAllQueues | Aggregate Delivery Queue Length All Queues | The number of messages queued for aggregate delivery. | ||
Versionbucketsallocated | Version Buckets Allocated | It provides the total number of version buckets allocated. | ||
ActiveRemoteDeliveryQueueLength | Active Remote Delivery Queue Length | The number of messages queued for remote delivery. | ||
LogRecordStallsPersec | Log Record Stalls Per sec | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of log records that cannot be added to the log buffers per second because the log buffers are full. If this counter is non-zero most of the time, the log buffer size may be a bottleneck. If I/O log write latencies are high, check for RAID5 or sync replication on log devices. The average value should be below 10 per second. Spikes (maximum values) should not be higher than 100 per second. | ||
PoisonQueueLength | Poison Queue Length | The number of messages in the poison message queue. | ||
IODatabaseWritesPersec | IO Database Writes Per sec | I/O Database Writes/sec is the rate of database write operations completed. | ||
DatabaseCachePercentHit | Database Cache Percent Hit | % | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the percentage of database file page requests that were fulfilled by the database cache without causing a file operation. If this percentage is too low, the database cache size may be too small. This value should be over 90% for companies with majority online mode clients. It should be over 99% for companies with majority cached mode clients. If the hit ratio is less than these numbers, the database cache may be insufficient. | |
UnreachableQueueLength | Unreachable Queue Length | The number of messages in Unreachable Queue when sample was taken. |
Agent G2 - Microsoft Exchange 2007 - Server Role - HUB Transport Servers (HTS) DotNet v4
Description
Applicable on Exchange Servers with the HTS role
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft Exchange 2007 - Server Role - HUB Transport Servers DotNet v4 | ActiveNonSmtpDeliveryQueueLength | Active Non-SMTP Delivery Queue Length | The number of messages queued for delivery to a Non-SMTP transport. The value format is a 4-byte integer. | |
RetryNonSmtpDeliveryQueueLength | Retry Non-SMTP Delivery Queue Length | Assures the number of messages currently in the retry non-SMTP delivery queue. Messages in this queue are in a retry state because an issue prevented their delivery. If the issue is transient, a subsequent reattempt to send the message may be successful. A queue length < 200 can be considered as Ok. | ||
ActiveMailboxDeliveryQueueLength | Active Mailbox Delivery Queue Length | The number of messages queued for delivery to an active mailbox. The value format is a 4-byte integer. | ||
MessagesCompletedDeliveryPerSecond | Messages Completed Delivery Per Second | This monitor is useful in assessing the efficiency and efficacy of the current design. They also provide insight into the interaction between different transport components, including the information store interface. It shows the number of messages that are delivered per second. | ||
RetryMailboxDeliveryQueueLength | Retry Mailbox Delivery Queue Length | Measures the number of messages currently in the retry mailbox delivery queue. Messages in this queue are in a retry state because an issue prevented their delivery. If the issue is transient, a subsequent attempt to send the message may be successful. A queue length < 200 can be considered as OK. | ||
MessagesQueuedforDeliveryPerSecond | Messages Queued for Delivery Per Second | This monitor is useful in assessing the efficiency and efficacy of the current design. They also provide insight into the interaction between different transport components, including the information store interface. It shows the number of messages that have been queued for delivery per second. | ||
MessagesSubmittedPerSecond | Messages Submitted Per Second | This monitor is useful in assessing the efficiency and efficacy of the current design. They also provide insight into the interaction between different transport components, including the information store interface. It shows the number of messages that have been queued in the Submission queue per second. | ||
SubmissionQueueLength | Submission Queue Length | The number of items in the submission queue when the sample was taken. | ||
IODatabaseWritesPersec | IO Database Writes Per sec | I/O Database Writes/sec is the rate of database write operations completed. | ||
MessagesQueuedforDeliveryPerSecond | Messages Queued for Delivery Per Second | This monitor is useful in assessing the efficiency and efficacy of the current design. They also provide insight into the interaction between different transport components, including the information store interface. It shows the number of messages that have been queued for delivery per second. | ||
LogRecordStallsPersec | Log Record Stalls Per sec | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of log records that cannot be added to the log buffers per second because the log buffers are full. If this counter is non-zero most of the time, the log buffer size may be a bottleneck. If I/O log write latencies are high, check for RAID5 or sync replication on log devices. The average value should be below 10 per second. Spikes (maximum values) should not be higher than 100 per second. | ||
AggregateDeliveryQueueLengthAllQueues | Aggregate Delivery Queue Length All Queues | The number of messages queued for aggregate delivery. | ||
IODatabaseReadsPersec | IO Database Reads Per sec | I/O Database Reads/sec is the rate of database read operations completed. | ||
PoisonQueueLength | Poison Queue Length | The number of messages in the poison message queue. | ||
LargestDeliveryQueueLength | Largest Delivery Queue Length | The number of items in the largest delivery queue. | ||
LogThreadsWaiting | Log Threads Waiting | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of threads waiting for their data to be written to the log to complete an update of the database. If this number is too high, the log may be a bottleneck. This value should be less than 10 on average. Regular spikes concurrent with log record stall spikes indicate that the transaction log disks are a bottleneck. If the value for log threads waiting is more than the spindles available for the logs, there is a bottleneck on the log disks. | ||
ActiveRemoteDeliveryQueueLength | Active Remote Delivery Queue Length | The number of messages queued for remote delivery. | ||
UnreachableQueueLength | Unreachable Queue Length | The number of messages in Unreachable Queue when the sample was taken. |
Agent G2 - Microsoft Exchange 2007 - Server Role - Mailbox Servers (MBS) DotNet v4
Description
Applicable on Exchange Servers with the MBS role
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft Exchange 2007 - Server Role - Mailbox Servers DotNet v4 | IODatabaseWritesAverageLatency | IO Database Writes Average Latency | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the average length of time, in milliseconds, per database write operation. | |
DatabaseCachePercentHit | Database Cache Percent Hit | % | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the percentage of database file page requests that were fulfilled by the database cache without causing a file operation. If this percentage is too low, the database cache size may be too small. This value should be over 90% for companies with a majority online mode clients. It should be over 99% for companies with majority cached mode clients. If the hit ratio is less than these numbers, the database cache may be insufficient. | |
RPCLatencyaveragemsec | RPC Latency Average msec | Exchange communicates with Hub Transport servers via RPC. This counter is useful in isolating and determining issues involving the interface between the Microsoft Exchange Information Store service on the Mailbox server and Hub Transport servers. This monitor shows the average latency, in milliseconds, of RPC requests. The average is calculated over all RPCs since exrpc32 was loaded. The value should be less than 100 ms at all times. | ||
MessagesDeliveredPersec | Messages Delivered Per sec | Exchange responds to client requests and attempts to fulfill them as quickly and efficiently as possible. This monitor shows the rate that messages are delivered to all recipients. It indicates the current message delivery rate to the store. | ||
MessagesQueuedForSubmissionISMailbox | Messages Queued For Submission IS Mailbox | Mailbox servers depend on Hub Transport servers for message delivery. This monitor shows the current number of submitted messages that are not yet processed by the transport layer. This value should be below 50 at all times. A higher value for more than 15 minutes may indicate that there are connectivity issues to the transport servers or that backpressure is occurring. | ||
DirectoryAccessLDAPReadsPersec | Directory Access LDAP Reads Per sec | Exchange responds to client requests and attempts to fulfill them as quickly and efficiently as possible. This monitor shows the current rate that the Lightweight Directory Access Protocol (LDAP) reads occur while processing requests for the client. | ||
IODatabaseReadsAverageLatency | IO Database Reads Average Latency | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the average length of time, in milliseconds, per database read operation. | ||
RPCClientBackoffPersec | RPC Client Backoff Per sec | Exchange throttles RPC clients to prevent individual clients from overusing server resources. This monitor shows the rate that the server notifies the client to back off. It indicates the rate at which client backoffs are occurring. Higher values may indicate that the server may be incurring a higher load resulting in an increase in overall averaged RPC latencies, causing client throttling to occur. This can also occur when certain client user actions are being performed. Depending on what the client is doing and the rate at which RPC operations are occurring, it may be normal to see backoffs occurring. | ||
RPCNumofSlowPackets | RPC Number of Slow Packets | When you use Microsoft Office Outlook in MAPI mode, Outlook executes client operations as RPCs between the client and the server. This monitor indicates the number of RPC packets in the past 1,024 packets that have latencies longer than 2 seconds. This value should be less than 1 on average, and should be less than 3 at all times. | ||
ClientRPCsFailedServerTooBusyPersec | Client RPCs Failed Server Too Busy Per sec | Exchange throttles RPC clients to prevent individual clients from overusing server resources. This monitor shows the client-reported rate of failed RPCs (since the store was started) due to the Server Too Busy ROC error. This value should be 0 at all times. Higher values may indicate RPC threads are exhausted or client throttling is occurring for clients running versions of Outlook earlier than Microsoft Office Outlook 2007. | ||
DirectoryAccessLDAPSearchesPersec | Directory Access LDAP Searches Per sec | Exchange responds to client requests and attempts to fulfill them as quickly and efficiently as possible. This monitor shows the current rate that the LDAP searches occur while processing requests for the client. | ||
LogRecordStallsPersec | Log Record Stalls Per sec | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of log records that cannot be added to the log buffers per second because the log buffers are full. If this counter is non-zero most of the time, the log buffer size may be a bottleneck. If I/O log write latencies are high, check for RAID5 or sync replication on log devices. The average value should be below 10 per second. Spikes (maximum values) should not be higher than 100 per second. | ||
ReplicationReceiveQueueSize | Replication Receive Queue Size | This monitor shows the number of replication messages waiting to be processed. This value should be less than 100 at all times. This value should return to a minimum value between replication intervals. | ||
MessagesQueuedForSubmissionISPublic | Messages Queued For Submission IS Public | Mailbox servers depend on Hub Transport servers for message delivery. This monitor shows the current number of submitted messages that are not yet processed by the transport layer. This value should be below 20 at all times. | ||
UserCount | User Count | This monitor shows the number of users connected to the information store. It can be used to determine the current user load. | ||
DatabaseCacheSizeMB | Database Cache Size MB | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the amount of system memory, in megabytes, used by the database cache manager to hold commonly used information from the database files to prevent file operations. If the database cache size seems too small for optimal performance and there is little available memory on the system (check the value of Memory/Available Bytes), adding more memory to the system may increase performance. If there is ample memory on the system and the database cache size is not growing beyond a certain point, the database cache size may be capped at an artificially low limit. Increasing this limit may increase performance. | ||
RPCRequestsoutstanding | RPC Requests outstanding | RPC Requests outstanding is the current number of outstanding RPC requests. | ||
RPCAveragedLatency | RPC Averaged Latency | Milliseconds | When you use Microsoft Office Outlook in MAPI mode, Outlook executes client operations as RPCs between the client and the server. This monitor indicates the RPC latency, in milliseconds, averaged for all operations in the last 1,024 packets. This value should not be higher than 25 ms on average. To determine if certain protocols are causing overall RPC latencies, monitor MSExchangeIS Client (*)RPC Average Latency to separate latencies based on client protocol. Cross-reference MSExchangeISRPC Client Backoff/sec to ensure higher latencies are not causing client throttling. | |
RPCRequests | RPC Requests | RPC Operations/sec is the rate at which RPC operations occur. | ||
RPCOperationsPersec | RPC Operations Per sec | When you use Microsoft Office Outlook in MAPI mode, Outlook executes client operations as RPCs between the client and the server. This monitor indicates the current number of RPC operations that are occurring per second. Should closely correspond to historical baselines. Values much higher than expected indicate that the workload has changed, while values much lower than expected indicate a bottleneck preventing client requests from reaching the server. | ||
LogThreadsWaiting | Log Threads Waiting | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of threads waiting for their data to be written to the log to complete an update of the database. If this number is too high, the log may be a bottleneck. This value should be less than 10 on average. Regular spikes concurrent with log record stall spikes indicate that the transaction log disks are a bottleneck. If the value for log threads waiting is more than the spindles available for the logs, there is a bottleneck on the log disks. | ||
DatabasePageFaultStallsPersec | Database Page Fault Stalls Per sec | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the rate that database file page requests require of the database cache manager to allocate a new page from the database cache. This should be 0 at all times. If this value is non-zero, this indicates that the database is not able to flush dirty pages to the database file fast enough to make pages free for new page allocations. |
Agent G2 - Microsoft Exchange 2007 - Server Role - Mailbox Servers replication (MBSR) DotNet v4
Description
Applicable on Exchange Servers with the MBSR role
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft Exchange 2007 - Server Role - Mailbox Servers replication DotNet v4 | ReplayQueueLength | Replay Queue Length | This monitor indicates issues involving the replication engine and replication partners. These issues can be local or remote. This monitor shows the number of transaction log files waiting to be replayed into the passive copy. It indicates the current replay queue length. Higher values cause longer store mount times when a handoff, failover, or activation is performed. | |
CopyQueueLength | Copy Queue Length | This monitor indicates issues involving the replication engine and replication partners. These issues can be local or remote. This monitor shows the number of transaction log files waiting to be copied to the passive copy log file folder. A copy is not considered complete until it has been checked for corruption. This value should be less than 10 at all times for Continuous Cluster Replication(CCR). It should be less than 1 at all times for local continuous replication (LCR). |
Agent G2 - Microsoft Exchange 2007 - Server Role - Unified Messaging servers (UMS) DotNet v4
Description
Applicable on Exchange Servers with the UMS role
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft Exchange 2007 - Server Role - Unified Messaging servers DotNet v4 | OperationsoverSixSeconds | OperationsoverSixSeconds | NULL | Shows the number of all Unified Messaging operations that took more than six seconds to complete. This is the time during which a caller was waiting for Unified Messaging to respond. |
MailboxServerAccessFailures | MailboxServerAccessFailures | NULL | Shows the number of times the system did not access a Mailbox server. This value should be 0 at all times. A non-zero value indicates that Unified Messaging is having problems with MAPI connectivity to mbx servers. | |
CallsDisconnectedbyCallersDuringUMAudioHourglass | CallsDisconnectedbyCallersDuringUMAudioHourglass | NULL | Shows the number of calls during which the caller disconnected while Unified Messaging was playing the audio hourglass tones. This value should be 0 at all times. A non-zero value suggests excessive latency between a Unified Messaging server and targeted domain controller. |
Agent G2 - Microsoft Exchange 2007 General DotNet v4
Description
Monitors Microsoft Exchange 2007 General Performance Counters
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft Exchange 2007 General DotNet v4 | ConnectionAttemptsPersec | Connection Attempts Per Second | The rate, in seconds, at which connections to the WWW service have been attempted since the service started. | |
NumberBytesinallHeaps | Number Bytes in all Heaps | Shows the sum of four other counters: Gen 0 Heap Size, Gen 1 Heap Size, Gen 2 Heap Size, and the Large Object Heap Size. This counter indicates the current memory allocated in bytes on the GC Heaps. | ||
NumberofExcepsThrownPersec | Number of Exceptions Thrown Per Second | Displays the number of exceptions thrown per second. These include both .NET exceptions and unmanaged exceptions that get converted into .NET exceptions. | ||
PercentTimeinGC | Percent Time in GC | % | % Time in GC is the percentage of elapsed time that was spent in performing a garbage collection (GC) since the last GC cycle. This counter is usually an indicator of the work done by the Garbage Collector on behalf of the application to collect and compact memory. This counter is updated only at the end of every GC and the counter value reflects the last observed value; its not an average. |
Agent G2 - Microsoft Exchange 2010 AD Access DC DotNet v4
Description
Monitor Microsoft Exchange 2010 AD Access DC performance counters
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft Exchange 2010 AD Access DC DotNet v4 | LDAPReadTime | LDAP Read Time | Shows the time (in ms) to send an LDAP read request and receive a response. | |
LDAPSearchTime | LDAP Search Time | Milliseconds | Shows the time (in ms) to send an LDAP search request and receive a response. Should be below 50 ms on average. Spikes (maximum values) shouldn't be higher than 100 ms. |
Agent G2 - Microsoft Exchange 2010 DotNet v4
Description
Monitor Microsoft Exchange 2010 performance counters
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft Exchange 2010 DotNet v4 | ReceiveQueueSizeMailBox | Receive Queue Size Mailbox | Maximum allowed number of messages in the mailbox receive queue. | |
LDAPSearchesTimedOut | LDAP Searches Timed Out | LDAP Searches timed out per minute is the number of LDAP searches returned LDAP_TIMEOUT during the last minute. | ||
RPCAveragedLatency | RPC Averaged Latency | Milliseconds | When you use Microsoft Office Outlook in MAPI mode, Outlook executes client operations as RPCs between the client and the server. This monitor indicates the RPC latency, in milliseconds, averaged for all operations in the last 1,024 packets. This value should not be higher than 25 ms on average. To determine if certain protocols are causing overall RPC latencies, monitor MSExchangeIS Client (*)RPC Average Latency to separate latencies based on client protocol. Cross-reference MSExchangeISRPC Client Backoff/sec to ensure higher latencies are not causing client throttling. | |
ActiveNonSmtpDeliveryQueueLength | Active Non-SMTP Delivery Queue Length | The number of messages queued for delivery to a Non-SMTP transport. The value format is a 4-byte integer. | ||
MessagesQueuedForSubmissionMailBox | Messages Queued For Submission Mailbox | Messages Queued For Submission is the current number of submitted messages which are not yet processed by transport. | ||
PollingDelay | Polling Delay | Seconds | Polling delay is the latency between when the most recent Mapi Event was polled and when the event was created in seconds. | |
LDAPSearchTime | LDAP Search Time | Milliseconds | Shows the time (in ms) to send an LDAP search request and receive a response. Should be below 50 ms on average. Spikes (maximum values) shouldn't be higher than 100 ms. | |
ClientLogonsMailBox | Client Logons Mailbox | The number of client logons (including system processes). The average number of logons per client depends on the type and version of the client. | ||
MessagesSentPersecMailBox | Messages Sent Persec Mailbox | Messages Sent/sec is the rate that messages are sent to the transport. | ||
Versionbucketsallocated | Version Buckets Allocated | It provides the total number of version buckets allocated. | ||
SubmissionQueueLength | Submission Queue Length | The number of items in the submission queue when sample was taken. | ||
AverageDeliveryTimePublicFolder | Average Delivery Time Public Folder | Milliseconds | Average Delivery Time is the average time in milliseconds between the submission of a message to the public store and the delivery to all local recipients (recipients on the same server) for the last 10 messages. | |
UnreachableQueueLength | Unreachable Queue Length | The number of messages in Unreachable Queue when sample was taken. | ||
MessagesDeliveredPersecMailBox | Messages Delivered Persec Mailbox | Messages Delivered/sec is the rate that messages are delivered to all recipients. | ||
AverageDeliveryTimeMailBox | Average Delivery Time Mailbox | Milliseconds | Average Delivery Time is the average time in milliseconds between the submission of a message to the mailbox store and the delivery to all local recipients (recipients on the same server) for the last 10 messages. | |
RPCRequests | RPC Requests | RPC Operations/sec is the rate at which RPC operations occur. | ||
MessagesDeliveredPersecPublicFolder | Messages Delivered Persec Public Folder | Messages Delivered Persec Public Folder | ||
ClientLogonsPublicFolder | Client Logons Public Folder | Client Logons is the number of clients (including system processes) currently logged on. | ||
RetryRemoteDeliveryQueueLength | Retry Remote Delivery Queue Length | Retry Remote Delivery Queue Length is the number of messages in retry in the remote delivery queues. | ||
LogGenerationCheckpointDepth | Log Generation Checkpoint Depth | Log Generation Checkpoint Depth represents the amount of work, in count of log files, that will need to be redone or undone to the database file(s) if the process crashes. | ||
LargestDeliveryQueueLength | Largest Delivery Queue Length | The number of items in the largest delivery queue. | ||
IODatabaseReadsPersec | I/O Database Reads Persec | I/O Database Reads/sec is the rate of database read operations completed. | ||
ActiveMailboxDeliveryQueueLength | Active Mailbox Delivery Queue Length | The number of messages queued for delivery to an active mailbox. The value format is a 4-byte integer. | ||
SendQueueSizeMailbox | Send Queue Size Mailbox | Indicates the number of messages in the mailbox store's send queue. | ||
PercentageofFailedEventDispatchers | Percentage of Failed Event Dispatchers | % | It is the percentage of Event Dispatchers that are in failure mode. | |
PoisonQueueLength | Poison Queue Length | The number of messages in the poison message queue. | ||
LogBytesWritePersec | Log Bytes Write Persec | Bytes per second | Log Bytes Write per second is the rate bytes are written to the log. | |
RetryMailboxDeliveryQueueLength | Retry Mailbox Delivery Queue Length | Measures the number of messages currently in the retry mailbox delivery queue. Messages in this queue are in a retry state because an issue prevented their delivery. If the issue is transient, a subsequent attempt to send the message may be successful. A queue length < 200 can be considered as OK. | ||
SendQueueSizePublicFolder | Send Queue Size Public Folder | Number of messages in a public folders send queue. The value should be less than 500. | ||
RetryNonSmtpDeliveryQueueLength | Retry Non-SMTP Delivery Queue Length | Assures the number of messages currently in the retry non-SMTP delivery queue. Messages in this queue are in a retry state because an issue prevented their delivery. If the issue is transient, a subsequent reattempt to send the message may be successful. A queue length < 200 can be considered as Ok. | ||
ReceiveQueueSizePublicFolder | Receive Queue Size Public Folder | Indicates the number of public folder replication messages waiting to be processed. | ||
MessagesSentPersecPublicFolder | Messages Sent Persec Public Folder | Messages Sent/sec is the rate that messages are sent to the transport. | ||
MessagesQueuedForSubmissionPublicFolder | Messages Queued For Submission Public Folder | Messages Queued For Submission is the current number of submitted messages which are not yet processed by transport. | ||
MessageRecipientsDeliveredPersecMailBox | Message Recipients Delivered Persec Mailbox | Shows the rate at which messages are delivered to all recipients. Indicates current message delivery rate to the store. | ||
AggregateDeliveryQueueLengthAllQueues | Aggregate Delivery Queue Length All Queues | The number of messages queued for aggregate delivery. | ||
ActiveRemoteDeliveryQueueLength | Active Remote Delivery Queue Length | The number of messages queued for remote delivery. | ||
ClientRPCsSucceded | Client RPCs Succeeded | The client-reported total number of successful RPCs (since the store was started). | ||
ClientLatency10secRPC | Client Latency 10 sec RPC | Milliseconds | The client-reported number of successful RPCs with latencies > 10 seconds. | |
ActiveClientLogonsPublicFolder | Active Client Logons Public Folder | Active Client Logons is the number of clients that performed any action within the last ten minute time interval. | ||
IODatabaseWritesPersec | I/O Database Writes Persec | I/O Database Writes/sec is the rate of database write operations completed. | ||
ActiveClientLogonsMailBox | Active Client Logons Mailbox | The number of logons that have been active within the last ten minute time interval. | ||
MessageRecipientsDeliveredPersecPublicFolder | Message Recipients Delivered Persec Public Folder | Message Recipients Delivered/sec is the rate that recipients receive messages. | ||
LongRunningLDAPOperations | Long Running LDAP Operations | Shows the number of LDAP operations on this domain controller that took longer than the specified threshold per minute. (Default threshold is 15 seconds.). Should be less than 50 at all times. |
Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - Client Access Servers (CAS)
Description
Applicable on Exchange Servers with the Client Access servers (CAS) role
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - Client Access Servers | DownloadTasksCompleted | Download Tasks Completed | NULL | Download Tasks Completed is the number of OAB download tasks completed |
ActiveSyncRequestsPersec | ActiveSync Requests Per Second | NULL | Shows the number of HTTP requests that are received from the client via ASP.NET per second | |
AverageSearchTime | Average Search Time | NULL | Average Search Time is the average time that elapsed while waiting for a search to complete | |
CurrentConnections | Current Connections | NULL | The number of active connections to the WWW service. | |
DownloadTaskQueued | Download Task Queued | NULL | Download Task Queued is one(1) if task is queued for execution, otherwise zero(0) | |
SyncCommandsPersec | Sync Commands Per Second | NULL | Sync Commands/sec is the number of Sync commands that are processed per second. Clients use this command to synchronize items within a folder | |
AvailabilityRequestssec | Availability Requests Per Second | NULL | Availability Requests per second is the number of requests serviced per second. The request can be only for free busy or include suggestions. One request may contain multiple mailboxes | |
OWARequestsPersec | OWA Requests Per Second | NULL | Requests/sec is the number of requests handled by Outlook Web App per second | |
AverageTimetoProcessaFreeBusyRequest | Average Time to Process a Free Busy Request | NULL | Average Time to Process a Free Busy Request is the average time to process a free busy request in seconds. One request may contain multiple mailboxes. Free busy responses do not have meeting suggestions | |
AverageResponseTime | Average Response Time | NULL | Average Response Time is the average time (in milliseconds) that elapsed between the beginning and end of an OEH or ASPX request |
Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - Client Access Servers ASPNET (CAS ASPNET)
Description
Applicable on Exchange Servers with the Client Access servers ASPNET (CAS) role
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - Client Access Servers ASPNET | ApplicationRestarts | Application Restarts | NULL | Number of times the application has been restarted during the web server lifetime |
RequestWaitTime | Request Wait Time | NULL | RPC Requests outstanding is the current number of outstanding RPC requests. | |
RequestsInApplicationQueue | Requests In Application Queue | NULL | The current number of requests, including those that are queued, currently executing, or waiting to be written to the client. Under the ASP.NET process model, when this counter exceeds the requestQueueLimit defined in the processModel configuration section, ASP.NET will begin rejecting requests. | |
WorkerProcessRestarts | Worker Process Restarts | NULL | Number of times a worker process has restarted on the machine. | |
RequestsCurrent | Requests Current | NULL | The current number of requests both executing and queued |
Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - Edge Transport Server (ETS)
Description
Applicable on Exchange Servers with the Edge Transport servers (ETS) role
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - Edge Transport Server | LargestDeliveryQueueLength | Largest Delivery Queue Length | NULL | The number of items in the largest delivery queue. |
IODatabaseReadsPersec | I/O Database Reads Per Second | NULL | I/O Database Reads/sec is the rate of database read operations completed. | |
UnreachableQueueLength | Unreachable Queue Length | NULL | The number of messages in Unreachable Queue when sample was taken. | |
SubmissionQueueLength | Submission Queue Length | NULL | The number of items in the submission queue when sample was taken. | |
IODatabaseWritesPersec | I/O Database Writes Per Second | NULL | I/O Database Writes/sec is the rate of database write operations completed. | |
LogThreadsWaiting | Log Threads Waiting | NULL | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of threads waiting for their data to be written to the log to complete an update of the database. If this number is too high, the log may be a bottleneck. This value should be less than 10 on average. Regular spikes concurrent with log record stall spikes indicate that the transaction log disks are a bottleneck. If the value for log threads waiting is more than the spindles available for the logs, there is a bottleneck on the log disks. | |
DatabaseCachePercentHit | Database Cache Percent Hit | % | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the percentage of database file page requests that were fulfilled by the database cache without causing a file operation. If this percentage is too low, the database cache size may be too small. This value should be over 90% for companies with majority online mode clients. It should be over 99% for companies with majority cached mode clients. If the hit ratio is less than these numbers, the database cache may be insufficient. | |
LogRecordStallsPersec | Log Record Stalls Per Second | NULL | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of log records that cannot be added to the log buffers per second because the log buffers are full. If this counter is non-zero most of the time, the log buffer size may be a bottleneck. If I/O log write latencies are high, check for RAID5 or sync replication on log devices. The average value should be below 10 per second. Spikes (maximum values) should not be higher than 100 per second. | |
ActiveRemoteDeliveryQueueLength | Active Remote Delivery Queue Length | NULL | The number of messages queued for remote delivery. | |
Versionbucketsallocated | Version Buckets Allocated | NULL | It provides the total number of version buckets allocated. | |
AggregateDeliveryQueueLengthAllQueues | Aggregate Delivery Queue Length All Queues | NULL | The number of messages queued for aggregate delivery. | |
RetryRemoteDeliveryQueueLength | Retry Remote Delivery Queue Length | NULL | Retry Remote Delivery Queue Length is the number of messages in retry in the remote delivery queues. | |
PoisonQueueLength | Poison Queue Length | NULL | The number of messages in the poison message queue. |
Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - HUB Transport Servers (HTS)
Description
Applicable on Exchange Servers with the HUB Transport servers (HTS) role
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - HUB Transport Servers | IODatabaseReadsPersec | I/O Database Reads Per Second | NULL | I/O Database Reads/sec is the rate of database read operations completed. |
MessagesSubmittedPerSecond | Messages Submitted Per Second | NULL | This monitor is useful in assessing the efficiency and efficacy of the current design. They also provide insight into the interaction between different transport components, including the information store interface. It shows the number of messages that have been queued in the Submission queue per second. | |
SubmissionQueueLength | Submission Queue Length | NULL | The number of items in the submission queue when sample was taken. | |
UnreachableQueueLength | Unreachable Queue Length | NULL | The number of messages in Unreachable Queue when sample was taken. | |
ActiveMailboxDeliveryQueueLength | Active Mailbox Delivery Queue Length | NULL | The number of messages queued for delivery to an active mailbox. The value format is a 4-byte integer. | |
LogThreadsWaiting | Log Threads Waiting | NULL | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of threads waiting for their data to be written to the log to complete an update of the database. If this number is too high, the log may be a bottleneck. This value should be less than 10 on average. Regular spikes concurrent with log record stall spikes indicate that the transaction log disks are a bottleneck. If the value for log threads waiting is more than the spindles available for the logs, there is a bottleneck on the log disks. | |
MessagesCompletedDeliveryPerSecond | Messages Completed Delivery Per Second | NULL | This monitor is useful in assessing the efficiency and efficacy of the current design. They also provide insight into the interaction between different transport components, including the information store interface. It shows the number of messages that are delivered per second. | |
AggregateDeliveryQueueLengthAllQueues | Aggregate Delivery Queue Length All Queues | NULL | The number of messages queued for aggregate delivery. | |
RetryMailboxDeliveryQueueLength | Retry Mailbox Delivery Queue Length | NULL | Measures the number of messages currently in the retry mailbox delivery queue. Messages in this queue are in a retry state because an issue prevented their delivery. If the issue is transient, a subsequent attempt to send the message may be successful. A queue length < 200 can be considered as OK. | |
RetryNonSmtpDeliveryQueueLength | Retry Non-SMTP Delivery Queue Length | NULL | Assures the number of messages currently in the retry non-SMTP delivery queue. Messages in this queue are in a retry state because an issue prevented their delivery. If the issue is transient, a subsequent reattempt to send the message may be successful. A queue length < 200 can be considered as Ok. | |
Versionbucketsallocated | Version Buckets Allocated | NULL | It provides the total number of version buckets allocated. | |
ActiveNonSmtpDeliveryQueueLength | Active Non-SMTP Delivery Queue Length | NULL | The number of messages queued for delivery to a Non-SMTP transport. The value format is a 4-byte integer. | |
IODatabaseWritesPersec | I/O Database Writes Per Second | NULL | I/O Database Writes/sec is the rate of database write operations completed. | |
PoisonQueueLength | Poison Queue Length | NULL | The number of messages in the poison message queue. | |
MessagesQueuedforDeliveryPerSecond | Messages Queued for Delivery Per Second | NULL | This monitor is useful in assessing the efficiency and efficacy of the current design. They also provide insight into the interaction between different transport components, including the information store interface. It shows the number of messages that have been queued for delivery per second. | |
LargestDeliveryQueueLength | Largest Delivery Queue Length | NULL | The number of items in the largest delivery queue. | |
ActiveRemoteDeliveryQueueLength | Active Remote Delivery Queue Length | NULL | The number of messages queued for remote delivery. | |
LogRecordStallsPersec | Log Record Stalls Per Second | NULL | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of log records that cannot be added to the log buffers per second because the log buffers are full. If this counter is non-zero most of the time, the log buffer size may be a bottleneck. If I/O log write latencies are high, check for RAID5 or sync replication on log devices. The average value should be below 10 per second. Spikes (maximum values) should not be higher than 100 per second. |
Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - Mailbox Servers (MBS)
Description
Applicable on Exchange Servers with the Mailbox Servers (MBS) role
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - Mailbox Servers | RPCLatencyaveragemsec | RPC Latency Average (ms) | NULL | Exchange communicates with Hub Transport servers via RPC. This counter is useful in isolating and determining issues involving the interface between the Microsoft Exchange Information Store service on the Mailbox server and Hub Transport servers. This monitor shows the average latency, in milliseconds, of RPC requests. The average is calculated over all RPCs since exrpc32 was loaded. The value should be less than 100 ms at all times. |
UserCount | User Count | NULL | This monitor shows the number of users connected to the information store. It can be used to determine current user load. | |
ReplicationReceiveQueueSize | Replication Receive Queue Size | NULL | This monitor shows the number of replication messages waiting to be processed. This value should be less than 100 at all times. This value should return to a minimum value between replication intervals. | |
LogThreadsWaiting | Log Threads Waiting | NULL | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of threads waiting for their data to be written to the log to complete an update of the database. If this number is too high, the log may be a bottleneck. This value should be less than 10 on average. Regular spikes concurrent with log record stall spikes indicate that the transaction log disks are a bottleneck. If the value for log threads waiting is more than the spindles available for the logs, there is a bottleneck on the log disks. | |
RPCAveragedLatency | RPC Averaged Latency | Milliseconds | When you use Microsoft Office Outlook in MAPI mode, Outlook executes client operations as RPCs between the client and the server. This monitor indicates the RPC latency, in milliseconds, averaged for all operations in the last 1,024 packets. This value should not be higher than 25 ms on average. To determine if certain protocols are causing overall RPC latencies, monitor MSExchangeIS Client (*)RPC Average Latency to separate latencies based on client protocol. Cross-reference MSExchangeISRPC Client Backoff/sec to ensure higher latencies are not causing client throttling. | |
RPCRequestsoutstanding | RPC Requests Outstanding | NULL | RPC Requests outstanding is the current number of outstanding RPC requests | |
MessagesQueuedForSubmissionISMailbox | Messages Queued For Submission (IS Mailbox) | NULL | Mailbox servers depend on Hub Transport servers for message delivery. This monitor shows the current number of submitted messages that are not yet processed by the transport layer. This value should be below 50 at all times. A higher value for more than 15 minutes may indicate that there are connectivity issues to the transport servers or that backpressure is occurring. | |
RPCNumofSlowPackets | RPC Number of Slow Packets | NULL | When you use Microsoft Office Outlook in MAPI mode, Outlook executes client operations as RPCs between the client and the server. This monitor indicates the number of RPC packets in the past 1,024 packets that have latencies longer than 2 seconds. This value should be less than 1 on average, and should be less than 3 at all times. | |
RPCRequests | RPC Requests | NULL | RPC Operations/sec is the rate at which RPC operations occur. | |
DatabaseCacheSizeMB | Database Cache Size (MB) | NULL | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the amount of system memory, in megabytes, used by the database cache manager to hold commonly used information from the database files to prevent file operations. If the database cache size seems too small for optimal performance and there is little available memory on the system (check the value of Memory/Available Bytes), adding more memory to the system may increase performance. If there is ample memory on the system and the database cache size is not growing beyond a certain point, the database cache size may be capped at an artificially low limit. Increasing this limit may increase performance. | |
DirectoryAccessLDAPSearchesPersec | Directory Access LDAP Searches Persec | NULL | Exchange responds to client requests and attempts to fulfill them as quickly and efficiently as possible. This monitor shows the current rate that the LDAP searches occur while processing requests for the client. | |
RPCOperationsPersec | RPC Operations Persec | NULL | When you use Microsoft Office Outlook in MAPI mode, Outlook executes client operations as RPCs between the client and the server. This monitor indicates the current number of RPC operations that are occurring per second. Should closely correspond to historical baselines. Values much higher than expected indicate that the workload has changed, while values much lower than expected indicate a bottleneck preventing client requests from reaching the server. | |
RPCClientBackoffPersec | RPC Client Backoff Persec | NULL | Exchange throttles RPC clients to prevent individual clients from overusing server resources. This monitor shows the rate that the server notifies the client to back off. It indicates the rate at which client backoffs are occurring. Higher values may indicate that the server may be incurring a higher load resulting in an increase in overall averaged RPC latencies, causing client throttling to occur. This can also occur when certain client user actions are being performed. Depending on what the client is doing and the rate at which RPC operations are occurring, it may be normal to see backoffs occurring. | |
LogRecordStallsPersec | Log Record Stalls Persec | NULL | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the number of log records that cannot be added to the log buffers per second because the log buffers are full. If this counter is non-zero most of the time, the log buffer size may be a bottleneck. If I/O log write latencies are high, check for RAID5 or sync replication on log devices. The average value should be below 10 per second. Spikes (maximum values) should not be higher than 100 per second. | |
DatabasePageFaultStallsPersec | Database Page Fault Stalls Persec | NULL | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the rate that database file page requests require of the database cache manager to allocate a new page from the database cache. This should be 0 at all times. If this value is non-zero, this indicates that the database is not able to flush dirty pages to the database file fast enough to make pages free for new page allocations. | |
DatabaseCachePercentHit | Database Cache Percent Hit | % | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the percentage of database file page requests that were fulfilled by the database cache without causing a file operation. If this percentage is too low, the database cache size may be too small. This value should be over 90% for companies with majority online mode clients. It should be over 99% for companies with majority cached mode clients. If the hit ratio is less than these numbers, the database cache may be insufficient. | |
MessagesDeliveredPersec | Messages Delivered Persec | NULL | Exchange responds to client requests and attempts to fulfill them as quickly and efficiently as possible. This monitor shows the rate that messages are delivered to all recipients. It indicates current message delivery rate to the store. | |
IODatabaseReadsAverageLatency | IO Database Reads Average Latency | NULL | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the average length of time, in milliseconds, per database read operation. | |
MessagesQueuedForSubmissionISPublic | Messages Queued For Submission (IS Public) | NULL | Mailbox servers depend on Hub Transport servers for message delivery. This monitor shows the current number of submitted messages that are not yet processed by the transport layer. This value should be below 20 at all times. | |
DirectoryAccessLDAPReadsPersec | Directory Access LDAP Reads Persec | NULL | Exchange responds to client requests and attempts to fulfill them as quickly and efficiently as possible. This monitor shows the current rate that the Lightweight Directory Access Protocol (LDAP) reads occur while processing requests for the client. | |
ClientRPCsFailedServerTooBusyPersec | Client RPCs Failed Server Too Busy Persec | NULL | Exchange throttles RPC clients to prevent individual clients from overusing server resources. This monitor shows the client-reported rate of failed RPCs (since the store was started) due to the Server Too Busy ROC error. This value should be 0 at all times. Higher values may indicate RPC threads are exhausted or client throttling is occurring for clients running versions of Outlook earlier than Microsoft Office Outlook 2007. | |
IODatabaseWritesAverageLatency | IO Database Writes Average Latency | NULL | Exchange is essentially a database application, relying upon transaction logs and database files for data integrity and storage. This monitor shows the average length of time, in milliseconds, per database write operation. |
Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - Mailbox Servers Replication (MBSR)
Description
Applicable on Exchange Servers with the Mailbox Servers Replication (MBSR) role
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description | |
---|---|---|---|---|---|
Agent G2 - Microsoft Exchange 2010 DotNet v4 - Server Role - Mailbox Servers Replication | ReplayQueueLength | Replay Queue Length | NULL | This monitor indicate issues involving the replication engine and replication partners. These issues can be local or remote. This monitor shows the number of transaction log files waiting to be replayed into the passive copy. It indicates the current replay queue length. Higher values cause longer store mount times when a handoff, failover, or activation is performed. | |
CopyQueueLength | Copy Queue Length | NULL | This monitor indicate issues involving the replication engine and replication partners. These issues can be local or remote. This monitor shows the number of transaction log files waiting to be copied to the passive copy log file folder. A copy is not considered complete until it has been checked for corruption. This value should be less than 10 at all times for Continuous Cluster Replication(CCR). It should be less than 1 at all times for local continuous replication (LCR). | ||
CallsDisconnectedbyCallersDuringUMAudioHourglass | Calls Disconnected by Callers During UM Audio Hourglass | NULL | Shows the number of calls during which the caller disconnected while Unified Messaging was playing the audio hourglass tones. This value should be 0 at all times. A non-zero value suggests excessive latency between a Unified Messaging server and targeted domain controller. | ||
OperationsoverSixSeconds | Operations over Six Seconds | NULL | Shows the number of all Unified Messaging operations that took more than six seconds to complete. This is the time during which a caller was waiting for Unified Messaging to respond. | ||
MailboxServerAccessFailures | Mailbox Server Access Failures | NULL | Shows the number of times the system did not access a Mailbox server. This value should be 0 at all times. A non-zero value indicates that Unified Messaging is having problems with MAPI connectivity to mbx servers. |
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - RabbitMQ Monitors | rabbitmq.node.disk.free | RabbitMQ Free Disk | MB | The free disk of the rabbitmq node in MB. |
rabbitmq.queue.consumers.active | RabbitMQ Active Consumers | NULL | Number of active consumers. An active consumer is one which could immediately receive any messages sent to the queue. | |
rabbitmq.node.mem.used | RabbitMQ Node Memory Utilization | MB | The memory used by the rabbitmq node in MB. | |
rabbitmq.node.fd.used | RabbitMQ OpenFDs | NULL | Number of open file descriptors used. | |
rabbitmq.queue.consumers | RabbitMQ Consumers | NULL | Number of consumers. | |
rabbitmq.queue.memory | RabbitMQ Memory | MB | Memory consumed by the Erlang process associated with the queue, including stack, heap, and internal structures. | |
rabbitmq.node.uptime | RabbitMQ Uptime | NULL | Uptime of the RabbitMQ server. | |
rabbitmq.queue.messages | RabbitMQ Messages | NULL | Sum of ready and unacknowledged messages (queue depth). | |
rabbitmq.node.sockets.used | RabbitMQ Sockets Used | NULL | Number of sockets used. | |
rabbitmq.queue.messages.unacknowledged | RabbitMQ Messages Unacknowledged | NULL | Number of messages delivered to clients but not yet acknowledged. | |
rabbitmq.objects.overview | RabbitMQ Overview Objects | NULL | Overview of all objects. | |
rabbitmq.queue.messages.ready | RabbitMQ Messages Ready | NULL | Number of messages ready to be delivered to clients. | |
rabbitmq.node.proc.used | RabbitMQ Erlang Processes Used | NULL | Number of Erlang processes used. |
Agent G2 - Linux - RedisDB Monitors
Description
Monitors RedisDB application metrics
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - RedisDB Monitors | redis.clients.biggest.inputbuf | Redis Clients Biggest InputBuf | NULL | Biggest input buffer among current client connections. |
redis.mem.fragmentation_ratio | Redis-Mem.fragmentation_ratio | NULL | Ratio between rss memory used and total memory used. | |
redis.mem.peak | Redis Peak Memory | MB | Peak memory consumed by Redis (in Mbytes). | |
redis.aof.last.rewritetime | Redis AOF Last Rewrite Op Time | NULL | Duration of the last AOF rewrite operation in seconds. | |
redis.replica.sync_left_bytes | Redis-Replica.sync_left_bytes | NULL | Number of bytes left before SYNCing is complete | |
redis.rdb.last_bgsave_time | Redis-Rdb.last_bgsave_time | NULL | Duration of the last RDB save operation in seconds. | |
redis.key.hits_rate | Redis Key Hit Rate | NULL | The rate of successful lookup of keys in the main dictionary. | |
redis.clients.longest.outputlist | Redis Clients Longest OutputList | NULL | Longest output list among current client connections. | |
redis.rdb.last.bgsavetime | Redis RDB Last Save Op Time | NULL | Duration of the last RDB save operation in seconds. | |
redis.net.rejected | Redis Connections Rejected | NULL | Number of connections rejected because of maxclients limit. | |
redis.clients.biggest_input_buf | Redis-Clients.biggest_input_buf | NULL | Biggest input buffer among current client connections. | |
redis.uptime | Redis Uptime | NULL | Checks the uptime of the Redis service. | |
redis.aof.size | Redis AOF Size | MB | AOF current file size in MB. | |
redis.pubsub.patterns | Redis Pubsub Patterns | NULL | Global number of pub/sub pattern with client subscriptions. | |
redis.stats.keyspace_hits | Redis Keyspace Hits | NULL | Number of successful lookup of keys in the main dictionary per second. | |
redis.clients.longest_output_list | Redis-Clients.longest_output_list | NULL | Longest output list among current client connections. | |
redis.key.misses_rate | Redis Key Miss Rate | NULL | The rate of failed lookup of keys in the main dictionary. | |
redis.net.commands | Redis Commands | NULL | The number of commands processed by the server | |
redis.replica.syncleftbytes | Redis Replication Bytes Lag | NULL | Number of bytes left before SYNCing is complete | |
redis.keyspace.misses | Redis Keyspace Misses | NULL | Number of failed lookup of keys in the main dictionary per second. | |
redis.keys.evicted | Redis Evicted Keys | NULL | Number of evicted keys due to maxmemory limit. | |
redis.perf.latest_fork_usec | Redis Latest Fork Op Time | NULL | Duration of the latest fork operation in microseconds. | |
redis.net.clients | Redis Client Connections | NULL | Number of client connections (excluding connections from slaves). | |
redis.changes | Redis Changes | NULL | Number of changes since the last save. | |
redis.keys.total | Redis Total Keys | NULL | Provides the total number of keys from all the DBs. | |
redis.mem.rss | Redis RSS Memory | MB | Number of bytes that Redis allocated as seen by the operating system (a.k.a resident set size). This is the number reported by tools such as top and ps. | |
redis.rdb.changes | Redis RDB Changes | NULL | Number of changes since the last dump. | |
redis.pubsub.channels | Redis Pubsub Channels | NULL | Global number of pub/sub channels with client subscriptions. | |
redis.replica.last_io_seconds_ago | Redis-Replica.last_io_seconds_ago | NULL | Number of seconds since the last interaction with master. | |
redis.clients.blocked | Redis Clients Blocked | NULL | Number of clients pending on a blocking call. | |
redis.mem.lua | Redis LUA Memory | MB | Number of Mbytes used by the Lua engine. | |
redis.mem.used | Redis Used Memory | MB | Total number of Mbytes allocated by Redis using its allocator (either standard libc, jemalloc, or an alternative allocator such as tcmalloc). | |
redis.replica.last.io.Secondsago | Redis Replication Lag | NULL | Number of seconds since the last interaction with master. | |
redis.connections | Redis Connections | NULL | The number of connections accepted by the server per second. | |
redis.mem.fragmentation.ratio | Redis Memory Fragmentation Ratio | NULL | Ratio between rss memory used and total memory used. | |
redis.keys.expired | Redis Expired Keys | NULL | Total number of key expiration events. | |
redis.aof.last_rewrite_time | Redis-Aof.last_rewrite_time | NULL | Duration of the last AOF rewrite operation in seconds. | |
redis.net.slaves | Redis Connected Slaves | NULL | Number of connected slaves. |
Agent G2 - Linux - Riak DB Monitors
Description
Agent G2 - Linux - Riak DB Monitors
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Riak DB Monitors | riak.node_get_fsm_objsize_mean | Riak Get Fsm Mean Object Size | NULL | Mean object size encountered by this node within the last minute |
riak.redir_requests | Riak Redirected Requests | NULL | Average number of requests this node has redirected to other nodes for coordination per second | |
riak.pipe_vnodes.running.count | Riak Running Pipe Vnodes | NULL | Number of pipe virtual node queues running in the last minute | |
riak.vnode_index_writes | Riak Vnode Index Writes | NULL | Average number of vnode index write operations performed per second | |
riak.node_get_fsm_objsize_95 | Riak Get Fsm 95 Object Size | NULL | 95th percentile object size encountered by this node within the last minute | |
riak.node_get_fsm_siblings_100 | Riak Get Fsm 100 Siblings | NULL | 100th percentile of siblings encountered during all GET operations by this node within the last minute | |
riak.node_gets | Riak Node GETs | NULL | Average number of GETs coordinated by this node per second, including GETs to non-local vnodes | |
riak.node_get_fsm_objsize_100 | Riak Get Fsm 100 Object Size | NULL | 100th percentile object size encountered by this node within the last minute | |
riak.vnode_index_reads | Riak Vnode Index Reads | NULL | Average number of vnode index read operations performed per second | |
riak.ring_num_partitions | Riak Ring Partitions | NULL | Number of partitions in the ring | |
riak.pipeline.active | Riak Active Pipelines | NULL | The number of pipelines active in the last 60 seconds | |
riak.vnode.puts | Riak Vnode PUTs | NULL | Average number of PUTS coordinated by local vnodes per second | |
riak.node_get_fsm_objsize_median | Riak Get Fsm Median Object Size | NULL | Median object size encountered by this node within the last minute | |
riak.node_put_fsm_time_95 | Riak Put Fsm 95 Time | NULL | 95th percentile time between reception of client PUT request and subsequent response to client | |
riak.sys.process_count | Riak Process Count | NULL | Number of Erlang processes | |
riak.node_get_fsm_time_100 | Riak Get Fsm 100 Time | NULL | 100th percentile time between reception of client GET request and subsequent response to client | |
riak.node_get_fsm_time_mean | Riak Get Fsm Mean Time | NULL | Mean time between reception of client GET request and subsequent response to client | |
riak.node_get_fsm_siblings_mean | Riak Get Fsm Mean Siblings | NULL | Mean number of siblings encountered during all GET operations by this node within the last minute | |
riak.vnode.gets | Riak Vnode GETs | NULL | Average number of GETs coordinated by local vnodes per second | |
riak.node_get_fsm_siblings_median | Riak Get Fsm Median Siblings | NULL | Median number of siblings encountered during all GET operations by this node within the last minute | |
riak.node_get_fsm_siblings_95 | Riak Get Fsm 95 Siblings | NULL | 95th percentile of siblings encountered during all GET operations by this node within the last minute | |
riak.precommit.fails | Riak Pre-commit Failures | NULL | Number of pre commit hook failures | |
riak.node_get_fsm_time_median | Riak Get Fsm Median Time | NULL | Median time between reception of client GET request and subsequent response to client | |
riak.pipeline.errors | Riak Pipeline Errors | NULL | The average number of pipeline creation errors per second | |
riak.kv_vnodes.running.count | Riak Running KV Vnodes | NULL | Number of key/value virtual node queues running in the last minute | |
riak.postcommit.fails | Riak Post-commit Failures | NULL | Number of post commit hook failures | |
riak.pbc_active | Riak Active PB Connections | NULL | Number of currently active protocol buffer connections | |
riak.node_put_fsm_time_median | Riak Put Fsm Median Time | NULL | Median time between reception of client PUT request and subsequent response to client | |
riak.vnode_index_deletes | Riak Vnode Index Deletes | NULL | Average number of vnode index delete operations performed per second | |
riak.node_put_fsm_time_mean | Riak Put Fsm Mean Time | NULL | Mean time between reception of client PUT request and subsequent response to client | |
riak.node_put_fsm_time_100 | Riak Put Fsm 100 Time | NULL | 100th percentile time between reception of client PUT request and subsequent response to client | |
riak.node.puts | Riak Node PUTs | NULL | Average number of PUTs coordinated by this node per second, including PUTs to non-local vnodes | |
riak.pipeline.creates | Riak Pipeline Creations | NULL | The average number of pipelines created per second | |
riak.node_get_fsm_time_95 | Riak Get Fsm 95 Time | NULL | 95th percentile time between reception of client GET request and subsequent response to client | |
riak.pbc_connects | Riak New PB Connections | NULL | Number of new protocol buffer connections established during the last minute | |
riak.memory.processes_used | Riak Memory Used | NULL | Total amount of memory used by Erlang processes | |
riak.read_repairs | Riak Read Repairs | NULL | Average number of read repair operations this node has coordinated per second |
Agent G2 - Linux - RiakDB Monitors
Description
It monitors DB parameters like Post-commit Failures, Pre-commit Failures, Node GETs, Vnode GETs, Memory Used, Get Fsm 100 Object Size, Get Fsm 100 Siblings, Get Fsm 100 Time, Get Fsm 95 Object Size, Get Fsm 95 Siblings, Get Fsm 95 Time, Get Fsm Mean Object Size, Get Fsm Mean Siblings, Get Fsm Mean Time, Get Fsm Median Object Size, Get Fsm Median Siblings, Get Fsm Median Time, Put Fsm 100 Time, Put Fsm 95 Time, Put Fsm Mean Time, Put Fsm Median Time, Active PB Connections, New PB Connections, Active Pipelines, Pipeline Creations, Pipeline Errors, Node PUTs, Vnode PUTs, Read Repairs, Redirected Requests, Ring Partitions, Running KV Vnodes, Running Pipe Vnodes, Process Count, Vnode Index Deletes, Vnode Index Reads, Vnode Index Writes
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - RiakDB Monitors | riak.postcommit.fails | Riak Post-commit Failures | NULL | Number of post commit hook failures |
riak.pbc_active | Riak Active PB Connections | NULL | Number of currently active protocol buffer connections | |
riak.vnode_index_deletes | Riak Vnode Index Deletes | NULL | Average number of vnode index delete operations performed per second | |
riak.node_get_fsm_time_mean | Riak Get Fsm Mean Time | NULL | Mean time between reception of client GET request and subsequent response to client | |
riak.node_get_fsm_objsize_median | Riak Get Fsm Median Object Size | NULL | Median object size encountered by this node within the last minute | |
riak.vnode_index_reads | Riak Vnode Index Reads | NULL | Average number of vnode index read operations performed per second | |
riak.node_get_fsm_time_95 | Riak Get Fsm 95 Time | NULL | 95th percentile time between reception of client GET request and subsequent response to client | |
riak.node_put_fsm_time_95 | Riak Put Fsm 95 Time | NULL | 95th percentile time between reception of client PUT request and subsequent response to client | |
riak.pipeline.active | Riak Active Pipelines | NULL | The number of pipelines active in the last 60 seconds | |
riak.pipe_vnodes.running.count | Riak Running Pipe Vnodes | NULL | Number of pipe virtual node queues running in the last minute | |
riak.ring_num_partitions | Riak Ring Partitions | NULL | Number of partitions in the ring | |
riak.memory.processes_used | Riak Memory Used | NULL | Total amount of memory used by Erlang processes | |
riak.node_get_fsm_siblings_100 | Riak Get Fsm 100 Siblings | NULL | 100th percentile of siblings encountered during all GET operations by this node within the last minute | |
riak.node_get_fsm_objsize_mean | Riak Get Fsm Mean Object Size | NULL | Mean object size encountered by this node within the last minute | |
riak.node_get_fsm_time_median | Riak Get Fsm Median Time | NULL | Median time between reception of client GET request and subsequent response to client | |
riak.pbc_connects | Riak New PB Connections | NULL | Number of new protocol buffer connections established during the last minute | |
riak.vnode.puts | Riak Vnode PUTs | NULL | Average number of PUTS coordinated by local vnodes per second | |
riak.node_put_fsm_time_mean | Riak Put Fsm Mean Time | NULL | Mean time between reception of client PUT request and subsequent response to client | |
riak.redir_requests | Riak Redirected Requests | NULL | Average number of requests this node has redirected to other nodes for coordination per second | |
riak.node_put_fsm_time_median | Riak Put Fsm Median Time | NULL | Median time between reception of client PUT request and subsequent response to client | |
riak.node.gets | Riak Node GETs | NULL | Average number of GETs coordinated by this node per second, including GETs to non-local vnodes | |
riak.node_get_fsm_objsize_95 | Riak Get Fsm 95 Object Size | NULL | 95th percentile object size encountered by this node within the last minute | |
riak.kv_vnodes.running.count | Riak Running KV Vnodes | NULL | Number of key/value virtual node queues running in the last minute | |
riak.node_get_fsm_siblings_median | Riak Get Fsm Median Siblings | NULL | Median number of siblings encountered during all GET operations by this node within the last minute | |
riak.read_repairs | Riak Read Repairs | NULL | Average number of read repair operations this node has coordinated per second | |
riak.node_put_fsm_time_100 | Riak Put Fsm 100 Time | NULL | 100th percentile time between reception of client PUT request and subsequent response to client | |
riak.sys.process_count | Riak Process Count | NULL | Number of Erlang processes | |
riak.vnode.gets | Riak Vnode GETs | NULL | Average number of GETs coordinated by local vnodes per second | |
riak.node_get_fsm_siblings_mean | Riak Get Fsm Mean Siblings | NULL | Mean number of siblings encountered during all GET operations by this node within the last minute | |
riak.pipeline.errors | Riak Pipeline Errors | NULL | The average number of pipeline creation errors per second | |
riak.precommit.fails | Riak Pre-commit Failures | NULL | Number of pre commit hook failures | |
riak.pipeline.creates | Riak Pipeline Creations | NULL | The average number of pipelines created per second | |
riak.node.puts | Riak Node PUTs | NULL | Average number of PUTs coordinated by this node per second, including PUTs to non-local vnodes | |
riak.vnode_index_writes | Riak Vnode Index Writes | NULL | Average number of vnode index write operations performed per second | |
riak.node_get_fsm_siblings_95 | Riak Get Fsm 95 Siblings | NULL | 95th percentile of siblings encountered during all GET operations by this node within the last minute | |
riak.node_get_fsm_objsize_100 | Riak Get Fsm 100 Object Size | NULL | 100th percentile object size encountered by this node within the last minute | |
riak.node_get_fsm_time_100 | Riak Get Fsm 100 Time | NULL | 100th percentile time between reception of client GET request and subsequent response to client |
Agent G2 - Linux Services Monitoring
Description
Monitors the status of the overall state the service is in. It can be active, reloading, inactive, failed, activating, or deactivating.
Template Usage Guidelines:
While applying this template on the device, users need to provide specific input parameters in below two formats only.
Format 1: all (case-insensitive) If “all” is specified as custom script arguments, the script monitors all available services along with alert tokens, that provide additional information, including the total count of services and the count of services in each available state.
Format 2: tuned,opsramp.*, systemd (provide service names without .service extension as shown here) Users can specify comma-separated service names or service regex patterns for a more focused monitoring approach. The script filters and retrieves status information about the mentioned services only.
Note: We strongly recommend specifying only particular service names in input parameters,(that is Format 2) as monitoring all services may significantly increase the system load, especially in environments with a large number of services.
Prerequisites
This template only works on systems having Systemd as the default init system.
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux Services Monitor | system_linux_services_status | System Linux Services Status | - | Monitors the status of the overall state the service is in. It can be active, reloading, inactive, failed, activating, or deactivating. |
Agent G2 - Linux - Squid Monitors
Description
Monitoring template for squid proxy application
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Squid Monitors | squid.server.errors | Squid Server Errors | NULL | Rate of server requests that resulted in error |
squid.fqdn.cache.entries | Squid FQDN Cache Entries | NULL | Number of FQDN cache entries | |
squid.icp.packets.received | Squid ICP Packets Received | NULL | Rate of ICP packets received | |
squid.icp.traffic.in | Squid ICP Traffic In | NULL | Amount of ICP data received | |
squid.server.traffic.in | Squid Server Traffic In | NULL | Rate of traffic read by server for all protocols | |
squid.icp.packets.sent | Squid ICP Packets Sent | NULL | Rate of ICP packets sent | |
squid.request.failure.ratio | Squid Request Failure Ratio | % | The average ratio between the number of failed and successful requests | |
squid.service.times | Squid Service Times | NULL | The median of various service time (or response time) distributions | |
squid.server.traffic.out | Squid Server Traffic Out | NULL | Rate of traffic written to origin servers for all protocols | |
squid.page.faults | Squid Page Faults | NULL | Page faults with physical i/o | |
squid.icp.replies.queued | Squid ICP Replies Queued | NULL | This counter shows how many times an ICP message was queued for re-transmission | |
squid.server.requests | Squid Server Requests | NULL | Rate of server-side requests to HTTP servers | |
squid.unlink.requests | Squid Unlink Requests | NULL | Rate of unlink requests | |
squid.client.http.requests | Squid Client Http Requests | NULL | The rate of the number of HTTP requests. | |
squid.client.http.traffic.out | Squid Client Http Traffic Out | NULL | Rate of response data traffic sent to clients | |
squid.aborted.requests | Squid Aborted Requests | NULL | Rate of aborted requests | |
squid.requests.disk.hit.ratio | Squid Requests Disk Hit Ratio | % | These values represent the percentage of plain cache hits served from disk | |
squid.cache.hit.ratio | Squid Cache Hit Ratio | % | The percentage of HTTP requests that result in a cache hit. | |
squid.clients | Squid Clients | NULL | Number of clients accessing cache, client actually means IP address. Squid assumes that each client has a unique IP address | |
squid.client.http.errors | Squid Client Http Errors | NULL | The rate of the number of HTTP errors. | |
squid.client.http.traffic.in | Squid Client Http Traffic In | NULL | Rate of request data traffic received from clients | |
squid.fqdn.cache.statistics | Squid FQDN Cache Statistics | NULL | The FQDN cache is similar to the IP cache, except that it stores address-to-hostname lookups | |
squid.ip.cache.entries | Squid IP Cache Entries | NULL | Number of IP cache entries | |
squid.icp.messages.sent | Squid ICP Messages Sent | NULL | The total number of ICP messages sent since Squid was started. | |
squid.icp.messages.received | Squid ICP Messages Received | NULL | The total number of ICP messages received since Squid was started. | |
squid.cache.byte.hit.ratio | Squid Cache Byte Hit Ratio | % | Squid calculates byte hit ratio by comparing the number of bytes received from origin servers (or neighbors) to the number of bytes sent to clients | |
squid.select.loops | Squid Select Loops | NULL | Rate of select loops called | |
squid.ip.cache.statistics | Squid IP Cache Statistics | NULL | The IP cache contains cached results of hostname-to-address lookups. | |
squid.request.memory.hit.ratio | Squid Request Memory Hit Ratio | % | These values represent the percentage of all cache hits that were served from memory | |
squid.icp.traffic.out | Squid ICP Traffic Out | NULL | Amount of ICP data sent |
Agent G2 - Linux - Varnish Monitors
Description
Varnish application monitoring
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - Varnish Monitors | varnish.uptime | Varnish Uptime | NULL | Uptime in minutes |
varnish.client.conns.accepted | Varnish Client Connections Accepted | NULL | Average number of client connections accepted per second | |
varnish.rate.esi.errors | Varnish ESI Errors | NULL | Rate of ESI parsing errors | |
varnish.rate.hdrbytes | Varnish Header Size | NULL | Rate of total header size in Kbytes | |
varnish.cache.hits | Varnish Cache Hits | NULL | Cache hit rate | |
varnish.struct.objects | Varnish Objects | NULL | Number of objects | |
varnish.backend.conns.success.ratio | Varnish Backend Connections Success Ratio | NULL | Successful backend server connections ratio | |
varnish.client.connections.ratio | Varnish Client Connections Ratio | NULL | Client accepted connections to received requests ratio | |
varnish.client.conns.dropped | Varnish Client Connections Dropped | NULL | Average number of client connections dropped per second | |
varnish.lru.objects.moved | Varnish LRU Moved Objects | NULL | Number of LRU moved objects | |
varnish.work.thread.ratio | Varnish Work Thread Ratio | NULL | Working threads to created threads ratio | |
varnish.rate.sessions | Varnish Sessions | NULL | Rate of total sessions | |
varnish.backend.connections | Varnish Backend Connections | NULL | Rate of backend connections success | |
varnish.rate.bodybytes | Varnish Body Size | NULL | Rate of total body size in Kbytes | |
varnish.abandoned.work.requests | Varnish Dropped Work Requests | NULL | Average number of abandoned work requests per second | |
varnish.client.requests | Varnish Client Requests | NULL | Average number of client requests received per second | |
varnish.cache.hit.ratio | Varnish Cache Hit Ratio | NULL | Cache hit ratio | |
varnish.expired.objects | Varnish Expired Objects | NULL | Number of expired objects | |
varnish.worker.threads | Varnish Worker Threads | NULL | Number of worker threads | |
varnish.rate.requests | Varnish Requests | NULL | Rate of total received requests | |
varnish.rate.fetches | Varnish Fetches | NULL | Rate of total fetches | |
varnish.backend.recycle | Varnish Backend Connection Recycles | NULL | Rate of backend connection recycles | |
varnish.cache.miss.ratio | Varnish Cache Miss Ratio | NULL | Cache miss ratio | |
varnish.backend.req | Varnish Backend Requests | NULL | Rate of backend requests made | |
varnish.lru.objects.nuked | Varnish LRU Nuked Objects | NULL | Number of LRU nuked objects | |
varnish.backend.fail | Varnish Backend Connections Failed | NULL | Rate of backend connection failures | |
varnish.worker.threads.failed | Varnish Failed Worker Threads | NULL | Average number of failures per second, when creating worker threads | |
varnish.backend.busy | Varnish Backend Connections Busy | NULL | Rate of backend connections busy | |
varnish.backend.toolate | Varnish Backend Connections Closed | NULL | Rate of backend connections closed | |
varnish.cache.misses | Varnish Cache Misses | NULL | Cache miss rate |
Agent G2 - Linux - WebLogic Monitors
Description
Agent G2 - Linux - WebLogic Monitors
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - WebLogic Monitors | weblogic_ejp_transactions_committed_count | Transactions Committed Count Rate | Per Second | EJB Transactions Committed Count Rate |
weblogic_ejp_access_count | Access Count Rate | Per Second | EJBPool Access Count Rate | |
weblogic_jdbc_ds_curr_capacity | Curr Capacity | NULL | JDBC DataSource Curr Capacity | |
weblogic_jta_transaction_rolled_back_count | JTARuntime Transaction Rolled Back Count Rate | Per Second | JTARuntime Transaction Rolled Back Count Rate | |
weblogic_thread_count | Thread Count | NULL | Thread Count Of The Server | |
weblogic_gc_collection_time | Collection Time | NULL | Time Taken For Collection Of The Garbage Objects | |
weblogic_ejp_beans_in_use_current_count | Beans In Use Current Count | NULL | EJBPool Beans In Use Current Count | |
weblogic_jdbc_ds_active_connections_current_count | Active Connections Current Count | NULL | JDBC DataSource Active Connections Current Count | |
weblogic_ejp_transactions_rolled_back_count | Transactions Rolled Back Count Rate | Per Second | EJB Transactions Rolled Back Count Rate | |
weblogic_jdbc_ds_leaked_connection_count | Leaked Connection Count | Per Second | JDBC DataSource Leaked Connection Count | |
weblogic_ejp_pooled_beans_current_count | Pooled Beans Current Count | NULL | EJBPool Pooled Beans Current Count | |
weblogic_ejp_destroyed_count | Destroyed Count Rate | Per Second | EJBPool Destroyed Count Rate | |
weblogic_open_file_descriptor_count | Open File Descriptor Count | NULL | Number Of Open File Descriptors Of The Server | |
weblogic_jdbc_ds_connections_count | Connections Count Rate | Per Second | JDBC DataSource Connections Count Rate | |
weblogic_non_heap_memory_usage_used | Non Heap Memory Usage Used | MB | Non Heap Memory Usage Of The Server | |
weblogic_thread_pool_standby_thread_count | Standby Thread Count | NULL | Standby Thread Count | |
weblogic_ejp_transactions_timed_out_count | Transactions Timed Out Count Rate | Per Second | EJB Transactions Timed Out Count Rate | |
weblogic_gc_collection_count | Collection Count | NULL | Number Of Garbage Objects Collected | |
weblogic_heap_memory_usage_committed | Heap Memory Usage Committed | MB | Heap Memory Committed For The Server | |
weblogic_uptime | WebLogic Uptime | Minutes | WebLogic Uptime | |
weblogic_ejp_waiter_current_count | Waiter Current Count | NULL | EJBPool Waiter Current Count | |
weblogic_total_started_thread_count | Total Started Thread Count | NULL | Total Started Thread Count | |
weblogic_thread_pool_hogging_thread_count | Hogging Thread Count | NULL | Hogging Thread Count | |
weblogic_jdbc_ds_waiting_for_connection_success | Waiting For Connection Success Rate | Per Second | JDBC DataSource Waiting For Connection Success Rate | |
weblogic_jdbc_ds_waiting_for_connection | Waiting For Connection Rate | Per Second | JDBC DataSource Waiting For Connection Rate | |
weblogic_jdbc_ds_num_available | Num Available | NULL | JDBC DataSource Num Available | |
weblogic_jdbc_ds_prep_stmt_cache_current_size | Prep Stmt Cache Current Size | NULL | JDBC DataSource Prep Stmt Cache Current Size | |
weblogic_thread_pool_execute_thread_idle_count | Execute Thread Idle Count | NULL | Execute Thread Idle Count | |
weblogic_ejp_miss_count | Miss Count Rate | Per Second | EJBPool Miss Count Rate | |
weblogic_loaded_class_count | Loaded Class Count | NULL | Loaded Class Count | |
weblogic_jta_transaction_count | JTARuntime Transaction Count Rate | Per Second | JTARuntime Transaction Count Rate | |
weblogic_jdbc_ds_waiting_for_connection_failure | Waiting For Connection Failure Rate | Per Second | JDBC DataSource Waiting For Connection Failure Rate | |
weblogic_jta_transaction_abandoned_count | JTARuntime Transaction Abandoned Count Rate | Per Second | JTARuntime Transaction Abandoned Count Rate | |
weblogic_unloaded_class_count | Unloaded Class Count | NULL | Unloaded Class Count | |
weblogic_jdbc_ds_waiting_for_connection_current_count | Waiting For Connection Current Count | NULL | JDBC DataSource Waiting For Connection Current Count | |
weblogic_jta_transaction_committed_count | JTARuntime Transaction Committed Count Rate | Per Second | JTARuntime Transaction Committed Count Rate | |
weblogic_heap_memory_usage_used | Heap Memory Usage Used | MB | Heap Memory Usage Of The Server | |
weblogic_non_heap_memory_usage_committed | Non Heap Memory Usage Committed | MB | Non Heap Memory Committed For The Server |
Agent G2 - Linux - WebSphere Monitors
Description
Agent G2 - Linux - WebSphere Monitors
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux - WebSphere Monitors | ibm_was_jdbc_FaultCount | JDBC Fault Count | NULL | The total number of faults, such as timeouts, in the connection pool. |
ibm_was_servlet_session_LiveCount | Servlet Session Live Count | NULL | The number of local sessions that are currently cached in memory from the time at which this metric is enabled. | |
ibm_was_jdbc_CreateCount | JDBC Create Count | NULL | The total number of managed connections that were created since pool creation. | |
ibm_was_servlet_session_NoRoomForNewSessionCount | Servlet Session No Room For New Session Count | NULL | Applies only to session in memory with AllowOverflow=false. The number of times that a request for a new session cannot be handled because it exceeds the maximum session count. | |
ibm_was_thread_pools_ActiveTime | Thread Pools Active Time | ms | The average time in milliseconds the threads are in active state | |
ibm_was_jvm_ProcessCpuUsage | JVM Process Cpu Usage Gauge | NULL | The CPU Usage (in percent) of the Java virtual machine. | |
ibm_was_thread_pools_ClearedThreadHangCount | Thread Pools Cleared Thread Hang Count | NULL | The number of thread hangs cleared | |
ibm_was_servlet_session_LifeTime | Servlet Session Life Time | ms | The average session life time in milliseconds (time invalidated - time created) | |
ibm_was_servlet_session_ActiveCount | Servlet Session Active Count | NULL | The number of concurrently active sessions. A session is active if the WebSphere Application Server is currently processing a request that uses that session. | |
ibm_was_thread_pools_DeclaredThreadHungCount | Thread Pools Declaredthread Hung Count | NULL | The number of threads declared hung | |
ibm_was_thread_pools_ActiveCount | Thread Pools Active Count | NULL | The number of concurrently active threads | |
ibm_was_servlet_session_CreateCount | Servlet Session Create Count | NULL | The number of sessions that were created | |
ibm_was_servlet_session_CacheDiscardCount | Servlet Session Cache Discard Count | NULL | The number of session objects that have been forced out of the cache. A least recently used (LRU) algorithm removes old entries to make room for new sessions and cache misses. Applicable only for persistent sessions. | |
ibm_was_jdbc_ReturnCount | JDBC Return Count | NULL | The total number of managed connections that were returned since pool creation. | |
ibm_was_jvm_UsedMemory | JVM Used Memory Gauge | KB | The used memory in the JVM run time | |
ibm_was_jdbc_PrepStmtCacheDiscardCount | JDBC Prep Stmt Cache Discard Count | NULL | The total number of statements that are discarded by the least recently used (LRU) algorithm of the statement cache. | |
ibm_was_jdbc_FreePoolSize | JDBC Free Pool Size | NULL | The number of managed connections that are in the free pool. | |
ibm_was_jvm_UpTime | JVM Up Time Gauge | Seconds | The amount of time that the JVM is running | |
ibm_was_jdbc_CloseCount | JDBC Close Count | NULL | The total number of managed connections that were destroyed since pool creation. | |
ibm_was_thread_pools_PoolSize | Thread Pools Pool Size | NULL | The average number of threads in pool | |
ibm_was_servlet_session_InvalidateCount | Servlet Session Invalidate Count | NULL | The number of sessions that were invalidated | |
ibm_was_servlet_session_ExternalWriteTime | Servlet Session External Write Time | ms | The time (milliseconds) taken to write the session data from the persistent store. Applicable only for (serialized) persistent sessions. Similar to external Read Time. | |
ibm_was_jdbc_WaitTime | JDBC Wait Time | NULL | The average waiting time in milliseconds until a connection is granted if a connection is not currently available. | |
ibm_was_thread_pools_CreateCount | Thread Pools Create Count | NULL | The total number of threads created | |
ibm_was_jdbc_AllocateCount | JDBC Allocate Count | NULL | The total number of managed connections that were allocated since pool creation. | |
ibm_was_jdbc_JDBCTime | JDBC Time | NULL | The average time in milliseconds spent running in the JDBC driver that includes time that is spent in the JDBC driver, network, and database. | |
ibm_was_servlet_session_TimeSinceLastActivated | Servlet Session Time Since Last Activated | ms | The time difference in milliseconds between previous and current access time stamps. Does not include session time out. | |
ibm_was_jdbc_PercentUsed | JDBC Percent Used | NULL | The percent of the pool that is in use. | |
ibm_was_thread_pools_DestroyCount | Thread Pools Destroy Count | NULL | The total number of threads destroyed | |
ibm_was_jdbc_WaitingThreadCount | JDBC Waiting Thread Count | NULL | The number of threads that are currently waiting for a connection. | |
ibm_was_thread_pools_PercentMaxed | Thread Pools Percent Maxed | NULL | The average percent of the time that all threads are in use | |
ibm_was_servlet_session_ExternalReadSize | Servlet Session External Read Size | Bytes | Size of the session data read from persistent store. Applicable only for (serialized) persistent sessions | |
ibm_was_thread_pools_ConcurrentHungThreadCount | Thread Pools Concurrent Hung Thread Count | NULL | The number of concurrently hung threads | |
ibm_was_jvm_FreeMemory | JVM Free Memory Gauge | KB | The free memory in the JVM run time | |
ibm_was_jdbc_UseTime | JDBC Use Time | NULL | The average time in milliseconds that a connection is in use. | |
ibm_was_servlet_session_ExternalWriteSize | Servlet Session External Write Size | Bytes | The size of the session data written to persistent store. Applicable only for (serialized) persistent sessions. Similar to external Read Time. | |
ibm_was_servlet_session_SessionObjectSize | Servlet Session Session Object Size | Bytes | The size in bytes of (the serializable attributes of ) in-memory sessions. Only session objects that contain at least one serializable attribute object is counted. A session can contain some attributes that are serializable and some that are not. The size in bytes is at a session level. | |
ibm_was_jdbc_PoolSize | JDBC Pool Size | NULL | The size of the connection pool. | |
ibm_was_jdbc_PercentMaxed | JDBC Percent Maxed | NULL | The percent of the time that all connections are in use. | |
ibm_was_servlet_session_TimeoutInvalidationCount | Servlet Session Timeout Invalidation Count | NULL | The number of sessions that are invalidated by timeout. | |
ibm_was_servlet_session_ExternalReadTime | Servlet Session External Read Time | ms | The time (ms) taken in reading the session data from the persistent store. For multirow sessions, the metrics are for the attribute; for single row sessions, the metrics are for the entire session. Applicable only for persistent sessions. When using a JMS persistent store, only available if replicated data is serialized. | |
ibm_was_jdbc_ConnectionHandleCount | JDBC Connection Handle Count | NULL | The number of connections that are in use. Can include multiple connections that are shared from a single managed connection. | |
ibm_was_jdbc_ManagedConnectionCount | JDBC Managed Connection Count | NULL | The total number of managed connections in the free, shared, and unshared pools. | |
ibm_was_servlet_session_AffinityBreakCount | Servlet Session Affinity Break Count | NULL | The number of requests that are received for sessions that were last accessed from another web application. This value can indicate failover processing or a corrupt plug-in configuration. | |
ibm_was_jvm_HeapSize | JVM Heap Size Gauge | KB | The total memory in the JVM run time | |
ibm_was_servlet_session_ActivateNonExistSessionCount | Servlet Session Activate Non Exist Session Count | NULL | The number of requests for a session that no longer exists, presumably because the session timed out. Use this counter to help determine if the timeout is too short. |
Agent G2 - Linux Directory Monitor - v2
Description
Agent G2 - Linux Directory Monitor
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Linux Directory Monitor - v2 | system_directory_Size | System Directory Size | GB | System Directory Size: To monitor the total size of the given directory/file path |
system_directory_file_ReadOnlyStatus | System Directory File ReadOnlyStatus | NULL | System Directory File ReadOnlyStatus - To monitor the Filtered File(s) Read Accessibility. 0 - Read Only, 1 - Other than Read Only | |
system_directory_FileCount | System Directory File Count | count | System Directory File Count: To monitor the file count in a given directory based on given regex pattern | |
system_directory_FileSize | System Directory File Size | KB | System Directory File Size: To monitor the each file size in a given directory path and filtering files based given regex pattern. | |
system_directory_FileSizeTotal | System Directory FileSizeTotal | MB | System Directory FileSizeTotal: To monitor the total size of filtered files in a given directory based on given regex pattern. | |
system_directory_Availability | System Directory Availability | NULL | System Directory Availability: To monitor the availability of given Directory/File path. Below are the possible states: 1 - Available, 0 - Not Available |
Agent G2 - Linux Network Monitoring
Description
Agent G2 - Linux Network Monitoring
Prerequisites
No prerequisite
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux Network Monitor | system.network.interface.discards.in | System Network In discards | psec | Monitors Network in discards of each interface for Linux Devices |
system.network.interface.discards.out | System Network Out Discards | psec | Monitors network Out Discards of each interface for Linux Devices | |
system.network.interface.errors.in | System Network In Errors | Errors per Sec | Monitors network in errors of each interface for Linux Devices | |
system.network.interface.errors.out | System Network Out Errors | Errors per Sec | Monitors Network Out traffic of each interface for Linux Devices | |
system.network.interface.packets.in | System Network In packets | packets/sec | Monitors in Packets of each interface for Linux Devices | |
system.network.interface.packets.out | System Network out packets | packets/sec | Monitors Out packets of each interface for Linux Devices | |
system.network.interface.traffic.in | System Network In traffic | Kbps | Monitors In traffic of each interface for Linux Devices | |
system.network.interface.traffic.out | System Network Out Traffic | Kbps | Monitors Out traffic of each interface for Linux Devices |
Agent G2 - Linux NFS Mount Point Monitoring
Description
Template to monitor linux nfs mount points availability, accessibility and utilization. This template need to assigned on all linux NFS client machines. This template tested in “CentOS7” + NFS mount points available.
Prerequisites
NFS Mount points(which are related to NFS file system) available on target device.
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux NFS Mount Point Custom Monitor | system_linux_nfs_mountpoint_accessibility | System Linux NFS Mountpoint Accessibility | NULL | System Linux NFS Mountpoint Accessibility - Below are the possible values: 1 - Read/Write access not available 0 - Read/Write access available |
system_linux_nfs_mountpoint_utilization | System Linux NFS Mountpoint Utilization | % | System Linux NFS Mountpoint Utilization | |
system_linux_nfs_mountpoint_availability | System Linux NFS Mountpoint Availability | NULL | System Linux NFS Mountpoint Availability - Below are the possible values: 1 - NFS Mount Point Not Available 0 - NFS Mount Point Available |
Agent G2 - Linux NFS Mount Point Monitoring - V2
Description
Template to monitor linux NFS mount points availability, accessibility and utilization and . In this version-2, mountpoint name is set as the component in both graph and alert. where as, previously in version-1 ,the component is the file system name. This template need to assigned on all linux NFS client machines.
Prerequisites
NFS Mount points(which are related to NFS file system) available on target device.
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux NFS Mount Point Custom Monitor-V2 | system_linux_nfs_mountpoint_accessibility | System Linux NFS Mountpoint Accessibility | System Linux NFS Mountpoint Accessibility - Below are the possible values: 1 - Read/Write access not available 0 - Read/Write access available | |
system_linux_nfs_mountpoint_utilization | System Linux NFS Mountpoint Utilization | % | System Linux NFS Mountpoint Utilization | |
system_linux_nfs_mountpoint_availability | System Linux NFS Mountpoint Availability | System Linux NFS Mountpoint Availability - Below are the possible values: 1 - NFS Mount Point Not Available 0 - NFS Mount Point Available |
Agent G2 - Linux NFS Mount Point Monitoring - v3
Description
Template to monitor Linux NFS mount points availability, accessibility and utilization. This template need to assigned on all linux NFS client machines.
Prerequisites
NFS Mount points(which are related to NFS file system) available on target device.
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux NFS Mount Point Custom Monitor - v3 | system_linux_nfs_mountpoint_accessibility | System Linux NFS Mountpoint Accessibility | System Linux NFS Mountpoint Accessibility - Below are the possible values: 1 - Read/Write access not available 0 - Read/Write access available | |
system_linux_nfs_mountpoint_utilization | System Linux NFS Mountpoint Utilization | % | System Linux NFS Mountpoint Utilization | |
system_linux_nfs_mountpoint_availability | System Linux NFS Mountpoint Availability | System Linux NFS Mountpoint Availability - Below are the possible values: 1 - NFS Mount Point Not Available 0 - NFS Mount Point Available |
Agent G2 - Linux OS File Systems Monitoring
Description
Template to monitor Linux OS advanced performance metrics related to File systems (space, Inodes utilization and change in mount status of a file system). It has been validated on following Linux flavors: RHEL 7.9, Centos 7 ,Ubuntu 20.04, Open SUSE Linux.
Prerequisites
- Applicable on devices which is running Opsramp Agent ( v7.0.0 or above)
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux File Systems | system_linux_fileSystem_Inodes_Usage_Number | System Linux FileSystem Inodes Usage Number | count | File system Inodes usage number. |
system_linux_fileSystem_Inodes_Delta | System Linux FileSystem Inodes Delta | count | File system Inodes usage delta. | |
system_linux_fileSystem_Space_UsedInMB | System Linux FileSystem Space Used In MB | MB | File system space usage in MB | |
system_linux_fileSystem_Space_DeltaInKB | System Linux FileSystem Space Delta In KB | KB | File system space usage(KB) delta | |
system_linux_fileSystem_Inodes_Utilization | System Linux FileSystem Inodes Utilization | % | File system Inodes utilization percent. | |
Agent G2 - Linux File Systems Mount Change Detection | system_linux_fileSystem_Mount_ChangeDetection | System Linux Filesystem Mount Change Detection | File system mount point change detection. It detects if any file system mount point removed and if any new mount point added into the system. Below are the possible values: 0 - Available, 1 - Newly Added, 2 - Removed |
Agent G2 - Linux OS File Systems Monitoring - V2
Description
Template to monitor Linux OS advanced performance metrics related to File systems (Network and Local) (space, Inodes utilization and change in mount status of a file system). It has been validated on following Linux flavors: RHEL 7.9, Centos 7 ,Ubuntu 20.04, Open SUSE Linux.
Prerequisites
- Applicable on devices which is running OpsRamp Agent ( v7.0.0 or above)
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux File Systems - V2 | system_linux_fileSystem_Inodes_Usage_Number | System Linux FileSystem Inodes Usage Number | count | File system Inodes usage number. |
system_linux_fileSystem_Inodes_Delta | System Linux FileSystem Inodes Delta | count | File system Inodes usage delta. | |
system_linux_fileSystem_Space_UsedInMB | System Linux FileSystem Space Used In MB | MB | File system space usage in MB | |
system_linux_fileSystem_Space_DeltaInKB | System Linux FileSystem Space Delta In KB | KB | File system space usage(KB) delta | |
system_linux_fileSystem_Inodes_Utilization | System Linux FileSystem Inodes Utilization | % | File system Inodes utilization percent. | |
Agent G2 - Linux File Systems Mount Change Detection - V2 | system_linux_fileSystem_Mount_ChangeDetection | System Linux Filesystem Mount Change Detection | File system mount point change detection. It detects if any file system mount point removed and if any new mount point added into the system. Below are the possible values: 0 - Available, 1 - Newly Added, 2 - Removed |
Agent G2 - Linux OS File Systems Monitoring - v3
Description
Template to monitor Linux OS advanced performance metrics related to File systems (Network and Local) (space, Inodes utilization and change in mount status of a file system). It has been validated on following Linux flavors: AlmaLinux release 8.6 (Sky Tiger), RHEL 9.0(Plow), RHEL 7.9(Maipo), Centos 8, Ubuntu 20.04, Open SUSE, SUSE SLES 15 sp2.
Prerequisites
- Applicable on devices which is running OpsRamp Agent ( v7.0.0 or above)
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux File Systems - V2 | system_linux_fileSystem_Inodes_Usage_Number | System Linux FileSystem Inodes Usage Number | count | File system Inodes usage number. |
system_linux_fileSystem_Inodes_Delta | System Linux FileSystem Inodes Delta | count | File system Inodes usage delta. | |
system_linux_fileSystem_Space_UsedInMB | System Linux FileSystem Space Used In MB | MB | File system space usage in MB | |
system_linux_fileSystem_Space_DeltaInKB | System Linux FileSystem Space Delta In KB | KB | File system space usage(KB) delta | |
system_linux_fileSystem_Inodes_Utilization | System Linux FileSystem Inodes Utilization | % | File system Inodes utilization percent. | |
Agent G2 - Linux File Systems Mount Change Detection - V3 | system_linux_fileSystem_Mount_ChangeDetection | System Linux Filesystem Mount Change Detection | File system mount point change detection. It detects if any file system mount point removed and if any new mount point added into the system. Below are the possible values: 0 - Available, 1 - Newly Added, 2 - Removed |
Agent G2 - Linux OS File Systems Monitoring - v4
Description
Agent G2 - Linux OS File Systems Monitoring - v4
Prerequisites
- Applicable on devices which is running OpsRamp Agent ( v7.0.0 or above)
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux File Systems - v4 | system_linux_fileSystem_Inodes_Usage_Number | System Linux FileSystem Inodes Usage Number | count | File system Inodes usage number. |
system_linux_fileSystem_Inodes_Delta | System Linux FileSystem Inodes Delta | count | File system Inodes usage delta. | |
system_linux_fileSystem_Space_UsedInMB | System Linux FileSystem Space Used In MB | MB | File system space usage in MB | |
system_linux_fileSystem_Space_DeltaInKB | System Linux FileSystem Space Delta In KB | KB | File system space usage(KB) delta | |
system_linux_fileSystem_Inodes_Utilization | System Linux FileSystem Inodes Utilization | % | File system Inodes utilization percent. | |
Agent G2 - Linux File Systems Mount Change Detection - v4 | system_linux_fileSystem_Mount_ChangeDetection | System Linux Filesystem Mount Change Detection | File system mount point change detection. It detects if any file system mount point removed and if any new mount point added into the system. Below are the possible values: 0 - Available, 1 - Newly Added, 2 - Removed |
Agent G2 - Linux OS Mount Point Monitoring
Description
Template to monitor Linux OS advanced performance metrics related to Mount Points (space,Inodes utilization and Availability status of mount point). It has been validated on following Linux flavors: RHEL 7.9, Centos 7 ,Ubuntu 20.04, Open SUSE Linux.
Prerequisites
- Applicable on devices which is running Opsramp Agent ( v7.0.0 or above)
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux Mount Points | system_linux_mountpoint_Space_DeltaInKB | System Linux Mount Point Space Delta In KB | KB | Mount point space usage(KB) delta |
system_linux_mountpoint_Inodes_Usage_Number | System Linux Mount Point Inodes Usage Number | count | Mount point Inodes usage number. | |
system_linux_mountpoint_Availability_Status | System Linux Mount Point Availability Status | Availability status of mount point. These are possible values: 0 - Not Available, 1 - Available | ||
system_linux_mountpoint_Space_UsedInMB | System Linux Mount Point Space Used In MB | MB | Mount point usage in MB | |
system_linux_mountpoint_Inodes_Utilization | System Linux Mount Point Inodes Utilization | % | Mount point Inodes utilization percent. | |
system_linux_mountpoint_Inodes_Delta | System Linux Mount Point Inodes Delta | count | Mount point Inodes usage delta. |
Agent G2 - Linux OS Mount Point Monitoring - V2
Description
Template to monitor Linux OS advanced performance metrics related to Mount Points (space,Inodes utilization and Availability status of mount point). It has been validated on following Linux flavors: RHEL 7.9, Centos 7 ,Ubuntu 20.04, Open SUSE Linux.
Prerequisites
- Applicable on devices which is running Opsramp Agent ( v7.0.0 or above)
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Linux Mount Points - V2 | system_linux_mountpoint_Space_DeltaInKB | System Linux Mount Point Space Delta In KB | KB | Mount point space usage(KB) delta |
system_linux_mountpoint_Inodes_Usage_Number | System Linux Mount Point Inodes Usage Number | count | Mount point Inodes usage number. | |
system_linux_mountpoint_Availability_Status | System Linux Mount Point Availability Status | Availability status of mount point. These are possible values: 0 - Not Available, 1 - Available | ||
system_linux_mountpoint_Space_UsedInMB | System Linux Mount Point Space Used In MB | MB | Mount point usage in MB | |
system_linux_mountpoint_Inodes_Utilization | System Linux Mount Point Inodes Utilization | % | Mount point Inodes utilization percent. | |
system_linux_mountpoint_Inodes_Delta | System Linux Mount Point Inodes Delta | count | Mount point Inodes usage delta. |
Agent G2 - Linux OS Performance Monitoring
Description
Agent G2 - Linux OS Performance Monitoring
Prerequisites
No prerequisite
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - CPU Monitor | system.cpu.usage.utilization | System CPU Utilization | % | The percentage of elapsed time that the processor spends to execute a non-Idle thread(This doesn't includes CPU steal time) |
Agent G2 - Disk Monitor | system.disk.usage.freespace | System FreeDisk Usage | GB | Monitors the Free Space usage in GB |
system.disk.usage.usedspace | System Disk UsedSpace | GB | Monitors disk used space in GB | |
system.disk.usage.utilization | System Disk Utilization | % | Monitors disk utilization in percentage | |
Agent G2 - Memory Monitor | system.memory.usage.usedspace | System Memory Used Space | GB | Physical and virtual memory usage in GB |
system.memory.usage.utilization | System Memory Utilization | % | Physical and virtual memory usage in percentage. | |
Agent G2 - Uptime Monitor | system.os.uptime | System Uptime | m | Time lapsed since last reboot in minutes |
Agent G2 - Linux CPU Load Monitor | system.cpu.load | System CPU Load | Monitors the system's last 1min, 5min and 15min load. It sends per cpu core load average. | |
Agent G2 - Linux CPU Statistics Monitor | system.cpu.usage.stats | System CPU Usage Statistics | % | Monitors cpu time in percentage spent in various program spaces.\n\nUser - The processor time spent running user space processes \nSystem - The amount of time that the CPU spent running the kernel.\nIOWait - The time the CPU spends idle while waiting for an I/O operation to complete\nIdle - The time the processor spends idle\nSteal - The time virtual CPU has spent waiting for the hypervisor to service another virtual CPU running on a different virtual machine.\nKernal Time\nTotal Time |
Agent G2 - Linux Disk Inode Utilization Monitor | system.disk.inode.utilization | System Disk Inode Utilization | % | This monitor is to collect DISK Inode metrics for all physical disks in a server. |
Agent G2 - Microsoft Windows IIS App Pool State
Description
Monitors IIS App Pool States for the given app pool names by fetching the App Pool Names from Internet Information Services (IIS) Manager > Application Pools > Column ‘Name’. By default, it will monitor all the available App Pools because the default value is ‘all’ Otherwise, it will monitor the given exact app pool names separated by commas (avoid spaces for each app pool name). Status of the application pool (1 - Uninitialized, 2 - Initialized, 3 - Running, 4 - Disabling, 5 - Disabled, 6 - Shutdown Pending, 7 - Delete Pending).
Prerequisites
No prerequisite
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft Windows IIS App Pool State | microsoft_windows_iis_appPool_state | Microsoft Windows IIS App Pool State | State | Monitors IIS App Pool States for the given app pool names by fetching the App Pool Names from Internet Information Services (IIS) Manager > Application Pools > Column 'Name'. By default, it will monitor all the available App Pools because the default value is 'all.' Otherwise, it will monitor the given exact app pool names separated by commas (avoid spaces for each app pool name). Status of the application pool (1 - Uninitialized, 2 - Initialized, 3 - Running, 4 - Disabling, 5 - Disabled, 6 - Shutdown Pending, 7 - Delete Pending). |
Agent G2 - Microsoft Active Directory Domain Controller Performance and Availability
Description
Monitors Active Directory metrics like Global Catalog Bind Time, Global Catalog Search Time, DNS Servers Bind Time, Lost Object Count, SYSVol Share availability.
Prerequisites
No prerequisite
Supported Metric
Monitor Name | Monitor Description | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|---|
Agent G2 - Microsoft AD DNS Server Bind Time | Monitors network connectivity test to the specified DNS server IP's. Example: 127.0.0.1 127.0.0.3 | microsoft_AD_DNS_server_bindTime | Microsoft AD DNS Server BindTime | ms | Monitors Microsoft Active Directory network connectivity tests to the specified DNS server IPs. |
Agent G2 - Microsoft AD Global Catalog Bind Time | Monitors Active Directory Global Catalog Bind Time in Milliseconds. User need to provide AD Domain Controller IP along with its port with : separated. Example: 127.0.0.1:3268 | microsoft_AD_global_catalog_bindTime | Microsoft AD Global Catalog BindTime | ms | Monitors Microsoft Active Directory Global Catalog Bind Time in milliseconds. |
Agent G2 - Microsoft AD Global Catalog Search Time | Monitors Active Directory Global catalog search time in milliseconds using given Domain Controller IPS's and its associated filter. It measures the time it takes to perform the search operation. User need to provide Domain Controller IP:port and LDAP Filter Name. Example: DomainControllerIP_and_Port: 127.0.0.1:3268 AD_LDAP_Filter: organization | microsoft_AD_global_catalog_searchTime | Microsoft AD Global Catalog SearchTime | ms | Monitors Microsoft Active Directory Global Catalog Search Time in milliseconds |
Agent G2 - Microsoft AD LostObjectCount_SYSVOLShareAvailability | Monitor the count of deleted Active Directory objects and availability of the Active Directory SYSVOL share. | microsoft_AD_lost_object_count | Microsoft AD Lost Object Count | count | Monitors the count of deleted Active Directory objects. |
Agent G2 - Microsoft AD LostObjectCount_SYSVOLShareAvailability | Monitor the count of deleted Active Directory objects and availability of the Active Directory SYSVOL share. | microsoft_AD_SYSVOL_share_availability | Microsoft AD SYSVOL Share Availability | null | Monitors Microsoft Active Directory SYSVOL share is available or not. |
Agent G2 - Microsoft Active Directory Domain Controller Performance and Availability - v2
Description
Monitors Active Directory metrics like Global Catalog Bind Time, Global Catalog Search Time, DNS Servers Bind Time, Lost Object Count, SYSVol Share availability.
Prerequisites
No Prerequisites
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft AD DNS Server Bind Time | microsoft_AD_DNS_server_bindTime | Microsoft AD DNS Server BindTime | ms | Monitors Microsoft Active Directory network connectivity tests to the specified DNS server IPs. |
Agent G2 - Microsoft AD Global Catalog Bind Time | microsoft_AD_global_catalog_bindTime | Microsoft AD Global Catalog BindTime | ms | Monitors Microsoft Active Directory Global Catalog Bind Time in milliseconds |
Agent G2 - Microsoft AD Global Catalog Search Time | microsoft_AD_global_catalog_searchTime | Microsoft AD Global Catalog SearchTime | ms | Monitors Microsoft Active Directory Global Catalog Search Time in milliseconds |
Agent G2 - Microsoft AD LostObjectCount_SYSVOLShareAvailability | microsoft_AD_lost_object_count | Microsoft AD Lost Object Count | count | Monitors the count of deleted Active Directory objects. |
Agent G2 - Microsoft AD LostObjectCount_SYSVOLShareAvailability | microsoft_AD_SYSVOL_share_availability | Microsoft AD SYSVOL Share Availability | null | Monitors Microsoft Active Directory SYSVOL share is available or not. |
Agent G2 - Microsoft AD BindTime | microsoft_AD_bindTime | Microsoft AD BindTime | ms | Monitors Active Directory BindTime for Domain Controller, Infrastructure Master, Domain Naming Master, Primary Domain Controller Emulator (PDC), Relative ID master (RID), Schema Master (SCH) in milliseconds. |
Agent G2 - Microsoft AD LastSuccessful Synchronization Time | microsoft_AD_lastSuccessfulSyncTime | Microsoft AD LastSuccessfulSync Time | m | Monitors Microsoft Active Directory last successful synchronization time in minutes. |
Agent G2 - Microsoft Windows OS Counters - RSE - v3
Description
To Monitor OS related counters like disk, memory. In this version, added these additonal metrics support: system_windows_Memory_TotalInstalledRAM,system_windows_Memory_UsedRAM,system_windows_Memory_CommitCharge,system_windows_Memory_PageFileInstalled
Prerequisites
No Prerequisites
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
System Windows LogicalDisk Monitor | system_windows_logicaldisk_UsedSpaceInMegaBytes | System Windows LogicalDisk UsedSpace In MegaBytes | MB | Total usable space on the selected logical disk drive that was free in MB |
system_windows_logicaldisk_DiskReadPerSecond | System Windows LogicalDisk DiskRead PerSecond | rops | Disk Reads/sec is the rate of read operations on the disk. | |
system_windows_logicaldisk_AvgDiskSecPerRead | System Windows LogicalDisk AvgDiskSec PerRead | null | Avg. Disk sec/Read is the average time, in seconds, of a read of data from the disk. | |
system_windows_logicaldisk_AvgDiskQueueLength | System Windows LogicalDisk AvgDiskQueueLength | null | Avg. Disk Queue Length is the average number of both read and write requests that were queued for the selected disk during the sample interval. | |
system_windows_logicaldisk_FreeSpaceInPercent | System Windows LogicalDisk FreeSpace In Percent | % | % Free Space is the percentage of total usable space on the selected logical disk drive that was free. | |
system_windows_logicaldisk_FreeSpaceInMB | System Windows LogicalDisk FreeSpace InMB | MB | Free Megabytes displays the unallocated space, in megabytes, on the disk drive in megabytes. One megabyte is equal to 1,048,576 bytes. | |
system_windows_logicaldisk_DiskWriteBytesPerSecond | System Windows Logicaldisk DiskWriteBytes PerSecond | Bps | Disk Write Bytes/sec is rate at which bytes are transferred to the disk during write operations. | |
system_windows_logicaldisk_DiskWritePerSecond | System windows LogicalDisk DiskWrite PerSecond | wops | Disk Writes/sec is the rate of write operations on the disk. | |
system_windows_logicaldisk_DiskReadBytesPerSecond | System Windows LogicalDisk DiskReadBytes PerSecond | Bps | Disk Read Bytes/sec is the rate at which bytes are transferred from the disk during read operations. | |
system_windows_logicaldisk_AvgDiskSecPerWrite | System Windows LogicalDisk AvgDiskSec PerWrite | null | Avg. Disk sec/Write is the average time, in seconds, of a write of data to the disk. | |
System Windows Processor Monitor | microsoft_windows_Processor_PercentPrivilegedTime | Microsoft Windows Processor PercentPrivilegedTime | % | % Privileged Time is the percentage of elapsed time that the process threads spent executing code in privileged mode. When a Windows system service in called, the service will often run in privileged mode to gain access to system-private data. Such data is protected from access by threads executing in user mode. Calls to the system can be explicit or implicit, such as page faults or interrupts. Unlike some early operating systems, Windows uses process boundaries for subsystem protection in addition to the traditional protection of user and privileged modes. Some work done by Windows on behalf of the application might appear in other subsystem processes in addition to the privileged time in the process. |
system_windows_processorDPCrate | System Windows Processor DPCrate | null | DPC Rate is the rate at which deferred procedure calls (DPCs) were added to the processors DPC queues between the timer ticks of the processor clock. DPCs are interrupts that run at alower priority than standard interrupts. Each processor has its own DPC queue. This counter measures the rate that DPCs were added to the queue, not the number of DPCs in the queue. This counter displays the last observed value only; it is not an average. | |
system_windows_processor_percentprocessortime | System Windows Processor PercentProcessorTime | % | % Processor Time is the percentage of elapsed time that the processor spends to execute a non-Idle thread. It is calculated by measuring the percentage of time that the processor spends executing the idle thread and then subtracting that value from 100%. (Each processor has an idle thread that consumes cycles when no other threads are ready to run). This counter is the primary indicator of processor activity, and displays the average percentage of busy time observed during the sample interval. It should be noted that the accounting calculation of whether the processor is idle is performed at an internal sampling interval of the system clock (10ms). On todays fast processors, % Processor Time can therefore underestimate the processor utilization as the processor may be spending a lot of time servicing threads between the system clock sampling interval. Workload based timer applications are one example of applications which are more likely to be measured inaccurately as timers are signaled just after the sample is taken. | |
system_windows_processorInterruptsPerSec | System Windows ProcessorInterrupts PerSec | psec | Interrupts/sec is the average rate, in incidents per second, at which the processor received and serviced hardware interrupts. It does not include deferred procedure calls (DPCs), which are counted separately. This value is an indirect indicator of the activity of devices that generate interrupts, such as the system clock, the mouse, disk drivers, data communication lines, network interface cards, and other peripheral devices. These devices normally interrupt the processor when they have completed a task or require attention. Normal thread execution is suspended. The system clock typically interrupts the processor every 10 milliseconds, creating a background of interrupt activity. This counter displays the difference between the values observed in the last two samples, divided by the duration of the sample interval. | |
microsoft_windows_ProcessorInformation_PercentC1Time | Microsoft Windows ProcessorInformation PercentC1Time | % | % C1 Time is the percentage of time the processor spends in the C1 low-power idle state. % C1 Time is a subset of the total processor idle time. C1 low-power idle state enables the processor to maintain its entire context and quickly return to the running state. Not all systems support the % C1 state | |
microsoft_windows_ASP_InMemoryTemplatesCached | Microsoft Windows ASP InMemoryTemplatesCached | count | The number of compiled templates cached in memory. | |
System Windows PhysicalDisk Monitor | system_windows_PhysicalDisk_writesPerSec | System Windows PhysicalDisk Writes PerSec | psec | Disk Writes/sec is the rate of write operations on the disk. |
system_windows_PhysicalDisk_avgDiskSecPerRead | System Windows PhysicalDisk AvgDiskSec PerRead | null | Avg. Disk sec/Read is the average time, in seconds, of a read of data from the disk. | |
system_windows_PhysicalDisk_writeBytesPerSec | System Windows PhysicalDisk WriteBytes PerSec | Bps | Disk Write Bytes/sec is rate at which bytes are transferred to the disk during write operations. | |
System_Windows_PhysicalDisk_readBytesPerSec | System Windows PhysicalDisk ReadBytes PerSec | Bps | Disk Read Bytes/sec is the rate at which bytes are transferred from the disk during read operations. | |
System_Windows_PhysicalDisk_avgDiskSecPerWrite | System Windows PhysicalDisk AvgDiskSec PerWrite | null | Avg. Disk sec/Write is the average time, in seconds, of a write of data to the disk. | |
system_windows_PhysicalDisk_readsPerSec | System Windows PhysicalDisk ReadsPerSec | psec | Disk Reads/sec is the rate of read operations on the disk. | |
System_Windows_PhysicalDisk_avgDiskQueueLength | System Windows PhysicalDisk avgDiskQueueLength | null | Avg. Disk Queue Length is the average number of both read and write requests that were queued for the selected disk during the sample interval. | |
microsoft_windows_PhysicalDisk_AvgDiskBytesPerRead | Microsoft Windows PhysicalDisk AvgDiskBytesPerRead | null | Avg. Disk Bytes/Read is the average number of bytes transferred from the disk during read operations. | |
microsoft_windows_PagingFile_PercentUsage | Microsoft Windows PagingFile PercentUsage | % | The amount of the Page File instance in use in percent | |
System Windows Server Monitor | system_windows_server_logonsPerSec | System Windows Server LogonsPerSec | psec | Logon/sec is the rate of all server logons. |
system_windows_contextSwitchesPerSec | System Windows ContextSwitchesPerSec | psec | "Context Switches/sec is the combined rate at which all processors on the computer are switched from one thread to another. Context switches occur when a running thread voluntarily relinquishes the processor, is preempted by a higher priority ready thread, or switches between user-mode and privileged (kernel) mode to use an Executive or subsystem service. It is the sum of Thread Context Switches/sec for all threads running on all processors in the computer and is measured in numbers of switches. There are context switch counters on the System and Thread objects. This counter displays the difference between the values observed in the last two samples, divided by the duration of the sample interval." | |
system_windows_bytesTransmittedPerSec | System Windows BytesTransmitted PerSec | psec | The number of bytes the server has sent on the network. Indicates how busy the server is. | |
microsoft_windows_System_ProcessorQueueLength | Microsoft Windows System ProcessorQueueLength | null | Processor Queue Length is the number of threads in the processor queue. Unlike the disk counters, this counter counters, this counter shows ready threads only, not threads that are running. There is a single queue for processor time even on computers with multiple processors. Therefore, if a computer has multiple processors, you need to divide this value by the number of processors servicing the workload. A sustained processor queue of less than 10 threads per processor is normally acceptable, dependent of the workload. | |
system_windows_bytesReceivedPerSec | System Windows BytesReceived PerSec | psec | The number of bytes the server has received from the network. Indicates how busy the server is. | |
Agent G2 - Windows Memory Custom Monitor - v3 | System_Windows_Pages_Output_PerSec | System_Windows_Pages_Output_PerSec | psec | |
System_Windows_Memory_AvailableMbytes | System_Windows_Memory_AvailableMbytes | MB | Calculates the Memory Available in Mega Bytes of the System | |
System_Windows_Memory_CommitLimit | System_Windows_Memory_CommitLimit | Bytes | Calculates the physical memory space reserved on the disk paging files | |
System_Windows_Memory_CommittedBytes | System_Windows_Memory_CommittedBytes | Bytes | Calculates the Total Number of Committed Bytes of the System | |
System_Windows_Memory_CommittedBytes_Inuse | System_Windows_Memory_CommittedBytes_Inuse | Bytes | Calculates the Percentage of the Committed Bytes In use of the System | |
System_Windows_Memory_PageFaults_PerSec | System_Windows_Memory_PageFaults_PerSec | psec | Calculates the Total Number of Memory Page Faults Per Second of the System | |
System_Windows_Memory_Pages_PerSec | System_Windows_Memory_Pages_PerSec | psec | Calculates the Total Number of Memory pages per second of the System | |
system_windows_rawdata_pagesIn_PerSecond | system_windows_rawdata_pagesIn_PerSecond | null | ||
system_windows_Memory_TotalInstalledRAM | System Windows Memory TotalInstalledRAM | GB | Total PhysicalMemory(RAM) in GB | |
system_windows_Memory_UsedRAM | System Windows Memory UsedRAM | GB | Used PhysicalMemory(RAM) in GB | |
system_windows_Memory_CommitCharge | System Windows Memory CommitCharge | GB | Commit Charge is the amount of committed virtual memory. Committed memory is the physical memory which has space reserved on the disk paging file(s). There can be one or more paging files on each physical drive. This counter displays the last observed value only; it is not an average. | |
system_windows_Memory_PageFileInstalled | System Windows Memory PageFileInstalled | GB | Page File Installed(Commit Limit) is the amount of virtual memory that can be committed without having to extend the paging file(s). Committed memory is the physical memory which has space reserved on the disk paging files. There can be one paging file on each logical drive). If the paging file(s) are be expanded, this limit increases accordingly. This counter displays the last observed value only; it is not an average. |
Agent G2 - Microsoft Windows OS Counters - RSE - v2
Description
To Monitor OS related counters like disk,memory
Prerequisites
No Prerequisites
Agent G2 - MSSQL Database Availability Status - v2
Description
Monitors MSSQL Instance Connection Time in milliseconds and Database Instance status along with Database Status. If the Instance is in a running state and the Databases associated with the instance is offline, it considers the Instance as Down. Conversely, if the Instance is in a running state and the associated Database is up, it considers the Instance as Up. Default value is ‘all’ i.e., it will monitor all databases. If user need to monitor any particular databases, then provide the database names with comma separated. Example: master,msdb,temp.
Prerequisites
Agent G2 - MSSQL Database Availability Status - v2
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - MSSQL Database Availability Status - v2 | mssql_database_availability_status | MSSQL Database Availability Status | Status | MSSQL Database Availability Status |
mssql_database_connection_time | MSSQL Database Connection Time | ms | Monitors MSSQL Database Instance Connection Time in milliseconds |
Agent G2 - MSSQL Database Data and Log Files Freespace and Utilization
Description
Monitors MSSQL Database Log & Data file Freespace and utilization.
Prerequisites
Requires Agent version 14.0.0 or later. MSSQL database credentials must be added to the device. When assigning the template, user need to provide the MSSQL Server name (if they have multiple instances, provide it in this format: ServerName InstanceName) and MSSQL instance name (instance names will be available in services.msc) and mention that the authentication type is Windows or SQL.
Template Usage Guidelines:
- Requires Agent version 14.0.0 or later.
- MSSQL database credentials must be added to the device.
- When assigning the template, user need to provide the MSSQL Server name (if they have multiple instances, provide it in this format: ServerName InstanceName) and MSSQL instance name (instance names will be available in services.msc) and mention that the authentication type is Windows or SQL.
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - MSSQL Database Data and Log Files Freespace and Utilization | mssql_database_logFilesFreeSpace | MSSQL Database LogFileFreeSpace | % | Monitors MSSQL database LogFiles free space |
mssql_database_dataFilesFreeSpace | MSSQL Database DataFilesFreeSpace | % | Monitors MSSQL datafiles free space regardless of auto-growth | |
mssql_database_ldf_file_utilization_percentage | MSSQL Database ldf File Utilization in Percentage | % | Monitors MSSQL .ldf files utilization in percentage | |
mssql_database_ndf_file_utilization_percentage | MSSQL Database ndf File Utilization Percentage | % | Monitors MSSQL .ndf files utilization in percentage | |
mssql_database_mdf_file_utilization_percentage | MSSQL Database mdf File Utilization Percentage | % | Monitors MSSQL .mdf files utilization in percentage |
Agent G2 - MSSQL Database Agent Job Status
Description
Monitors MSSQL Agent Job Status latest information (in last 15mins only): Below are the possible status values - 0 : Failed 1 : Successful 2 : Retry 3 : Cancelled 4 : InProgress.
Prerequisites
Requires Agent version 14.0.0 or later. MSSQL database credentials must be added to the device. When assigning the template, user need to provide the MSSQL Server name (if they have multiple instances, provide it in this format: ServerName InstanceName) and MSSQL instance name (instance names will be available in services.msc) and mention that the authentication type is Windows or SQL.
Template Usage Guidelines:
- Requires Agent version 14.0.0 or later.
- MSSQL database credentials must be added to the device.
- When assigning the template, user need to provide the MSSQL Server name (if they have multiple instances, provide it in this format: ServerName InstanceName) and MSSQL instance name (instance names will be available in services.msc) and mention that the authentication type is Windows or SQL.
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - MSSQL Database Agent Job Status | mssql_database_agentJobStatus | MSSQL Database AgentJobStatus | MSSQL DB Agent Jobs Status : This metric collects mssql agent job status latest information (in last 15mins only): Below are the possible status values - 0 : Failed 1 : Successful 2 : Retry 3 : Cancelled 4 : InProgress |
Agent G2 - Microsoft Windows MSSQL Performance and Statistics
Description:
Monitors the various performance statistics of MSSQL from versions 2000 to 2022. Ensure that required WMI classes be available in system.
Template Usage Guidelines:
While applying this template on the device, users need to provide specific input parameters specifying version of MSSQL. The default value is set to “MSSQL2022.”
Example 1: MSSQL2022
This monitors the MSSQL 2022 statistics for given metrics.
Example 2: MSSQL2019
This monitors the MSSQL 2019 statistics for given metrics.
Below are the Namespaces associated to MSSQL version.
“MSSQL2000” namespace = “root\Microsoft\SqlServer\ComputerManagement” “MSSQL2005” namespace = “root\Microsoft\SqlServer\ComputerManagement” “MSSQL2008” namespace = “root\Microsoft\SqlServer\ComputerManagement10” “MSSQL2012” namespace = “root\Microsoft\SqlServer\ComputerManagement11” “MSSQL2014” namespace = “root\Microsoft\SqlServer\ComputerManagement12” “MSSQL2016” namespace = “root\Microsoft\SqlServer\ComputerManagement13” “MSSQL2017” namespace = “root\Microsoft\SqlServer\ComputerManagement14” “MSSQL2019” namespace = “root\Microsoft\SqlServer\ComputerManagement15” “MSSQL2022” namespace = “root\Microsoft\SqlServer\ComputerManagement16”
To validate which namespaces exist on the device and provide the correct MSSQL version:
- Click Start and search for computer management.
- Under Services and Application, select WMI Control.
- Right click services and select security tab Namespace: root\Microsoft\SqlServer?
Please find the attached screenshot for your reference.
Prerequisites
Ensure that required WMI classes be available in system.
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Microsoft Windows MSSQL Performance and Statistics | microsoft_windows_MSSQL_latchWaitsPerSec | Microsoft Windows MSSQL LatchWaits PerSec | Count per sec | Monitors the number of latch requests that could not be granted immediately and had to wait before being granted. |
microsoft_windows_MSSQL_averageLatchWaitTimems | Microsoft Windows MSSQL AverageLatchWaitTimems | ms | Monitors the average latch wait time (milliseconds) for latch requests that had to wait. | |
microsoft_windows_MSSQL_dataFilesSizeKB | Microsoft Windows MSSQL DataFilesSizeKB | KB | Monitors the cumulative size of all the data files in the database. | |
microsoft_windows_MSSQL_logFilesUsedSizeKB | Microsoft Windows MSSQL LogFilesUsedSizeKB | KB | Monitors the cumulative used size of all the log files in the database. | |
microsoft_windows_MSSQL_logFlushesPersec | Microsoft Windows MSSQL LogFlushesPersec | Count per sec | Monitors the total number of log flushes. | |
microsoft_windows_MSSQL_transactionsPersec | Microsoft Windows MSSQL TransactionsPersec | Count per sec | Monitors the total number of transactions started for the database | |
microsoft_windows_MSSQL_lockTimeoutsPersec | Microsoft Windows MSSQL LockTimeoutsPersec | Count per sec | Monitors the number of lock requests that timed out. This includes requests for NOWAIT locks. | |
microsoft_windows_MSSQL_numberofDeadlocksPersec | Microsoft Windows MSSQL NumberofDeadlocksPersec | Count per sec | Monitors the number of lock requests that resulted in a deadlock. | |
microsoft_windows_MSSQL_averageWaitTimems | Microsoft Windows MSSQL AverageWaitTimems | ms | Monitors the total average amount of wait time (milliseconds) for each lock request that resulted in a wait. | |
microsoft_windows_MSSQL_lockWaitsPersec | Microsoft Windows MSSQL LockWaitsPersec | Count per sec | Monitors the total number of lock requests that could not be satisfied immediately and required the caller to wait before being granted the lock. | |
microsoft_windows_MSSQL_pagelifeexpectancy | Microsoft Windows MSSQL Pagelifeexpectancy | s | Monitors the number of seconds a page will stay in the buffer pool without references | |
microsoft_windows_MSSQL_lazywritesPersec | Microsoft Windows MSSQL LazywritesPersec | Count per sec | Monitors the number of buffers written by buffer manager's lazy writer. | |
microsoft_windows_MSSQL_pagereadsPersec | Microsoft Windows MSSQL PagereadsPersec | Count per sec | Monitors the number of physical database page reads issued. | |
microsoft_windows_MSSQL_pagewritesPersec | Microsoft Windows MSSQL PagewritesPersec | Count per sec | Monitors the number of physical database page writes issued. | |
microsoft_windows_MSSQL_checkpointpagesPersec | Microsoft Windows MSSQL CheckpointpagesPersec | Count per sec | Monitors the number of pages flushed by checkpoint or other operations that require all dirty pages to be flushed. | |
microsoft_windows_MSSQL_connectionMemoryKB | Microsoft Windows MSSQL ConnectionMemoryKB | KB | Monitors the total amount of dynamic memory the server is using for maintaining connections. | |
microsoft_windows_MSSQL_databaseCacheMemoryKB | Microsoft Windows MSSQL DatabaseCacheMemoryKB | KB | Monitors the amount of memory the server is currently using for the database cache. | |
microsoft_windows_MSSQL_freeMemoryKB | Microsoft Windows MSSQL FreeMemoryKB | KB | Monitors the amount of memory the server is currently not using. | |
microsoft_windows_MSSQL_lockMemoryKB | Microsoft Windows MSSQL LockMemoryKB | KB | Monitors the total amount of dynamic memory the server is using for locks. | |
microsoft_windows_MSSQL_memoryGrantsPending | Microsoft Windows MSSQL MemoryGrantsPending | count | Monitors the current number of processes waiting for a workspace memory grant. | |
microsoft_windows_MSSQL_optimizerMemoryKB | Microsoft Windows MSSQL OptimizerMemoryKB | KB | Monitors the total amount of dynamic memory the server is using for query optimization. | |
microsoft_windows_MSSQL_totalServerMemoryKB | Microsoft Windows MSSQL TotalServerMemoryKB | KB | Monitors the total amount of dynamic memory the server is currently consuming. | |
microsoft_windows_MSSQL_SQLCacheMemoryKB | Microsoft Windows MSSQL SQLCacheMemoryKB | KB | Monitors the total amount of dynamic memory the server is using for the dynamic SQL cache. | |
microsoft_windows_MSSQL_forwardedRecordsPersec | Microsoft Windows MSSQL ForwardedRecordsPersec | Count per sec | Monitors the number of records fetched through forwarded record pointers. | |
microsoft_windows_MSSQL_pageSplitsPersec | Microsoft Windows MSSQL PageSplitsPersec | Count per sec | Monitors the number of page splits per second that occur as a result of overflowing index pages. | |
microsoft_windows_MSSQL_fullScansPersec | Microsoft Windows MSSQL FullScansPersec | Count per sec | Monitors the number of unrestricted full scans. These can either be base table or full index scans. | |
microsoft_windows_MSSQL_processesBlocked | Microsoft Windows MSSQL ProcessesBlocked | count | Monitors the Number of currently blocked processes. | |
microsoft_windows_MSSQL_userConnections | Microsoft Windows MSSQL UserConnections | count | Monitors the number of users connected to the system. | |
microsoft_windows_MSSQL_loginsPersec | Microsoft Windows MSSQL LoginsPersec | Count per sec | Monitors the total number of logins started per second. | |
microsoft_windows_MSSQL_logoutsPersec | Microsoft Windows MSSQL LogoutsPersec | Count per sec | Monitors the total number of logouts started per second. | |
microsoft_windows_MSSQL_longestTransactionRunningTime | Microsoft Windows MSSQL LongestTransactionRunningTime | s | Monitors the longest running time of any transaction in seconds. | |
microsoft_windows_MSSQL_receiveIPerOsPersec | Microsoft Windows MSSQL ReceiveIPerOsPersec | Count per sec | Monitors the number of transport receives I/O per second. Note that a transport receive I/O may contain more than one message fragment | |
microsoft_windows_MSSQL_sendIPerOsPersec | Microsoft Windows MSSQL SendIPerOsPersec | Count per sec | Monitors the number of transport send I/Os per second. Note that a transport send I/O may contain more than one message fragment. | |
microsoft_windows_MSSQL_openConnectionCount | Microsoft Windows MSSQL OpenConnectionCount | Count | Monitors the total number of transport connections currently open. | |
microsoft_windows_MSSQL_SQLCompilationsPersec | Microsoft Windows MSSQL SQLCompilationsPersec | Count per sec | Monitors the number of SQL compilations. | |
microsoft_windows_MSSQL_SQLReCompilationsPersec | Microsoft Windows MSSQL SQLReCompilationsPersec | Count per sec | Monitors the number of SQL re-compiles. | |
microsoft_windows_MSSQL_batchRequestsPersec | Microsoft Windows MSSQL BatchRequestsPersec | Count per sec | Monitors the number of SQL batch requests received by server. | |
microsoft_windows_MSSQL_taskLimitReached | Microsoft Windows MSSQL TaskLimitReached | count | Monitors the total number of times the activated task limit on a queue has been reached. | |
microsoft_windows_MSSQL_tasksAbortedPersec | Microsoft Windows MSSQL TasksAbortedPersec | Count per sec | Monitors the number of activated tasks that are being aborted per second. | |
microsoft_windows_MSSQL_SQLRECEIVEsPersec | Microsoft Windows MSSQL SQLRECEIVEsPersec | Count per sec | Monitors the number of SQL RECEIVE commands processed by the Broker per second. | |
microsoft_windows_MSSQL_BrokerTransactionRollbacks | Microsoft Windows MSSQL BrokerTransactionRollbacks | count | Monitors the number of Service Broker related transactions that have rolled back. | |
microsoft_windows_MSSQL_SQLSENDsPersec | Microsoft Windows MSSQL SQLSENDsPersec | Count per sec | Monitors the number of SQL SEND commands processed by the Broker per second. | |
microsoft_windows_MSSQL_logBytesFlushedPersec | Microsoft Windows MSSQL LogBytesFlushedPersec | Bps | Monitors the total number of log bytes flushed. | |
microsoft_windows_MSSQL_percentLogUsed | Microsoft Windows MSSQL PercentLogUsed | % | Monitors the percent of space in the log that is in use. | |
microsoft_windows_MSSQL_logFilesSizeKB | Microsoft Windows MSSQL LogFilesSizeKB | KB | Monitors the cumulative size of all the log files in the database. | |
microsoft_windows_MSSQL_logFlushWaitsPersec | Microsoft Windows MSSQL LogFlushWaitsPersec | Count per sec | Monitors the number of commits waiting on log flush. | |
microsoft_windows_MSSQL_logGrowths | Microsoft Windows MSSQL LogGrowths | count | Monitors the total number of log growths for this database. | |
microsoft_windows_MSSQL_logShrinks | Microsoft Windows MSSQL LogShrinks | count | Monitors the total number of log shrinks for this database. | |
microsoft_windows_MSSQL_cacheHitRatioPercentage | Microsoft Windows MSSQL CacheHitRatio Percentage | % | Monitors the percentage of cache hits relative to cache lookups. | |
microsoft_windows_MSSQL_bufferCacheHitRatioPercentage | Microsoft Windows MSSQL Buffercachehitratio Percentage | % | Monitors the percentage of pages that were found in the buffer pool without having to incur a read from disk. |
Agent G2 - Windows CertStore Certificates Expiry - IssuedTo_SN as component
Description
To monitor certificate(s) expiry (in days) which are available in Certificate Manager and not yet expired. This template contains “Issued To and Serial Number” in alert subject.
Template Usage Guidelines:
The script receives its parameters from custom monitor during Runtime.
Params field in the custom monitor holds the certificate store path, the excluded Thumbprints OR the keyword “all”
Formulate your Path params in the format specified below and encode the string. The string has two parts to it separated by a semicolon ( ; ).
The Certificate Store paths are separated by a comma (,) and are present to the left to the semicolon separator.
The list of Thumbprints that needs to be excluded from the monitor are comma (,) separated and are present to the right of the semicolon separator.
The error or warning alerts contains Certificate information specifying respective certificate’s Issuer, Subject, Serial Number.
Case 1: Giving input as specific paths and excluding few certificates from those paths(using their thumbprints)
Example String Before Encoding:
cert:\LocalMachine\Root, cert:\LocalMachine\AuthRoot;a8985d3a65e5e5c4b2d7d66d40c6dd2fb19c5436,df3c24f9bfd666761b268073fe06d1cc8d4f82a4
Example String After Encoding: (use a tool like notepad++ or online resource like https://www.base64encode.org/ to encode your params string)
Y2VydDpcTG9jYWxNYWNoaW5lXFJvb3QsY2VydDpcTG9jYWxNYWNoaW5lXEF1dGhSb290O+KAjmE4OTg1ZDNhNjVlNWU1YzRiMmQ3ZDY2ZDQwYzZkZDJmYjE5YzU0MzYs4oCOZGYzYzI0ZjliZmQ2NjY3NjFiMjY4MDczZmUwNmQxY2M4ZDRmODJhNA==
Case 2: Monitoring all the certificates from all folders or Current User account and Local Machine account
To monitor all the certificates user needs to give input as “all”
Prerequisite
Powershell Version 3 and above.
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Windows CertStore Certificates Expiry - IssuedTo_SN as component | windows_certStore_certificatesExpiryInDays | Windows CertStore CertificatesExpiry InDays | Days | To monitor certificate(s) expiry (in days) which are available in Certificate Manager. |
Agent G2 - Windows Memory Monitoring
Description
Agent G2 - Windows Service Monitoring v2.0
Prerequisites
No prerequisite
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Windows Memory Custom Monitor | system.windows.memory.available | System Windows Memory Available | MB | Available MBytes is the amount of physical memory, in Megabytes, immediately available for allocation to a process or for system use. It is equal to the sum of memory assigned to the standby (cached), free and zero page lists. |
Agent G2 - Windows Network Interface Monitoring
Description
Agent G2 - Windows Network Interface Monitoring
Prerequisites
No prerequisite
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Windows Network Interface Custom Monitor | system.windows.network.interface.packets.persec | System Windows Network Interface Packets Per Sec | packets/sec | Packets/sec is the rate at which packets are sent and received on the network interface. |
system.windows.network.interface.traffic.in | System Windows Network Interface Traffic In | bps | Bytes Received/sec is the rate at which bytes are received over each network adapter, including framing characters. Network Interface\\Bytes Received/sec is a subset of Network Interface\\Bytes Total/sec. | |
system.windows.network.interface.traffic.out | System Windows Network Interface Traffic Out | bps | Bytes Sent/sec is the rate at which bytes are sent over each network adapter, including framing characters. Network Interface\\Bytes Sent/sec is a subset of Network Interface\\Bytes Total/sec. | |
system.windows.network.interface.traffic.total | System Windows Network Interface Traffic Total | bps | Bytes Total/sec is the rate at which bytes are sent and received over each network adapter, including framing characters. Network Interface\\Bytes Total/sec is a sum of Network Interface\\Bytes Received/sec and Network Interface\\Bytes Sent/sec. |
Agent G2 - Windows Network Interface Monitoring - v3
Description
Monitors Windows Network Interface metrics. In this version of template, we have added support for new metrics, and we have also modified the metric names from the existing dotted notation to underscore format.
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Windows Network Interface Custom Monitor - v3 | system_windows_network_interface_status | System Windows Network Interface Status | null | NetConnectionStatus is a string indicating the state of the network adapter's connection to the network. The value of the property is to be interpreted as follows: 0 - Disconnected 1 - Connecting 2 - Connected 3 - Disconnecting 4 - Hardware not present 5 - Hardware disabled 6 - Hardware malfunction 7 - Media disconnected 8 - Authenticating 9 - Authentication succeeded 10 - Authentication failed 11 - Invalid Address 12 - Credentials Required Other (13?65535) |
system_windows_network_interface_currentBandwidth | System Windows Network Interface CurrentBandwidth | bps | Current Bandwidth is an estimate of the current bandwidth of the network interface in bits per second (BPS). For interfaces that do not vary in bandwidth or for those where no accurate estimation can be made, this value is the nominal bandwidth. | |
system_windows_network_interface_packetsReceivedPersec | System Windows Network Interface PacketsReceivedPersec | psec | Packets Received/sec is the rate at which packets are received on the network interface. | |
system_windows_network_interface_packetsSentPersec | System Windows Network Interface PacketsSentPersec | psec | Packets Sent/sec is the rate at which packets are sent on the network interface. | |
system_windows_network_interface_packetsReceivedNonUnicastPersec | System Windows Network Interface PacketsReceivedNonUnicastPersec | psec | Packets Received Non-Unicast/sec is the rate at which non-unicast (subnet broadcast or subnet multicast) packets are delivered to a higher-layer protocol. | |
system_windows_network_interface_packetsReceivedUnicastPersec | System Windows Network Interface PacketsReceivedUnicastPersec | psec | Packets Received Unicast/sec is the rate at which (subnet) unicast packets are delivered to a higher-layer protocol. | |
system_windows_network_interface_packetsSentNonUnicastPersec | System Windows Network Interface PacketsSentNonUnicastPersec | psec | Packets Sent Non-Unicast/sec is the rate at which packets are requested to be transmitted to non-unicast (subnet broadcast or subnet multicast) addresses by higher-level protocols. The rate includes the packets that were discarded or not sent. | |
system_windows_network_interface_packetsSentUnicastPersec | System Windows Network Interface PacketsSentUnicastPersec | psec | Packets Sent Unicast/sec is the rate at which packets are requested to be transmitted to subnet-unicast addresses by higher-level protocols. The rate includes the packets that were discarded or not sent. | |
system_windows_network_interface_offloadedConnections | System Windows Network Interface OffloadedConnections | count | Offloaded Connections is the number of TCP connections (over both IPv4 and IPv6) that are currently handled by the TCP chimney offload capable network adapter. | |
system_windows_network_interface_packetsOutboundErrors | System Windows Network Interface PacketsOutboundErrors | count | Packets Outbound Errors is the number of outbound packets that could not be transmitted because of errors. | |
system_windows_network_interface_packetsReceivedErrors | System Windows Network Interface PacketsReceivedErrors | count | Packets Received Errors is the number of inbound packets that contained errors preventing them from being deliverable to a higher-layer protocol. | |
system_windows_network_interface_outputQueueLength | System Windows Network Interface OutputQueueLength | count | Output Queue Length is the length of the output packet queue (in packets). If this is longer than two, there are delays and the bottleneck should be found and eliminated, if possible. Since the requests are queued by the Network Driver Interface Specification (NDIS) in this implementation, this will always be 0. | |
system_windows_network_interface_packetsReceivedDiscarded | System Windows Network Interface PacketsReceivedDiscarded | count | Packets Received Discarded is the number of inbound packets that were chosen to be discarded even though no errors had been detected to prevent their delivery to a higher-layer protocol. One possible reason for discarding packets could be to free up buffer space. | |
system_windows_network_interface_packetsOutboundDiscarded | System Windows Network Interface PacketsOutboundDiscarded | count | Packets Outbound Discarded is the number of outbound packets that were chosen to be discarded even though no errors had been detected to prevent transmission. One possible reason for discarding packets could be to free up buffer space. | |
system_windows_network_interface_utilization | System Windows Network Interface Utilization | % | Network Interface utilization in percentage. | |
system_windows_network_interface_trafficIn | System Windows Network Interface TrafficIn | bps | Bytes Received/sec is the rate at which bytes are received over each network adapter, including framing characters. Network Interface\Bytes Received/sec is a subset of Network Interface\Bytes Total/sec. | |
system_windows_network_interface_trafficOut | System Windows Network Interface TrafficOut | bps | Bytes Sent/sec is the rate at which bytes are sent over each network adapter, including framing characters. Network Interface\Bytes Sent/sec is a subset of Network Interface\Bytes Total/sec. | |
system_windows_network_interface_trafficTotal | System Windows Network Interface TrafficTotal | bps | Bytes Total/sec is the rate at which bytes are sent and received over each network adapter, including framing characters. Network Interface\Bytes Total/sec is a sum of Network Interface\Bytes Received/sec and Network Interface\Bytes Sent/sec. | |
system_windows_network_interface_packetsPersec | System Windows Network Interface PacketsPersec | packets/sec | Packets/sec is the rate at which packets are sent and received on the network interface. |
Agent G2 - Windows Network Interface Monitoring - v2
Description
Agent G2 - Windows Network Interface Monitoring - v2
Prerequisites
NULL
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Windows Network Interface Monitoring - v2 | system.windows.network.interface.packets.received.persec | System Windows Network Interface Packets Received Per Sec | packets/sec | PacketsReceived/sec is the rate at which packets are received on the network interface. |
system.windows.network.interface.packets.sent.persec | System Windows Network Interface Packets Sent Per Sec | packets/sec | PacketsSentPerSec is the rate at which packets are sent on the network interface. | |
system.windows.network.interface.traffic.out | System Windows Network Interface Traffic Out | bps | Bytes Sent/sec is the rate at which bytes are sent over each network adapter, including framing characters. Network Interface\\Bytes Sent/sec is a subset of Network Interface\\Bytes Total/sec. | |
system.windows.network.interface.packets.persec | System Windows Network Interface Packets Per Sec | packets/sec | Packets/sec is the rate at which packets are sent and received on the network interface. | |
system.windows.network.interface.traffic.in | System Windows Network Interface Traffic In | bps | Bytes Received/sec is the rate at which bytes are received over each network adapter, including framing characters. Network Interface\\Bytes Received/sec is a subset of Network Interface\\Bytes Total/sec. | |
system.windows.network.interface.traffic.total | System Windows Network Interface Traffic Total | bps | Bytes Total/sec is the rate at which bytes are sent and received over each network adapter, including framing characters. Network Interface\\Bytes Total/sec is a sum of Network Interface\\Bytes Received/sec and Network Interface\\Bytes Sent/sec. |
Agent G2 - Windows Registry Monitoring
Description
Agent G2 - Windows Registry Monitoring
Prerequisites
No prerequisite
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Windows Registry Custom Monitor | system.windows.registry.quota.usage | System Windows Registry Quota Usage | % | % Registry Quota In Use is the percentage of the Total Registry Quota Allowed that is currently being used by the system. This counter displays the current percentage value only; it is not an average. |
Agent G2 - Windows Services Monitoring
Description
Agent G2 - Windows Services Monitoring
Prerequisites
Provide ServiceNames with comma separated as input parameters while applying the template at device level.
Example: opsramp-agent,opsramp-shield,power.
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Windows Services Custom Monitor | system.windows.service.status | System Windows Service Status | NULL | It gives the current status of the given service name(s), In graph 1 - Running & 0 - Stopped |
Agent G2 - Windows Service Monitoring v2.0
Description
It monitors the windows service monitoring with Regex support.
Prerequisites
Provide service names as input arguments separated by commas when applying the template at the device level, along with support for regular expressions.
Syntax: serviceName1,serviceName2,regexPattern1,regexPattern2.
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Windows Services Custom Monitor v2.0 | System_Windows_Service_Status_Ext | System Windows Service Status Ext | NULL | It gives the current status of the services by matching with the given service name(s) or regex patterns(s). Below are the possible values: Stopped - 0, Running - 1, Start Pending - 2, Stop Pending - 3, Continue Pending - 4, Pause Pending - 5, Paused - 6, Unknown - 7 |
Agent G2 - Windows Service Monitoring - v3
Description
To monitor the windows services status
Template Usage Guidelines:
- When assigning this template on the device, users need to pass specific input parameters. These parameters should be provided as one or more service names (not the service display names) or service name regex patterns.
- To provide multiple service names or service name regex patterns, seperate them with commas.
- SAMPLE CUSTOM SCRIPT ARGUMENTS:
Example 1(With regex): ^opsramp,agent$,Power
Example 2(Without Regex): Netlogon,Dnscache,RpcEptMapper
Prerequisites
No prerequisite
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Windows Services Custom Monitor - v3 | System_Windows_Service_Status_Ext | System Windows Service Status Ext | It gives the current status of the services by matching with the given service name(s) or regex patterns(s). Below are the possible values: Stopped - 0, Running - 1, Start Pending - 2, Stop Pending - 3, Continue Pending - 4, Pause Pending - 5, Paused - 6, Unknown - 7 |
Agent G2 - Windows Mountpoint Monitoring
Description
Agent G2 - Windows Mountpoint Monitoring
Prerequisites
No prerequisite
Supported Metric
Monitor Name | Metric Name | Metric Display Name | Unit | Description |
---|---|---|---|---|
Agent G2 - Windows Mountpoint Custom Monitor | system.windows.mountpoint.disk.freespace | System Windows MountPoint Disk FreeSpace | MB | Shows the free space of mounted disks in MB. |
system.windows.mountpoint.disk.usage | System Windows MountPoint Disk Usage | % | Shows the utilization space of mounted disks in percentage. |