WHERE DO I START?
You are here:Integrations > AWS > AWS Metrics
  • Top ↑

AWS Metrics

Relevant Topic(s)   
Note   In the following tables, the BASE column indicates whether there's a baseline band available for the metric, the CORR column indicates whether there's a contextual band available for the metric, and the UTIL column indicates whether the metric can be used as a utilization metric in the Utilization and Utilization Boxplot Reports.

Collected

ASG

For a description of the collected metrics, visit AWS EC2 dimensions and metrics.

Friendly Name Fully Qualified Name (FQN) AWS Metric Statistic Units Min Max Sparse Data Strategy (SDS) BASE CORR UTIL
CPU Credit Balance aws.ec2.cpucreditbalance CPUCreditBalance average   0 none none
CPU Credit Usage aws.ec2.cpucreditusage CPUCreditUsage sum   0 none none
CPU Utilization Percent aws.ec2.cpuutilization CPUUtilizationPercent average percent 0 100 none
Ephemeral Disk Read Bytes aws.ec2.diskreadbytes DiskReadBytes sum bytes 0 none none
Ephemeral Disk Read Ops aws.ec2.diskreadops DiskReadOps sum   0 none none
Ephemeral Disk Write Bytes aws.ec2.diskwritebytes DiskWriteBytes sum bytes 0 none none
Ephemeral Disk Write Ops aws.ec2.diskwriteops DiskWriteOps sum   0 none none
Network Bytes In Counter aws.ec2.networkin NetworkIn sum bytes 0 none none
Network Bytes Out Counter aws.ec2.networkout NetworkOut sum bytes 0 none none
Status Check Failed aws.ec2.statuscheckfailed StatusCheckFailed sum   0 5 none
Status Check Failed - Instance aws.ec2.statuscheckfailed_instance StatusCheckFailed_Instance sum   0 5 none
Status Check Failed - System aws.ec2.statuscheckfailed_system StatusCheckFailed_System sum   0 5 none

DynamoDB

For a description of the collected metrics, visit AWS DynamoDB dimensions and metrics.

Fully Qualified Name (FQN) AWS Metric Statistic Units Min Max Sparse Data Strategy (SDS) BASE CORR UTIL
aws.dynamodb.conditionalcheckfailedrequests ConditionalCheckFailedRequests sum count 0 none zero
aws.dynamodb.consumedreadcapacityunits ConsumedReadCapacityUnits sum count 0 none zero
aws.dynamodb.consumedwritecapacityunits ConsumedWriteCapacityUnits sum count 0 none zero
aws.dynamodb.onlineindexconsumedwritecapacity OnlineIndexConsumedWriteCapacity sum count 0 none zero
aws.dynamodb.onlineindexpercentageprogress OnlineIndexPercentageProgress max percent 0 100 none
aws.dynamodb.onlineindexthrottleevents OnlineIndexThrottleEvents sum count 0 none zero
aws.dynamodb.provisionedreadcapacityunits ProvisionedReadCapacityUnits sum count 0 none zero
aws.dynamodb.provisionedwritecapacityunits ProvisionedWriteCapacityUnits sum count 0 none zero
aws.dynamodb.readthrottleevents ReadThrottleEvents sum count 0 none zero
aws.dynamodb.returnedbytes ReturnedBytes average bytes 0 none zero
aws.dynamodb.returneditemcount ReturnedItemCount average count 0 none zero
aws.dynamodb.returnedrecordscount ReturnedRecordsCount average count 0 none zero
aws.dynamodb.successfulrequestlatency SuccessfulRequestLatency average ms 0 none zero
aws.dynamodb.systemerrors SystemErrors sum count 0 none zero
aws.dynamodb.throttledrequests ThrottledRequests sum count 0 none zero
aws.dynamodb.usererrors UserErrors sum count 0 none zero
aws.dynamodb.writethrottleevents WriteThrottleEvents sum count 0 none zero

EBS

For a description of the collected metrics, visit AWS EBS dimensions and metrics.

Friendly Name Fully Qualified Name (FQN) AWS Metric Statistic Units Min Max Sparse Data Strategy (SDS) BASE CORR UTIL
Read Ops aws.ebs.volumereadops VolumeReadOps sum   0 none none
Write Ops aws.ebs.volumewriteops VolumeWriteOps sum   0 none none
Queue Length aws.ebs.volumequeuelength VolumeQueueLength sum   0 none none
Idle Time aws.ebs.volumeidletime VolumeIdleTime average s 0 300 none
Read Bytes aws.ebs.volumereadbytes VolumeReadBytes sum bytes 0 none zero
Total Read Time aws.ebs.volumetotalreadtime VolumeTotalReadTime sum s 0 none zero
Total Write Time aws.ebs.volumetotalwritetime VolumeTotalWriteTime sum s 0 none zero
Write Bytes aws.ebs.volumewritebytes VolumeWriteBytes sum bytes 0 none zero
Percent IOPS Delivered aws.ebs.volumethroughputpercentage VolumeThroughputPercentage average percent 0 100 none
Consumed Read Write Ops aws.ebs.volumeconsumedreadwriteops VolumeConsumedReadWriteOps sum   0 none none

ECS

For a description of the collected metrics, visit Amazon CloudWatch Developer Guide - ECS Metrics.

Cluster Service Fully Qualified Name (FQN) AWS Metric Statistic Units Min Max Sparse Data Strategy (SDS) BASE CORR UTIL
aws.ecs.cpureservation CPUReservation average percent 0 100 none
aws.ecs.memoryreservation MemoryReservation average percent 0 100 none
aws.ecs.cpuutilization CPUUtilization average percent 0 100 none
aws.ecs.memoryutilization MemoryUtilization average percent 0 100 none

EC2

For a description of the collected metrics, visit AWS EC2 dimensions and metrics.

Fully Qualified Name (FQN) AWS Metric Statistic Units Min Max Sparse Data Strategy (SDS) BASE CORR UTIL
aws.ec2.cpucreditbalance CPUCreditBalance average   0 none none
aws.ec2.cpucreditusage CPUCreditUsage sum   0 none none
aws.ec2.cpuutilization CPUUtilizationPercent average percent 0 100 none
aws.ec2.diskreadbytes DiskReadBytes sum bytes 0 none none
aws.ec2.diskreadops DiskReadOps sum   0 none none
aws.ec2.diskwritebytes DiskWriteBytes sum bytes 0 none none
aws.ec2.diskwriteops DiskWriteOps sum   0 none none
aws.ec2.networkin NetworkIn sum bytes 0 none none
aws.ec2.networkout NetworkOut sum bytes 0 none none
aws.ec2.statuscheckfailed StatusCheckFailed sum   0 5 none
aws.ec2.statuscheckfailed_instance StatusCheckFailed_Instance sum   0 5 none
aws.ec2.statuscheckfailed_system StatusCheckFailed_System sum   0 5 none

Elasticache

Note   Elasticache instances can come in a few "flavors", which means metrics are unique to each "flavor" of Elasticache. In the table below, Host-level metrics are present on both Memcached as well as Redis clusters. Memcached metrics are available only on Memcached clusters, Memcached 1.4.14 metrics are only available on Memcached clusters using at least version 1.4.14, and Redis metrics are present only on Redis clusters.

For a description of the collected metrics, see the following links:

Metric Type Fully Qualified Name (FQN) AWS Metric Statistic Units Min Max Sparse Data Strategy (SDS) BASE CORR UTIL
Host-level aws.elasticache.cpuutilization CPUUtilization average percent 0 100 none
Host-level aws.elasticache.freeablememory FreeableMemory average bytes 0 none none
Host-level aws.elasticache.networkbytesin NetworkBytesIn average bytes 0 none none
Host-level aws.elasticache.networkbytesout NetworkBytesOut average bytes 0 none none
Host-level aws.elasticache.swapusage SwapUsage average bytes 0 none none
Memcached aws.elasticache.bytesreadintomemcached BytesReadIntoMemcached average bytes 0 none none
Memcached aws.elasticache.bytesusedforacheitems BytesUsedForCacheItems average bytes 0 none none
Memcached aws.elasticache.byteswrittenoutfrommemcached BytesWrittenOutFromMemcached average bytes 0 none none
Memcached aws.elasticache.casbadval CasBadVal sum count 0 none none
Memcached aws.elasticache.cashits CasHits sum count 0 none none
Memcached aws.elasticache.casmisses CasMisses sum count 0 none none
Memcached aws.elasticache.cmdflush CmdFlush sum count 0 none none
Memcached aws.elasticache.cmdget CmdGet sum count 0 none none
Memcached aws.elasticache.cmdset CmdSet sum count 0 none none
Memcached aws.elasticache.currconnections CurrConnections sum count 0 none none
Memcached aws.elasticache.curritems CurrItems sum count 0 none none
Memcached aws.elasticache.decrhits DecrHits sum count 0 none none
Memcached aws.elasticache.decrmisses DecrMisses sum count 0 none none
Memcached aws.elasticache.deletehits DeleteHits sum count 0 none none
Memcached aws.elasticache.deletemisses DeleteMisses sum count 0 none none
Memcached aws.elasticache.evictions Evictions sum count 0 none none
Memcached aws.elasticache.gethits GetHits sum count 0 none none
Memcached aws.elasticache.getmisses GetMisses sum count 0 none none
Memcached aws.elasticache.incrhits IncrHits sum count 0 none none
Memcached aws.elasticache.incrmisses IncrMisses sum count 0 none none
Memcached aws.elasticache.reclaimed Reclaimed sum count 0 none none
Memcached 1.4.14 aws.elasticache.bytesusedforhash BytesUsedForHash average bytes 0 none none
Memcached 1.4.14 aws.elasticache.cmdconfigget CmdConfigGet sum count 0 none none
Memcached 1.4.14 aws.elasticache.cmgconfigset CmdConfigSet sum count 0 none none
Memcached 1.4.14 aws.elasticache.cmdtouch CmdTouch sum count 0 none none
Memcached 1.4.14 aws.elasticache.currconfig CurrConfig average count 0 none none
Memcached 1.4.14 aws.elasticache.evictedunfetched EvictedUnfetched sum count 0 none none
Memcached 1.4.14 aws.elasticache.expiredunfetched ExpiredUnfetched sum count 0 none none
Memcached 1.4.14 aws.elasticache.slabsmoved SlabsMoved sum count 0 none none
Memcached 1.4.14 aws.elasticache.touchhits TouchHits sum count 0 none none
Memcached 1.4.14 aws.elasticache.touchmisses TouchMisses sum count 0 none none
Redis aws.elasticache.bytesusedforcache BytesUsedForCache average bytes 0 none none
Redis aws.elasticache.cachehits CacheHits sum count 0 none none
Redis aws.elasticache.cachemisses CacheMisses sum count 0 none none
Redis aws.elasticache.currconnections CurrConnections sum count 0 none none
Redis aws.elasticache.evictions Evictions sum count 0 none none
Redis aws.elasticache.hyperloglogbasedcmds HyperLogLogBasedCmds sum count 0 none none
Redis aws.elasticache.newconnections NewConnections sum count 0 none none
Redis aws.elasticache.reclaimed Reclaimed sum count 0 none none
Redis aws.elasticache.replicationbytes ReplicationBytes average bytes 0 none none
Redis aws.elasticache.replicationlag ReplicationLag average seconds 0 none none
Redis aws.elasticache.saveinprogress SaveInProgress max count 0 1 none
Redis aws.elasticache.curritems CurrItems sum count 0 none none
Redis aws.elasticache.gettypecmds GetTypeCmds sum count 0 none none
Redis aws.elasticache.hashbasedcmds HashBasedCmds sum count 0 none none
Redis aws.elasticache.keybasedcmds KeyBasedCmds sum count 0 none none
Redis aws.elasticache.listbasedcmds ListBasedCmds sum count 0 none none
Redis aws.elasticache.setbasedcmds SetBasedCmds sum count 0 none none
Redis aws.elasticache.settypecmds SetTypeCmds sum count 0 none none
Redis aws.elasticache.sortedsetbasedcmds SortedSetBasedCmds sum count 0 none none
Redis aws.elasticache.stringbasedcmds StringBasedCmds sum count 0 none none

ELB

For a description of the collected metrics, visit AWS ELB dimensions and metrics.

Friendly Name Fully Qualified Name (FQN) AWS Metric Statistic Units Min Max Sparse Data Strategy (SDS) BASE CORR UTIL
Healthy Host Count aws.elb.healthyhostcount HealthyHostCount average   0 none last
Unhealthy Host Count aws.elb.unhealthyhostcount UnHealthyHostCount average   0 none last
Request Count aws.elb.requestcount RequestCount sum   0 none zero
Average Latency aws.elb.latency Latency average s 0 none zero
HTTPCode ELB 4XX Response Count aws.elb.httpcode_elb_4xx HTTPCode_ELB_4XX sum   0 none zero
HTTPCode ELB 5XX Response Count aws.elb.httpcode_elb_5xx HTTPCode_ELB_5XX sum   0 none zero
HTTPCode Backend 2XX Response Count aws.elb.httpcode_backend_2xx HTTPCode_Backend_2XX sum   0 none zero
HTTPCode Backend 3XX Response Count aws.elb.httpcode_backend_3xx HTTPCode_Backend_3XX sum   0 none zero
HTTPCode Backend 4XX Response Count aws.elb.httpcode_backend_4xx HTTPCode_Backend_4XX sum   0 none zero
HTTPCode Backend 5XX Response Count aws.elb.httpcode_backend_5xx HTTPCode_Backend_5XX sum   0 none zero
Backend Connection Errors aws.elb.backendconnectionerrors BackendConnectionErrors sum   0 none zero
Average Surge Queue Length aws.elb.surgequeuelength SurgeQueueLength average   0 1024 zero
Spillover Count aws.elb.spillovercount SpilloverCount sum   0 none zero

EMR

Note   AWS groups EMR metrics into different categories (cluster status, node status, IO, etc.), but this has no impact on how Netuitive monitors EMR.

For a description of the collected metrics, visit AWS EMR dimensions and metrics.

Category Fully Qualified Name (FQN) AWS Metric Statistic Units Min Max Sparse Data Strategy (SDS) BASE CORR UTIL
Cluster Status aws.elasticmapreduce.appscompleted AppsCompleted average count 0 none none
Cluster Status aws.elasticmapreduce.appsfailed AppsFailed average count 0 none none
Cluster Status aws.elasticmapreduce.appskilled AppsKilled average count 0 none none
Cluster Status aws.elasticmapreduce.appspending AppsPending average count 0 none none
Cluster Status aws.elasticmapreduce.appsrunning AppsRunning average count 0 none none
Cluster Status aws.elasticmapreduce.appssubmitted AppsSubmitted average count 0 none none
Cluster Status aws.elasticmapreduce.containerallocated ContainerAllocated ave average count 0 none none
Cluster Status aws.elasticmapreduce.containerreserved ContainerReserved average count 0 none none
Cluster Status aws.elasticmapreduce.containerpending ContainerPending average count 0 none none
Cluster Status aws.elasticmapreduce.isidle IsIdle average count 0 1 none
Node Status aws.elasticmapreduce.corenodesrunning CoreNodesRunning average count 0 none none
Node Status aws.elasticmapreduce.corenodespending CoreNodesPending average count 0 none none
Node Status aws.elasticmapreduce.livedatanodes LiveDataNodes average percent 0 100 none
Node Status aws.elasticmapreduce.mrtotalnodes MRTotalNodes average count 0 none none
Node Status aws.elasticmapreduce.mractivenodes MRActiveNodes average count 0 none none
Node Status aws.elasticmapreduce.mrlostnodes MRLostNodes average count 0 none none
Node Status aws.elasticmapreduce.mrunhealthynodes MRUnhealthyNodes average count 0 none none
Node Status aws.elasticmapreduce.mrdecommissionednodes MRDecommissionedNodes average count 0 none none
Node Status aws.elasticmapreduce.mrrebootednodes MRRebootedNodes average count 0 none none
IO aws.elasticmapreduce.s3byteswritten S3BytesWritten sum bytes 0 none none
IO aws.elasticmapreduce.s3bytesread S3BytesRead sum bytes 0 none none
IO aws.elasticmapreduce.hdfsutilization HDFSUtilization average percent 0 100 none
IO aws.elasticmapreduce.hdfsbytesRead HDFSBytesRead sum bytes 0 none none
IO aws.elasticmapreduce.hdfsbytesWritten HDFSBytesWritten sum bytes 0 none none
IO aws.elasticmapreduce.missingblocks MissingBlocks average count 0 none none
IO aws.elasticmapreduce.corruptblocks CorruptBlocks average count 0 none none
IO aws.elasticmapreduce.totalload TotalLoad average count 0 none none
IO aws.elasticmapreduce.memorytotalmb MemoryTotalMB average megabytes 0 none none
IO aws.elasticmapreduce.memoryreservedmb MemoryReservedMB average megabytes 0 none none
IO aws.elasticmapreduce.memoryavailablemb MemoryAvailableMB average megabytes 0 none none
IO aws.elasticmapreduce.memoryallocatedmb MemoryAllocatedMB average megabytes 0 none none
IO aws.elasticmapreduce.pendingdeletionblocks PendingDeletionBlocks average count 0 none none
IO aws.elasticmapreduce.underreplicatedblocks UnderReplicatedBlocks average count 0 none none
IO aws.elasticmapreduce.dfspendingreplicationblocks DfsPendingReplicationBlocks average count 0 none none
IO aws.elasticmapreduce.capacityremaininggb CapacityRemainingGB average gigabytes 0 none none
HBase aws.elasticmapreduce.hbasebackupfailed HbaseBackupFailed average count 0 1 none
HBase aws.elasticmapreduce.mostrecentbackupduration MostRecentBackupDuration average count 0 none none
HBase aws.elasticmapreduce.timesincelastsuccessfulbackup TimeSinceLastSuccessfulBackup average count 0 none none

Kinesis

For a description of the collected metrics, visit Monitoring the Amazon Kinesis Streams Service with Amazon CloudWatch.

Important   Currently, Netuitive only supports Kinesis Streams, but additional support for Kinesis Firehose may come in the future.
Fully Qualified Name (FQN) AWS Metric Statistic Units Min Max Sparse Data Strategy (SDS) BASE CORR UTIL
aws.kinesis.getrecords.bytes GetRecords.Bytes average bytes 0 none zero
aws.kinesis.getrecords.iteratoragemilliseconds GetRecords.IteratorAgeMilliseconds average ms 0 none zero
aws.kinesis.getrecords.latency GetRecords.Latency average ms 0 none zero
aws.kinesis.getrecords.records GetRecords.Records sum ops 0 none zero
aws.kinesis.getrecords.success GetRecords.Success sum ops 0 none zero
aws.kinesis.incomingbytes IncomingBytes sum bytes 0 none zero
aws.kinesis.incomingrecords IncomingRecords sum ops 0 none zero
aws.kinesis.putrecord.bytes PutRecord.Bytes sum bytes 0 none zero
aws.kinesis.putrecord.latency PutRecord.Latency average ms 0 none zero
aws.kinesis.putrecord.success PutRecord.Success sum ops 0 none zero
aws.kinesis.putrecords.bytes PutRecords.Bytes sum bytes 0 none zero
aws.kinesis.putrecords.latency PutRecords.Latency average ms 0 none zero
aws.kinesis.putrecords.records PutRecords.Records sum ops 0 none zero
aws.kinesis.putrecords.success PutRecords.Success sum ops 0 none zero
aws.kinesis.readprovisionedthroughputexceeded ReadProvisionedThroughputExceeded sum ops 0 none zero
aws.kinesis.writeprovisionedthroughputexceeded WriteProvisionedThroughputExceeded sum ops 0 none zero

Lambda

For a description of the collected metrics, visit AWS Lambda dimensions and metrics.

Fully Qualified Name (FQN) Type Statistic Units Min Max Sparse Data Strategy (SDS) BASE CORR UTIL
aws.lambda.duration GAUGE average milliseconds 0 none zero
aws.lambda.errors GAUGE sum count 0 none zero
aws.lambda.invocations GAUGE sum count 0 none zero
aws.lambda.throttles GAUGE sum count 0 none zero

RDS

For a description of the collected metrics, visit AWS RDS dimensions and metrics.

Friendly Name Fully Qualified Name (FQN) AWS Metric Statistic Units Min Max Sparse Data Strategy (SDS) BASE CORR UTIL
Bin Log Disk Usage aws.rds.binlogdiskusage BinLogDiskUsage average bytes 0 none none
Replica Lag aws.rds.replicalag ReplicaLag average s 0 none none
CPU Utilization Percent aws.rds.cpuutilization CPUUtilization average percent 0 100 none
Database Connections aws.rds.databaseconnections DatabaseConnections average   0 none none
Free Storage Space aws.rds.freestoragespace FreeStorageSpace average bytes 0 none none
Freeable Memory aws.rds.freeablememory FreeableMemory average bytes 0 none none
Network Receive Throughput aws.rds.networreceivethroughput NetworkReceiveThroughput average Bsec 0 none none
Network Transmit Throughput aws.rds.networktransmitthroughput NetworkTransmitThroughput average Bsec 0 none none
Physical Disk Queue Length aws.rds.diskqueuedepth DiskQueueDepth average   0 none none
Read IOPS aws.rds.readiops ReadIOPS average rps 0 none none
Read Latency aws.rds.readlatency ReadLatency average s 0 none none
Read Throughput aws.rds.readthroughput ReadThroughput average Bps 0 none none
Swap Usage aws.rds.swapusage SwapUsage average bytes 0 none none
Write IOPS aws.rds.writeiops WriteIOPS average wps 0 none none
Write Latency aws.rds.writelatency WriteLatency average s 0 none none
Write Throughput aws.rds.writethroughput WriteThroughput average Bsec 0 none none

Redshift

Note   For each Redshift cluster, two types of elements are collected:
  • RedshiftCluster: Contains cluster-specific metrics as well as metrics that are averages across all nodes.
  • RedshiftNode: Contains node-specific metrics. There will be one element per Redshift node.

The table below denotes which metrics are cluster- or node-based (or both).

For a description of the collected metrics, visit AWS Redshift dimensions and metrics.

Fully Qualified Name (FQN) Cluster Node AWS Metric Statistic Units Min Max Sparse Data Strategy (SDS) BASE CORR UTIL
aws.redshift.cpuutilization CPUUtilization average percent 0 100 none
aws.redshift.databaseconnections DatabaseConnections average count 0 none none
aws.redshift.healthstatus HealthStatus average   0 1 none
aws.redshift.maintenancemode MaintenanceMode average   0 1 none
aws.redshift.networkreceivethroughput NetworkReceiveThroughput average Bps 0 none none
aws.redshift.networktransmitthroughput NetworkTransmitThroughput average Bps 0 none none
aws.redshift.percetagediskspaceused PercentageDiskSpaceUsed average percent 0 100 none
aws.redshift.readiops ReadIOPS average iops 0 none none
aws.redshift.readlatency ReadLatency average seconds 0 none none
aws.redshift.readthroughput ReadThroughput average Bps 0 none none
aws.redshift.writeiops WriteIOPS average iops 0 none none
aws.redshift.writelatency WriteLatency average seconds 0 none none
aws.redshift.writethroughput WriteThroughput average Bps 0 none none

SQS

For a description of the collected metrics, visit AWS SQS dimensions and metrics.

Friendly Name Fully Qualified Name (FQN) AWS Metric Statistic Units Min Max Sparse Data Strategy (SDS) BASE CORR UTIL
Approximate Age of Oldest Message aws.sqs.approximateageofoldestmessage ApproximateAgeOfOldestMessage average seconds 0 none none
Approximate Number of Messages Delayed aws.sqs.approximatenumberofmessagesdelayed ApproximateNumberOfMessagesDelayed sum Count 0 none none
Approximate Number of Messages Not Visible aws.sqs.approximatenumberofmessagesnotvisible ApproximateNumberOfMessagesNotVisible sum Count 0 none none
Approximate Number of Messages Visible aws.sqs.approximatenumberofmessagesvisible ApproximateNumberOfMessagesVisible sum Count 0 none none
Number of Empty Receives aws.sqs.numberofemptyreceives NumberOfEmptyReceives sum Count 0 none none
Number of Messages Deleted aws.sqs.numberofmessagesdeleted NumberOfMessagesDeleted sum Count 0 none none
Number of Messages Received aws.sqs.numberofmessagesreceived NumberOfMessagesReceived sum Count 0 none none
Number of Messages Sent aws.sqs.numberofmessagessent NumberOfMessagesSent sum Count 0 none none
Sent Message Size aws.sqs.sentmessagesize SentMessageSize average Bytes 0 none none

Computed

ASG

Note   ASGs use the same computed metrics as EC2s.
Fully Qualified Name (FQN) Description Units Min Max BASE CORR UTIL Related Global Policies
netuitive.aws.autoscaling.grouptotalinstrances

This is currently available as an attribute, but making it a metric allows Netuitive to graph it over time. Customers with detailed monitoring enabled will already get this information in a collected metric (aws.autoscaling.grouptotalinstances), so this is primarily to benefit customers without detailed monitoring.

Computation:
attribute['totalinstances'].value

Count 0 none  
netuitive.aws.ec2.diskreadbytespersec

This metric expresses the number of bytes read per second from the ephemeral disk of an EC2 instance. This metric is useful for monitoring ephemeral disk read activity.

Computation:
netuitive.aws.ec2.diskreadbytespersec / 300

bytes/second 0 none  
netuitive.aws.ec2.diskwritebytespersec

This metric expresses the number of bytes written per second to the ephemeral disk of an EC2 instance. This metric is useful for monitoring ephemeral disk write activity.

Computation:
netuitive.aws.ec2.diskwritebytespersec / 300

bytes/second 0 none  
netuitive.aws.ec2.diskreadopspersec

This metric expresses the number of read operations per second from the ephemeral disk of an EC2 instance. This metric is useful for monitoring ephemeral disk read activity.

Computation:
netuitive.aws.ec2.diskreadopspersec / 300

operations/second 0 none Elevated ASG Ephemeral Disk Activity
netuitive.aws.ec2.diskwriteopspersec

This metric expresses the number of write operations per second to the ephemeral disk of an EC2 instance. This metric is useful for monitoring ephemeral disk write activity.

Computation:
netuitive.aws.ec2.diskwriteopspersec / 300

operations/second 0 none Elevated ASG Ephemeral Disk Activity
netuitive.aws.ec2.disktotalops

This metric expresses the total number of read and write operations against the ephemeral disk of an EC2 instance. This metric is useful for monitoring ephemeral disk I/O activity.

Computation:
netuitive.aws.ec2.diskreadops + netuitive.aws.ec2.diskwriteops

operations 0 none  
netuitive.aws.ec2.diskiops

This metric expresses the total IOPS performed against the ephemeral disk of an EC2 instance. This metric is useful for monitoring ephemeral disk I/O activity.

Computation:
(netuitive.aws.ec2.disktotalops) / 300

operations/second 0 none  
netuitive.aws.ec2.bytesinpersec

This metric expresses the number of network bytes received per second by an EC2 instance. This metric is useful for monitoring network receive activity.

Computation:
netutitive.aws.ec2.networkin / 300

bytes/second 0 none
  • Elevated ASG CPU Activity (Normal Network Activity)
  • Elevated ASG Network Activity
netuitive.aws.ec2.bytesoutpersec

This metric expresses the number of network bytes written per second by an EC2 instance. This metric is useful for monitoring network transmit activity.

Computation:
netuitive.aws.ec2.networkout / 300

bytes/second 0 none
  • Elevated ASG CPU Activity (Normal Network Activity)
  • Elevated ASG Network Activity

DynamoDB

Fully Qualified Name (FQN) Description Units Min Max BASE CORR UTIL Related Global Policies
netuitive.aws.dynamodb.readcapacityutilization

This metric represents the percentage of the provisioned read capacity being used.

Computation:
((aws.dynamodb.consumedreadcapacityunits / 300) / aws.dynamodb.provisionedreadcapacityunits) * 100

percent 0 100 AWS DynamoDB - Elevated Read Capacity Utilization
netuitive.aws.dynamodb.writecapacityutilization

This metric represents the percentage of the provisioned write capacity being used.

Computation:
((aws.dynamodb.consumedwritecapacityunits / 300) / aws.dynamodb.provisionedwritecapacityunits) * 100

percent 0 100 AWS DynamoDB - Elevated Write Capacity Utilization

EBS

Fully Qualified Name (FQN) Description Units Min Max BASE CORR UTIL Related Global Policies
netuitive.aws.ebs.totalops

This metric expresses the total number of read and write operations against this EBS instance. This metric is useful for monitoring EBS I/O activity.

Computation:
Read Ops + Write Ops

operations 0 none  
netuitive.aws.ebs.totalbytes

This metric expresses the total number of bytes read and written from/to this EBS instance. This metric is useful for monitoring EBS I/O activity.

Computation:
Read Bytes + Write Bytes

bytes 0 none  
netuitive.aws.ebs.averagelatency

This metric expresses the average latency per operation for this EBS instance. This metric is useful for monitoring EBS performance and quality of service.

Computation:
(Total Read Time + Total Write Time) / (Read Ops +Write Ops)

seconds 0 none Elevated Queue Length Differential with Elevated Latency
netuitive.aws.ebs.readbytespersec

This metric expresses the bytes read per second from this EBS instance. This metric is useful for monitoring EBS read activity.

Computation:
Read Bytes / 300

bytes/second 0 none  
netuitive.aws.ebs.writebytespersec

This metric expresses the bytes written per second to this EBS instance. This metric is useful for monitoring EBS write activity.

Computation: Write Bytes / 300

bytes/second 0 none  
netuitive.aws.ebs.totalbytespersec

This metric expresses the total number of bytes either read from or written to this EBS instance. This metric is useful for monitoring overall EBS I/O activity.

Computation:
Read Bytes Per Second + Write Bytes Per Second

bytes/second 0 none  
netuitive.aws.ebs.averagereadlatency

This metric expresses the average latency per read operation for this EBS instance. This metric is useful for monitoring EBS performance and quality of service.

Computation:
Total Read Time / Read Ops

seconds 0 none  
netuitive.aws.ebs.averagewritelatency

This metric expresses the average latency per write operation for this EBS instance. This metric is useful for monitoring EBS performance and quality of service.

Computation:
Total Write Time / Write Ops

seconds 0 none  
netuitive.aws.ebs.readopspersec

This metric expresses the number of read operations per second for this EBS instance. This metric is useful for monitoring EBS read activity.

Computation:
Read Ops / 300

operations/second 0 none  
netuitive.aws.ebs.writeopspersec

This metric expresses the number of write operations per second for this EBS instance. This metric is useful for monitoring EBS write activity.

Computation:
Write Ops / 300

operations/second 0 none  
netuitive.aws.ebs.iops

This metric expresses the total number of either read or write operations per second for this EBS instance. This metric is useful for monitoring EBS I/O activity.

Computation:
Total Ops / 300

operations/second 0 none  
netuitive.aws.ebs.busytimeiops

This metric expresses the number of operations per second measured only over the time that the disk is actually busy. This can be useful in determining the IOPS being achieved during bursts.

Computation:
Total Ops / (300 - floor(Idle Time))

operations/second 0 none  
netuitive.aws.ebs.busytimebytespersecond

This metric expresses the number of bytes per second read and written, measured only over the time that the disk is actually busy. This can be useful in determining the maximum throughput being achieved during bursts.

Computation:
Total Bytes / (300 - floor(Idle Time))

bytes/second 0 none  
netuitive.aws.ebs.busypercent

This metric expresses the percent of time during each 5 minute interval that this EBS was actually busy performing an I/O operation. This metric is useful for monitoring utilization of EBS capacity.

Computation:
100 - ((Idle Time / 300) * 100)

percent 0 100  
netuitive.aws.ebs.queuelengthdifferential

This metric is measuring the difference between the actual queue length and the "ideal" queue length. The ideal queue length is based on Amazon's rule of thumb that for every 200 IOPS you should have a queue length of 1. In theory, a well-optimized volume should have a queue length differential that tends to hover around 0. In practice, we have seen volumes with extremely low latency (< 0.0001) have queue length differentials that are higher than 0; presumably this is because the latency is much lower than Amazon is assuming for their rule of thumb. Even in these cases, the differential is a pretty steady number; hence an upper deviation in the differential would tend to indicate that the disk is not keeping up.

Computation:
Queue Length - (IOPS / 200)

difference 0 none Elevated Queue Length Differential with Elevated Latency
netuitive.aws.ebs.iopsutilization

This metric compares the current IOPS to the provisioned IOPS for the volume in order to determine how much of the provisioned capacity is being used.

Computation:
min(100, (attribute[IOPS] == NULL ? data[IOPS] / 300 : data[IOPS / attribute[IOPS]]) *100)

percent 0 100  

ECS

None.

EC2

Fully Qualified Name (FQN) Description Units Min Max BASE CORR UTIL Related Global Policies
netuitive.aws.ec2.diskreadbytespersec

This metric expresses the number of bytes read per second from the ephemeral disk of an EC2 instance. This metric is useful for monitoring ephemeral disk read activity.

Computation:
netuitive.aws.ec2.diskreadbytespersec / 300

bytes/second 0 none  
netuitive.aws.ec2.diskwritebytespersec

This metric expresses the number of bytes written per second to the ephemeral disk of an EC2 instance. This metric is useful for monitoring ephemeral disk write activity.

Computation:
netuitive.aws.ec2.diskwritebytespersec / 300

bytes/second 0 none  
netuitive.aws.ec2.diskreadopspersec

This metric expresses the number of read operations per second from the ephemeral disk of an EC2 instance. This metric is useful for monitoring ephemeral disk read activity.

Computation:
netuitive.aws.ec2.diskreadopspersec / 300

operations/second 0 none

Elevated EC2 Ephemeral Disk Activity

netuitive.aws.ec2.diskwriteopspersec

This metric expresses the number of write operations per second to the ephemeral disk of an EC2 instance. This metric is useful for monitoring ephemeral disk write activity.

Computation:
netuitive.aws.ec2.diskwriteopspersec / 300

operations/second 0 none

Elevated EC2 Ephemeral Disk Activity

netuitive.aws.ec2.disktotalops

This metric expresses the total number of read and write operations against the ephemeral disk of an EC2 instance. This metric is useful for monitoring ephemeral disk I/O activity.

Computation:
netuitive.aws.ec2.diskreadops + netuitive.aws.ec2.diskwriteops

operations 0 none  
netuitive.aws.ec2.diskiops

This metric expresses the total IOPS performed against the ephemeral disk of an EC2 instance. This metric is useful for monitoring ephemeral disk I/O activity.

Computation:
(netuitive.aws.ec2.disktotalops) / 300

operations/second 0 none  
netuitive.aws.ec2.bytesinpersec

This metric expresses the number of network bytes received per second by an EC2 instance. This metric is useful for monitoring network receive activity.

Computation:
aws.ec2.networkin / 300

bytes/second 0 none
  • Elevated EC2 CPU Activity (Normal Network Activity)
  • Elevated EC2 Network Activity
netuitive.aws.ec2.bytesoutpersec

This metric expresses the number of network bytes written per second by an EC2 instance. This metric is useful for monitoring network transmit activity.

Computation:
aws.ec2.networkout / 300

bytes/second 0 none
  • Elevated EC2 CPU Activity (Normal Network Activity)
  • Elevated EC2 Network Activity

Elasticache

Friendly Name Fully Qualified Name (FQN) Description Units Min Max BASE CORR UTIL Related Global Policies
Cache Hit Rate netuitive.aws.elasticache.cachehitrate

This metric provides the percentage of hits against the cache.

Computation:
(data['aws.elasticache.cachehits'].actual + data['aws.elasticache.cachemisses'].actual) == 0 ? 0 : 100 * (data['aws.elasticache.cachehits'].actual / (data['aws.elasticache.cachehits'].actual + data['aws.elasticache.cachemisses'].actual))

percent 0 100 AWS Elasticache Redis - Low Cache Hit Rate

ELB

Fully Qualified Name (FQN) Description Units Min Max BASE CORR UTIL Related Global Policies
netuitive.aws.elb.unhealthyhostpercent

This metric expresses the percent of hosts for which an ELB is balancing load that are not healthy. When an ELB determines that an EC2 is "unhealthy", it will stop directing requests to it, thereby effectively decreasing the configuration's capacity to service requests. This metric is useful for monitoring actual delivered service capacity of an ELB and its associated EC2s.

Computation:
(Unhealthy Host Count + Healthy Host Count) == 0 ? 0 : ((Unhealthy Host Count)/(Unhealthy Host Count + Healthy Host Count)) * 100

percent 0 100  
netuitive.aws.elb.backendconnectionerrorpercent

This metric expresses the percent of all requests for which a connection was not successfully established between the load balancer and the registered instances. Because the load balancer retries the connection when there are errors, this count can exceed the request rate; so, consequently this percentage can exceed 100. Any value of this metric that is over 100% indicates significant retries and serious connectivity issues.

Computation:
(Request Count == 0 ? 0 : (Backend Connection Errors / Request Count) * 100

percent 0 100  
netuitive.aws.elb.totalelbhttperrors

This metric expresses the total number of HTTP 4XX and 5XX errors that were generated by the ELB. This metric is useful for detecting connection faults between clients and the services associated with the ELB.

Computation:
HTTPCode ELB 4XX Response Count + HTTPCode ELB 5XX Response Count

count 0 none  
netuitive.aws.elb.httpcodeelberrorpercent

This metric expresses the percentage of all requests for which HTTP 4XX and 5XX errors were generated by the ELB. This metric is useful for detecting connection faults between clients and the services associated with the ELB.

Computation:
(Total ELB HTTP Errors / Request Count) * 100

percent 0 100  
netuitive.aws.elb.httpcodelb4xxerrorpercent

This metric expresses the percentage of all requests for which HTTP 4xx errors were generated by the ELB.

Computation:
(HTTPCode ELB 4xx Response Count / Request Count) * 100

percent 0 100  
netuitive.aws.elb.httpcodelb5xxerrorpercent

This metric expresses the percentage of all requests for which HTTP 5xx errors were generated by the ELB.

Computation:
(HTTPCode ELB 5xx Response Count / Request Count) * 100

percent 0 100  
netuitive.aws.elb.totalbackendhttperrors

This metric expresses the total number of HTTP 4XX and 5XX errors that were generated by hosts for which the ELB is balancing load. This metric is useful for detecting potential service faults.

Computation:
HTTPCode Backend 4XX Response Count + HTTPCode Backend 5XX Response Count

count 0 none  
netuitive.aws.elb.httpcodebackenderrorpercent

This metric expresses the percentage of all requests for which HTTP 4XX and 5XX errors were generated by hosts for which the ELB is balancing load. This metric is useful for detecting potential service faults.

Computation:
(Total Backend HTTP Errors / Request Count) * 100

percent 0 100
  • Elevated Backend Error Rate (Low Volume)
  • Elevated Backend Error Rate (High Volume, Low Error Rate)
  • Elevated Backend Error Rate (High Volume, High Error Rate)
netuitive.aws.elb.httpcodebackend4xxerrorpercent

This metric expresses the percentage of all requests for which HTTP 4xx errors were generated by hosts for which the ELB is balancing load.

Computation:
(HTTPCode Backend 4xx Response Count / Request Count) * 100

percent 0 100  
netuitive.aws.elb.httpcodebackend5xxerrorpercent

This metric expresses the percentage of all requests for which HTTP 5xx errors were generated by hosts for which the ELB is balancing load.

Computation:
(HTTPCode Backend 5xx Response Count / Request Count) * 100

percent 0 100  
netuitive.aws.elb.concurrency

This metric expresses the level of concurrency that an ELB and its associated EC2 is currently delivering. Concurrency is given by Little's Law which is an indicator of the degree of parallelism supported by the ELB configuration. When the capacity of the system to process requests in parallel is consistently exceeded, queues grow, latency increases and the system will begin to reject requests. This metric may be valuable for off-line analytics to determine system capacity.

Computation:
(Request Count * Average Latency ) / 300

requests 0 none  
netuitive.aws.elb.surgequeueutilization

This metric expresses the percent of surge queue capacity (1024 requests) that are currently consumed by waiting requests. This metric can be a leading indicator for latency issues as well as increases in spill over (rejected requests due to high demand). This metric can be used to characterize one aspect of an ELB's overall utilization.

Computation:
(Max Surge Queue Length/1024) * 100

percent 0 100
  • Surge Queue Utilization Above 5%
  • Surge Queue Utilization Above 50%
netuitive.aws.elb.requestspersecond

This metric reports the number of requests per second being handled by the ELB.

Computation:
Request Count / 300

requests/second 0 none  

EMR

None.

Kinesis

Name Fully Qualified Name (FQN) Description Units Min Max BASE CORR UTIL Related Global Policies
GetRecords Failure Percent netuitive.aws.kinesis.getrecords.failurepercent

The percentage of GetRecords calls which failed.

Computation:
(data['aws.kinesis.getrecords.records'] == null | data['aws.kinesis.getrecords.records'].actual == 0) ? 0 : ((data['aws.kinesis.getrecords.records'].actual - data['aws.kinesis.getrecords.success'].actual) / data['aws.kinesis.getrecords.records'].actual) * 100

percent 0 100  
PutRecords Failure Percent netuitive.aws.kinesis.putrecords.failurepercent

The percentage of PutRecords calls which failed.

Computation:
(data['aws.kinesis.putrecords.records'] == null | data['aws.kinesis.putrecords.records'].actual == 0) ? 0 : ((data['aws.kinesis.putrecords.records'].actual - data['aws.kinesis.putrecords.success'].actual) / data['aws.kinesis.putrecords.records'].actual) * 100

percent 0 100  

Lambda

Name FQN Computation Units Min Max BASE CORR UTIL Description Related Global Policies
Error Percent netuitive.aws.lambda.errorpercent (data['aws.lambda.invocations'] == null | data['aws.lambda.invocations'].actual == 0) ? 0 : (data['aws.lambda.errors'].actual / data['aws.lambda.invocations'].actual) * 100 percent 0 0 The percentage of function invocations which resulted in the function returning an error.  
Throttle Percent netuitive.aws.lambda.throttlepercent (data['aws.lambda.invocations'] == null | data['aws.lambda.invocations'].actual == 0) ? 0 : (data['aws.lambda.throttles'].actual / (data['aws.lambda.invocations'].actual + data['aws.lambda.throttles'].actual)) * 100 percent 100 100 The percentage of total attempted calls to the function which were throttled. Note that the AWS invocation metric does not include a count of requests which were throttled; thus, we must add "throttles" to "invocations" to get the true total number of calls for computing our percentage against.  

RDS

Fully Qualified Name (FQN) Description Units Min Max BASE CORR UTIL Related Global Policies
netuitive.aws.rds.totalthroughput Computation:
(Read Throughput + Write Throughput)
bytes/second 0 none Elevated RDS Latency
netuitive.aws.rds.totaliops Computation:
(Read IOPS + Write IOPS)
operations/second 0 none  
netuitive.aws.rds.averagereadsize Computation:
Read Throughput / Read IOPS
bytes/operation 0 none  
netuitive.aws.rds.averagewritesize Computation:
Write Throughput / Write IOPS
bytes/operation 0 none  
netuitive.aws.rds.connectionthroughpututilizationpercent Computation:
100 * ((Read Latency * Read IOPS) + (Write Latency * Write IOPS)) / Database Connections
percent 0 100  
netuitive.aws.rds.diskspacepercentused Computation:
((Allocated Storage - (Free Storage Space / 1024 / 1024 / 1024)) / Allocated Storage) * 100)
percent 0 100  
netuitive.aws.rds.iopsutilization Computation:
min(100, (attribute[IOPS] == NULL ? data[Total IOPS] / 300 : data[Total IOPS / attribute[IOPS]]) *100)
percent 0 100  
netuitive.aws.rds.overallutilization Computation:
min(100, max(Connection Utilization Percent, CPU Utilization Percent, IOPS Utilization))
percent 0 100  

Redshift

Fully Qualified Name (FQN) Description Units Min Max BASE CORR UTIL Related Global Policies
netuitive.aws.redshift.totalthroughput

This metric represents the total throughput, obtained by adding the read and write throughputs.

Computation:
(ReadThroughput + WriteThroughput)

Bps 0 none  

SQS

Fully Qualified Name (FQN) Description Units Min Max BASE CORR UTIL Related Global Policies
netuitive.aws.sqs.arrivalrate

The number of messages per second being added to the queue.

Computation:
aws.sqs.numberofmessagessent / 300

ops 0 none  
netuitive.aws.sqs.completionrate

The number of messages per second processed by consumers.

Computation:
aws.sqs.numberofmessagesdeleted / 300

ops 0 none  
netuitive.aws.sqs.emptyreceivepercent

Percentage of times consumers tried to read the queue and got an empty response.

Computation:
(aws.sqs.numberofemptyreceives == 0) ? 0 : (aws.sqs.numberofemptyreceives / (aws.sqs.numberofemptyreceives + aws.sqs.numberofmessagesreceived)) * 100

percent 0 100  
netuitive.aws.sqs.totalmessagesinqueue

The total number of messages in queue, including visible (ready to be read), not visible (being processed), and delayed (not ready to be read).

Computation:
aws.sqs.approximatenumberofmessagesvisible + aws.sqs.approximatenumberofmessagesnotvisible + aws.sqs.approximatenumberofmessagesdelayed

count 0 none