website/docs/azure/monitoring.md
Azure Cosmos DB Garnet Cache provides comprehensive monitoring capabilities through Azure Monitor, enabling you to track performance, troubleshoot issues, and optimize your cache deployment.
Azure Cosmos DB Garnet Cache automatically collects metrics and sends them to Azure Monitor. Most metrics have Min, Max, and Average aggregations as well as the ability to split by Node for detailed insights into cluster performance.
| Metric | Additional Information |
|---|---|
| Connected Clients |
| Metric | Additional Information |
|---|---|
| Command Process Rate | The total number of commands processed per second. |
| Query Latency P99 | The P99 latency of processing per network call received server-side, considering only non-admin requests. Reported in microseconds. |
| Query Latency Mean | The mean latency of processing per network call received server-side, considering only non-admin requests. Reported in microseconds. |
| Read Command Process Rate | The number of read commands processed. |
| Write Command Process Rate | The number of write commands processed. |
| Metric | Additional Information |
|---|---|
| Index Size | The size of the index in the main store in bytes. |
| Log Size | The size of the log in the main store in bytes. |
| Main Store Size | Total size of the main store including index, log, and overflow in bytes. |
| Read Cache Size | Size of read cache in the main store in bytes. |
| Metric | Additional Information |
|---|---|
| Average CPU Usage Active | Average active CPU usage across all CPUs. |
| CPU Usage Active | |
| Memory Utilization | |
| Network Received Bytes | |
| Network Received Packets | |
| Network Transmitted Bytes | |
| Network Transmitted Packets |
The Azure Monitor activity log contains entries for control plane events from Azure Cosmos DB Garnet Cache resources. It includes information like when a cache cluster is created, when scaling operations occur, or when RBAC permissions are granted. Use the activity log to review or audit administrative actions on your Azure Cosmos DB Garnet Cache resources, or create alerts to be proactively notified when control plane events occur.
High CPU usage can indicate various performance bottlenecks in your cluster. Use the Average CPU Usage Active and CPU Usage Active metrics to identify and resolve CPU-related issues.
Memory utilization issues can lead to performance degradation and potential data eviction. Monitor Memory Utilization alongside store-specific metrics to maintain optimal performance.
Connection problems can manifest through various network and client metrics. Use Connected Clients, Network Received/Transmitted Bytes, and Network Received/Transmitted Packets to diagnose connectivity issues.
Performance issues can affect various aspects of your cache operations. Use latency and throughput metrics to identify and resolve performance bottlenecks.