Ensure that Cloud Monitor is enabled for your Container Service for Kubernetes (ACK) clusters. Cloud Monitor relies on a specialized agent for accessing extra system resources and application services within virtual machine instances. The agent allows monitoring of metrics such as CPU utilization, specific disk traffic metrics, network traffic, and disk IO information. These metrics play a crucial role in observing signals and facilitating operational activities within your Kubernetes Engine clusters.
Enabling CloudMonitor installation provides access to both system metrics and custom metrics. System metrics encompass measurements related to the cluster's infrastructure, such as CPU or memory usage. To capture these system metrics, a monitoring agent is installed, which periodically connects to each node for collecting data. It collects metrics pertaining to Pods and containers on each node and subsequently sends these metrics to the CloudMonitor server. The collected metrics include information on CPU usage, memory utilization, evictable memory, non-evictable memory, and disk usage.
Audit
To determine if Cloud Monitor is enabled for your ACK clusters, perform the following operations:
Remediation / Resolution
To enable the Cloud Monitor service for your ACK clusters, perform the following operations:
References
- Alibaba Cloud Documentation
- Best Practices
- Enable Kubernetes Monitoring for a Kubernetes cluster
- Monitor basic resources
- Alibaba Cloud CLI Documentation
- View all clusters
- DescribeClusterNodePools
- ModifyClusterNodePool