Ensure that your Amazon OpenSearch clusters (domains) are healthy, i.e. they all have shard allocation status set to "Green". When an Amazon OpenSearch domain is unhealthy, the shard allocation status is set to "Red", which means that at least one primary shard and its replicas are not allocated to a node. The most common cause of an OpenSearch cluster with the status set to "Red" is the one with the failed cluster nodes (or when some process crashes due to a continuous heavy processing load). To get notified when your Amazon OpenSearch clusters become unhealthy and implement a plan to recover them, Trend Cloud One™ – Conformity recommends creating CloudWatch alarms that get triggered whenever your OpenSearch clusters health status becomes "Red" for longer than one minute.
The Amazon CloudWatch metric used to detect unhealthy OpenSearch clusters (Red) is:
ClusterStatus.red – which indicates that the primary and replica shards of at least one index are not allocated to nodes within an OpenSearch cluster. Relevant statistic: Maximum. Units: Count.
This rule can help you with the following compliance standards:
- NIST4
For further details on compliance standards supported by Conformity, see here.
This rule can help you work with the AWS Well-Architected Framework.
This rule resolution is part of the Conformity Security & Compliance tool for AWS.
excellence
Detecting unhealthy Amazon OpenSearch clusters with the status set to "Red" is imperative for your OpenSearch application availability. Also, the OpenSearch service stops taking automatic snapshots while the cluster status is set to "Red" and when this status persists for more than 16 days, permanent data loss can occur.
Audit
To identify unhealthy Amazon OpenSearch domains (clusters), perform the following operations:
Remediation / Resolution
Step 1: Create and configure the CloudWatch alarm required to send alert notifications whenever the health status of your Amazon OpenSearch cluster becomes Red for more than one minute:
Step 2: Recovering unhealthy Amazon OpenSearch clusters can be a complex task, therefore you may need the AWS Support team to assist. To create a support case for recovering unhealthy OpenSearch clusters, perform the following operations:
Creating a support case to request recovering unhealthy OpenSearch resources using the AWS Command Line Interface (AWS CLI) is not currently supportedReferences
- AWS Documentation
- Amazon OpenSearch Service
- What is Amazon OpenSearch Service?
- Recommended CloudWatch alarms for Amazon OpenSearch Service
- Troubleshooting Amazon OpenSearch Service
- AWS Command Line Interface (CLI) Documentation
- es
- list-domain-names
- sns
- create-topic
- subscribe
- confirm-subscription
- cloudwatch
- get-metric-statistics
- put-metric-alarm