Ensure that the value configured for the maximum concurrent requests per instance is optimal in order to improve application responsiveness and scalability during traffic spikes, and enhance user experience. Maximum concurrent requests per instance refer to the maximum number of incoming HTTP requests that can be processed simultaneously by a single container instance running on Google Cloud Run. The maximum concurrent requests per instance value (concurrency threshold) must be defined in the conformity rule settings, on the Trend Cloud One™ – Conformity account console.
Configuring a higher concurrent requests per instance for Google Cloud Run services allows your application to handle more simultaneous user interactions, improving responsiveness and user experience. It can optimize resource utilization, reducing latency during traffic spikes, and ensure better scalability for applications with varying workloads.
Audit
To determine the maximum concurrent requests per instance configured for Cloud Run services, perform the following actions:
Remediation / Resolution
To configure the maximum concurrent requests per instance for your Cloud Run managed service, you have to create a new service revision with the appropriate configuration. To deploy a new revision for your Cloud Run service, perform the following actions:
References
- Google Cloud Platform (GCP) Documentation
- Cloud Run
- Access control with IAM
- Deploying to Cloud Run
- GCP Command Line Interface (CLI) Documentation
- gcloud projects list
- gcloud services list
- gcloud beta run services describe
- gcloud beta run services update