Architect Flow 503 Error During High Concurrency JMeter Test

Does anyone know why we are getting 503 Service Unavailable errors when simulating high volume inbound calls through Genesys Cloud Architect flows using JMeter? We are running a load test to validate the system capacity for IVR navigation and queue distribution. Environment is Genesys Cloud US1. Tool is JMeter 5.6 with the HTTP Request Sampler configured to POST to /api/v2/architect/flows. We are simulating 200 concurrent call legs initiating flow executions. The test setup includes a CSV Data Set Config for unique call IDs and a Thread Group with 200 threads and a ramp-up period of 10 seconds. The initial requests succeed with 200 OK status, but after about 15 seconds, we start seeing a spike in 503 errors. The error response body indicates that the Architect engine is unable to process the request due to high load. We have checked the API rate limits and ensured that we are not exceeding the standard limits for flow execution APIs. The WebSocket connections are stable, and there are no network timeouts observed in the JMeter logs. We suspect that the issue might be related to the call capacity planning or the way the Architect engine handles concurrent flow executions. We are looking for advice on how to optimize our JMeter test configuration or if there are specific settings in Genesys Cloud that need to be adjusted to handle higher concurrency. We have tried increasing the thread pool size in JMeter, but the issue persists. Any insights on handling 503 errors during high concurrency load tests for Architect flows would be greatly appreciated. We need to ensure that our IVR can handle peak traffic without dropping calls. The current setup seems to bottleneck at around 200 concurrent executions, which is below our expected peak volume. We are also monitoring the CPU and memory usage of the Genesys Cloud platform, but there are no alerts indicating resource exhaustion. We are using the latest version of the Genesys Cloud API and have verified that the flow logic is correct and does not contain any infinite loops or excessive delays. The flow includes standard elements like Get Input, Set Variable, and Queue Interaction. We are particularly concerned about the reliability of the IVR during peak hours and want to ensure that we have a robust testing strategy in place. We have also tried staggering the request start times using a Constant Throughput Timer, but the 503 errors still occur. We are open to any suggestions on best practices for load testing Architect flows in Genesys Cloud.