Has anyone seen Architect bots return a 503 error when we spike the concurrent call volume?
We are running a performance test for our new AI-driven support flow. The environment is a Genesys Cloud org with a BYOC Edge in AWS us-east-1. We are using JMeter to simulate 200 concurrent inbound calls triggering the same bot flow. The bot uses a simple “Greeting → Intent Classification → Transfer to Agent” pattern.
When the concurrent session count hits around 150, we start seeing 503 Service Unavailable errors on the WebSocket connection for the bot interactions. The error occurs before the bot even sends the first message. The calls that do not fail are handled correctly, but the failed ones drop to the IVR fallback.
Here is the error snippet from our logs:
WebSocket Error: 1006 Abnormal Closure
HTTP Response: 503 Service Unavailable
Payload: {"error": "service_unavailable", "message": "Bot service temporarily overloaded"}
We are using the latest Architect version. We have checked the API rate limits, but this seems to be a connection limit issue rather than a REST API throttling issue. The WebSocket connection limit for our org is set to 1000, so we are well within the documented limits.
Is this a known issue with Architect bot capacity under high concurrency? Are there any specific configuration settings for bot flows that we are missing? We need to stabilize this before our go-live date. Any help is appreciated.