Running a load test to validate Bot Studio performance under concurrent user pressure. The setup involves a custom JMeter script hitting the Genesys Cloud Bot API endpoint /api/v2/bots/sessions to initiate conversations. The goal is to simulate 100 concurrent users interacting with a simple FAQ bot flow.
The problem emerges almost immediately. Once the virtual user count hits around 30-40 concurrent sessions, the API starts returning 429 Too Many Requests errors consistently. This happens even though the platform dashboard shows plenty of available bot capacity and no obvious CPU spikes on the edge nodes. The error payload includes a Retry-After header, but the values are inconsistent, ranging from 1s to 5s, making it hard to implement a reliable retry logic in the JMeter test plan.
Here are the specifics:
- Endpoint:
POST /api/v2/bots/sessions - JMeter Config: 100 threads, ramp-up 10s, loop count 10. Using a CSV dataset for unique user IDs.
- Bot Version: v2.3.1 (Bot Studio)
- Error:
429 Too Many Requests - Environment: Genesys Cloud (US-East), BYOC Edge deployment.
I’ve checked the API documentation for rate limits, but the standard limits seem high enough for this volume. Is there a specific bot session rate limit that is lower than the general API limits? Or is this related to WebSocket connection pooling on the Edge?
The test fails completely because the JMeter script doesn’t handle the 429s gracefully, causing the entire batch to error out. Looking for advice on how to structure the JMeter script to handle these retries correctly, or if there’s a known bottleneck in the Bot Studio API during high concurrency. Also, curious if anyone has seen this behavior with similar load patterns. Is it better to stagger the requests more aggressively in the test setup to avoid triggering these limits?