Configuration is broken for some reason… Trying to spin up 50 concurrent users for a stress test in the Asia/Singapore region using the Genesys Cloud Python SDK 2.1.0. The goal is to hit the /api/v2/users endpoint with POST requests to simulate a massive onboarding event. JMeter is configured with a thread group of 50 users, looping 10 times, with a constant timer set to 500ms. The requests are authenticating via OAuth2 service account tokens generated just before the test run.
After the first 12 successful user creations, the API starts returning HTTP 503 Service Unavailable errors. The response body indicates "error": "service_unavailable", "message": "The service is currently unavailable.". This happens consistently around the 13th request in each thread. The latency spikes to over 3000ms before the failure. I have verified the token is valid and has the users:write scope. The environment is a standard sandbox instance with default capacity limits.
I tried adding a random timer between 1000ms and 2000ms to spread out the load, but the 503s still appear, just later in the sequence. The Architect flow triggering this is a simple webhook listener that forwards the request to the platform API. No complex logic is involved. The error log shows no specific rate limit headers like X-RateLimit-Remaining hitting zero, which is confusing. It feels like the backend service is dropping connections rather than throttling them gracefully.
Is there a hidden concurrency limit for user provisioning in the sandbox environment? Or is this a known issue with the Python SDK handling WebSocket handshakes during bulk operations? The documentation mentions rate limits for read operations but is vague on write bursts for identity management. Need to know if I should lower the thread count or if this is a platform-side bug affecting load testing in the APAC region.