Can anyone clarify the rate limiting behavior for the screen recording endpoints in Genesys Cloud when subjected to high concurrent load? We are running performance tests using JMeter 5.6 to simulate a peak volume of 300 simultaneous agents initiating screen recording sessions via the Architect flow. The setup involves triggering a recording start through the Web SDK, followed immediately by an API call to fetch the recording metadata.
The issue manifests as a steady stream of HTTP 429 Too Many Requests errors returned by the /api/v2/recordings/screens endpoint. This happens almost immediately after the load test begins, even though the total number of requests per second is well below the general API limit of 1,000 requests per minute for our organization tier. The error response body indicates a retry-after header of 5 seconds, but spreading the requests out does not resolve the issue because the load test requires near-instantaneous initiation for accurate simulation.
Here is what we have tried so far:
- Adjusted the JMeter thread group to introduce a random delay of 200-500ms between iterations, which reduced the 429 errors but did not eliminate them, and it significantly skewed the performance metrics we are trying to validate.
- Verified the OAuth token scope includes
recording:readandrecording:write, and confirmed that the tokens are valid and not expiring during the test window. The single-user test runs without any issues, suggesting the problem is strictly related to concurrency.
The environment is Genesys Cloud v2024.2. We suspect there might be a specific throttling rule for screen recording metadata retrieval that is not documented in the standard API rate limit guide. Is there a known limit on how many recording sessions can be queried or initiated concurrently from the same org? Any insight into how to structure the load test to avoid these false-positive rate limit hits would be appreciated.