WFM Schedule API 429 Rate Limit during JMeter Load Test for 500 Agents

I’m completely stumped as to why the WFM schedule API is returning 429 Too Many Requests when I am trying to simulate a bulk schedule update for 500 agents using JMeter from Singapore.

Running a load test against the Genesys Cloud WFM API endpoints. Targeting /api/v2/wfm/schedules. Using a basic JMeter script with the HTTP Request sampler. Configured a thread group with 10 concurrent users. Each user sends 50 sequential POST requests to update agent schedules. Total expected requests: 500. The requests are spaced with a constant timer of 100ms between each call. The goal is to test the system’s capacity to handle bulk schedule changes without hitting rate limits too early. However, the test fails after only 15 requests. The response code is 429. The response body contains a message saying “Rate limit exceeded. Please retry after 5 seconds.” This happens even though I am not hitting the documented rate limit of 100 requests per minute for this endpoint. I have checked the documentation and it says the limit is 100 req/min for schedule updates. With 10 users sending 1 request every 100ms, that is 600 requests per minute total. Wait, 10 users * 60 requests/min = 600 req/min. That is over the limit. But the error comes after only 15 requests. That is way too early. The error response does not include a Retry-After header. It just says 429. I have tried adding a longer delay of 200ms between requests. The error still occurs after 15 requests. I have tried using a single thread with 500 requests. The error occurs after 15 requests. I have tried using different OAuth tokens for each thread. The error occurs after 15 requests. The environment is Genesys Cloud EU. The API version is v2. The JMeter version is 5.6.2. The Java version is 17. The issue is consistent across multiple test runs. I need to understand if there is a per-tenant rate limit that is lower than the documented limit. Or if there is a specific rate limit for bulk schedule operations. Or if the WFM API has a different rate limiting strategy than the standard REST APIs. Any insights on how to structure JMeter tests for WFM API load testing would be appreciated. I am new to load testing Genesys Cloud APIs and want to make sure I am not misinterpreting the error messages. The documentation is not clear on whether rate limits are per-user, per-tenant, or per-endpoint. I have also noticed that the 429 errors start appearing even when the request rate is well below 100 req/min. This suggests that there might be a sliding window or a burst limit that is not documented. I am looking for best practices on how to handle rate limiting in JMeter scripts for Genesys Cloud WFM APIs. Specifically, how to implement exponential backoff and retry logic. And how to determine the actual rate limits for WFM endpoints. Any help or pointers to relevant documentation would be great. I am stuck and need to move forward with the performance testing phase.