I’ve spent hours trying to figure out why the Workforce Management API is throttling our load tests so aggressively. We are trying to validate capacity for shift swapping during peak hours in the Singapore region.
Using JMeter 5.6, we set up a thread group with 200 concurrent users hitting POST /api/v2/wfm/schedules/shifts. The goal is to simulate a sudden surge in agent availability changes. Within the first 30 seconds, the response rate drops to near zero, and the error log fills with 429 Too Many Requests.
We added the Retry-After header parsing, but the values are inconsistent, jumping between 2 and 15 seconds. This breaks the test flow.
“Rate limits for WFM endpoints are calculated per organization and are stricter for write operations to prevent database contention.”
Is there a specific endpoint configuration or a hidden limit for schedule/shifts that isn’t documented? The standard application/json content type is being used. We are not touching the scheduling engine itself, just the shift records. Any tips on pacing the requests to stay under the radar?