Analytics api 429 throttling on multi-tenant partner app

just noticed that our premium app is hitting consistent 429 errors when querying conversation details across multiple partner orgs. we are building a unified reporting dashboard that aggregates historical interaction data for several enterprise clients via the appfoundry platform. the issue specifically manifests when we execute bulk queries against the /api/v2/analytics/conversations/queries endpoint using the partner org context.

the environment details are as follows: genesys cloud platform version 2024-10, us-east-1 region. we are using the official genesys cloud rest api client library (version 2.15.0) for node.js. the app is registered as a premium app in appfoundry with multi-org oauth enabled. each partner org has approximately 500 concurrent agents and generates roughly 10,000 conversations per day. our query logic pulls data in 1-hour intervals for the previous 7 days, resulting in about 168 api calls per org per refresh cycle. with 15 active partner orgs, this totals around 2,520 calls within a short window during peak dashboard load times.

the error response consistently returns:

{
 "code": "too_many_requests",
 "message": "rate limit exceeded",
 "status": 429
}

we have implemented exponential backoff and jitter as recommended in the platform documentation, but the retry logic fails after three attempts because the rate limit window does not reset quickly enough for our aggregation requirements. we are observing this behavior primarily during business hours (pst) when user activity is highest. the standard rate limit for this endpoint is 30 requests per second for partner apps, but it appears the effective burst capacity is significantly lower when querying across multiple org contexts simultaneously.

does anyone have insights into the exact burst limits for the conversation details query endpoint when operating under a partner org context? we are considering switching to a batched export approach via the /api/v2/analytics/conversations/exports endpoint, but that introduces latency issues for near-real-time dashboard updates. any guidance on optimizing query patterns or understanding the underlying rate limit mechanics for multi-tenant partner apps would be greatly appreciated.

Make sure you check the rate limits for partner orgs because they are shared across tenants. this is a common gotcha when aggregating data from multiple clients.