Just noticed that…
- Environment: Genesys Cloud EU-West (Ireland), BYOC configured with AWS Route 53
- Tenant Region: Europe/Paris (Local Edge)
- Dashboard View: Performance > Queue Activity > Real-Time
- Flow Version: v4.2.1 (Architect)
- Issue: Intermittent 504 Gateway Timeout on inbound SIP trunks
The Performance Dashboard indicates a stable queue activity level with an average handle time of 120 seconds. However, the BYOC edge logs show a consistent 504 Gateway Timeout occurring approximately every 45 minutes during peak load. The timeout happens before the call reaches the Architect flow logic, suggesting the issue resides within the edge routing or the SIP trunk registration rather than the flow execution itself.
The latency metrics in the Performance Dashboard show a slight increase in wait time coinciding with these timeouts, but the queue occupancy remains below 80%. This discrepancy suggests that the edge is dropping calls before they are counted in the queue activity, or the dashboard is not reflecting the real-time edge failures accurately.
Has anyone encountered a similar pattern where BYOC edge timeouts correlate with minor fluctuations in queue wait time metrics? The current configuration uses a standard SIP trunk with a 30-second keepalive. The edge health status reports all services as healthy, yet the 504 errors persist.
The goal is to determine if the timeout threshold on the BYOC edge needs adjustment to align with the Performance Dashboard’s view of queue activity. Alternatively, is there a known limitation in how the dashboard reports edge-level failures versus flow-level failures? The business impact is significant, as these dropped calls are not reflected in the abandoned call metrics, leading to inaccurate service level reporting.
Any insights into the expected behavior of BYOC edge timeouts in relation to queue performance metrics would be appreciated. The current workaround involves monitoring the edge logs manually, which is not sustainable for real-time operations.