Intermittent SIP 503 Service Unavailable on local Edge appliances

I am currently managing a cluster of four Genesys Cloud Edge appliances for a large-scale enterprise environment. We have recently observed a pattern of intermittent SIP 503 Service Unavailable responses originating from the appliances during periods of sustained interaction volume. Upon reviewing the system logs, we noticed that the available memory on the active nodes is consistently depleting over a 72-hour cycle, necessitating a proactive reboot of the services to restore stability. This behavior appears to be related to the media processing subsystem. Has anyone within the community identified a specific firmware version or configuration parameter that may be contributing to this apparent memory leak?

This is a serious concern from a compliance and risk management perspective. As a banking institution, we rely on the continuous availability of our Edge infrastructure to ensure that all interactions are recorded and archived according to federal regulations. If the Edge appliance is returning 503 errors, it suggests that the signaling plane is failing, which could lead to interactions bypassing our recording policies entirely.

I strongly recommend escalating this to Genesys Support immediately to ensure that your regulatory obligations are not compromised.

I have been doing some exciting research into the Genesys Cloud audio APIs and how they interact with the Edge media tier! While I have not seen a 503 error personally, I have noticed that high-frequency biometric enrollment requests can put a significant load on the appliance memory if the cache is not cleared correctly. Are you using any advanced voice biometrics or real-time speech analytics on these nodes? It would be wonderful to know if those features are enabled, as they may be the key to understanding this fascinating resource behavior!