Quality Management: Discrepancy in Scorecard Weighted Averages for Multi-Language Queues

Observing a calculation anomaly in the Quality Management scorecard reports for our multilingual support queues. The weighted average score deviates significantly from the raw data export when filtering by specific language skills. Is there a known issue with how the WFM integration handles skill-based segmentation in the latest patch? The discrepancy affects our SLA reporting accuracy. Need clarification on the correct metric source for compliance audits.