Speech AI Comparison Hub

3 papers - avg viability 6.0

Reference Surfaces

Ara-Best-RQ: Multi Dialectal Arabic SSL(7.0)
A family of self-supervised learning models for multi-dialectal Arabic speech processing that achieves state-of-the-art performance on dialect identification.
From Oracle to Noisy Context: Mitigating Contextual Exposure Bias in Speech-LLMs(7.0)
This research introduces a novel training framework to significantly improve the robustness of speech-to-LLM models against noisy and error-prone contextual information during inference, leading to more reliable real-world performance.
Do What I Say: A Spoken Prompt Dataset for Instruction-Following