Papers
1–2 of 2Research Paper·Feb 23, 2026
StyleStream: Real-Time Zero-Shot Voice Style Conversion
Voice style conversion aims to transform an input utterance to match a target speaker's timbre, accent, and emotion, with a central challenge being the disentanglement of linguistic content from style...
6.0 viability
Research Paper·Feb 19, 2026
The Cascade Equivalence Hypothesis: When Do Speech LLMs Behave Like ASR$\rightarrow$LLM Pipelines?
Current speech LLMs largely perform implicit ASR: on tasks solvable from a transcript, they are behaviorally and mechanistically equivalent to simple Whisper$\to$LLM cascades. We show this through mat...
3.0 viability