Transformers
Top papers
- PhaseCoder: Microphone Geometry-Agnostic Spatial Audio Understanding for Multimodal LLMs(8.0)
- AudioCapBench: Quick Evaluation on Audio Captioning across Sound, Music, and Speech(5.0)
- Towards Explicit Acoustic Evidence Perception in Audio LLMs for Speech Deepfake Detection(3.0)
- Spatial Audio Question Answering and Reasoning on Dynamic Source Movements(3.0)