Audio-Language Models Comparison Hub
3 papers - avg viability 5.7
Top Papers
- ALARM: Audio-Language Alignment for Reasoning Models(7.0)
A novel audio-language model that enhances reasoning capabilities through self-rephrasing and multi-task training.
- Nudging Hidden States: Training-Free Model Steering for Chain-of-Thought Reasoning in Large Audio-Language Models(6.0)
A training-free model steering approach to enhance reasoning in large audio-language models.
- MUGEN: Evaluating and Improving Multi-audio Understanding of Large Audio-Language Models(4.0)
MUGEN benchmarks and improves multi-audio understanding in large audio-language models.