Multimodal LLMs Comparison Hub

Multimodal Task Interference: A Benchmark and Analysis of History-Target Mismatch in Multimodal LLMs(7.0)

A benchmark and analysis revealing directional task interference in multimodal LLMs, enabling targeted improvements for more robust conversational AI.

Uncertainty-Aware Knowledge Distillation for Multimodal Large Language Models(7.0)

An uncertainty-aware knowledge distillation framework that adaptively balances data and teacher supervision to improve multimodal vision-language models.

Multimodal LLMs Comparison Hub

Reference Surfaces

Top Papers