Video Reasoning Comparison Hub
4 papers - avg viability 5.3
Top Papers
- Are Video Reasoning Models Ready to Go Outside?(7.0)
ROVA enhances the robustness of vision-language models against real-world disturbances through a novel training framework.
- VisionCoach: Reinforcing Grounded Video Reasoning via Visual-Perception Prompting(7.0)
VisionCoach enhances video reasoning by using visual prompting to improve spatio-temporal grounding during training.
- Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning(4.0)
Develop a video generation model for enhanced visual reasoning in AI applications like maze navigation and tangram puzzles.
- Clue Matters: Leveraging Latent Visual Clues to Empower Video Reasoning(3.0)
ClueNet enhances video question answering by improving visual clue extraction and reasoning alignment.