Chain-of-ThoughtGRPO
Top papers
- ClueTracer: Question-to-Vision Clue Tracing for Training-Free Hallucination Suppression in Multimodal Reasoning(7.0)
- Concise Geometric Description as a Bridge: Unleashing the Potential of LLM for Plane Geometry Problem Solving(6.0)
- Evolving from Tool User to Creator via Training-Free Experience Reuse in Multimodal Reasoning(5.0)
- Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning(2.0)