Top papers
- KLong: Training LLM Agent for Extremely Long-horizon Tasks(8.0)
- ShipTraj-R1: Reinforcing Ship Trajectory Prediction in Large Language Models via Group Relative Policy Optimization(7.0)
- No One Size Fits All: QueryBandits for Hallucination Mitigation(7.0)
- CodeTaste: Can LLMs Generate Human-Level Code Refactorings?(7.0)