Qwen3-8B
Qwen3-8B is a model in our research taxonomy.
Related papers
- Ada-RS: Adaptive Rejection Sampling for Selective Thinking
- CoDiQ: Test-Time Scaling for Controllable Difficult Question Generation
- Training-Trajectory-Aware Token Selection
- Robust Tool Use via Fission-GRPO: Learning to Recover from Execution Errors
- HiMAP-Travel: Hierarchical Multi-Agent Planning for Long-Horizon Constrained Travel
- Miner:Mining Intrinsic Mastery for Data-Efficient RL in Large Reasoning Models
- BoRP: Bootstrapped Regression Probing for Scalable and Human-Aligned LLM Evaluation
- Probing the Trajectories of Reasoning Traces in Large Language Models
- Private LLM Inference on Consumer Blackwell GPUs: A Practical Guide for Cost-Effective Local Deployment in SMEs