Language Model Optimization Comparison Hub
4 papers - avg viability 4.0
Top Papers
- Predictive Batch Scheduling: Accelerating Language Model Training Through Loss-Aware Sample Prioritization(5.0)
Develop a tool that accelerates language model training by prioritizing high-loss samples for improved convergence speed.
- Lost in the Prompt Order: Revealing the Limitations of Causal Attention in Language Models(4.0)
Leverage causal attention insights to improve language model prompt performance for multiple-choice QA applications.
- Breaking the Overscaling Curse: Thinking Parallelism Before Parallel Thinking(4.0)
Optimize parallelism in language models to reduce computational costs and maintain performance.
- Can I Have Your Order? Monte-Carlo Tree Search for Slot Filling Ordering in Diffusion Language Models(3.0)
McDiffuSE enhances slot infilling order in diffusion language models using Monte Carlo Tree Search to improve generation quality.