3 papers - avg viability 5.0
A novel learning rate scheduler to stabilize RL training for large language models by dynamically adjusting based on model response length.
Manifold Aware Denoising Score Matching optimizes manifold learning in data distributions using a computationally efficient score decomposition approach.
Accelerate the training of robust machine learning models by safely screening out irrelevant data points.