Muon
Muon is a model in our research taxonomy.
Related papers
- Preconditioning Benefits of Spectral Orthogonalization in Muon
- PRISM: Distribution-free Adaptive Computation of Matrix Functions for Accelerating Neural Network Training
- Adaptive Batch Sizes Using Non-Euclidean Gradient Noise Scales for Stochastic Sign and Spectral Descent
- Advancing Model Refinement: Muon-Optimized Distillation and Quantization for LLM Deployment