NorMuon: Making Muon More Efficient and Scalable

Publication
Preprint