Skip to content

qtris123/Muon-vs-AdamW

Repository files navigation

File descriptions:

Muon optimizer.py is acquired from the paper Muon is Scalable by Moonshot AI The training pipeline is the main working space, where all the config and hyperparameters are chosen and main experiments are run.

About

CF on vertical learning with maths qa

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages