Open
Description
🚀 The feature, motivation and pitch
The Muon optimizer seems to be converging faster and with more stability than the Adam optimizer. Could you please consider adding it to the torch optimizers? Write-up here . Implementation here
Thanks PyTorch team. You guys rock !
Alternatives
No response
Additional context
No response
Metadata
Metadata
Assignees
Labels
Type
Projects
Status
Done