Skip to content

OptimAI-Lab/large_lr_sgd

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

Revisiting the Adam-SGD Gap in LLM Pre-Training: The Role of Large Effective Learning Rates

Preliminary code release for our paper "Revisiting the Adam-SGD Gap in LLM Pre-Training: The Role of Large Effective Learning Rates", by Athanasios Glentis, Dawei Li, Chung-Yiu Yau and Mingyi Hong.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages