Fix division by zero in LR scheduler when max_steps equals warmup_steps by Br1an67 · Pull Request #2212 · Lightning-AI/litgpt

Br1an67 · 2026-03-09T02:47:16Z

Summary

This PR fixes a division by zero error that occurs in the learning rate scheduler when max_steps equals lr_warmup_steps. The issue was caused by the CosineAnnealingLR scheduler receiving a T_max value of 0, which results in a ZeroDivisionError during training.

Changes

Added validation in the get_lr_scheduler function across all finetune modules (adapter.py, adapter_v2.py, full.py, lora.py, lora_legacy.py)
The validation ensures that max_steps > warmup_steps before creating the scheduler
If validation fails, a clear error message is raised indicating the problematic values

Testing

The fix was verified by:

Ensuring all modified Python files parse correctly
The validation will catch the problematic case early with a clear error message instead of failing during training with a cryptic division by zero error

Files Changed

 litgpt/finetune/adapter.py     | 2 ++
 litgpt/finetune/adapter_v2.py  | 2 ++
 litgpt/finetune/full.py        | 2 ++
 litgpt/finetune/lora.py        | 2 ++
 litgpt/finetune/lora_legacy.py | 2 ++
 5 files changed, 10 insertions(+)

… LR scheduler

fix: validate max_steps > warmup_steps to prevent division by zero in…

870e21b

… LR scheduler

Br1an67 requested review from KaelanDt, andyland, k223kim, lantiga, lianakoleva and t-vi as code owners March 9, 2026 02:47

Br1an67 mentioned this pull request Mar 9, 2026

LR scheduler can result in a division by 0 #1393

Open

Br1an67 force-pushed the fix/issue-1393-lr-scheduler-validation branch from a88745a to 870e21b Compare March 18, 2026 00:45

lianakoleva approved these changes Mar 21, 2026

View reviewed changes

Merge branch 'main' into fix/issue-1393-lr-scheduler-validation

972e63f

alvinttang mentioned this pull request Apr 25, 2026

Fix lr_warmup_fraction silently rejected by default lr_warmup_steps=100 #2243

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix division by zero in LR scheduler when max_steps equals warmup_steps#2212

Fix division by zero in LR scheduler when max_steps equals warmup_steps#2212
Br1an67 wants to merge 2 commits into
Lightning-AI:mainfrom
Br1an67:fix/issue-1393-lr-scheduler-validation

Br1an67 commented Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Br1an67 commented Mar 9, 2026

Summary

Changes

Testing

Files Changed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants