Skip to content

Pull requests: EleutherAI/gpt-neox

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix warmup LR being zero at step 0
#1379 opened Apr 8, 2026 by Robby955 Loading…
fix: gMLP uses full bias instead of truncated bias
#1371 opened Feb 11, 2026 by Mr-Neutr0n Loading…
fix: correct inverted Sinkhorn early-stop condition
#1370 opened Feb 11, 2026 by Mr-Neutr0n Loading…
Bump pip from 23.3.2 to 26.0 in /requirements dependencies Pull requests that update a dependency file python Pull requests that update python code
#1369 opened Feb 3, 2026 by dependabot bot Loading…
Add assert to catch case when neither epochs or iters is set
#1368 opened Jan 11, 2026 by aflah02 Contributor Loading…
Update local_setup.yml
#1366 opened Nov 6, 2025 by dannyandchantellebailey-design Loading…
L1 Norm Logging for Activations
#1362 opened Jun 9, 2025 by Quentin-Anthony Member Loading…
Log from all ranks
#1361 opened Jun 9, 2025 by Quentin-Anthony Member Loading…
Add verbosity and refactor startup logging
#1359 opened Jun 9, 2025 by Quentin-Anthony Member Loading…
Add unit tests for dmoe, context parallelism, and muP
#1358 opened Jun 7, 2025 by Quentin-Anthony Member Loading…
Update Evaluation Logic to Latest lm_eval (0.4.8) and Support Automatic Benchmark Evals w/o Validation Set dependencies Pull requests that update a dependency file
#1348 opened Mar 21, 2025 by Kyle1668 Contributor Loading…
update deeperspeed version
#1333 opened Jan 29, 2025 by Quentin-Anthony Member Draft
CUDA/Pytorch multiprocessing workaround and test fixes
#1304 opened Oct 14, 2024 by AI-WAIFU Contributor Draft
Add support for context parallelism
#1299 opened Oct 1, 2024 by bclyang Contributor Loading…
TE integration via full TransformerLayer
#1297 opened Sep 30, 2024 by tf-nv Contributor Loading…
Add initial ring flash attention support
#1266 opened Sep 4, 2024 by dmahan93 Contributor Draft
Add tensor parallelism for RWKV
#1237 opened Jun 19, 2024 by jahatef Collaborator Loading…
Add lora support
#1225 opened May 20, 2024 by mkerin Contributor Draft
Adding replay into GPT-NeoX
#1200 opened Apr 13, 2024 by AIproj Contributor Loading…
Added infinite lr schedules
#1194 opened Mar 25, 2024 by kshitijkg Contributor Loading…
ProTip! Exclude everything labeled bug with -label:bug.