Skip to content

Pull requests: AI-Hypercomputer/maxtext

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add eval code owners
#3892 opened May 13, 2026 by dipannita08 Collaborator Loading…
4 tasks done
Experimental skills directory
#3891 opened May 13, 2026 by dipannita08 Collaborator Loading…
4 tasks done
mHC optimization for expansion rate 4 (flag controlled) gemini-review
#3890 opened May 12, 2026 by dandragona-dev Collaborator Loading…
4 tasks done
Add nnx weight init logic when vocab tiling is enabled
#3889 opened May 12, 2026 by NuojCheng Collaborator Draft
4 tasks done
Support pytest markers regardless of decorator order
#3888 opened May 12, 2026 by igorts-git Collaborator Draft
4 tasks done
Add OLMo-3 7B stage-1 pretraining scripts gemini-review
#3886 opened May 12, 2026 by gagika Collaborator Loading…
4 tasks done
Do not block Add Label due to skipped checkRun pull ready
#3885 opened May 12, 2026 by charlesli640 Collaborator Loading…
4 tasks done
Plumbing and core MoE logic for router replay
#3881 opened May 12, 2026 by xuefgu Collaborator Draft
4 tasks done
Improve error message when tokenize_data config doesn't match dataset gemini-review
#3879 opened May 12, 2026 by aireenmei Collaborator Loading…
4 tasks done
enable aot identification test
#3876 opened May 11, 2026 by NuojCheng Collaborator Loading…
4 tasks done
write dequantization scripts for DeepSeek V4 FP4/FP8 weights
#3873 opened May 11, 2026 by snehalv2002 Collaborator Loading…
4 tasks done
Add zero1 aot support in train compile
#3872 opened May 11, 2026 by NuojCheng Collaborator Loading…
4 tasks done
Implement custom MoE HashRouter, TopKRouter, and sqrtsoftplus
#3871 opened May 11, 2026 by parambole Collaborator Draft
4 tasks
Conditionally branch tokamax.ragged_dot calls based on use_manual_quantization
#3869 opened May 11, 2026 by zxhe-sean Collaborator Loading…
4 tasks done
DeepSeek V4 Integration
#3867 opened May 11, 2026 by parambole Collaborator Draft
4 tasks
Implement DeepSeek-V4 Compressed Attention Layers
#3866 opened May 11, 2026 by parambole Collaborator Draft
4 tasks
DeepSeek-V4 Core Primitives
#3865 opened May 11, 2026 by parambole Collaborator Draft
4 tasks
[DeepSeek v3] Add grad mask and update MLA init gemini-review
#3864 opened May 10, 2026 by gagika Collaborator Loading…
4 tasks done
Enable Qwen3-Omni SFT on ChartQA
#3863 opened May 10, 2026 by hengtaoguo Collaborator Draft
4 tasks
Optimize MaxText unit and integration test suite runtime gemini-review pull ready
#3860 opened May 9, 2026 by shralex Collaborator Loading…
4 tasks done
Update optimization docs and add TPU v7x guide
#3857 opened May 8, 2026 by jacoguzo Collaborator Loading…
4 tasks done
Update docker image guide
#3855 opened May 8, 2026 by melissawm Collaborator Loading…
1 task done
Update JAX to 0.10.0 for pre-training
#3854 opened May 8, 2026 by SurbhiJainUSC Collaborator Draft
4 tasks done
ProTip! Add no:assignee to see everything that’s not assigned.