Skip to content

Pull requests: NVIDIA/Model-Optimizer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix debugger server failing to detect editable-installed modelopt
#1270 opened Apr 16, 2026 by cjluo-nv Collaborator Loading…
2 tasks done
Jingyux/diffusion skip softmax 2
#1269 opened Apr 15, 2026 by jingyu-ml Contributor Draft
Centralize 'trtexec' subprocess runs in ONNX into a single function
#1268 opened Apr 15, 2026 by gcunhase Contributor Loading…
Fix DeepEP TMA constraint violation for MoE CUDA graph batch sizes
#1267 opened Apr 15, 2026 by kevalmorabia97 Collaborator Loading…
1 task
[chore]: weekly bump of uv.lock on main (2026-04-15)
#1266 opened Apr 15, 2026 by github-actions bot Loading…
Add ResNet50 support for torch_onnx quantization workflow
#1263 opened Apr 14, 2026 by ajrasane Contributor Loading…
2 tasks done
Exclude small-k and small-n Matmul nodes from Int8 quantization
#1256 opened Apr 14, 2026 by nv-samcheng Contributor Loading…
Add EfficientViT support for torch_onnx quantization workflow
#1254 opened Apr 14, 2026 by ajrasane Contributor Loading…
3 tasks done
Add a standalone monitor skill for persistent job tracking
#1252 opened Apr 14, 2026 by kaix-nv Contributor Loading…
Add layerwise calibration for large models
#1251 opened Apr 13, 2026 by realAsma Contributor Loading…
1 task
fix(launcher): use afterany dependency for allow_to_fail pipelines
#1248 opened Apr 13, 2026 by yeyu-nvidia Contributor Loading…
3 tasks
Add LAQ (Learnable Amax Quantization) algorithm
#1247 opened Apr 13, 2026 by realAsma Contributor Loading…
4 tasks
vLLM fakequant fold weight_quantizer for megatron export
#1246 opened Apr 13, 2026 by kinjalpatel27 Contributor Loading…
vLLM fakequant export update for AWQ checkpoint
#1242 opened Apr 13, 2026 by kinjalpatel27 Contributor Loading…
Add dep check for ptq and runtime check for evaluation/deployment
#1240 opened Apr 12, 2026 by kaix-nv Contributor Loading…
support Qwen3.5 quantization
#1230 opened Apr 10, 2026 by deepindeed2022 Loading…
[2/3] Implicit Gemm NVFP4
#1227 opened Apr 9, 2026 by jingyu-ml Contributor Loading…
GPTQ vector
#1223 opened Apr 9, 2026 by sugunav14 Contributor Loading…
ProTip! Follow long discussions with comments:>50.