NVIDIA / Model-Optimizer Public

Notifications You must be signed in to change notification settings
Fork 357
Star 2.5k

Code
Issues 61
Pull requests 135
Actions
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security and quality
Insights

Pull requests: NVIDIA/Model-Optimizer

Labels 31 Milestones 0

New pull request New

135 Open 788 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Refactor: code reuse for dflash/eagle; Deprecate parallel draft

#1271 opened Apr 16, 2026 by h-guo18 Contributor • Draft

Fix debugger server failing to detect editable-installed modelopt

#1270 opened Apr 16, 2026 by cjluo-nv Collaborator

Loading…

2 tasks done

Jingyux/diffusion skip softmax 2

#1269 opened Apr 15, 2026 by jingyu-ml Contributor • Draft

Centralize 'trtexec' subprocess runs in ONNX into a single function

#1268 opened Apr 15, 2026 by gcunhase Contributor

Loading…

Fix DeepEP TMA constraint violation for MoE CUDA graph batch sizes

#1267 opened Apr 15, 2026 by kevalmorabia97 Collaborator

Loading…

1 task

[chore]: weekly bump of uv.lock on main (2026-04-15)

#1266 opened Apr 15, 2026 by github-actions bot

Loading…

Handle zero-amax per-channel activation scaling for MoE export

#1265 opened Apr 15, 2026 by AEON-7

Loading…

Fix non-scalar input amax in preprocess_linear_fusion for MoE export

#1264 opened Apr 15, 2026 by AEON-7

Loading…

Add ResNet50 support for torch_onnx quantization workflow

#1263 opened Apr 14, 2026 by ajrasane Contributor

Loading…

2 tasks done

Exclude small-k and small-n Matmul nodes from Int8 quantization

#1256 opened Apr 14, 2026 by nv-samcheng Contributor

Loading…

Add EfficientViT support for torch_onnx quantization workflow

#1254 opened Apr 14, 2026 by ajrasane Contributor

Loading…

3 tasks done

Add a general composable $import system for YAML configs, and use it to implement composable recipes

#1253 opened Apr 14, 2026 by shengliangxu Collaborator

Loading…

Add a standalone monitor skill for persistent job tracking

#1252 opened Apr 14, 2026 by kaix-nv Contributor

Loading…

Add layerwise calibration for large models

#1251 opened Apr 13, 2026 by realAsma Contributor

Loading…

1 task

fix(launcher): use afterany dependency for allow_to_fail pipelines

#1248 opened Apr 13, 2026 by yeyu-nvidia Contributor

Loading…

3 tasks

Add LAQ (Learnable Amax Quantization) algorithm

#1247 opened Apr 13, 2026 by realAsma Contributor

Loading…

4 tasks

vLLM fakequant fold weight_quantizer for megatron export

#1246 opened Apr 13, 2026 by kinjalpatel27 Contributor

Loading…

vLLM fakequant export update for AWQ checkpoint

#1242 opened Apr 13, 2026 by kinjalpatel27 Contributor

Loading…

feat: parallelize fakequant export across GPUs via ThreadPoolExecutor

#1241 opened Apr 13, 2026 by kinjalpatel27 Contributor

Loading…

Add dep check for ptq and runtime check for evaluation/deployment

#1240 opened Apr 12, 2026 by kaix-nv Contributor

Loading…

[1/N] Polish evaluation skills and common skills based on an E2E workflow testing

#1239 opened Apr 12, 2026 by Edwardf0t1 Contributor

Loading…

[1/N] Polish deployment skills - Add a debug loop for unsupported models

#1236 opened Apr 11, 2026 by Edwardf0t1 Contributor

Loading…

support Qwen3.5 quantization

#1230 opened Apr 10, 2026 by deepindeed2022

Loading…

[2/3] Implicit Gemm NVFP4

#1227 opened Apr 9, 2026 by jingyu-ml Contributor

Loading…

GPTQ vector

#1223 opened Apr 9, 2026 by sugunav14 Contributor

Loading…

Previous 1 2 3 4 5 6 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!