feat: add ONNX Runtime inference backend with PyTorch fallback (#12) by jshaofa-ui · Pull Request #32 · Climate-Vision/ClimateVision

jshaofa-ui · 2026-04-29T14:08:48Z

[Good First Issue] Add ONNX Runtime inference path with PyTorch fallback

Resolves #12

Summary

Implements a complete ONNX Runtime inference backend for ClimateVision with automatic PyTorch fallback, enabling faster inference on CPU and edge devices while maintaining full compatibility with existing PyTorch models.

Changes

onnx_runtime.py (540 lines) - ONNX Runtime engine with session management, inference, and benchmarking
onnx_export.py (422 lines) - PyTorch to ONNX model export for U-Net and Siamese networks
init.py (67 lines) - Unified module API combining PyTorch and ONNX inference
test_onnx_runtime.py (873 lines) - 32 unit tests across 11 test classes
onnx-runtime-guide.md - Complete usage documentation

Core Features

ONNXSession - Cached session manager with automatic CPU/CUDA provider selection
run_onnx_inference() - Batch inference with latency tracking
benchmark_onnx_model() - Full benchmarking (p50/p95/p99 latency + FPS)
export_unet_to_onnx() / export_siamese_to_onnx() - Dynamic axis, configurable opset
run_inference_with_fallback() - Automatic ONNX to PyTorch fallback
validate_onnx_model() - Cross-validation with PyTorch output

Test Coverage

11 test classes: ONNXSession caching, device selection, inference, benchmarking, export, validation, fallback, integration
32 unit tests total
Graceful skip when torch/onnx not available

Technical Details

Zero breaking changes to existing inference pipeline
Automatic provider selection based on hardware availability
Session caching for repeated inference calls
Full numerical validation against PyTorch baseline

- ONNXSession: Cached session manager with auto CPU/CUDA provider selection - run_onnx_inference: Batch inference with latency tracking - benchmark_onnx_model: Full benchmarking (p50/p95/p99 + FPS) - export_unet_to_onnx / export_siamese_to_onnx: Dynamic axis, configurable opset - run_inference_with_fallback: ONNX to PyTorch automatic fallback - validate_onnx_model: Cross-validation with PyTorch output - 32 unit tests across 11 test classes - Graceful skip when torch/onnx not available Closes Climate-Vision#12

Goldokpa

Thanks for the substantial PR! Solid scope and good test coverage. Before this can move forward there are a few real bugs and a few things worth questioning:

Bugs that will break at runtime:

run_onnx_inference warm-up call is malformed. In onnx_runtime.py:
```
session.run(session.input_name, {session.input_name: image[:1]})
```
ONNXSession.run only takes input_data (one positional arg), and the underlying _session.run expects (output_names, input_feed) where output_names is None or a list — not a string. This will throw on every call. Should just be session.run(image[:1]).
all_outputs.extend(outputs) then all_outputs[0]. session.run() returns a list of arrays (one per output). For a single-output model with one batch, extend flattens to [array], but for multi-batch this stitches outputs from different batches as siblings rather than concatenating along the batch axis. The downstream argmax/softmax then runs only on the first batch's logits and silently drops the rest. Use np.concatenate([out[0] for out in batched_outputs], axis=0) or similar.
Fallback path calls pytorch_inference(image, analysis_type=...) but image is a numpy array. Worth confirming run_inference in pipeline.py accepts that — if it expects a tensor or a file path, the fallback will always fail.
_EXECUTION_PROVIDERS is malformed. The CUDA entry is ("CUDAExecutionProvider", {"device_id": "0"}) — but device_id should be an int, not a string, per the ORT API. Also ort.InferenceSession(providers=...) expects either a list of strings or a list of (name, options_dict) tuples; the code mixes both formats and the selected_providers filter only checks the name, so it passes the malformed tuple through unchanged.

Things that look suspicious / worth challenging:

Test file references an absolute home path — pip install -e /home/fa/projects/climatevision-work in docs/onnx-runtime-guide.md, plus the docstring on onnx_runtime.py and the _DEFAULT_ONNX_DIR = parents[3] (same parents-index issue I'd want verified — file is at src/climatevision/inference/onnx_runtime.py, so parents[3] should be the repo root, that one's actually correct here, but worth confirming for onnx_export.py too). The hardcoded developer path in the docs should be removed.
873-line test file with 40+ test cases for a new module is unusual. Many tests follow the pattern of try: import onnxruntime; except: pytest.skip() — meaning in CI without onnxruntime installed (which is a new dependency this PR adds), almost every test silently skips. The test for test_session_raises_without_onnxruntime literally has pass # Skip this test as it requires complex mocking inside it, so it asserts nothing. Worth pruning to focused tests that actually run, or pinning onnxruntime as a test dep.
Dependencies aren't added to pyproject.toml / requirements.txt. The PR introduces onnx>=1.14.0 and onnxruntime>=1.15.0 but only mentions them in the markdown doc. Imports will fail in any environment that hasn't been manually prepared.
run_onnx_inference has no path that returns a dict[str, Any] despite the return type annotation ONNXInferenceResult | dict[str, Any]. Either the dict branch is missing or the annotation is wrong.
export_model_from_checkpoint calls torch.load(...) without weights_only=True. Recent PyTorch versions warn on this, and it's a security concern for untrusted checkpoints.

Happy to re-review once these are addressed. The overall architecture (session caching, fallback path, benchmark dataclass) is reasonable — the issues are mostly in the wiring.

Goldokpa requested changes May 7, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add ONNX Runtime inference backend with PyTorch fallback (#12)#32

feat: add ONNX Runtime inference backend with PyTorch fallback (#12)#32
jshaofa-ui wants to merge 1 commit intoClimate-Vision:mainfrom
jshaofa-ui:feature/onnx-runtime-inference

jshaofa-ui commented Apr 29, 2026

Uh oh!

Goldokpa left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jshaofa-ui commented Apr 29, 2026

[Good First Issue] Add ONNX Runtime inference path with PyTorch fallback

Summary

Changes

Core Features

Test Coverage

Technical Details

Uh oh!

Goldokpa left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants