Model Optimizations

In addition to tuning performance using ONNX Runtime configurations, there are techniques that can be applied to reduce model size and/or complexity to improve performance.