Performance Issues
ONNX Runtime
How to profile my super slow WebNN implementation?
WebNN EP of ONNX Runtime Web needs specify fixed integer values via freeDimensionOverrides
for all the symbolic dimensions, otherwise the optimizations will not be applied.
Oh Wow! fixing the argument names makes it 100x time faster!
See Also:
Why am I not seeing performance benefits on Intel GPUs?
DirectML metacommands execution is disabled by default on Intel GPUs between driver versions 31.0.101.5518
and 32.0.101.6083
. To enable metacommands and evaluate performance on affected GPU drivers, add the following flag:
gpu-flag
--disable_dml_meta_commands_for_gpu=0
Note: This flag only affects GPU inference and has no impact on NPU operations.
LiteRT
Last updated on