how to fuse QuantizeLinear Node with my custom op when convert onnx to trtengine #4270

AnnaTrainingG · 2024-12-05T08:19:19Z

how to fuse QuantizeLinear Node with my custom op when convert onnx to trtengine
I find that the QuantizeLinear will be called like:

it cost long time, How to fuse it with my custom kernel before QuantizeLinear node. the compute in QuantizeLinear is what ?
like:
my_custom_op - Q -- DQ --- Conv. fuse to => my_custom_op_with_Q --- (DQ_Conv)

lix19937 · 2024-12-16T05:33:48Z

First , you need use --best to check whether your model fusion case match your expected goal.
Then, fusion the scale in your plugin.

asfiyab-nvidia added triaged Issue has been triaged by maintainers ONNX Issues relating to ONNX usage and import labels Dec 16, 2024

asfiyab-nvidia assigned akhilg-nv Dec 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to fuse QuantizeLinear Node with my custom op when convert onnx to trtengine #4270

how to fuse QuantizeLinear Node with my custom op when convert onnx to trtengine #4270

AnnaTrainingG commented Dec 5, 2024 •

edited

Loading

lix19937 commented Dec 16, 2024

how to fuse QuantizeLinear Node with my custom op when convert onnx to trtengine #4270

how to fuse QuantizeLinear Node with my custom op when convert onnx to trtengine #4270

Comments

AnnaTrainingG commented Dec 5, 2024 • edited Loading

lix19937 commented Dec 16, 2024

AnnaTrainingG commented Dec 5, 2024 •

edited

Loading