You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am using ONNX Runtime to generate a TensorRt engine from ONNX model file.
I am using trt_options.trt_engine_cache_enable = 1; and trt_options.trt_engine_cache_path = "./path"; to generate the engine file once and to load it on later runs, which works well (speeds up strongly).
However I'm not sure about what exactly that engines file is.
Does it include models weights?
Can it be used to load the full model, without having access to the .onnx model file?
My question arises because we are encrypting our model files and load them from RAM during runtime, so that other people with access to the system don't have access to our models. If the cache is basically a full access to the model, we will need a different solution.
The text was updated successfully, but these errors were encountered:
Does it include models weights?
By default the engine will include the weights.
Can it be used to load the full model, without having access to the .onnx model file?
Once the engine is built, the onnx model is no longer needed for inference. Note that the engine only works for the configs you specify during engine build (optimization profile, precisions, GPU version, etc.)
I am using ONNX Runtime to generate a TensorRt engine from ONNX model file.
I am using trt_options.trt_engine_cache_enable = 1; and trt_options.trt_engine_cache_path = "./path"; to generate the engine file once and to load it on later runs, which works well (speeds up strongly).
However I'm not sure about what exactly that engines file is.
Does it include models weights?
Can it be used to load the full model, without having access to the .onnx model file?
My question arises because we are encrypting our model files and load them from RAM during runtime, so that other people with access to the system don't have access to our models. If the cache is basically a full access to the model, we will need a different solution.
The text was updated successfully, but these errors were encountered: