-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
不同个数的输出导致运行结果不一致 #4284
Comments
Maybe the fusion methods(tactic choice) are different. |
How to solve it? |
You can compare the build logs of two onnx by |
Thanks! There is one other problem in Tensorrt 10.7.0 : when building the trt engine, we set the parameter value of config: |
Description
(部分linear attetion代码)
hidden_states = query_KV / query_Z
return hidden_states, query_KV, query_Z
和
hidden_states = query_KV / query_Z
return hidden_states
上面两者方式onnx转tensorrt时,两者结果不一样,前者是正确的,后者会出现nan;
这相当于输出了中间状态会导致结果的正确性,该怎么解决这种问题哇?
后面测试:tensorrt 10.7会出现这种问题,10.6是正确的
The text was updated successfully, but these errors were encountered: