Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RT-DETR FP16 inference get correct result on v100 but weird result on a10 #3652

Closed
xiaochus opened this issue Feb 5, 2024 · 5 comments
Closed
Assignees
Labels
triaged Issue has been triaged by maintainers

Comments

@xiaochus
Copy link

xiaochus commented Feb 5, 2024

Description

I am using tritonserver:23.10 to deploy RT-DETR model. The onnxruntime fp32, onnxruntime fp16 and Tesla V100 TRT 8.6.1 FP16/F32 both get the correct result. But Tesla A10 TRT 8.6.1 get correct result in FP32 and weird result in FP16. The FP16 result should be same both in V100 and A10 with the same code.

A10 - TRT - FP16
image

V100 - TRT - FP16
image

Environment

TensorRT Version: TensoRT 8.6.1

NVIDIA GPU: Tesla V100 / A10

NVIDIA Driver Version: 515.65.01

CUDA Version: 12.2

CUDNN Version: v8

Operating System: Ubuntu 22.04

Python Version (if applicable): 3.10

Tensorflow Version (if applicable):

PyTorch Version (if applicable): 2.0.1

Baremetal or Container (if so, version): tritonserver 23.10

Relevant Files

https://github.com/lyuwenyu/RT-DETR

Steps To Reproduce

/usr/src/tensorrt/bin/trtexec --onnx=model.onnx --saveEngine=model.plan --fp16

@zerollzeng
Copy link
Collaborator

  1. Did you test metric like mAP?
  2. Could you please share the onnx here for reproduce.

Thanks!

@zerollzeng zerollzeng self-assigned this Feb 7, 2024
@zerollzeng zerollzeng added the triaged Issue has been triaged by maintainers label Feb 7, 2024
@zerollzeng
Copy link
Collaborator

FP16 may introduce accuracy drop so it's hard to said whether it's bug unless we have generic metric like mAP.

@ttyio
Copy link
Collaborator

ttyio commented Mar 5, 2024

closing since no activity for more than 3 weeks, thanks all!

@ttyio ttyio closed this as completed Mar 5, 2024
@chinakook
Copy link

#3700

@chinakook
Copy link

Please reopen this issue to track Ampere accuracy lose issue on all detr like models.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
triaged Issue has been triaged by maintainers
Projects
None yet
Development

No branches or pull requests

4 participants