RT-DETR FP16 inference get correct result on v100 but weird result on a10 #3652

xiaochus · 2024-02-05T09:17:20Z

Description

I am using tritonserver:23.10 to deploy RT-DETR model. The onnxruntime fp32, onnxruntime fp16 and Tesla V100 TRT 8.6.1 FP16/F32 both get the correct result. But Tesla A10 TRT 8.6.1 get correct result in FP32 and weird result in FP16. The FP16 result should be same both in V100 and A10 with the same code.

A10 - TRT - FP16

V100 - TRT - FP16

Environment

TensorRT Version: TensoRT 8.6.1

NVIDIA GPU: Tesla V100 / A10

NVIDIA Driver Version: 515.65.01

CUDA Version: 12.2

CUDNN Version: v8

Operating System: Ubuntu 22.04

Python Version (if applicable): 3.10

Tensorflow Version (if applicable):

PyTorch Version (if applicable): 2.0.1

Baremetal or Container (if so, version): tritonserver 23.10

Relevant Files

https://github.com/lyuwenyu/RT-DETR

Steps To Reproduce

/usr/src/tensorrt/bin/trtexec --onnx=model.onnx --saveEngine=model.plan --fp16

zerollzeng · 2024-02-07T09:29:32Z

Did you test metric like mAP?
Could you please share the onnx here for reproduce.

Thanks!

zerollzeng · 2024-02-07T09:30:31Z

FP16 may introduce accuracy drop so it's hard to said whether it's bug unless we have generic metric like mAP.

ttyio · 2024-03-05T16:57:24Z

closing since no activity for more than 3 weeks, thanks all!

chinakook · 2024-03-08T14:37:44Z

#3700

chinakook · 2024-04-19T10:59:11Z

Please reopen this issue to track Ampere accuracy lose issue on all detr like models.

zerollzeng self-assigned this Feb 7, 2024

zerollzeng added the triaged Issue has been triaged by maintainers label Feb 7, 2024

ttyio closed this as completed Mar 5, 2024

chinakook mentioned this issue Apr 11, 2024

tensorrt engine of fp32 precision lost accuracy compared with onnx #3700

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RT-DETR FP16 inference get correct result on v100 but weird result on a10 #3652

RT-DETR FP16 inference get correct result on v100 but weird result on a10 #3652

xiaochus commented Feb 5, 2024

zerollzeng commented Feb 7, 2024

zerollzeng commented Feb 7, 2024

ttyio commented Mar 5, 2024

chinakook commented Mar 8, 2024

chinakook commented Apr 19, 2024

RT-DETR FP16 inference get correct result on v100 but weird result on a10 #3652

RT-DETR FP16 inference get correct result on v100 but weird result on a10 #3652

Comments

xiaochus commented Feb 5, 2024

Description

Environment

Relevant Files

Steps To Reproduce

zerollzeng commented Feb 7, 2024

zerollzeng commented Feb 7, 2024

ttyio commented Mar 5, 2024

chinakook commented Mar 8, 2024

chinakook commented Apr 19, 2024