Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question: Will TensorRT 9 be available in the 23.11 NGC container ? #3442

Closed
MatthieuToulemont opened this issue Nov 9, 2023 · 8 comments
Closed
Assignees
Labels
triaged Issue has been triaged by maintainers

Comments

@MatthieuToulemont
Copy link

Hello, thanks for all the great work !

Some of my models require bfloat16 at inference time, I saw it was added in TensorRT 9 with TensorRT-LLM, and I was wondering when TensorRT 9 will be available as the default version in the NGC containers.

Thank you,

@zerollzeng
Copy link
Collaborator

@nvpohanh ^ ^

@zerollzeng zerollzeng added the triaged Issue has been triaged by maintainers label Nov 12, 2023
@nvpohanh
Copy link
Collaborator

23.11 NGC container will still use TRT 8.6 since all the TRT 9.x releases are only recommended for LLM workloads. We will update the TRT version in the NGC container when there is a general TRT release for all use cases. Thanks

@MatthieuToulemont
Copy link
Author

MatthieuToulemont commented Nov 13, 2023

Thank you, do you know when bfloat16 will be added to TensorRT then ?

bloat16 has become so widespread in training that it feels unfortunate not to have it available on the fastest inference framework. Even more so when a model trained in bfloat16 does not work in float16. Happened to me a couple of times :/

@MatthieuToulemont
Copy link
Author

We will update the TRT version in the NGC container when there is a general TRT release for all use cases

Do you have an estimated released date ?

@david-PHR
Copy link

Having any guideline on how adapting a BF16 trained model for a TensorRT (version < 9) would be super useful in the meantime

@copasseron
Copy link

Any update on this ?

can we hope for the 24.02 NGC container ?

it's mainly related to this issue for my side:
#3124

@nvpohanh
Copy link
Collaborator

There won't be NGC container with TRT 9+ until Q2. Until then, please build the container using: https://github.com/NVIDIA/TensorRT/blob/v9.3.0/docker/ubuntu-22.04.Dockerfile

Thanks

@ttyio
Copy link
Collaborator

ttyio commented Apr 16, 2024

closing since no activity for more than 3 weeks, thanks all

@ttyio ttyio closed this as completed Apr 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
triaged Issue has been triaged by maintainers
Projects
None yet
Development

No branches or pull requests

6 participants