TensorTRT

Requirements

Python 3
tensorflow 1.15
numpy
scikit-learn

Export ONNX Model

python convert2onnx_v2.py

Modify the Onehot Layer

python modify_onnx_gs.py

Build the Onehot plugin from Nvidia Hackathon Repo

git clone https://github.com/NVIDIA/trt-samples-for-hackathon-cn.git
cd build
make

and copy Onehot_plugin.so to Convbert folder.

Then generate the .trt file:

trtexec --onnx=ConvBert_onehot.onnx --plugins=OnehotPlugin.so --saveEngine=ConvBert_onehot.trt --verbose

Comparison between TF inference and TRT inference

python test_tf_trt_infer.py

Result

tf_time= [INFO] TF  execution time 367.3338 ms
trt_time= TRT execution time 9.16735 ms

The value is the average of inference time. The tf_time is over-estimated as it may contain the cpu time. It may need to do profiling to get an accurate value.

The speed-up ratio is 40.06.

References

Get the Docker Engine with cuda and tensorflow environment:

sudo docker pull registry.cn-hangzhou.aliyuncs.com/hackathon-fighters/21.03-tf1-py3-trt:v1

Here are some great resources we benefit:

Codebase: Our model codebase are based on Convbert.

ConvBert: NeurIPS 2020 paper ConvBERT: Improving BERT with Span-based Dynamic Convolution.

Dynamic convolution: Implementation from Pay Less Attention with Lightweight and Dynamic Convolutions.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
finetune		finetune
model		model
pretrain		pretrain
util		util
LICENSE		LICENSE
README.md		README.md
build_data.sh		build_data.sh
build_openwebtext_pretraining_dataset.py		build_openwebtext_pretraining_dataset.py
build_pretraining_dataset.py		build_pretraining_dataset.py
configure_finetuning.py		configure_finetuning.py
configure_pretraining.py		configure_pretraining.py
convBert.pb		convBert.pb
convbert.pb		convbert.pb
convert2onnx_v2.py		convert2onnx_v2.py
download_glue_data.py		download_glue_data.py
finetune.sh		finetune.sh
modify_onnx_gs.py		modify_onnx_gs.py
pretrain.sh		pretrain.sh
run_finetuning.py		run_finetuning.py
run_pretraining.py		run_pretraining.py
test_tf_trt_infer.py		test_tf_trt_infer.py
vocab.txt		vocab.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TensorTRT

Requirements

Export ONNX Model

Modify the Onehot Layer

Build the Onehot plugin from Nvidia Hackathon Repo

Comparison between TF inference and TRT inference

Result

References

About

Releases

Packages

Contributors 6

Languages

License

trthackthonFighters/ConvBert

Folders and files

Latest commit

History

Repository files navigation

TensorTRT

Requirements

Export ONNX Model

Modify the Onehot Layer

Build the Onehot plugin from Nvidia Hackathon Repo

Comparison between TF inference and TRT inference

Result

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 6

Languages

Packages