Release v0.9.0-beta
dsuthar-nvidia
released this
28 Jun 22:16
·
20 commits
to main
since this release
v0.9.0-beta
Release Highlights
CV-CUDA v0.9.0 includes the following changes:
-
New Features:
-
Improved Resize performance (up to 4x for u8 inputs, up to 3x for RGB8)
-
Improved performance of cubic interpolation, eg in Rotate, WarpAffine and WarpPerspective (up to 2x faster)
-
Added optional scaling to ResizeCropConvertReformat fused operator
-
Improved structure of Python documentation and optimized its generation (>5min to <30s) by removing the Exhale index
-
Added 64bit stride support to various operators
- limited to 32bit strides to avoid performance regressions: AdaptiveThreshold, AdvCvtColor, AverageBlur, BilateralFilter, BrightnessContrast, ColorTwist, BoxBlur, CenterCrop, ConvertTo, CopyMakeBorder, CustomCrop, GaussianNoise, Gaussian, Flip, HistogramEq, JointBilateralFilter, Laplacian, Morphology, Normalize, RandomResizedCrop, Reformat, Remap, Resize, Rotate, SIFT, WarpAffine, WarpPerspective
-
-
Bug Fixes:
- Added exception handling on CApi in Python: now forward C/C++exceptions to Python
- Fixed coordinate rounding bug in Resize operator with nearest neighbor interpolation
Compatibility and Known Limitations
- Documentation built on Ubuntu 20.04 needs an up-to-date version of sphinx (
pip install --upgrade sphinx
) as well as explicitly parsing the system's default python version./ci/build_docs path/to/build -DPYTHON_VERSIONS="<py_ver>"
. - Python bindings installed via Debian packages and Python tests fail with Numpy 2.0. We recommend using an older version of Numpy (e.g. 1.26) until we have implemented a fix.
- The Resize and RandomResizedCrop operators incorrectly interpolate pixel values near the boundary of an image or tensor when using linear and cubic interpolation. This will be fixed in an upcoming release.
See main README on CV-CUDA GitHub.
License
CV-CUDA is licensed under the Apache 2.0 license.
Resources
- CV-CUDA GitHub
- CV-CUDA Increasing Throughput and Reducing Costs for AI-Based Computer Vision with CV-CUDA
- NVIDIA Announces Microsoft, Tencent, Baidu Adopting CV-CUDA for Computer Vision AI
- CV-CUDA helps Tencent Cloud audio and video PaaS platform achieve full-process GPU acceleration for video enhancement AI
Acknowledgements
CV-CUDA is developed jointly by NVIDIA and the ByteDance Machine Learning team.