Releases: saadrahim/rocBLAS
rocBLAS-2.24.0 for ROCm 3.6.0
New Features
- Improvements to User Guide and Design Document
- L1 dot function optimized to utilize shuffle instructions ( improvements on bf16, f16, f32 data types )
- L1 dot function added x dot x optimized kernel
- Standardization of L1 rocblas-bench to use device pointer mode to focus on GPU memory bandwidth
- Adjustments for hipcc (hip-clang) compiler as standard build compiler and Centos8 support
- Added Fortran interface for all rocBLAS functions
Known Issues
None
rocBLAS-2.24.0 for ROCm 3.6.0
New Features
No new features
Known Issues
None
rocBLAS-2.24.0 for ROCm 3.6.0
New Features
- Improvements to User Guide and Design Document * L1 dot function optimized to utilize shuffle instructions ( improvements on bf16, f16, f32 data types ) * L1 dot function added x dot x optimized kernel * Standardization of L1 rocblas-bench to use device pointer mode to focus on GPU memory bandwidth * Adjustments for hipcc (hip-clang) compiler as standard build compiler and Centos8 support * Added Fortran interface for all rocBLAS functions
Known Issues
None
rocBLAS 3.5.0
testtag3.5.0 Updating to rocm2.2 for Jenkins Builds
rocBLAS-2.22.0 for ROCm 3.5.0
New Features
-
add geam complex, geam_batched, and geam_strided_batched
-
add dgmm, dgmm_batched, and dgmm_strided_batched
-
Optimized performance
- ger
- rocblas_sger, rocblas_dger
- rocblas_sger_batched, rocblas_dger_batched
- rocblas_sger_strided_batched, rocblas_dger_strided_batched
- geru
- rocblas_cgeru, rocblas_zgeru
- rocblas_cgeru_batched, rocblas_zgeru_batched
- rocblas_cgeru_strided_batched, rocblas_zgeru_strided_batched
- gerc
- rocblas_cgerc, rocblas_zgerc
- rocblas_cgerc_batched, rocblas_zgerc_batched
- rocblas_cgerc_strided_batched, rocblas_zgerc_strided_batched
- symv
- rocblas_ssymv, rocblas_dsymv, rocblas_csymv, rocblas_zsymv
- rocblas_ssymv_batched, rocblas_dsymv_batched, rocblas_csymv_batched, rocblas_zsymv_batched
- rocblas_ssymv_strided_batched, rocblas_dsymv_strided_batched, rocblas_csymv_strided_batched, rocblas_zsymv_strided_batched
- sbmv
- rocblas_ssbmv, rocblas_dsbmv
- rocblas_ssbmv_batched, rocblas_dsbmv_batched
- rocblas_ssbmv_strided_batched, rocblas_dsbmv_strided_batched
- spmv
- rocblas_sspmv, rocblas_dspmv
- rocblas_sspmv_batched, rocblas_dspmv_batched
- rocblas_sspmv_strided_batched, rocblas_dspmv_strided_batched
- ger
-
Improved documentation
-
Fix argument checking in functions to match legacy BLAS
-
Fixed conjugate-transpose version of geam
Known Issues
None
rocBLAS-2.22.0 for ROCm 3.5.0
New Features
-
add geam complex, geam_batched, and geam_strided_batched
-
add dgmm, dgmm_batched, and dgmm_strided_batched
-
Optimized performance
- ger
- rocblas_sger, rocblas_dger
- rocblas_sger_batched, rocblas_dger_batched
- rocblas_sger_strided_batched, rocblas_dger_strided_batched
- geru
- rocblas_cgeru, rocblas_zgeru
- rocblas_cgeru_batched, rocblas_zgeru_batched
- rocblas_cgeru_strided_batched, rocblas_zgeru_strided_batched
- gerc
- rocblas_cgerc, rocblas_zgerc
- rocblas_cgerc_batched, rocblas_zgerc_batched
- rocblas_cgerc_strided_batched, rocblas_zgerc_strided_batched
- symv
- rocblas_ssymv, rocblas_dsymv, rocblas_csymv, rocblas_zsymv
- rocblas_ssymv_batched, rocblas_dsymv_batched, rocblas_csymv_batched, rocblas_zsymv_batched
- rocblas_ssymv_strided_batched, rocblas_dsymv_strided_batched, rocblas_csymv_strided_batched, rocblas_zsymv_strided_batched
- sbmv
- rocblas_ssbmv, rocblas_dsbmv
- rocblas_ssbmv_batched, rocblas_dsbmv_batched
- rocblas_ssbmv_strided_batched, rocblas_dsbmv_strided_batched
- spmv
- rocblas_sspmv, rocblas_dspmv
- rocblas_sspmv_batched, rocblas_dspmv_batched
- rocblas_sspmv_strided_batched, rocblas_dspmv_strided_batched
- ger
-
Improved documentation
-
Fix argument checking in functions to match legacy BLAS
-
Fixed conjugate-transpose version of geam
Known Issues
None
rocBLAS-2.22.0 for ROCm 3.5.0
New Features
-
add geam complex, geam_batched, and geam_strided_batched
-
add dgmm, dgmm_batched, and dgmm_strided_batched
-
Optimized performance
- ger
- rocblas_sger, rocblas_dger
- rocblas_sger_batched, rocblas_dger_batched
- rocblas_sger_strided_batched, rocblas_dger_strided_batched
- geru
- rocblas_cgeru, rocblas_zgeru
- rocblas_cgeru_batched, rocblas_zgeru_batched
- rocblas_cgeru_strided_batched, rocblas_zgeru_strided_batched
- gerc
- rocblas_cgerc, rocblas_zgerc
- rocblas_cgerc_batched, rocblas_zgerc_batched
- rocblas_cgerc_strided_batched, rocblas_zgerc_strided_batched
- symv
- rocblas_ssymv, rocblas_dsymv, rocblas_csymv, rocblas_zsymv
- rocblas_ssymv_batched, rocblas_dsymv_batched, rocblas_csymv_batched, rocblas_zsymv_batched
- rocblas_ssymv_strided_batched, rocblas_dsymv_strided_batched, rocblas_csymv_strided_batched, rocblas_zsymv_strided_batched
- sbmv
- rocblas_ssbmv, rocblas_dsbmv
- rocblas_ssbmv_batched, rocblas_dsbmv_batched
- rocblas_ssbmv_strided_batched, rocblas_dsbmv_strided_batched
- spmv
- rocblas_sspmv, rocblas_dspmv
- rocblas_sspmv_batched, rocblas_dspmv_batched
- rocblas_sspmv_strided_batched, rocblas_dspmv_strided_batched
- ger
-
Improved documentation
-
Fix argument checking in functions to match legacy BLAS
-
Fixed conjugate-transpose version of geam
Known Issues
None
rocBLAS-2.22.0 for ROCm 3.5.0
New Features
Known Issues
rocBLAS-2.22.0 for ROCm 3.5.0
New Features
Known Issues
rocBLAS-2.22.0 for ROCm 3.5.0
test_tag_3.5.0_ver2 Updating to rocm2.2 for Jenkins Builds