Skip to content

Releases: saadrahim/rocBLAS

rocBLAS-2.24.0 for ROCm 3.6.0

10 Jul 23:03
Compare
Choose a tag to compare

New Features

  • Improvements to User Guide and Design Document
  • L1 dot function optimized to utilize shuffle instructions ( improvements on bf16, f16, f32 data types )
  • L1 dot function added x dot x optimized kernel
  • Standardization of L1 rocblas-bench to use device pointer mode to focus on GPU memory bandwidth
  • Adjustments for hipcc (hip-clang) compiler as standard build compiler and Centos8 support
  • Added Fortran interface for all rocBLAS functions

Known Issues

None

rocBLAS-2.24.0 for ROCm 3.6.0

10 Jul 22:58
Compare
Choose a tag to compare

New Features
No new features
Known Issues

None

rocBLAS-2.24.0 for ROCm 3.6.0

10 Jul 22:55
Compare
Choose a tag to compare

New Features

  • Improvements to User Guide and Design Document * L1 dot function optimized to utilize shuffle instructions ( improvements on bf16, f16, f32 data types ) * L1 dot function added x dot x optimized kernel * Standardization of L1 rocblas-bench to use device pointer mode to focus on GPU memory bandwidth * Adjustments for hipcc (hip-clang) compiler as standard build compiler and Centos8 support * Added Fortran interface for all rocBLAS functions
    Known Issues

None

rocBLAS 3.5.0

29 May 17:33
Compare
Choose a tag to compare
testtag3.5.0

Updating to rocm2.2 for Jenkins Builds

rocBLAS-2.22.0 for ROCm 3.5.0

01 Jun 19:02
Compare
Choose a tag to compare

New Features

  • add geam complex, geam_batched, and geam_strided_batched

  • add dgmm, dgmm_batched, and dgmm_strided_batched

  • Optimized performance

    • ger
      • rocblas_sger, rocblas_dger
      • rocblas_sger_batched, rocblas_dger_batched
      • rocblas_sger_strided_batched, rocblas_dger_strided_batched
    • geru
      • rocblas_cgeru, rocblas_zgeru
      • rocblas_cgeru_batched, rocblas_zgeru_batched
      • rocblas_cgeru_strided_batched, rocblas_zgeru_strided_batched
    • gerc
      • rocblas_cgerc, rocblas_zgerc
      • rocblas_cgerc_batched, rocblas_zgerc_batched
      • rocblas_cgerc_strided_batched, rocblas_zgerc_strided_batched
    • symv
      • rocblas_ssymv, rocblas_dsymv, rocblas_csymv, rocblas_zsymv
      • rocblas_ssymv_batched, rocblas_dsymv_batched, rocblas_csymv_batched, rocblas_zsymv_batched
      • rocblas_ssymv_strided_batched, rocblas_dsymv_strided_batched, rocblas_csymv_strided_batched, rocblas_zsymv_strided_batched
    • sbmv
      • rocblas_ssbmv, rocblas_dsbmv
      • rocblas_ssbmv_batched, rocblas_dsbmv_batched
      • rocblas_ssbmv_strided_batched, rocblas_dsbmv_strided_batched
    • spmv
      • rocblas_sspmv, rocblas_dspmv
      • rocblas_sspmv_batched, rocblas_dspmv_batched
      • rocblas_sspmv_strided_batched, rocblas_dspmv_strided_batched
  • Improved documentation

  • Fix argument checking in functions to match legacy BLAS

  • Fixed conjugate-transpose version of geam

Known Issues

None

rocBLAS-2.22.0 for ROCm 3.5.0

01 Jun 18:59
Compare
Choose a tag to compare

New Features

  • add geam complex, geam_batched, and geam_strided_batched

  • add dgmm, dgmm_batched, and dgmm_strided_batched

  • Optimized performance

    • ger
      • rocblas_sger, rocblas_dger
      • rocblas_sger_batched, rocblas_dger_batched
      • rocblas_sger_strided_batched, rocblas_dger_strided_batched
    • geru
      • rocblas_cgeru, rocblas_zgeru
      • rocblas_cgeru_batched, rocblas_zgeru_batched
      • rocblas_cgeru_strided_batched, rocblas_zgeru_strided_batched
    • gerc
      • rocblas_cgerc, rocblas_zgerc
      • rocblas_cgerc_batched, rocblas_zgerc_batched
      • rocblas_cgerc_strided_batched, rocblas_zgerc_strided_batched
    • symv
      • rocblas_ssymv, rocblas_dsymv, rocblas_csymv, rocblas_zsymv
      • rocblas_ssymv_batched, rocblas_dsymv_batched, rocblas_csymv_batched, rocblas_zsymv_batched
      • rocblas_ssymv_strided_batched, rocblas_dsymv_strided_batched, rocblas_csymv_strided_batched, rocblas_zsymv_strided_batched
    • sbmv
    • rocblas_ssbmv, rocblas_dsbmv
    • rocblas_ssbmv_batched, rocblas_dsbmv_batched
    • rocblas_ssbmv_strided_batched, rocblas_dsbmv_strided_batched
    • spmv
      • rocblas_sspmv, rocblas_dspmv
      • rocblas_sspmv_batched, rocblas_dspmv_batched
      • rocblas_sspmv_strided_batched, rocblas_dspmv_strided_batched
  • Improved documentation

  • Fix argument checking in functions to match legacy BLAS

  • Fixed conjugate-transpose version of geam

Known Issues

None

rocBLAS-2.22.0 for ROCm 3.5.0

01 Jun 17:53
Compare
Choose a tag to compare

New Features

  • add geam complex, geam_batched, and geam_strided_batched

  • add dgmm, dgmm_batched, and dgmm_strided_batched

  • Optimized performance

    • ger
      • rocblas_sger, rocblas_dger
      • rocblas_sger_batched, rocblas_dger_batched
      • rocblas_sger_strided_batched, rocblas_dger_strided_batched
    • geru
      • rocblas_cgeru, rocblas_zgeru
      • rocblas_cgeru_batched, rocblas_zgeru_batched
      • rocblas_cgeru_strided_batched, rocblas_zgeru_strided_batched
    • gerc
      • rocblas_cgerc, rocblas_zgerc
      • rocblas_cgerc_batched, rocblas_zgerc_batched
      • rocblas_cgerc_strided_batched, rocblas_zgerc_strided_batched
    • symv
      • rocblas_ssymv, rocblas_dsymv, rocblas_csymv, rocblas_zsymv
      • rocblas_ssymv_batched, rocblas_dsymv_batched, rocblas_csymv_batched, rocblas_zsymv_batched
      • rocblas_ssymv_strided_batched, rocblas_dsymv_strided_batched, rocblas_csymv_strided_batched, rocblas_zsymv_strided_batched
    • sbmv
      • rocblas_ssbmv, rocblas_dsbmv
    • rocblas_ssbmv_batched, rocblas_dsbmv_batched
    • rocblas_ssbmv_strided_batched, rocblas_dsbmv_strided_batched
    • spmv
      • rocblas_sspmv, rocblas_dspmv
      • rocblas_sspmv_batched, rocblas_dspmv_batched
      • rocblas_sspmv_strided_batched, rocblas_dspmv_strided_batched
  • Improved documentation

  • Fix argument checking in functions to match legacy BLAS

  • Fixed conjugate-transpose version of geam

Known Issues

None

rocBLAS-2.22.0 for ROCm 3.5.0

01 Jun 17:50
Compare
Choose a tag to compare

New Features

Known Issues

rocBLAS-2.22.0 for ROCm 3.5.0

29 May 17:40
Compare
Choose a tag to compare

New Features

Known Issues

rocBLAS-2.22.0 for ROCm 3.5.0

29 May 17:37
Compare
Choose a tag to compare
test_tag_3.5.0_ver2

Updating to rocm2.2 for Jenkins Builds