Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

{2023.06}[foss/2023a] OSU-Microbenchmarks v7.2 w/ CUDA 12.1.1 (rebuild) #716

Merged

Conversation

Copy link

eessi-bot bot commented Sep 18, 2024

Instance eessi-bot-mc-aws is configured to build for:

  • architectures: x86_64/generic, x86_64/intel/haswell, x86_64/intel/skylake_avx512, x86_64/amd/zen2, x86_64/amd/zen3, aarch64/generic, aarch64/neoverse_n1, aarch64/neoverse_v1
  • repositories: eessi-hpc.org-2023.06-compat, eessi-hpc.org-2023.06-software, eessi.io-2023.06-software, eessi.io-2023.06-compat

Copy link

eessi-bot bot commented Sep 18, 2024

Instance eessi-bot-mc-azure is configured to build for:

  • architectures: x86_64/amd/zen4
  • repositories: eessi-hpc.org-2023.06-compat, eessi.io-2023.06-compat, eessi-hpc.org-2023.06-software, eessi.io-2023.06-software

Instance boegel-bot-deucalion is configured to build for:

  • architectures: aarch64/a64fx
  • repositories: eessi.io-2023.06-software

@casparvl
Copy link
Collaborator Author

bot: build repo:eessi.io-2023.06-software arch:x86_64/amd/zen2 accel:nvidia/cc80

Updates by the bot instance boegel-bot-deucalion (click for details)
  • account casparvl has NO permission to send commands to the bot

Copy link

eessi-bot bot commented Sep 19, 2024

Updates by the bot instance eessi-bot-mc-aws (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/amd/zen2 accel:nvidia/cc80 from casparvl

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen2 accelerator:nvidia/cc80
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen2 accelerator:nvidia/cc80 resulted in:

Copy link

eessi-bot bot commented Sep 19, 2024

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/amd/zen2 accel:nvidia/cc80 from casparvl

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen2 accelerator:nvidia/cc80
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen2 accelerator:nvidia/cc80 resulted in:

    • no jobs were submitted

Copy link

eessi-bot bot commented Sep 19, 2024

New job on instance eessi-bot-mc-aws for CPU micro-architecture x86_64-amd-zen2 and accelerator nvidia/cc80 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.09/pr_716/19106

date job status comment
Sep 19 07:19:04 UTC 2024 submitted job id 19106 awaits release by job manager
Sep 19 07:19:47 UTC 2024 released job awaits launch by Slurm scheduler
Sep 19 07:26:02 UTC 2024 running job 19106 is running
Sep 19 07:31:26 UTC 2024 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-19106.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen2-1726730776.tar.gzsize: 0 MiB (45 bytes)
entries: 0
modules under 2023.06/software/linux/x86_64/amd/zen2/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/amd/zen2/software
no software packages in tarball
other under 2023.06/software/linux/x86_64/amd/zen2
no other files in tarball
Sep 19 07:31:26 UTC 2024 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 9/9 test case(s) from 9 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-19106.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@casparvl
Copy link
Collaborator Author

bot: build repo:eessi.io-2023.06-software arch:x86_64/amd/zen2 accel:nvidia/cc80

Copy link

eessi-bot bot commented Sep 19, 2024

Updates by the bot instance eessi-bot-mc-aws (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/amd/zen2 accel:nvidia/cc80 from casparvl

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen2 accelerator:nvidia/cc80
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen2 accelerator:nvidia/cc80 resulted in:

Updates by the bot instance boegel-bot-deucalion (click for details)
  • account casparvl has NO permission to send commands to the bot

Copy link

eessi-bot bot commented Sep 19, 2024

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/amd/zen2 accel:nvidia/cc80 from casparvl

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen2 accelerator:nvidia/cc80
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen2 accelerator:nvidia/cc80 resulted in:

    • no jobs were submitted

Copy link

eessi-bot bot commented Sep 19, 2024

New job on instance eessi-bot-mc-aws for CPU micro-architecture x86_64-amd-zen2 and accelerator nvidia/cc80 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.09/pr_716/19114

date job status comment
Sep 19 08:46:01 UTC 2024 submitted job id 19114 awaits release by job manager
Sep 19 08:46:23 UTC 2024 released job awaits launch by Slurm scheduler
Sep 19 08:47:31 UTC 2024 running job 19114 is running
Sep 19 08:57:00 UTC 2024 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-19114.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen2-1726735836.tar.gzsize: 3 MiB (3524173 bytes)
entries: 95
modules under 2023.06/software/linux/x86_64/amd/zen2/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/amd/zen2/software
no software packages in tarball
other under 2023.06/software/linux/x86_64/amd/zen2
accel/nvidia/cc80/modules/all/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1.lua
accel/nvidia/cc80/modules/perf/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1.lua
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/easybuild/
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/easybuild/easybuild-OSU-Micro-Benchmarks-7.2-20240919.084849.log.bz2
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/easybuild/easybuild-OSU-Micro-Benchmarks-7.2-20240919.084849_test_report.md
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/easybuild/OSU-Micro-Benchmarks-7.2-gompi-2023a-CUDA-12.1.1-easybuild-devel
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/easybuild/OSU-Micro-Benchmarks-7.2-gompi-2023a-CUDA-12.1.1.eb
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/easybuild/reprod/
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/easybuild/reprod/easyblocks/
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/easybuild/reprod/easyblocks/configuremake.py
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/easybuild/reprod/hooks/
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/easybuild/reprod/hooks/eb_hooks.py
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/easybuild/reprod/OSU-Micro-Benchmarks-7.2-gompi-2023a-CUDA-12.1.1.eb
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/easybuild/reprod/OSU-Micro-Benchmarks-7.2-gompi-2023a-CUDA-12.1.1.env
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/get_local_rank
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_allgather
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_allgatherv
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_allreduce
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_alltoall
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_alltoallv
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_alltoallw
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_barrier
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_bcast
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_gather
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_gatherv
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_iallgather
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_iallgatherv
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_iallreduce
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_ialltoall
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_ialltoallv
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_ialltoallw
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_ibarrier
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_ibcast
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_igather
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_igatherv
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_ineighbor_allgather
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_ineighbor_allgatherv
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_ineighbor_alltoall
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_ineighbor_alltoallv
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_ineighbor_alltoallw
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_ireduce
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_ireduce_scatter
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_iscatter
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_iscatterv
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_neighbor_allgather
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_neighbor_allgatherv
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_neighbor_alltoall
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_neighbor_alltoallv
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_neighbor_alltoallw
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_reduce
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_reduce_scatter
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_scatter
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/collective/osu_scatterv
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/one-sided/
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/one-sided/osu_acc_latency
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/one-sided/osu_cas_latency
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/one-sided/osu_fop_latency
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/one-sided/osu_get_acc_latency
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/one-sided/osu_get_bw
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/one-sided/osu_get_latency
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/one-sided/osu_put_bibw
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/one-sided/osu_put_bw
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/one-sided/osu_put_latency
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/pt2pt/
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/pt2pt/osu_bibw
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/pt2pt/osu_bw
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/pt2pt/osu_latency
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/pt2pt/osu_latency_mp
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/pt2pt/osu_latency_mt
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/pt2pt/osu_mbw_mr
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/pt2pt/osu_multi_lat
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/pt2pt/persistent/
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/pt2pt/persistent/osu_bibw_persistent
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/pt2pt/persistent/osu_bw_persistent
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/pt2pt/persistent/osu_latency_persistent
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/startup/
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/startup/osu_hello
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/startup/osu_init
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/nccl/
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/nccl/collective/
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/nccl/collective/osu_nccl_allgather
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/nccl/collective/osu_nccl_allreduce
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/nccl/collective/osu_nccl_alltoall
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/nccl/collective/osu_nccl_bcast
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/nccl/collective/osu_nccl_reduce
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/nccl/collective/osu_nccl_reduce_scatter
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/nccl/pt2pt/
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/nccl/pt2pt/osu_nccl_bibw
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/nccl/pt2pt/osu_nccl_bw
accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/nccl/pt2pt/osu_nccl_latency
Sep 19 08:57:00 UTC 2024 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 9/9 test case(s) from 9 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-19114.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@boegel boegel added the 2023.06-software.eessi.io 2023.06 version of software.eessi.io label Sep 25, 2024
@boegel boegel changed the title {2023.06}[foss/2023a] OSU-Microbenchmarks v7.2 w/ CUDA 12.1.1 {2023.06}[foss/2023a] OSU-Microbenchmarks v7.2 w/ CUDA 12.1.1 (rebuild) Sep 25, 2024
@casparvl
Copy link
Collaborator Author

bot: build repo:eessi.io-2023.06-software arch:x86_64/amd/zen2 accel:nvidia/cc80
bot: build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3 accel:nvidia/cc80

Updates by the bot instance boegel-bot-deucalion (click for details)
  • account casparvl has NO permission to send commands to the bot

Copy link

eessi-bot bot commented Sep 26, 2024

Updates by the bot instance eessi-bot-mc-aws (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/amd/zen2 accel:nvidia/cc80 from casparvl

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen2 accelerator:nvidia/cc80
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3 accel:nvidia/cc80 from casparvl

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3 accelerator:nvidia/cc80
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen2 accelerator:nvidia/cc80 resulted in:

  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3 accelerator:nvidia/cc80 resulted in:

Copy link

eessi-bot bot commented Sep 26, 2024

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/amd/zen2 accel:nvidia/cc80 from casparvl

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen2 accelerator:nvidia/cc80
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/amd/zen3 accel:nvidia/cc80 from casparvl

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3 accelerator:nvidia/cc80
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen2 accelerator:nvidia/cc80 resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen3 accelerator:nvidia/cc80 resulted in:

    • no jobs were submitted

Copy link

eessi-bot bot commented Sep 26, 2024

New job on instance eessi-bot-mc-aws for CPU micro-architecture x86_64-amd-zen2 and accelerator nvidia/cc80 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.09/pr_716/20007

date job status comment
Sep 26 11:08:51 UTC 2024 submitted job id 20007 awaits release by job manager
Sep 26 11:09:06 UTC 2024 released job awaits launch by Slurm scheduler
Sep 26 11:15:19 UTC 2024 running job 20007 is running
Sep 26 11:34:56 UTC 2024 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-20007.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen2-1727349726.tar.gzsize: 3 MiB (3524501 bytes)
entries: 95
modules under 2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc80/modules/all
OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1.lua
software under 2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc80/software
OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1
other under 2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc80
no other files in tarball
Sep 26 11:34:56 UTC 2024 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 9/9 test case(s) from 9 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-20007.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case
Sep 26 14:22:12 UTC 2024 uploaded transfer of eessi-2023.06-software-linux-x86_64-amd-zen2-1727349726.tar.gz to S3 bucket succeeded

Copy link

eessi-bot bot commented Sep 26, 2024

New job on instance eessi-bot-mc-aws for CPU micro-architecture x86_64-amd-zen3 and accelerator nvidia/cc80 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.09/pr_716/20008

date job status comment
Sep 26 11:08:55 UTC 2024 submitted job id 20008 awaits release by job manager
Sep 26 11:09:09 UTC 2024 released job awaits launch by Slurm scheduler
Sep 26 11:10:11 UTC 2024 running job 20008 is running
Sep 26 11:25:45 UTC 2024 finished
😁 SUCCESS (click triangle for details)
Details
✅ job output file slurm-20008.out
✅ no message matching ERROR:
✅ no message matching FAILED:
✅ no message matching required modules missing:
✅ found message(s) matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen3-1727349308.tar.gzsize: 3 MiB (3526552 bytes)
entries: 95
modules under 2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/modules/all
OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1.lua
software under 2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/software
OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1
other under 2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80
no other files in tarball
Sep 26 11:25:45 UTC 2024 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 9/9 test case(s) from 9 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-20008.out
✅ no message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case
Sep 26 14:22:31 UTC 2024 uploaded transfer of eessi-2023.06-software-linux-x86_64-amd-zen3-1727349308.tar.gz to S3 bucket succeeded

@casparvl
Copy link
Collaborator Author

For zen2:

[casparvl@login1 20007]$ readelf -d 2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/pt2pt/osu_latency | grep RPATH | grep CUDA
 0x000000000000000f (RPATH)              Library rpath: [/cvmfs/software.eessi.io/host_injections/2023.06/software/linux/x86_64/amd/zen2/rpath_overrides/OpenMPI/system/lib:/cvmfs/software.eessi.io/host_injections/2023.06/software/linux/x86_64/amd/zen2/rpath_overrides/OpenMPI/system/lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/lib64:$ORIGIN:$ORIGIN/../lib:$ORIGIN/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc80/software/NCCL/2.18.3-GCCcore-12.3.0-CUDA-12.1.1/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc80/software/CUDA/12.1.1/lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc80/software/UCC-CUDA/1.2.0-GCCcore-12.3.0-CUDA-12.1.1/lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/GCCcore/12.3.0/lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/GCCcore/12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/OpenMPI/4.1.5-GCC-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/hwloc/2.9.1-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/libevent/2.1.12-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/GDRCopy/2.3.1-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/UCC/1.2.0-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/PMIx/4.2.4-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/libfabric/1.18.0-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/UCX/1.14.1-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/OpenSSL/1.1/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/libpciaccess/0.17-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/libxml2/2.11.4-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/numactl/2.0.16-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/GCCcore/12.3.0/lib/gcc/x86_64-pc-linux-gnu/12.3.0:/cvmfs/software.eessi.io/versions/2023.06/compat/linux/x86_64/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/compat/linux/x86_64/usr/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/compat/linux/x86_64/lib:/cvmfs/software.eessi.io/versions/2023.06/compat/linux/x86_64/usr/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/GDRCopy/2.3.1-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/UCC/1.2.0-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/PMIx/4.2.4-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/libfabric/1.18.0-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/UCX/1.14.1-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/OpenSSL/1.1/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/libpciaccess/0.17-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/libxml2/2.11.4-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/numactl/2.0.16-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc80/software/CUDA/12.1.1/lib]

For zen3:

$ readelf -d 2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/pt2pt/osu_latency | grep RPATH | grep CUDA
 0x000000000000000f (RPATH)              Library rpath: [/cvmfs/software.eessi.io/host_injections/2023.06/software/linux/x86_64/amd/zen3/rpath_overrides/OpenMPI/system/lib:/cvmfs/software.eessi.io/host_injections/2023.06/software/linux/x86_64/amd/zen3/rpath_overrides/OpenMPI/system/lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/lib64:$ORIGIN:$ORIGIN/../lib:$ORIGIN/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/software/NCCL/2.18.3-GCCcore-12.3.0-CUDA-12.1.1/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/software/CUDA/12.1.1/lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/software/UCC-CUDA/1.2.0-GCCcore-12.3.0-CUDA-12.1.1/lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/GCCcore/12.3.0/lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/GCCcore/12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/OpenMPI/4.1.5-GCC-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/hwloc/2.9.1-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/libevent/2.1.12-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/GDRCopy/2.3.1-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/UCC/1.2.0-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/PMIx/4.2.4-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/libfabric/1.18.0-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/UCX/1.14.1-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/OpenSSL/1.1/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/libpciaccess/0.17-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/libxml2/2.11.4-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/numactl/2.0.16-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/GCCcore/12.3.0/lib/gcc/x86_64-pc-linux-gnu/12.3.0:/cvmfs/software.eessi.io/versions/2023.06/compat/linux/x86_64/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/compat/linux/x86_64/usr/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/compat/linux/x86_64/lib:/cvmfs/software.eessi.io/versions/2023.06/compat/linux/x86_64/usr/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/GDRCopy/2.3.1-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/UCC/1.2.0-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/PMIx/4.2.4-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/libfabric/1.18.0-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/UCX/1.14.1-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/OpenSSL/1.1/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/libpciaccess/0.17-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/libxml2/2.11.4-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/numactl/2.0.16-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/software/CUDA/12.1.1/lib]

Also fine!

@casparvl casparvl added the bot:deploy Ask bot to deploy missing software installations to EESSI label Sep 26, 2024

Label bot:deploy has been set by user casparvl, but this person does not have permission to trigger deployments

@boegel
Copy link
Contributor

boegel commented Sep 26, 2024

Tested on our zen3 + A100 cluster, in a 2-GPU Slurm job.

Bandwidth:

$ mpirun -np 2 osu_bw -d cuda -m 33554432 D D
# OSU MPI-CUDA Bandwidth Test v7.2
# Send Buffer on DEVICE (D) and Receive Buffer on DEVICE (D)
# Size      Bandwidth (MB/s)
# Datatype: MPI_CHAR.
1                       1.01
2                       1.19
4                       2.36
8                       4.55
16                     16.65
32                     33.78
64                     67.25
128                    84.75
256                   260.92
512                   173.38
1024                  299.62
2048                  451.33
4096                  458.77
8192                  525.73
16384                3857.74
32768                7994.64
65536               15294.18
131072              27755.39
262144              44371.70
524288              61133.24
1048576             73675.19
2097152             82905.64
4194304             87543.13
8388608             90981.81
16777216            92202.43
33554432            93130.52

That's getting close to the max. bandwidth of 100GBs/s between two A100's with 4x NVLink (each 25GB/s):

$ nvidia-smi topo -m
        GPU0    GPU1    NIC0    CPU Affinity    NUMA Affinity   GPU NUMA ID
GPU0     X      NV4     SYS             3               N/A
GPU1    NV4      X      SYS             5               N/A
NIC0    SYS     SYS      X

Legend:

  X    = Self
  SYS  = Connection traversing PCIe as well as the SMP interconnect between NUMA nodes (e.g., QPI/UPI)
  ...
  NV#  = Connection traversing a bonded set of # NVLinks

Latency:

$ mpirun -np 2 osu_latency -d cuda D D
# OSU MPI-CUDA Latency Test v7.2
# Send Buffer on DEVICE (D) and Receive Buffer on DEVICE (D)
# Size          Latency (us)
# Datatype: MPI_CHAR.
1                       1.54
2                       2.39
4                       2.35
8                       2.34
16                      1.57
32                      1.54
64                      1.55
128                     2.04
256                     2.00
512                     2.94
1024                    4.91
2048                    6.27
4096                   10.38
8192                   17.47
16384                  10.11
32768                   9.86
65536                  10.42
131072                 11.27
262144                 12.48
524288                 15.34
1048576                20.84
2097152                31.89
4194304                54.41

@boegel boegel merged commit 4c12b5c into EESSI:2023.06-software.eessi.io Sep 26, 2024
35 checks passed
Copy link

eessi-bot bot commented Sep 26, 2024

PR merged! Moved ['/project/def-users/SHARED/jobs/2024.09/pr_716/19106', '/project/def-users/SHARED/jobs/2024.09/pr_716/19114', '/project/def-users/SHARED/jobs/2024.09/pr_716/20007', '/project/def-users/SHARED/jobs/2024.09/pr_716/20008'] to /project/def-users/SHARED/trash_bin/EESSI/software-layer/2024.09.26

PR merged! Moved [] to $HOME/trash_bin/EESSI/software-layer/2024.09.26

Copy link

eessi-bot bot commented Sep 26, 2024

PR merged! Moved [] to /project/def-users/SHARED/trash_bin/EESSI/software-layer/2024.09.26

@boegel
Copy link
Contributor

boegel commented Sep 26, 2024

For zen2:

[casparvl@login1 20007]$ readelf -d 2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/pt2pt/osu_latency | grep RPATH | grep CUDA
 0x000000000000000f (RPATH)              Library rpath: [/cvmfs/software.eessi.io/host_injections/2023.06/software/linux/x86_64/amd/zen2/rpath_overrides/OpenMPI/system/lib:/cvmfs/software.eessi.io/host_injections/2023.06/software/linux/x86_64/amd/zen2/rpath_overrides/OpenMPI/system/lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/lib64:$ORIGIN:$ORIGIN/../lib:$ORIGIN/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc80/software/NCCL/2.18.3-GCCcore-12.3.0-CUDA-12.1.1/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc80/software/CUDA/12.1.1/lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc80/software/UCC-CUDA/1.2.0-GCCcore-12.3.0-CUDA-12.1.1/lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/GCCcore/12.3.0/lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/GCCcore/12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/OpenMPI/4.1.5-GCC-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/hwloc/2.9.1-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/libevent/2.1.12-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/GDRCopy/2.3.1-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/UCC/1.2.0-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/PMIx/4.2.4-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/libfabric/1.18.0-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/UCX/1.14.1-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/OpenSSL/1.1/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/libpciaccess/0.17-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/libxml2/2.11.4-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/numactl/2.0.16-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/GCCcore/12.3.0/lib/gcc/x86_64-pc-linux-gnu/12.3.0:/cvmfs/software.eessi.io/versions/2023.06/compat/linux/x86_64/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/compat/linux/x86_64/usr/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/compat/linux/x86_64/lib:/cvmfs/software.eessi.io/versions/2023.06/compat/linux/x86_64/usr/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/GDRCopy/2.3.1-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/UCC/1.2.0-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/PMIx/4.2.4-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/libfabric/1.18.0-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/UCX/1.14.1-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/OpenSSL/1.1/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/libpciaccess/0.17-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/libxml2/2.11.4-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/numactl/2.0.16-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/accel/nvidia/cc80/software/CUDA/12.1.1/lib]

For zen3:

$ readelf -d 2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/libexec/osu-micro-benchmarks/mpi/pt2pt/osu_latency | grep RPATH | grep CUDA
 0x000000000000000f (RPATH)              Library rpath: [/cvmfs/software.eessi.io/host_injections/2023.06/software/linux/x86_64/amd/zen3/rpath_overrides/OpenMPI/system/lib:/cvmfs/software.eessi.io/host_injections/2023.06/software/linux/x86_64/amd/zen3/rpath_overrides/OpenMPI/system/lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/software/OSU-Micro-Benchmarks/7.2-gompi-2023a-CUDA-12.1.1/lib64:$ORIGIN:$ORIGIN/../lib:$ORIGIN/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/software/NCCL/2.18.3-GCCcore-12.3.0-CUDA-12.1.1/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/software/CUDA/12.1.1/lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/software/UCC-CUDA/1.2.0-GCCcore-12.3.0-CUDA-12.1.1/lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/GCCcore/12.3.0/lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/GCCcore/12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/OpenMPI/4.1.5-GCC-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/hwloc/2.9.1-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/libevent/2.1.12-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/GDRCopy/2.3.1-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/UCC/1.2.0-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/PMIx/4.2.4-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/libfabric/1.18.0-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/UCX/1.14.1-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/OpenSSL/1.1/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/libpciaccess/0.17-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/libxml2/2.11.4-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/numactl/2.0.16-GCCcore-12.3.0/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/GCCcore/12.3.0/lib/gcc/x86_64-pc-linux-gnu/12.3.0:/cvmfs/software.eessi.io/versions/2023.06/compat/linux/x86_64/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/compat/linux/x86_64/usr/lib/../lib64:/cvmfs/software.eessi.io/versions/2023.06/compat/linux/x86_64/lib:/cvmfs/software.eessi.io/versions/2023.06/compat/linux/x86_64/usr/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/GDRCopy/2.3.1-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/UCC/1.2.0-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/PMIx/4.2.4-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/libfabric/1.18.0-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/UCX/1.14.1-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/OpenSSL/1.1/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/libpciaccess/0.17-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/libxml2/2.11.4-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/software/numactl/2.0.16-GCCcore-12.3.0/lib:/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/software/CUDA/12.1.1/lib]

Also fine!

Just to confirm:

$ ldd $(which osu_bw) | grep CUDA
        libcudart.so.12 => /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/software/CUDA/12.1.1/lib64/libcudart.so.12 (0x00001493be200000)
        libnccl.so.2 => /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen3/accel/nvidia/cc80/software/NCCL/2.18.3-GCCcore-12.3.0-CUDA-12.1.1/lib/libnccl.so.2 (0x00001493bbb7b000)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2023.06-software.eessi.io 2023.06 version of software.eessi.io accel:nvidia bot:deploy Ask bot to deploy missing software installations to EESSI
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants