Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rebuild LAMMPS for */generic targets to make sure that CPU-specific optimizations are disabled #788

Open
wants to merge 4 commits into
base: 2023.06-software.eessi.io
Choose a base branch
from

Conversation

bedroge
Copy link
Collaborator

@bedroge bedroge commented Oct 15, 2024

The easyblock PR is not merged yet, but I want to do some test builds here already.

@bedroge bedroge added the 2023.06-software.eessi.io 2023.06 version of software.eessi.io label Oct 15, 2024
Copy link

eessi-bot bot commented Oct 15, 2024

Instance eessi-bot-mc-aws is configured to build for:

  • architectures: x86_64/generic, x86_64/intel/haswell, x86_64/intel/skylake_avx512, x86_64/amd/zen2, x86_64/amd/zen3, aarch64/generic, aarch64/neoverse_n1, aarch64/neoverse_v1
  • repositories: eessi.io-2023.06-compat, eessi-hpc.org-2023.06-software, eessi-hpc.org-2023.06-compat, eessi.io-2023.06-software

Instance boegel-bot-deucalion is configured to build for:

  • architectures: aarch64/a64fx
  • repositories: eessi.io-2023.06-software

Copy link

eessi-bot bot commented Oct 15, 2024

Instance eessi-bot-mc-azure is configured to build for:

  • architectures: x86_64/amd/zen4
  • repositories: eessi-hpc.org-2023.06-software, eessi-hpc.org-2023.06-compat, eessi.io-2023.06-software, eessi.io-2023.06-compat

@bedroge
Copy link
Collaborator Author

bedroge commented Oct 15, 2024

bot: build repo:eessi.io-2023.06-software arch:aarch64/generic
bot: build repo:eessi.io-2023.06-software arch:x86_64/generic

Copy link

eessi-bot bot commented Oct 15, 2024

Updates by the bot instance eessi-bot-mc-aws (click for details)

Updates by the bot instance boegel-bot-deucalion (click for details)
  • account bedroge has NO permission to send commands to the bot

Copy link

eessi-bot bot commented Oct 15, 2024

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:aarch64/generic from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:aarch64/generic
  • received bot command build repo:eessi.io-2023.06-software arch:x86_64/generic from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:x86_64/generic
  • handling command build repository:eessi.io-2023.06-software architecture:aarch64/generic resulted in:

    • no jobs were submitted
  • handling command build repository:eessi.io-2023.06-software architecture:x86_64/generic resulted in:

    • no jobs were submitted

Copy link

eessi-bot bot commented Oct 15, 2024

New job on instance eessi-bot-mc-aws for CPU micro-architecture aarch64-generic for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.10/pr_788/23391

date job status comment
Oct 15 08:48:48 UTC 2024 submitted job id 23391 awaits release by job manager
Oct 15 08:49:05 UTC 2024 released job awaits launch by Slurm scheduler
Oct 15 08:55:19 UTC 2024 running job 23391 is running
Oct 15 09:19:01 UTC 2024 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-23391.out
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-generic-1728983330.tar.gzsize: 0 MiB (45 bytes)
entries: 0
modules under 2023.06/software/linux/aarch64/generic/modules/all
no module files in tarball
software under 2023.06/software/linux/aarch64/generic/software
no software packages in tarball
other under 2023.06/software/linux/aarch64/generic
no other files in tarball
Oct 15 09:19:01 UTC 2024 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] (1/8) EESSI_OSU_Micro_Benchmarks_coll %benchmark_info=mpi.collective.osu_allreduce %scale=1_node %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %device_type=cpu /31ac6ab9 @BotBuildTests:aarch64-generic-node+default
P: latency: 3.57 us (r:0, l:None, u:None)
[ OK ] (2/8) EESSI_OSU_Micro_Benchmarks_coll %benchmark_info=mpi.collective.osu_allreduce %scale=1_node %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %device_type=cpu /f3be40a2 @BotBuildTests:aarch64-generic-node+default
P: latency: 3.38 us (r:0, l:None, u:None)
[ OK ] (3/8) EESSI_OSU_Micro_Benchmarks_coll %benchmark_info=mpi.collective.osu_alltoall %scale=1_node %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %device_type=cpu /10e66fba @BotBuildTests:aarch64-generic-node+default
P: latency: 5.45 us (r:0, l:None, u:None)
[ OK ] (4/8) EESSI_OSU_Micro_Benchmarks_coll %benchmark_info=mpi.collective.osu_alltoall %scale=1_node %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %device_type=cpu /5be57ae7 @BotBuildTests:aarch64-generic-node+default
P: latency: 5.37 us (r:0, l:None, u:None)
[ OK ] (5/8) EESSI_OSU_Micro_Benchmarks_pt2pt %benchmark_info=mpi.pt2pt.osu_latency %scale=1_node %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %device_type=cpu /c8c9aff5 @BotBuildTests:aarch64-generic-node+default
P: latency: 0.43 us (r:0, l:None, u:None)
[ OK ] (6/8) EESSI_OSU_Micro_Benchmarks_pt2pt %benchmark_info=mpi.pt2pt.osu_latency %scale=1_node %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %device_type=cpu /9795e491 @BotBuildTests:aarch64-generic-node+default
P: latency: 0.47 us (r:0, l:None, u:None)
[ OK ] (7/8) EESSI_OSU_Micro_Benchmarks_pt2pt %benchmark_info=mpi.pt2pt.osu_bw %scale=1_node %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %device_type=cpu /48da21c5 @BotBuildTests:aarch64-generic-node+default
P: bandwidth: 19915.29 MB/s (r:0, l:None, u:None)
[ OK ] (8/8) EESSI_OSU_Micro_Benchmarks_pt2pt %benchmark_info=mpi.pt2pt.osu_bw %scale=1_node %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %device_type=cpu /1b8c1ca2 @BotBuildTests:aarch64-generic-node+default
P: bandwidth: 19798.36 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 8/8 test case(s) from 8 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-23391.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

Copy link

eessi-bot bot commented Oct 15, 2024

New job on instance eessi-bot-mc-aws for CPU micro-architecture x86_64-generic for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.10/pr_788/23392

date job status comment
Oct 15 08:48:52 UTC 2024 submitted job id 23392 awaits release by job manager
Oct 15 08:49:07 UTC 2024 released job awaits launch by Slurm scheduler
Oct 15 08:55:23 UTC 2024 running job 23392 is running
Oct 15 09:19:03 UTC 2024 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-23392.out
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-generic-1728983401.tar.gzsize: 0 MiB (45 bytes)
entries: 0
modules under 2023.06/software/linux/x86_64/generic/modules/all
no module files in tarball
software under 2023.06/software/linux/x86_64/generic/software
no software packages in tarball
other under 2023.06/software/linux/x86_64/generic
no other files in tarball
Oct 15 09:19:03 UTC 2024 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] (1/8) EESSI_OSU_Micro_Benchmarks_coll %benchmark_info=mpi.collective.osu_allreduce %scale=1_node %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %device_type=cpu /31ac6ab9 @BotBuildTests:x86-64-generic-node+default
P: latency: 5.04 us (r:0, l:None, u:None)
[ OK ] (2/8) EESSI_OSU_Micro_Benchmarks_coll %benchmark_info=mpi.collective.osu_allreduce %scale=1_node %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %device_type=cpu /f3be40a2 @BotBuildTests:x86-64-generic-node+default
P: latency: 5.4 us (r:0, l:None, u:None)
[ OK ] (3/8) EESSI_OSU_Micro_Benchmarks_coll %benchmark_info=mpi.collective.osu_alltoall %scale=1_node %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %device_type=cpu /10e66fba @BotBuildTests:x86-64-generic-node+default
P: latency: 10.48 us (r:0, l:None, u:None)
[ OK ] (4/8) EESSI_OSU_Micro_Benchmarks_coll %benchmark_info=mpi.collective.osu_alltoall %scale=1_node %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %device_type=cpu /5be57ae7 @BotBuildTests:x86-64-generic-node+default
P: latency: 9.85 us (r:0, l:None, u:None)
[ OK ] (5/8) EESSI_OSU_Micro_Benchmarks_pt2pt %benchmark_info=mpi.pt2pt.osu_latency %scale=1_node %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %device_type=cpu /c8c9aff5 @BotBuildTests:x86-64-generic-node+default
P: latency: 0.72 us (r:0, l:None, u:None)
[ OK ] (6/8) EESSI_OSU_Micro_Benchmarks_pt2pt %benchmark_info=mpi.pt2pt.osu_latency %scale=1_node %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %device_type=cpu /9795e491 @BotBuildTests:x86-64-generic-node+default
P: latency: 0.63 us (r:0, l:None, u:None)
[ OK ] (7/8) EESSI_OSU_Micro_Benchmarks_pt2pt %benchmark_info=mpi.pt2pt.osu_bw %scale=1_node %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %device_type=cpu /48da21c5 @BotBuildTests:x86-64-generic-node+default
P: bandwidth: 11221.66 MB/s (r:0, l:None, u:None)
[ OK ] (8/8) EESSI_OSU_Micro_Benchmarks_pt2pt %benchmark_info=mpi.pt2pt.osu_bw %scale=1_node %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %device_type=cpu /1b8c1ca2 @BotBuildTests:x86-64-generic-node+default
P: bandwidth: 11202.81 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 8/8 test case(s) from 8 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-23392.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@bedroge
Copy link
Collaborator Author

bedroge commented Oct 15, 2024

The builds actually completed, and with right/expected CPU type:

x86_64/generic:

== configuring...
== Running pre-configure hook...
== Using Kokkos package with arch: CPU - EASYBUILD_GENERIC, GPU - None

aarch64/generic:

== configuring...
== Running pre-configure hook...
== Using Kokkos package with arch: CPU - ARMV80, GPU - None

However, both of them failed in the install step, because of this annoying issue again where somehow the old installation is showing up again (even though it should have been removed):

== installing...
== ... (took 6 secs)
== FAILED: Installation ended unsuccessfully (build directory: /tmp/bot/easybuild/build/LAMMPS/2Aug2023_update2/foss-2023a-kokkos): build failed (first 300 chars): Failed to remove directory /cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/generic/software/LAMMPS/2Aug2023_update2-foss-2023a-kokkos even after 3 attempts.
Reasons: [OSError(39, 'Directory not empty'), OSError(39, 'Directory not empty'), OSError(39, 'Directory not empty')] (took 9 mins 59 secs)

@bedroge
Copy link
Collaborator Author

bedroge commented Oct 15, 2024

bot: build repo:eessi.io-2023.06-software arch:aarch64/generic

Updates by the bot instance boegel-bot-deucalion (click for details)
  • account bedroge has NO permission to send commands to the bot

Copy link

eessi-bot bot commented Oct 15, 2024

Updates by the bot instance eessi-bot-mc-aws (click for details)

Copy link

eessi-bot bot commented Oct 15, 2024

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:aarch64/generic from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:aarch64/generic
  • handling command build repository:eessi.io-2023.06-software architecture:aarch64/generic resulted in:

    • no jobs were submitted

Copy link

eessi-bot bot commented Oct 15, 2024

New job on instance eessi-bot-mc-aws for CPU micro-architecture aarch64-generic for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.10/pr_788/23398

date job status comment
Oct 15 12:00:45 UTC 2024 submitted job id 23398 awaits release by job manager
Oct 15 12:01:42 UTC 2024 released job awaits launch by Slurm scheduler
Oct 15 12:06:44 UTC 2024 running job 23398 is running
Oct 15 12:32:18 UTC 2024 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-23398.out
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-generic-1728994970.tar.gzsize: 0 MiB (45 bytes)
entries: 0
modules under 2023.06/software/linux/aarch64/generic/modules/all
no module files in tarball
software under 2023.06/software/linux/aarch64/generic/software
no software packages in tarball
other under 2023.06/software/linux/aarch64/generic
no other files in tarball
Oct 15 12:32:18 UTC 2024 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] (1/8) EESSI_OSU_Micro_Benchmarks_coll %benchmark_info=mpi.collective.osu_allreduce %scale=1_node %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %device_type=cpu /31ac6ab9 @BotBuildTests:aarch64-generic-node+default
P: latency: 3.64 us (r:0, l:None, u:None)
[ OK ] (2/8) EESSI_OSU_Micro_Benchmarks_coll %benchmark_info=mpi.collective.osu_allreduce %scale=1_node %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %device_type=cpu /f3be40a2 @BotBuildTests:aarch64-generic-node+default
P: latency: 3.77 us (r:0, l:None, u:None)
[ OK ] (3/8) EESSI_OSU_Micro_Benchmarks_coll %benchmark_info=mpi.collective.osu_alltoall %scale=1_node %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %device_type=cpu /10e66fba @BotBuildTests:aarch64-generic-node+default
P: latency: 5.4 us (r:0, l:None, u:None)
[ OK ] (4/8) EESSI_OSU_Micro_Benchmarks_coll %benchmark_info=mpi.collective.osu_alltoall %scale=1_node %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %device_type=cpu /5be57ae7 @BotBuildTests:aarch64-generic-node+default
P: latency: 5.29 us (r:0, l:None, u:None)
[ OK ] (5/8) EESSI_OSU_Micro_Benchmarks_pt2pt %benchmark_info=mpi.pt2pt.osu_latency %scale=1_node %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %device_type=cpu /c8c9aff5 @BotBuildTests:aarch64-generic-node+default
P: latency: 0.45 us (r:0, l:None, u:None)
[ OK ] (6/8) EESSI_OSU_Micro_Benchmarks_pt2pt %benchmark_info=mpi.pt2pt.osu_latency %scale=1_node %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %device_type=cpu /9795e491 @BotBuildTests:aarch64-generic-node+default
P: latency: 0.47 us (r:0, l:None, u:None)
[ OK ] (7/8) EESSI_OSU_Micro_Benchmarks_pt2pt %benchmark_info=mpi.pt2pt.osu_bw %scale=1_node %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %device_type=cpu /48da21c5 @BotBuildTests:aarch64-generic-node+default
P: bandwidth: 19761.28 MB/s (r:0, l:None, u:None)
[ OK ] (8/8) EESSI_OSU_Micro_Benchmarks_pt2pt %benchmark_info=mpi.pt2pt.osu_bw %scale=1_node %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %device_type=cpu /1b8c1ca2 @BotBuildTests:aarch64-generic-node+default
P: bandwidth: 19711.0 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 8/8 test case(s) from 8 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-23398.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@bedroge
Copy link
Collaborator Author

bedroge commented Oct 15, 2024

bot: build repo:eessi.io-2023.06-software arch:aarch64/generic

Copy link

eessi-bot bot commented Oct 15, 2024

Updates by the bot instance eessi-bot-mc-aws (click for details)

Copy link

eessi-bot bot commented Oct 15, 2024

Updates by the bot instance eessi-bot-mc-azure (click for details)
  • received bot command build repo:eessi.io-2023.06-software arch:aarch64/generic from bedroge

    • expanded format: build repository:eessi.io-2023.06-software architecture:aarch64/generic
  • handling command build repository:eessi.io-2023.06-software architecture:aarch64/generic resulted in:

    • no jobs were submitted

Updates by the bot instance boegel-bot-deucalion (click for details)
  • account bedroge has NO permission to send commands to the bot

Copy link

eessi-bot bot commented Oct 15, 2024

New job on instance eessi-bot-mc-aws for CPU micro-architecture aarch64-generic for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.10/pr_788/23399

date job status comment
Oct 15 12:27:44 UTC 2024 submitted job id 23399 awaits release by job manager
Oct 15 12:28:08 UTC 2024 released job awaits launch by Slurm scheduler
Oct 15 12:29:11 UTC 2024 running job 23399 is running
Oct 15 12:52:40 UTC 2024 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-23399.out
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-generic-1728996174.tar.gzsize: 0 MiB (45 bytes)
entries: 0
modules under 2023.06/software/linux/aarch64/generic/modules/all
no module files in tarball
software under 2023.06/software/linux/aarch64/generic/software
no software packages in tarball
other under 2023.06/software/linux/aarch64/generic
no other files in tarball
Oct 15 12:52:40 UTC 2024 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ OK ] (1/8) EESSI_OSU_Micro_Benchmarks_coll %benchmark_info=mpi.collective.osu_allreduce %scale=1_node %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %device_type=cpu /31ac6ab9 @BotBuildTests:aarch64-generic-node+default
P: latency: 3.54 us (r:0, l:None, u:None)
[ OK ] (2/8) EESSI_OSU_Micro_Benchmarks_coll %benchmark_info=mpi.collective.osu_allreduce %scale=1_node %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %device_type=cpu /f3be40a2 @BotBuildTests:aarch64-generic-node+default
P: latency: 3.37 us (r:0, l:None, u:None)
[ OK ] (3/8) EESSI_OSU_Micro_Benchmarks_coll %benchmark_info=mpi.collective.osu_alltoall %scale=1_node %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %device_type=cpu /10e66fba @BotBuildTests:aarch64-generic-node+default
P: latency: 5.4 us (r:0, l:None, u:None)
[ OK ] (4/8) EESSI_OSU_Micro_Benchmarks_coll %benchmark_info=mpi.collective.osu_alltoall %scale=1_node %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %device_type=cpu /5be57ae7 @BotBuildTests:aarch64-generic-node+default
P: latency: 5.4 us (r:0, l:None, u:None)
[ OK ] (5/8) EESSI_OSU_Micro_Benchmarks_pt2pt %benchmark_info=mpi.pt2pt.osu_latency %scale=1_node %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %device_type=cpu /c8c9aff5 @BotBuildTests:aarch64-generic-node+default
P: latency: 0.45 us (r:0, l:None, u:None)
[ OK ] (6/8) EESSI_OSU_Micro_Benchmarks_pt2pt %benchmark_info=mpi.pt2pt.osu_latency %scale=1_node %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %device_type=cpu /9795e491 @BotBuildTests:aarch64-generic-node+default
P: latency: 0.47 us (r:0, l:None, u:None)
[ OK ] (7/8) EESSI_OSU_Micro_Benchmarks_pt2pt %benchmark_info=mpi.pt2pt.osu_bw %scale=1_node %module_name=OSU-Micro-Benchmarks/7.2-gompi-2023b %device_type=cpu /48da21c5 @BotBuildTests:aarch64-generic-node+default
P: bandwidth: 19506.8 MB/s (r:0, l:None, u:None)
[ OK ] (8/8) EESSI_OSU_Micro_Benchmarks_pt2pt %benchmark_info=mpi.pt2pt.osu_bw %scale=1_node %module_name=OSU-Micro-Benchmarks/7.1-1-gompi-2023a %device_type=cpu /1b8c1ca2 @BotBuildTests:aarch64-generic-node+default
P: bandwidth: 19531.36 MB/s (r:0, l:None, u:None)
[ PASSED ] Ran 8/8 test case(s) from 8 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-23399.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2023.06-software.eessi.io 2023.06 version of software.eessi.io
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant