Skip to content

Image update - OpenHPC v3.1 for RL9 #394

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 21 commits into from
Jun 6, 2024
Merged

Image update - OpenHPC v3.1 for RL9 #394

merged 21 commits into from
Jun 6, 2024

Conversation

sjpb
Copy link
Collaborator

@sjpb sjpb commented May 23, 2024

  • RL9.4
  • OpenHPC v3.1 (bumps slurm from 22.0.5 to 23.11.5, new compilers etc)
  • Upgrades OFED to version 24.04-0.6.6.0
  • Bumps OSC ondemand role, so the dnf-installed ondemand package will now be the latest available.
  • Makes CI run RL8 automatically if PR is labelled with "RL8"

NB: both ofed and non-ofed images now require 15GB root disks.

@sjpb
Copy link
Collaborator Author

sjpb commented May 23, 2024

@sjpb
Copy link
Collaborator Author

sjpb commented May 24, 2024

Image build: https://github.com/stackhpc/ansible-slurm-appliance/actions/runs/9220886576
FAILED - OFED LTS version is not supported for RL9.4

@sjpb
Copy link
Collaborator Author

sjpb commented May 24, 2024

@sjpb
Copy link
Collaborator Author

sjpb commented May 28, 2024

@sjpb
Copy link
Collaborator Author

sjpb commented May 28, 2024

@sjpb
Copy link
Collaborator Author

sjpb commented May 29, 2024

@sjpb
Copy link
Collaborator Author

sjpb commented May 29, 2024

@sjpb
Copy link
Collaborator Author

sjpb commented May 29, 2024

@sjpb
Copy link
Collaborator Author

sjpb commented Jun 4, 2024

@sjpb
Copy link
Collaborator Author

sjpb commented Jun 4, 2024

@sjpb
Copy link
Collaborator Author

sjpb commented Jun 4, 2024

@sjpb
Copy link
Collaborator Author

sjpb commented Jun 4, 2024

@sjpb sjpb force-pushed the update/ohpc-v3.1 branch from 0568b25 to 9ef2ea0 Compare June 5, 2024 08:45
@sjpb
Copy link
Collaborator Author

sjpb commented Jun 5, 2024

Image build: https://github.com/stackhpc/ansible-slurm-appliance/actions/runs/9381036201

RL9 ones both worked, RL8 appeared to get cancelled for some reason

@sjpb
Copy link
Collaborator Author

sjpb commented Jun 5, 2024

@sjpb sjpb added RL8 labels Jun 5, 2024
@sjpb sjpb marked this pull request as ready for review June 6, 2024 08:21
@sjpb sjpb requested a review from a team as a code owner June 6, 2024 08:21
@sjpb
Copy link
Collaborator Author

sjpb commented Jun 6, 2024

RL9 manual tests on OOD:

  • shell OK
  • desktop: OK
  • jupyter: OK
  • monitoring: OK

@sjpb
Copy link
Collaborator Author

sjpb commented Jun 6, 2024

Leafcloud pingpong:
image
vs previous release image:
image

Copy link
Collaborator

@m-bull m-bull left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@sjpb sjpb merged commit 664e08e into main Jun 6, 2024
2 checks passed
@sjpb sjpb deleted the update/ohpc-v3.1 branch June 6, 2024 10:46
MaxBed4d pushed a commit that referenced this pull request Oct 15, 2024
* bump Packer source image to RL9.4

* downgrade OFED to LTS to get stable download url

* bump OOD role, now ondemand dnf package installed will be latest

* Revert Packer source image to RL9.3 to avoid hanging after post-update reboot"

This reverts commit 851c494.

* bump OFED to get RL9.4-supported version

* bump leafcloud packer vm to 8GB RAM

* DEBUG: disable (working) OFED build

* Revert "DEBUG: disable (working) OFED build"

This reverts commit 45a48c3.

* DEBUG: output builder hostname

* Revert "DEBUG: output builder hostname"

This reverts commit 3f95f8e.

* fix build workflow concurrency

* DEBUG: disable updates

* Revert "DEBUG: disable updates"

This reverts commit 3581a35.

* bump packer build volume size for non-ofed to avoid RL8 build running out of root space

* try to prevent stackhpc env image build connection drops

* bump packer source image to fixed RL9.4 image

* run test CI workflow on RL8 image if PR labeled with 'RL8'

* bump CI images

* bump openhpc role to fix munge checks on key path
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants