Skip to content

Don't start CUDA persistence daemon in image build #283

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Jun 16, 2023
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions ansible/roles/cuda/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,3 +11,4 @@ Requires OFED to be installed to provide required kernel-* packages.
- `cuda_distro`: Optional. Default `rhel8`.
- `cuda_repo`: Optional. Default `https://developer.download.nvidia.com/compute/cuda/repos/{{ cuda_distro }}/x86_64/cuda-{{ cuda_distro }}.repo`
- `cuda_packages`: Optional. Default: `['cuda', 'nvidia-gds']`.
- `cuda_persistenced_state`: Optional. State of systemd `nvidia-persistenced` service. Values as [ansible.builtin.systemd:state](https://docs.ansible.com/ansible/latest/collections/ansible/builtin/systemd_module.html#parameter-state). Default `started`.
1 change: 1 addition & 0 deletions ansible/roles/cuda/defaults/main.yml
Original file line number Diff line number Diff line change
Expand Up @@ -11,3 +11,4 @@ cuda_samples_programs:
- deviceQuery
- bandwidthTest
# cuda_devices: # discovered from deviceQuery run
cuda_persistenced_state: started
2 changes: 1 addition & 1 deletion ansible/roles/cuda/tasks/main.yml
Original file line number Diff line number Diff line change
Expand Up @@ -41,7 +41,7 @@
systemd:
name: nvidia-persistenced
enabled: true
state: started
state: "{{ cuda_persistenced_state }}"

- name: Reboot
ansible.builtin.reboot:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -12,3 +12,4 @@ grafana_state: stopped # as it tries to listen on the "real" grafana node
block_devices_configurations: [] # as volumes will not be attached to Packer build VMs
mysql_state: stopped # as it tries to connect to real mysql node
opensearch_state: stopped # avoid writing config+certs+db into image
cuda_persistenced_state: stopped # probably don't have GPU in Packer build VMs