Skip to content

Commit b3640df

Browse files
committed
add manila UI for caas
1 parent bad0374 commit b3640df

File tree

1 file changed

+105
-0
lines changed

1 file changed

+105
-0
lines changed
Lines changed: 105 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,105 @@
1+
name: "slurm-manila-preview"
2+
label: "Slurm: Manila Integration Preview"
3+
description: >-
4+
Batch cluster running the Slurm workload manager, the Open
5+
OnDemand web interface, and custom monitoring.
6+
7+
Has /project and /home from CephFS/Manila.
8+
logo: https://upload.wikimedia.org/wikipedia/commons/thumb/3/3a/Slurm_logo.svg/158px-Slurm_logo.svg.png
9+
10+
parameters:
11+
- name: cluster_floating_ip
12+
label: External IP
13+
description: The external IP to use for the login node.
14+
kind: cloud.ip
15+
immutable: true
16+
17+
- name: compute_count
18+
label: Compute node count
19+
description: The number of compute nodes in the cluster.
20+
kind: integer
21+
options:
22+
min: 1
23+
default: 3
24+
25+
- name: compute_flavor
26+
label: Compute node size
27+
description: The size to use for the compute node.
28+
kind: "cloud.size"
29+
immutable: true
30+
options:
31+
min_ram: 2048
32+
min_disk: 20
33+
34+
- name: home_volume_size
35+
label: Home filesystem size (GB)
36+
description: The size of the cloud volume or share to use for home directories
37+
kind: integer
38+
immutable: true
39+
options:
40+
min: 10
41+
default: 100
42+
43+
- name: metrics_db_maximum_size
44+
label: Metrics database size (GB)
45+
description: |
46+
The oldest metrics records in the [Prometheus](https://prometheus.io/) database will be
47+
discarded to ensure that the database does not grow larger than this size.
48+
49+
**A cloud volume of this size +10GB will be created to hold and persist the metrics
50+
database and important Slurm files.**
51+
kind: integer
52+
immutable: true
53+
options:
54+
min: 10
55+
default: 10
56+
57+
- name: cluster_run_validation
58+
label: Post-configuration validation
59+
description: >-
60+
If selected, post-configuration jobs will be executed to validate the core functionality
61+
of the cluster when it is re-configured.
62+
kind: boolean
63+
required: false
64+
default: true
65+
options:
66+
checkboxLabel: Run post-configuration validation?
67+
68+
usage_template: |-
69+
# Accessing the cluster using Open OnDemand
70+
71+
[Open OnDemand](https://openondemand.org/) is a web portal for managing HPC jobs, including graphical
72+
environments such as [Jupyter Notebooks](https://jupyter.org/).
73+
74+
{% if cluster.outputs.openondemand_url %}
75+
The Open OnDemand portal for this cluster is available at
76+
[{{ cluster.outputs.openondemand_url.slice(8) }}]({{ cluster.outputs.openondemand_url }}).
77+
78+
Enter the username `azimuth` and password `{{ cluster.outputs.azimuth_user_password }}` when prompted.
79+
{% else %}
80+
The Open OnDemand portal for this cluster can be accessed from the services list.
81+
{% endif %}
82+
83+
# Accessing the cluster using SSH
84+
85+
The cluster can be accessed over SSH via the external IP. The SSH public key of the user that
86+
deployed the cluster is injected into the `azimuth` user:
87+
88+
```
89+
$ ssh azimuth@{{ cluster.outputs.cluster_access_ip | default('[cluster ip]') }}
90+
[azimuth@{{ cluster.name }}-login-0 ~]$ sinfo
91+
PARTITION AVAIL TIMELIMIT NODES STATE NODELIST
92+
compute* up 60-00:00:0 {{ "%3s" | format(cluster.parameter_values.compute_count) }} idle {{ cluster.name }}-compute-[0-{{ cluster.parameter_values.compute_count - 1 }}]
93+
```
94+
95+
The `rocky` user can be accessed the same way and has passwordless `sudo` enabled.
96+
97+
SSH access can be granted to additional users by placing their SSH public key in `~azimuth/.ssh/authorized_keys`.
98+
99+
services:
100+
- name: ood
101+
label: Open OnDemand
102+
icon_url: https://github.com/stackhpc/ansible-slurm-appliance/raw/main/environments/.caas/assets/ood-icon.png
103+
- name: monitoring
104+
label: Monitoring
105+
icon_url: https://raw.githubusercontent.com/cncf/artwork/master/projects/prometheus/icon/color/prometheus-icon-color.png

0 commit comments

Comments
 (0)