Commit Graph

360 Commits

Author SHA1 Message Date
caab89f642 feat: use central sulrmdbd, and misc fixes 2025-09-27 14:03:51 +02:00
a981f5e7ba fix: fix slurm and munge uid:s and gid:s 2025-09-27 13:42:38 +02:00
34c28e18bf fix: move bin to toplevel and add reuid-slurm.sh 2025-09-27 13:42:09 +02:00
398af17797 fix: slurm updates for rossby 2025-09-26 16:03:31 +02:00
998d551943 fix: use jwt, simplify slurmrestd, and make slurmdbd optional 2025-09-26 16:02:30 +02:00
c9624213ed fix: fix slurmdbd setup 2025-09-25 15:52:30 +02:00
bcff2e6c2f Merge branch 'main' of gitlab.com:oceanbox/clusterfck 2025-09-25 12:38:44 +02:00
3c0a7f91f5 fix: slurm and stuff 2025-09-25 12:28:59 +02:00
46cf9da93f feat: allow tailnet access 2025-09-25 12:16:42 +02:00
2e919182d4 fix: remove /opt/singularity 2025-09-25 12:16:24 +02:00
ff3f897859 fix: rename features for better clarity 2025-09-25 12:15:51 +02:00
9b798444d1 feat: enable slurm jwt and remove slocket proxy 2025-09-25 12:15:24 +02:00
d2e27a7e87 feat: add slurm key generators and remove stale scripts 2025-09-25 12:08:05 +02:00
Jonas Juselius
d5cfcd2bf9 fix: reset systemd slurmrest socketConfig to true 2025-09-24 15:22:33 +02:00
Jonas Juselius
cf4ae97e1c feat: ekman on new cluster setup 2025-09-24 12:24:28 +02:00
Jonas Juselius
96f8215c52 feat: upgrade ekman to new cluster structure 2025-09-23 13:40:16 +02:00
Jonas Juselius
46473c88dd Merge branch 'main' of gitlab.com:oceanbox/nixos-clusters 2025-09-23 12:59:13 +02:00
fac7bdd62e fix: change feature manager to manage 2025-09-23 12:58:57 +02:00
Jonas Juselius
e38b0a2317 fix: change /frontend to /users 2025-09-23 12:30:18 +02:00
82a5328d7f feat: move /home and /opt to cephfs and tweak mounts 2025-09-23 12:11:53 +02:00
8894339216 fix: enable 100GbE and disable net mounts for now 2025-09-16 13:45:37 +02:00
f5679d39f9 fix: add missing nodes and disable net mounts for now 2025-09-16 13:43:00 +02:00
59db74b265 fix: misc fixes and tweaks 2025-09-16 13:42:25 +02:00
65aba0f69d fix: update Mellanox firmware tools 2025-09-16 13:41:01 +02:00
db794e6eea fix: fix extraSANs 2025-09-13 10:13:12 +02:00
4057a00143 fix: /work mount 2025-09-13 07:31:45 +02:00
14b5f07cc6 fix: move apiserver port to standard 6443 on (new) ekman 2025-09-13 07:11:04 +02:00
33a14d1509 fix: move IB network to 10.1.6.0/24 (get it? :) 2025-09-13 07:10:25 +02:00
3af5ba3fbd fix: add fs-work and etcd cluster 2025-09-13 07:03:17 +02:00
6767eb21e6 fix: move apiserver port to standard 6443 2025-09-13 07:00:49 +02:00
eb7b1f8130 fix: fix ekman part of botched merge 2025-09-12 14:38:36 +02:00
fcd136ed4e fix: partially fix a totally botched merge. 2025-09-12 14:32:42 +02:00
Jonas Juselius
c8814ec8d9 Merge remote-tracking branch 'origin/rossby' 2025-09-12 13:52:20 +02:00
f7f6eabb0f fix: misc fixes and tweaks 2025-09-12 13:49:29 +02:00
Jonas Juselius
69e47e60d0 fix: simplify ekman hive 2025-09-12 13:28:38 +02:00
Jonas Juselius
5c72112457 major: grand unified clusterfck (ekman not tested yet) 2025-09-12 13:12:36 +02:00
Jonas Juselius
ba5f1b8add wip: convert ekman to new cluster sturcture (not complete) 2025-09-12 12:53:56 +02:00
e0846164a7 major: initial rossy cluster and biggish refactor 2025-09-12 11:59:15 +02:00
Jonas Juselius
899a7f4338 fix: misc fixes (save for rossby) 2025-09-06 08:01:54 +02:00
Jonas Juselius
8f1048cddc fix: update nixos module 2025-06-30 12:28:15 +02:00
Jonas Juselius
bc3a034654 fix: add k8s and hpc modules to main repo 2025-06-30 12:21:05 +02:00
Jonas Juselius
4aa9fa677a Remove modules as submodule. 2025-06-30 12:20:22 +02:00
Jonas Juselius
15bd8fe978 fix: misc update, don't know what 2025-06-30 12:11:39 +02:00
Jonas Juselius
c080e16f07 cleanup: remove stale files 2024-06-15 23:40:50 +02:00
Jonas Juselius
7ba7bb42e0 feat and fix: add c0-17/18, new fs-work and fs-backup and tweaks 2024-06-15 23:39:44 +02:00
Jonas Juselius
98d2d16d51 wip: restrcturing compute and storage 2024-06-15 06:11:50 +02:00
Jonas Juselius
3ad0687026 feat: refactor and unify network mounts throughout the cluster 2024-05-08 13:16:13 +02:00
Jonas Juselius
84441a1a94 fix: fix kraken home permissions globally 2024-03-08 08:51:58 +01:00
Jonas Juselius
d752928816 fix: fix routing and iptables 2024-03-07 15:41:55 +01:00
Jonas Juselius
e072b8b1c1 fix: remove ekman from etcd cluster 2024-03-07 15:40:38 +01:00