[WIP] add matrix #1923

adrienbernede · 2025-10-01T08:33:09Z

This PR adds testing on Matrix.

Note that this is working and all checks pass. However, we'll hold off on merging until we discuss this as a team. Access to machine resources may make this testing addition impractical.

TODO:

Merge Update to spack v1 #1911 first
Merge [WIP] Update packages and configs in RADIUSS Spack Configs #1920 then
Only merge this branch after the first two.

pguthrey · 2025-10-11T17:07:08Z

scripts/gitlab/build_and_test.sh


    # Map CPU core allocations
-    declare -A core_counts=(["lassen"]=40 ["poodle"]=28 ["dane"]=28 ["corona"]=32 ["rzansel"]=48 ["tioga"]=32 ["tuolumne"]=48)
+    declare -A core_counts=(["lassen"]=40 ["poodle"]=28 ["dane"]=28 ["corona"]=32 ["rzansel"]=48 ["tioga"]=32 ["tuolumne"]=48 ["matrix"]=48)


https://hpc.llnl.gov/hardware/compute-platforms says 112, which is what I have been using

Yes, I know how many cores a node has on the machine. The reason it is set to use less than that is that the compilation will fail frequently if you try to run parallel make with all cores. We do this on other platforms as well.

pguthrey · 2025-10-11T17:09:47Z

.gitlab/custom-jobs-and-variables.yml

+# Arguments for top level allocation
+  MATRIX_SHARED_ALLOC: "--exclusive --time=60 --nodes=1"
+# Arguments for job level allocation
+  MATRIX_JOB_ALLOC: "--nodes=1"
+# Project specific variants for matrix
+  PROJECT_MATRIX_VARIANTS: "~shared +cuda cuda_arch=75 +tests"
+# Project specific deps for matrix
+  PROJECT_MATRIX_DEPS:


You might need to specify number of gpus (unique to matrix possibly). I use

srun -n1 -p pdebug --gres=gpu:4 --exclusive ...

and

LLNL_MATRIX_SLURM_SCHEDULER_PARAMETERS: value: "--nodes=1 --ntasks-per-node=1 --gres=gpu:4 --time=00:20:00 --cpus-per-task=112 -p pdebug --exclusive"

rhornung67 · 2025-10-17T22:16:55Z

@adrienbernede this is all working now when I test locally. However, it is almost impossible to get an allocation on matrix and it times out waiting for an allocation almost every time. I'm going to pursue this to see what can be done.

adayton1 · 2025-10-20T22:35:34Z

@rhornung67, is this still a WIP or is it ready for review?

rhornung67 · 2025-10-21T14:49:07Z

@adayton1 please review if you want to. I want to discuss with the team whether it makes sense to merge this since our priority on matrix is very low.

adrienbernede and others added 19 commits October 1, 2025 10:30

Add Matrix

083f26f

Fix oversight

199e120

Fix oversight

d5d53b7

update camp to version used in develop v2025.09.2

223eb23

Bump core count for matrix

9ea3e10

Merge branch 'develop' into woptim/add-matrix

dd5fc02

Use newer cmake on matrix to avoid configure issues

47d55dd

Fix typo

be43f4b

Another attempt

9e4e493

Attempt to fix cmake issue on matrix

f38ef1f

Remove old CMake version

89a3c9f

Pull in changes to RSC

57253b0

remove explicit cmake version for matrix, add newer cmake packages

32e9929

empty commit to trigger CI

b3c685e

Pull in RSC fix

4f9fefe

Point at proper commit

fcd2501

Try to fix commit history

f92885e

Merge branch 'develop' into woptim/add-matrix

a5ea100

Bump time allocation on matrix

3bbc54d

pguthrey reviewed Oct 11, 2025

View reviewed changes

rhornung67 added 2 commits October 17, 2025 08:28

Add config infor for CUDA and HIP using-with-cmake test

d37bcf3

Merge branch 'develop' into woptim/add-matrix

603ef43

rhornung67 requested review from MrBurmark, adayton1, artv3, davidbeckingsale, rhornung67 and smithsg84 October 20, 2025 17:21

rhornung67 requested review from johnbowen42 and rchen20 October 20, 2025 17:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[WIP] add matrix #1923

[WIP] add matrix #1923

Uh oh!

adrienbernede commented Oct 1, 2025 •

edited by rhornung67

Loading

Uh oh!

pguthrey Oct 11, 2025 •

edited

Loading

Uh oh!

rhornung67 Oct 14, 2025

Uh oh!

pguthrey Oct 11, 2025

Uh oh!

pguthrey Oct 11, 2025

Uh oh!

rhornung67 commented Oct 17, 2025

Uh oh!

adayton1 commented Oct 20, 2025

Uh oh!

rhornung67 commented Oct 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

[WIP] add matrix #1923

Are you sure you want to change the base?

[WIP] add matrix #1923

Uh oh!

Conversation

adrienbernede commented Oct 1, 2025 • edited by rhornung67 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pguthrey Oct 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rhornung67 Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

pguthrey Oct 11, 2025

Choose a reason for hiding this comment

Uh oh!

pguthrey Oct 11, 2025

Choose a reason for hiding this comment

Uh oh!

rhornung67 commented Oct 17, 2025

Uh oh!

adayton1 commented Oct 20, 2025

Uh oh!

rhornung67 commented Oct 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

adrienbernede commented Oct 1, 2025 •

edited by rhornung67

Loading

pguthrey Oct 11, 2025 •

edited

Loading