Skip to content

Benchmark: [DCGPU] Slurm script for rccl benchmarking #568

Merged
Xiaoming-AMD merged 13 commits intoAMD-AGI:mainfrom
Z-Y00:slurm_bench
Mar 25, 2026
Merged

Benchmark: [DCGPU] Slurm script for rccl benchmarking #568
Xiaoming-AMD merged 13 commits intoAMD-AGI:mainfrom
Z-Y00:slurm_bench

Conversation

@Z-Y00
Copy link
Copy Markdown
Contributor

@Z-Y00 Z-Y00 commented Feb 26, 2026

Adding slurm script for DCGPU cluster rccl benchmarking. Some edits are from Joyce for their cluster setup.

Usage example on DCGPU cluster:

DOCKER_IMAGE=rocm/primus:v26.1 NNODES=2 sbatch -N2 -w smci355-ccs-aus-n04-[25,29] -p Compute-DCPT ./run_slurm.sh

@Z-Y00 Z-Y00 changed the title [DCGPU] Slurm script for rccl benchmarking Benchmark: [DCGPU] Slurm script for rccl benchmarking Feb 26, 2026
@Xiaoming-AMD Xiaoming-AMD merged commit bdc4532 into AMD-AGI:main Mar 25, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants