GitHub - zohimchandani/cudaq-perlmutter: Executing CUDA-Q on multiple nodes on Perlmutter

Please make sure you work in GLOBAL $HOME and not $SCRACH

On the login node (zohim@login19:~>) in Perlmutter, run the following commands:

Pull the latest image:

shifterimg pull nvcr.io/nvidia/nightly/cuda-quantum:latest

Enter the image to add some configuration:

shifter --image=docker:nvcr.io/nvidia/nightly/cuda-quantum:latest --module=cuda-mpich /bin/bash

Copy over the distributed_interfaces folder:

cp -r /opt/nvidia/cudaq/distributed_interfaces/ .

3.5. Pip install any packages you would like

Exit the image:

exit

Activate the native MPI plguin

export MPI_PATH=/opt/cray/pe/mpich/8.1.27/ofi/gnu/9.1
source distributed_interfaces/activate_custom_mpi.sh

Make sure the distributed_interfaces folder from step 5 above is in home directory.

Verify the successful creation of the local library and environment variable:

echo $CUDAQ_MPI_COMM_LIB

Shifter into the container again and copy some files:

shifter --image=docker:nvcr.io/nvidia/nightly/cuda-quantum:latest --module=cuda-mpich /bin/bash

cp /usr/local/cuda/targets/x86_64-linux/lib/libcudart.so.11.8.89 ~/libcudart.so

exit

Now you have all the settings required to run CUDA-Q on multiple nodes on Perlmutter.

Create a .py file you would like to execute and a jobscript.sh file. See examples of these in the repo.

Execute the job via sbatch jobscript.sh on terminal.

The output will be a slurm-jobid.out file, an example of which is also in the repo.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
mgpu		mgpu
mpi		mpi
mqpu		mqpu
remotemqpu		remotemqpu
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

zohimchandani/cudaq-perlmutter

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages