This is an old revision of the document!
Load modules and declare environment variables as follows:
module purge module load batch/slurm module load compilers/intel17 module load compilers/cuda-7.5 module load mpi/openmpi-2.0.i17.cuda75 export CUDA_LIB=/usr/local/cuda/cuda-7.5/lib64 export CUDA_INC=/usr/local/cuda/cuda-7.5/include
First make sure that required modules are loaded and CUDA_LIB, CUDA_INC environment variables are declared (see previous section). Create a run directory including directories DATABASE_MPI, OUTPUT_FILES, bin and DATA.
In the directory DATA, create CMTSOLUTION, Par_file and STATIONS file (cf., SPECFEM3D_GLOBE documentation).
Change directory to the main specfem3D_globe directory. Copy the Par_file of the run directory into DATA of the main specfem3D_globe directory. You can then run configure:
./configure -with-cuda=cuda5
(the cuda5 option will enable more recent GPU compute capabilities).
Then compile the code:
make clean make all
Copy binaries to the run directory and backup setup files:
cp bin/xmeshfem3D $rundir/bin cp bin/xspecfem3D $rundir/bin cp setup/* $rundir/OUTPUT_FILES/ cp DATA/Par_file $rundir/OUTPUT_FILES/
Link model directories. For example with s362ani:
ln -s DATA/crust2.0 $rundir/DATA/crust2.0 ln -s DATA/s362ani $rundir/DATA/s362ani ln -s DATA/QRFSI12 $rundir/DATA/QRFSI12 ln -s DATA/topo_bathy $rundir/DATA/topo_bathy
Example of slurm script (number of CPU cores should be adapted to NPROC_XI and NPROC_ETA:
#!/bin/bash #SBATCH -p grant -A g2016a68 # Partition / Account #SBATCH -n 96 # Number of CPU cores #SBATCH --job-name=SPECFEM #SBATCH -t 23:00:00 # Wall time echo Master on host `hostname` echo Time is `date` # Start time begin=`date +"%s"` # Run BASEMPIDIR=`grep LOCAL_PATH DATA/Par_file | cut -d = -f 2 ` NPROC_XI=`grep NPROC_XI DATA/Par_file | cut -d = -f 2 ` NPROC_ETA=`grep NPROC_ETA DATA/Par_file | cut -d = -f 2` NCHUNKS=`grep NCHUNKS DATA/Par_file | cut -d = -f 2 ` numcpus=$(( $NCHUNKS * $NPROC_XI * $NPROC_ETA )) # numcpus should be consistent with the -n option mkdir -p OUTPUT_FILES # backup files used for this simulation cp DATA/Par_file OUTPUT_FILES/ cp DATA/STATIONS OUTPUT_FILES/ cp DATA/CMTSOLUTION OUTPUT_FILES/ ## ## mesh generation ## sleep 2 echo echo `date` echo "starting MPI mesher on $numcpus processors" echo mpirun -np $numcpus bin/xmeshfem3D echo " mesher done: `date`" echo # backup important files addressing.txt and list*.txt cp OUTPUT_FILES/*.txt $BASEMPIDIR/ ## ## forward simulation ## # set up addressing #cp $BASEMPIDIR/addr*.txt OUTPUT_FILES/ #cp $BASEMPIDIR/list*.txt OUTPUT_FILES/ sleep 2 echo echo `date` echo starting run in current directory $PWD echo mpirun -np $numcpus bin/xspecfem3D /bin/rm -rf DATABASES_MPI echo "finished successfully" echo `date` # Print time after running echo Time is `date` echo Walltime : $(expr `date +"%s"` - $begin) # Seconds echo CPUtime : $(squeue -j $SLURM_JOBID -o "%M" -h) # HH:MM:SS
Example of slurm script (number of nodes should be adapted to the number of GPU per nodes and to NPROC_XI and NPROC_ETA:
#!/bin/bash #SBATCH -p pri2015gpu -A eost #SBATCH -N 3-3 # Will use 3 nodes #SBATCH --tasks-per-node 8 # 8 tasks per node #SBATCH --gres=gpu:8 # We only want nodes with 8 GPUs #SBATCH --job-name=SPECFEM #SBATCH -t 12:00:00 # Wall time #SBATCH --cpu_bind=verbose # ID of each GPU (should be adapted if using a different number of GPUs) export CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 echo Master on host `hostname` echo Time is `date` # Start time begin=`date +"%s"` # Run BASEMPIDIR=`grep LOCAL_PATH DATA/Par_file | cut -d = -f 2 ` NPROC_XI=`grep NPROC_XI DATA/Par_file | cut -d = -f 2 ` NPROC_ETA=`grep NPROC_ETA DATA/Par_file | cut -d = -f 2` NCHUNKS=`grep NCHUNKS DATA/Par_file | cut -d = -f 2 ` numgpus=$(( $NCHUNKS * $NPROC_XI * $NPROC_ETA )) mkdir -p OUTPUT_FILES # backup files used for this simulation cp -pf DATA/Par_file OUTPUT_FILES/ cp -pf DATA/STATIONS OUTPUT_FILES/ cp -pf DATA/CMTSOLUTION OUTPUT_FILES/ ## ## mesh generation ## sleep 2 echo echo `date` echo "starting MPI mesher on $numgpus processors" echo mpirun -np $numgpus bin/xmeshfem3D echo " mesher done: `date`" echo # backup important files addressing.txt and list*.txt cp OUTPUT_FILES/*.txt $BASEMPIDIR/ ## ## forward simulation ## # set up addressing #cp $BASEMPIDIR/addr*.txt OUTPUT_FILES/ #cp $BASEMPIDIR/list*.txt OUTPUT_FILES/ sleep 2 echo echo `date` echo starting run in current directory $PWD echo mpirun -np $numgpus bin/xspecfem3D /bin/rm -rf DATABASES_MPI echo "finished successfully" echo `date` # Print time after running echo Time is `date` echo Walltime : $(expr `date +"%s"` - $begin) # Seconds echo CPUtime : $(squeue -j $SLURM_JOBID -o "%M" -h) # HH:MM:SS