I am trying to run a simulation programmed with CUDA on a server that has an single Intel Xeon with 36 cores and a single P4000 GPU. When I used mpirun -np 16 ./sim, it failed to run, stating that MPI cannot use multiple CPU cores. However, when I executed ./sim, it ran using only one CPU core and the GPU.
I am using OpenMPI. What settings should I configure? Is it necessary to reinstall OpenMPI?
조준호 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.