Getting distributed package doesn't have mpi built in error

Question

I have been trying to write a distributed application using pytorch. I have been following tutorial here. Over there, I am using the "MPI Backend" option. According to that, I need to follow the basic steps to install pytorch and then install openmpi as conda install -c conda-forge openmpi

Unfortunately, whenever I try to run a script using mpirun mpiexec -n 2 python ptdist.py, I get the following error RuntimeError: Distributed package doesn't have MPI built in. I believe this is happening because of error in import ProcessGroupMPI code here in python.

I have tried to install openmpi from their source code as well as sudo apt-get install python-mpi4py, but am still facing the same error.

I also tried pip install mpi4py but that also does not help

Does anyone know what is the problem?

Gilles Gouaillardet · Accepted Answer · 2019-08-14 03:11:29Z

1

From https://medium.com/@esaliya/pytorch-distributed-with-mpi-acb84b3ae5fd

The MPI backend, though supported, is not available unless you compile PyTorch from its source

This suggests you should first install your favorite MPI library, and possibly mpi4py built on top of it, and then build pytorch from sources at last.

answered Aug 14, 2019 at 3:11

Gilles Gouaillardet

8,47111 gold badges26 silver badges34 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

brokendreams Over a year ago

@Gilies I followed the same tutorial from medium to setup everything. I am getting the same aforementioned error

Gilles Gouaillardet Over a year ago

there is an other tutorial at pytorch.org/tutorials/intermediate/dist_tuto.html. I suggest you restart from a fresh install (so you do not run python setup.py install too early). Also check the outputs in order to confirm MPI backend is found.

Collectives™ on Stack Overflow

Getting distributed package doesn't have mpi built in error

1 Answer 1

2 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

2 Comments

Your Answer

Sign up or log in

Post as a guest

Related