Replies: 2 comments
-
Could you please try with the current release, Horovod 0.24 |
Beta Was this translation helpful? Give feedback.
0 replies
-
add a |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I am trying to compile Horovod with the following command:
HOROVOD_NCCL_HOME=/sw/arch/Debian10/EB_production/2021/software/NCCL/2.10.3-GCCcore-10.3.0-CUDA-11.3.1/ HOROVOD_GPU_OPERATIONS=NCCL pip install --no-cache-dir horovod[pytorch]==0.21.0
I already added the environment variables
HOROVOD_WITH_PYTORCH=1 HOROVOD_WITHOUT_PYTORCH=0 HOROVOD_WITH_MXNET=0 HOROVOD_WITHOUT_MXNET=1 HOROVOD_WITH_TENSORFLOW=0 HOROVOD_WITHOUT_TENSORFLOW=1
And also
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/sw/arch/Debian10/EB_production/2021/software/NCCL/2.10.3-GCCcore-10.3.0-CUDA-11.3.1/lib
I get the following error:
I don't understand why I get the error"
CMake Error at horovod/mxnet/CMakeLists.txt:19 (message): Horovod build with GPU support was requested but this MXNet installation does not support CUDA.
I am not even trying to install Horovod with mxnet support?
I have the following modules loaded on the cluster:
`Currently Loaded Modules:
My
pip freeze
gives the followingSuggestions are appreciated!
Beta Was this translation helpful? Give feedback.
All reactions