Skip to content

Failed to build from source (pytorch 1.3.1 + CUDA 11.6) #2321

Discussion options

You must be logged in to vote

Hi all,
Thanks for your attention!
I build from the source successfully!

The root cause of this building issue is because of the wrong environment variable.
In ColossalAI setup pipeline, the script will parse the path of the cuda from torch.utils.cpp_extension.CUDA_HOME.
And the torch.utils.cpp_extension.CUDA_HOME get the path based on the command which nvcc if there is no proper environment variable. (ref. https://github.com/pytorch/pytorch/blob/master/torch/utils/cpp_extension.py#L90)

However, in the anaconda virtual environment with pytorch-cuda=11.6, there is a duplicated nvcc execute file in the bin/, which makes the command which grabs the wrong path before grabbing the correct cuda…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by eric8607242
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
1 participant