报错:
mmcv/_ext.cpython-37m-x86_64-linux-gnu.so: undefined symbol
RuntimeError: NCCL error in: /pytorch/torch/lib/c10d/ProcessGroupNCCL.cpp:38, unhandled cuda error, NCCL version
RuntimeError: NCCL error in: …/torch/csrc/distributed/c10d/ProcessGroupNCCL.cpp:45
原因:torch/torchvision需要更新,mmcv又强烈依赖二者版本,也需要对应更新,并且要注意版本对应
解决:
pip install torch-1.8.1+cu111-cp37-cp37m-linux_x86_64.whl # 文件在官网下载 wget https://download.pytorch.org/whl/cu111/torch-1.8.1%2Bcu111-cp37-cp37m-linux_x86_64.whl
pip install torchvision-0.9.1+cu111-cp37-cp37m-linux_x86_64.whl # wget https://download.pytorch.org/w