Hi,
I tried to pull the docker/apptainer as indicated from the tutorial, but it failed, the link is not responsive.
Then I tried to git clone and create a conda environment but it seems there is a CUDA lib /system mismatch.
" libcusparse.so.12: undefined symbol: __nvJitLinkComplete_12_4, version libnvJitLink.so.12 "
I tried also to build an apptainer from the Singularity file inside the git folder, but it also fails,
" RuntimeError: CUDA error: out of memory
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1
Compile with TORCH_USE_CUDA_DSA to enable device-side assertions. "
I wonder is it possible to update the link so I can try to get the apptainer directly?
Thanks.
Hi,
I tried to pull the docker/apptainer as indicated from the tutorial, but it failed, the link is not responsive.
Then I tried to git clone and create a conda environment but it seems there is a CUDA lib /system mismatch.
" libcusparse.so.12: undefined symbol: __nvJitLinkComplete_12_4, version libnvJitLink.so.12 "
I tried also to build an apptainer from the Singularity file inside the git folder, but it also fails,
" RuntimeError: CUDA error: out of memory
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1
Compile with
TORCH_USE_CUDA_DSAto enable device-side assertions. "I wonder is it possible to update the link so I can try to get the apptainer directly?
Thanks.