-
Notifications
You must be signed in to change notification settings - Fork 127
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RuntimeError: Error building extension 'slstm_HS128BS8NH4NS4DBfDRbDWbDGbDSbDAfNG4SA1GRCV0GRC0d0FCV0FC0d0' #58
Comments
“subprocess.CalledProcessError: Command '['ninja', '-v']' returned non-zero exit status 1.” |
I have same issue. I installed ninja with: and in my terminal I can execute successfully: when I execute:
I'm in windows environment but I tested in Debian and same issue too |
You try to run xlstm's program directly, and'ninja-v 'should be called in the process, because we call the command manually without the specified file, hopefully I can help you |
Exactly the same thing happened to me :( Environment:
CalledProcessError Traceback (most recent call last) File ~/miniconda3/envs/main/lib/python3.10/subprocess.py:526, in run(input, capture_output, timeout, check, *popenargs, **kwargs) CalledProcessError: Command '['ninja', '-v']' returned non-zero exit status 1. [1/5] /usr/local/cuda/bin/nvcc -DTORCH_EXTENSION_NAME=slstm_HS256BS8NH8NS4DBfDRbDWbDGbDSbDAfNG4SA1GRCV0GRC0d0FCV0FC0d0 -DTORCH_API_INCLUDE_EXTENSION_H -DPYBIND11_COMPILER_TYPE="gcc" -DPYBIND11_STDLIB="libstdcpp" -DPYBIND11_BUILD_ABI="cxxabi1011" -isystem /home/user/miniconda3/envs/main/lib/python3.10/site-packages/torch/include -isystem /home/user/miniconda3/envs/main/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /home/user/miniconda3/envs/main/lib/python3.10/site-packages/torch/include/TH -isystem /home/user/miniconda3/envs/main/lib/python3.10/site-packages/torch/include/THC -isystem /usr/local/cuda/include -isystem /home/user/miniconda3/envs/main/include/python3.10 -D_GLIBCXX_USE_CXX11_ABI=0 -D__CUDA_NO_HALF_OPERATORS -D__CUDA_NO_HALF_CONVERSIONS_ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ --expt-relaxed-constexpr -gencode=arch=compute_86,code=compute_86 -gencode=arch=compute_86,code=sm_86 --compiler-options '-fPIC' -Xptxas="-v" -gencode arch=compute_80,code=compute_80 -res-usage --use_fast_math -O3 -Xptxas -O3 --extra-device-vectorization -DSLSTM_HIDDEN_SIZE=256 -DSLSTM_BATCH_SIZE=8 -DSLSTM_NUM_HEADS=8 -DSLSTM_NUM_STATES=4 -DSLSTM_DTYPE_B=float -DSLSTM_DTYPE_R=nv_bfloat16 -DSLSTM_DTYPE_W=nv_bfloat16 -DSLSTM_DTYPE_G=nv_bfloat16 -DSLSTM_DTYPE_S=nv_bfloat16 -DSLSTM_DTYPE_A=float -DSLSTM_NUM_GATES=4 -DSLSTM_SIMPLE_AGG=true -DSLSTM_GRADIENT_RECURRENT_CLIPVAL_VALID=false -DSLSTM_GRADIENT_RECURRENT_CLIPVAL=0.0 -DSLSTM_FORWARD_CLIPVAL_VALID=false -DSLSTM_FORWARD_CLIPVAL=0.0 -U__CUDA_NO_HALF_OPERATORS -U__CUDA_NO_HALF_CONVERSIONS -U__CUDA_NO_BFLOAT16_OPERATORS -U__CUDA_NO_BFLOAT16_CONVERSIONS -U__CUDA_NO_BFLOAT162_OPERATORS__ -U__CUDA_NO_BFLOAT162_CONVERSIONS__ -std=c++17 -c /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/slstm_forward.cu -o slstm_forward.cuda.o /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_fp16.cuh(70): error: identifier "__hadd_rn" is undefined /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_fp16.cuh(76): error: identifier "__hsub_rn" is undefined /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_fp16.cuh(87): error: identifier "__hmul_rn" is undefined /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_bf16.cuh(76): error: identifier "__hadd_rn" is undefined /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_bf16.cuh(83): error: identifier "__hsub_rn" is undefined /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_bf16.cuh(96): error: identifier "__hmul_rn" is undefined 6 errors detected in the compilation of "/home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/slstm_forward.cu". /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_fp16.cuh(70): error: identifier "__hadd_rn" is undefined /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_fp16.cuh(76): error: identifier "__hsub_rn" is undefined /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_fp16.cuh(87): error: identifier "__hmul_rn" is undefined /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_bf16.cuh(76): error: identifier "__hadd_rn" is undefined /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_bf16.cuh(83): error: identifier "__hsub_rn" is undefined /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_bf16.cuh(96): error: identifier "__hmul_rn" is undefined 6 errors detected in the compilation of "/home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/slstm_pointwise.cu". /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_fp16.cuh(70): error: identifier "__hadd_rn" is undefined /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_fp16.cuh(76): error: identifier "__hsub_rn" is undefined /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_fp16.cuh(87): error: identifier "__hmul_rn" is undefined /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_bf16.cuh(76): error: identifier "__hadd_rn" is undefined /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_bf16.cuh(83): error: identifier "__hsub_rn" is undefined /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_bf16.cuh(96): error: identifier "__hmul_rn" is undefined /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/slstm_backward.cu(54): warning: parameter "num_gates_i" was declared but never referenced /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/slstm_backward.cu(54): warning: parameter "num_gates_t" was declared but never referenced 6 errors detected in the compilation of "/home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/slstm_backward.cu". /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_fp16.cuh(70): error: identifier "__hadd_rn" is undefined /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_fp16.cuh(76): error: identifier "__hsub_rn" is undefined /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_fp16.cuh(87): error: identifier "__hmul_rn" is undefined /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_bf16.cuh(76): error: identifier "__hadd_rn" is undefined /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_bf16.cuh(83): error: identifier "__hsub_rn" is undefined /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/../util/inline_ops_bf16.cuh(96): error: identifier "__hmul_rn" is undefined /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/slstm_backward_cut.cu(54): warning: parameter "num_gates_i" was declared but never referenced /home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/slstm_backward_cut.cu(54): warning: parameter "num_gates_t" was declared but never referenced 6 errors detected in the compilation of "/home/user/miniconda3/envs/main/lib/python3.10/site-packages/xlstm/blocks/slstm/src/cuda/slstm_backward_cut.cu". |
@WangYLon What do you mean by "run xlstm's program directly"? I simply tried to build an instance in this way and the exception follows as mentioned above :( `from model.ModelxLSTMMixer import xLSTMMixer import os os.environ['CUDA_LIB'] = '/usr/local/cuda/lib64' |
My answer
is to @ IamYipi,When running the program of sLSTM, Ninja is called automatically, so we don't need to manually execute Ninja-v. Ninja is a build system similar to GNU Make for efficiently compiling and managing dependencies. In the code of sLSTM, we can find that there is a. cu file (CUDA C/C + + program). When sLSTM is included in the model, the code automatically calls Ninja to compile the relevant CUDA program and build the required dependencies. Judging from your error @ dearsikadeer, your ninja has started compiling and building slstm. It may fail due to other problems. Maybe you can check the version of gcc/g++ |
Thank you for your suggestions. I will try every possible method。 |
Thx for the reply I fixed with the environment variable CUDA_HOME set in correct path.
|
@WangYLon gcc Thanks! I still don't know how to fix the problem. gcc: (Ubuntu 9.4.0-1ubuntu1~20.04.1) 9.4.0 I added this to my |
You can change the backend of sLSTM to 'vanilla', as the author commented.
it will be work |
@HungNguyen4864 Thank you! It works. But can I put the model on the GPU for training in 'vanilla' mode? |
In the In contrast, the If there's still a desire to use a GPU, adjustments in the source code would be necessary. I'm not able to provide specifics on how to make these changes but can confirm the information as stated. You can follow this path to read more : Because i don't know how to fix it so If you can fix it, if it's not too much trouble, could you please provide me with a way to fix it? |
how to fix it?? do you have an answer? |
Traceback (most recent call last):
File "D:\Anaconda3\envs\DjPytorch\lib\site-packages\torch\utils\cpp_extension.py", line 2107, in _run_ninja_build
subprocess.run(
File "D:\Anaconda3\envs\DjPytorch\lib\subprocess.py", line 528, in run
raise CalledProcessError(retcode, process.args,
subprocess.CalledProcessError: Command '['ninja', '-v']' returned non-zero exit status 1.
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "C:\Users\Admin\Desktop\pythonProject(配置GPU环境)\pythonProject1\xlstmtest.py", line 35, in
xlstm_stack = xLSTMBlockStack(cfg)
File "D:\Anaconda3\envs\DjPytorch\lib\site-packages\xlstm\xlstm_block_stack.py", line 84, in init
self.blocks = self._create_blocks(config=config)
File "D:\Anaconda3\envs\DjPytorch\lib\site-packages\xlstm\xlstm_block_stack.py", line 105, in _create_blocks
blocks.append(sLSTMBlock(config=config))
File "D:\Anaconda3\envs\DjPytorch\lib\site-packages\xlstm\blocks\slstm\block.py", line 33, in init
super().init(
File "D:\Anaconda3\envs\DjPytorch\lib\site-packages\xlstm\blocks\xlstm_block.py", line 63, in init
self.xlstm = sLSTMLayer(config=self.config.slstm)
File "D:\Anaconda3\envs\DjPytorch\lib\site-packages\xlstm\blocks\slstm\layer.py", line 78, in init
self.slstm_cell = sLSTMCell(self.config)
File "D:\Anaconda3\envs\DjPytorch\lib\site-packages\xlstm\blocks\slstm\cell.py", line 780, in new
return sLSTMCell_cuda(config, skip_backend_init=skip_backend_init)
File "D:\Anaconda3\envs\DjPytorch\lib\site-packages\xlstm\blocks\slstm\cell.py", line 690, in init
self.func = sLSTMCellFuncGenerator(self.training, config)
File "D:\Anaconda3\envs\DjPytorch\lib\site-packages\xlstm\blocks\slstm\cell.py", line 536, in sLSTMCellFuncGenerator
slstm_cuda = sLSTMCellCUDA.instance(config=config)
File "D:\Anaconda3\envs\DjPytorch\lib\site-packages\xlstm\blocks\slstm\cell.py", line 515, in instance
cls.mod[repr(config)] = load(
File "D:\Anaconda3\envs\DjPytorch\lib\site-packages\xlstm\blocks\slstm\src\cuda_init.py", line 84, in load
mod = _load(name + suffix, sources, **myargs)
File "D:\Anaconda3\envs\DjPytorch\lib\site-packages\torch\utils\cpp_extension.py", line 1309, in load
return _jit_compile(
File "D:\Anaconda3\envs\DjPytorch\lib\site-packages\torch\utils\cpp_extension.py", line 1719, in _jit_compile
_write_ninja_file_and_build_library(
File "D:\Anaconda3\envs\DjPytorch\lib\site-packages\torch\utils\cpp_extension.py", line 1832, in _write_ninja_file_and_build_library
_run_ninja_build(
File "D:\Anaconda3\envs\DjPytorch\lib\site-packages\torch\utils\cpp_extension.py", line 2123, in _run_ninja_build
raise RuntimeError(message) from e
RuntimeError: Error building extension 'slstm_HS128BS8NH4NS4DBfDRbDWbDGbDSbDAfNG4SA1GRCV0GRC0d0FCV0FC0d0'
This problem has been bothering me for three days, who knows how to solve it. I really need it.
The text was updated successfully, but these errors were encountered: