feat: Add environment variable to specify the number of layers to offload to GPU #2055

nopperl · 2024-05-05T16:17:11Z

This PR allows to set the LLAMA_CPP_N_GPU_LAYERS env var to specify the number of layers to offload to GPU (using the llama.cpp --n-gpu-layers flag). Fixes #892.

add env var to specify the number of layers to offload to GPU

e6f40eb

nopperl changed the title ~~Add environment variable to specify the number of layers to offload to GPU~~ feat: Add environment variable to specify the number of layers to offload to GPU May 5, 2024

wsxiaoys merged commit 2785a9a into TabbyML:main May 5, 2024
1 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add environment variable to specify the number of layers to offload to GPU #2055

feat: Add environment variable to specify the number of layers to offload to GPU #2055

nopperl commented May 5, 2024

feat: Add environment variable to specify the number of layers to offload to GPU #2055

feat: Add environment variable to specify the number of layers to offload to GPU #2055

Conversation

nopperl commented May 5, 2024