Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Llama cuda support? #138

Open
jojo2357 opened this issue May 4, 2024 · 1 comment
Open

Llama cuda support? #138

jojo2357 opened this issue May 4, 2024 · 1 comment

Comments

@jojo2357
Copy link

jojo2357 commented May 4, 2024

I have compiled llama.cpp with the LLAMA_CUDA option and I notice that running an edge model does not use the GPU at all. Is there something I should look for in my config?

Also, would it be possible to download models other than the LIBERTY - EDGE models? I assume that I could get more inference earnings if I had a more popular model, too.

(Running on Ubuntu Linux with proprietary nvidia drivers)

@mhtvsSFrpHdE
Copy link

mhtvsSFrpHdE commented Jul 14, 2024

Hi, I've just discovered how to hack llama startup arguments

#144

Can you try and share what arguments do you use so you can enable GPU acceleration?
I also don't know how to compile the one yet on Windows... If any information, steps, commands is so thankful.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants