Skip to content

Does 4bit support offloading yet? #370

Answered by BetaDoggo
ye7iaserag asked this question in Q&A
Discussion options

You must be logged in to vote

It's now possible as of 7618f3f using the --gptq-pre-layer <number of layers> argument.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@ye7iaserag
Comment options

Answer selected by ye7iaserag
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants