feat: pass the `max_lora_rank` parameter to vLLM backend #3794

jue-jue-zi · 2024-05-17T08:13:53Z

What does this PR do?

If the user train the llm with lora with lora rank greater than 16, the vLLM backend will fail to process with the error: ValueError: LoRA rank XXX is greater than max_lora_rank 16. This commit pass the max_lora_rank parameter to vLLM backend to support a higher lora rank (currently up to 64 due to the vLLM constraints).

hiyouga · 2024-05-17T08:17:27Z

LGTM!

feat: pass the max_lora_rank parameter to vLLM backend

b20d62b

hiyouga self-requested a review May 17, 2024 08:14

Update model_args.py

9646727

hiyouga approved these changes May 17, 2024

View reviewed changes

hiyouga merged commit d7ff49f into hiyouga:main May 17, 2024
1 check passed

hiyouga added the solved This problem has been already solved label May 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: pass the `max_lora_rank` parameter to vLLM backend #3794

feat: pass the `max_lora_rank` parameter to vLLM backend #3794

jue-jue-zi commented May 17, 2024

hiyouga commented May 17, 2024

feat: pass the max_lora_rank parameter to vLLM backend #3794

feat: pass the max_lora_rank parameter to vLLM backend #3794

Conversation

jue-jue-zi commented May 17, 2024

What does this PR do?

hiyouga commented May 17, 2024

feat: pass the `max_lora_rank` parameter to vLLM backend #3794

feat: pass the `max_lora_rank` parameter to vLLM backend #3794