Add lora_path to chat completion #2438

ccchow · 2024-12-11T06:36:04Z

Motivation

Add lora_path to ChatCompletionRequest for OpenAI chat completion API. It was previously added to OpenAI completion API #2243

Modifications

Added lora_path to ChatCompletionRequest

Checklist

Format your code according to the Contributor Guide.
Add unit tests as outlined in the Contributor Guide.
Update documentation as needed, including docstrings or example tutorials.

python/sglang/srt/openai_api/protocol.py

merrymercy · 2024-12-17T12:14:39Z

Even if you added this, it seems it is still not used.

ccchow · 2024-12-17T16:54:03Z

We are working on a project to provide multi-lora serving via OpenAI compatible API, and we have validated this fix by adding lora_path to OpenAI protocol and serving a batch with different lora adapters.

ccchow · 2024-12-17T16:54:12Z

Thanks for merging this change!

qingzhong1 · 2024-12-19T02:07:23Z

Hello, how to use url = "http://localhost:8000/v1/chat/completions" to request the configured lora, data = {"model": "Qwen2.5-7B-Instruct","messages": [{ "role": "user", "content": "What is the capital of France?"}]},lora name='aa'

ccchow · 2024-12-19T02:30:37Z

Hello, how to use url = "http://localhost:8000/v1/chat/completions" to request the configured lora, data = {"model": "Qwen2.5-7B-Instruct","messages": [{ "role": "user", "content": "What is the capital of France?"}]},lora name='aa'

curl -X POST http://127.0.0.1:30000/v1/chat/completions -d '{"model": "meta-llama/Llama-3.2-1B", "messages": [{"role": "system", "content": "You are a happy assistant that puts a positive spin on everything."}, {"role": "user", "content": "I fell off my bike today."}], "lora_path": "lora1", "max_tokens": 64}'

qingzhong1 · 2024-12-19T12:17:50Z

v1_completions can successfully call lora, but v1/chat/completions cannot call lora, why?Comparing v1_generate_request and v1_chat_generate_request, we found that v1_chat_generate_request does not have the lora_pathes variable

ccchow · 2024-12-19T16:56:13Z

v1_completions can successfully call lora, but v1/chat/completions cannot call lora, why?Comparing v1_generate_request and v1_chat_generate_request, we found that v1_chat_generate_request does not have the lora_pathes variable

You are right. I missed that when cherry picking changes. Will have another PR.

ccchow · 2024-12-19T18:46:20Z

v1_completions can successfully call lora, but v1/chat/completions cannot call lora, why?Comparing v1_generate_request and v1_chat_generate_request, we found that v1_chat_generate_request does not have the lora_pathes variable

#2529

Add lora_path to chat completion

7e5d297

ccchow requested review from merrymercy, Ying1123, hnyls2002, zhyncs, ispobock and ByronHsu as code owners December 11, 2024 06:36

Merge branch 'main' into chat_lora

5c23ecc

merrymercy reviewed Dec 11, 2024

View reviewed changes

python/sglang/srt/openai_api/protocol.py Outdated Show resolved Hide resolved

merrymercy and others added 5 commits December 11, 2024 08:46

Update python/sglang/srt/openai_api/protocol.py

b3cec0b

Merge branch 'sgl-project:main' into chat_lora

5e84bab

Merge branch 'sgl-project:main' into chat_lora

da0e0b4

Merge branch 'main' into chat_lora

b808a30

Merge branch 'main' into chat_lora

f2ae792

ccchow requested a review from merrymercy December 13, 2024 00:56

Merge branch 'main' into chat_lora

a093a23

ccchow mentioned this pull request Dec 15, 2024

[Feature] Add LoRA Support for Chat Completion in SGLang #1936

Closed

2 tasks

Merge branch 'main' into chat_lora

154f64b

merrymercy merged commit 33c5ff2 into sgl-project:main Dec 17, 2024
1 of 14 checks passed

ccchow deleted the chat_lora branch December 19, 2024 17:47

ccchow mentioned this pull request Dec 19, 2024

Add lora_paths to v1_chat_generate_request #2529

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add lora_path to chat completion #2438

Add lora_path to chat completion #2438

ccchow commented Dec 11, 2024 •

edited

Loading

merrymercy commented Dec 17, 2024

ccchow commented Dec 17, 2024

ccchow commented Dec 17, 2024

qingzhong1 commented Dec 19, 2024

ccchow commented Dec 19, 2024

qingzhong1 commented Dec 19, 2024

ccchow commented Dec 19, 2024

ccchow commented Dec 19, 2024

Add lora_path to chat completion #2438

Add lora_path to chat completion #2438

Conversation

ccchow commented Dec 11, 2024 • edited Loading

Motivation

Modifications

Checklist

merrymercy commented Dec 17, 2024

ccchow commented Dec 17, 2024

ccchow commented Dec 17, 2024

qingzhong1 commented Dec 19, 2024

ccchow commented Dec 19, 2024

qingzhong1 commented Dec 19, 2024

ccchow commented Dec 19, 2024

ccchow commented Dec 19, 2024

ccchow commented Dec 11, 2024 •

edited

Loading