vLLM support #41

MichaelJayW · 2023-09-20T12:36:25Z

No description provided.

Data-drone · 2023-10-11T05:37:57Z

Are there any updates on this one?

louisoutin · 2023-10-16T12:59:10Z

+1

ruidongtd · 2023-10-23T07:22:13Z

+1

insist93 · 2023-11-03T03:01:53Z

+1

leonardxie · 2023-11-30T02:13:07Z

+1

TexasRangers86 · 2023-12-18T14:08:53Z

+1

Lvjinhong · 2023-12-20T01:48:33Z

So, could you provide advice so that I can make custom modifications on vLLM myself (llama2 70b)?

RonanKMcGovern · 2023-12-21T11:19:40Z

fwiw, i know this is about vLLM, but you can run medusa on tgi using --speculate 3

TexasRangers86 · 2023-12-26T12:47:29Z

fwiw, i know this is about vLLM, but you can run medusa on tgi using --speculate 3

hello，how can I pass medusa model and base model args when I use medusa on tgi.

RonanKMcGovern · 2023-12-26T12:48:28Z

fwiw, i know this is about vLLM, but you can run medusa on tgi using --speculate 3

hello，how can I pass medusa model and base model args when I use medusa on tgi.

Just pass the medusa model repo (as you would with any other model) and then add on --speculate 2

You can try this template: https://runpod.io/gsc?template=2xpg09eenv&ref=jmfkcdio

TexasRangers86 · 2023-12-26T12:51:53Z

Thanks a lot !!!!

chuangzhidan · 2024-09-29T08:28:07Z

how to use medusa based on vllm or sglang?

abhigoyal1997 mentioned this issue Jul 9, 2024

[Speculative Decoding] Medusa Implementation with Top-1 proposer vllm-project/vllm#4978

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vLLM support #41

vLLM support #41

MichaelJayW commented Sep 20, 2023

Data-drone commented Oct 11, 2023

louisoutin commented Oct 16, 2023

ruidongtd commented Oct 23, 2023

insist93 commented Nov 3, 2023

leonardxie commented Nov 30, 2023

TexasRangers86 commented Dec 18, 2023

Lvjinhong commented Dec 20, 2023 •

edited

Loading

RonanKMcGovern commented Dec 21, 2023

TexasRangers86 commented Dec 26, 2023

RonanKMcGovern commented Dec 26, 2023 •

edited

Loading

TexasRangers86 commented Dec 26, 2023

chuangzhidan commented Sep 29, 2024

vLLM support #41

vLLM support #41

Comments

MichaelJayW commented Sep 20, 2023

Data-drone commented Oct 11, 2023

louisoutin commented Oct 16, 2023

ruidongtd commented Oct 23, 2023

insist93 commented Nov 3, 2023

leonardxie commented Nov 30, 2023

TexasRangers86 commented Dec 18, 2023

Lvjinhong commented Dec 20, 2023 • edited Loading

RonanKMcGovern commented Dec 21, 2023

TexasRangers86 commented Dec 26, 2023

RonanKMcGovern commented Dec 26, 2023 • edited Loading

TexasRangers86 commented Dec 26, 2023

chuangzhidan commented Sep 29, 2024

Lvjinhong commented Dec 20, 2023 •

edited

Loading

RonanKMcGovern commented Dec 26, 2023 •

edited

Loading