Vllm get tokenizer #1794

AguirreNicolas · 2024-05-06T17:57:35Z

Issue:

It is possible to change the model names served by vllm, and then have it not respond to any Huggingface repository making it impossible to obtain the tokenizer and therefore run lm-eval-harness.

Features:

This PR works in conjunction with another PR in the vllm repository to enable such features. The vllm server (optionally for whoever runs it) will send the *.json generated after using tokenizer.save_pretrained() and then the tokenizer is instantiated locally.

When running the tests, simply add the tokenizer_backend=vllm option to the model arguments.

CLAassistant · 2024-05-06T17:57:40Z

All committers have signed the CLA.

AguirreNicolas added 2 commits May 6, 2024 14:41

vllm tokenizer + doc update

22f08df

formating

c55ea8e

AguirreNicolas requested review from haileyschoelkopf and lintangsutawika as code owners May 6, 2024 17:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vllm get tokenizer #1794

Vllm get tokenizer #1794

AguirreNicolas commented May 6, 2024

CLAassistant commented May 6, 2024 •

edited

Loading

Vllm get tokenizer #1794

Are you sure you want to change the base?

Vllm get tokenizer #1794

Conversation

AguirreNicolas commented May 6, 2024

Issue:

Features:

CLAassistant commented May 6, 2024 • edited Loading

CLAassistant commented May 6, 2024 •

edited

Loading