加载llama3 70B模型时， AutoTokenizer 报错 #3348

ArcherShirou · 2024-04-19T09:58:18Z

Reminder

I have read the README and searched the existing issues.

Reproduction

请问如何解决

Expected behavior

No response

System Info

No response

Others

No response

hiyouga · 2024-04-19T11:41:09Z

报错信息不全

ArcherShirou · 2024-04-19T14:22:22Z

您好，以下是完整的报错信息：

tokenizers 0.19.1
torch 2.2.2
transformers 4.40.0
加载 vllm 推理时：
export CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7
API_PORT=8000 python src/api_demo.py
--model_name_or_path Meta-Llama-3-70B-Instruct
--template llama3
--infer_backend vllm
--vllm_enforce_eager
出现相同的错误：

hiyouga · 2024-04-19T14:41:36Z

模型文件不全

ArcherShirou · 2024-04-19T17:34:54Z

您好，模型文件没有缺失，用官方的vllm可以推理：

hiyouga · 2024-04-19T17:47:58Z

更新llamafactory代码

ArcherShirou · 2024-04-22T09:44:53Z

找到真正的问题所在了，是官方上传的模型文件不全，缺少tokenizer_config.json文件，需要把llama-8b-instruct中的tokenizer_config.json复制到llama-70b-instruct中，就可以解决tokenizer报错的问题

luolanfeixue · 2024-05-11T02:01:32Z

找到真正的问题所在了，是官方上传的模型文件不全，缺少tokenizer_config.json文件，需要把llama-8b-instruct中的tokenizer_config.json复制到llama-70b-instruct中，就可以解决tokenizer报错的问题

不需要这么做，更新transformers库到4.41.0就可以

luolanfeixue · 2024-05-15T03:01:02Z

LlamaTokenizer.from_pretrained(model_name_or_path,**tokenizer_kwargs)
改为
AutoTokenizer.from_pretrained(model_name_or_path,**tokenizer_kwargs)

ArcherShirou changed the title ~~加载llama3 70B模型时， Autotokenizer 报错~~ 加载llama3 70B模型时， AutoTokenizer 报错 Apr 19, 2024

hiyouga added the pending This problem is yet to be addressed label Apr 19, 2024

hiyouga added solved This problem has been already solved and removed pending This problem is yet to be addressed labels Apr 19, 2024

hiyouga closed this as completed Apr 19, 2024

hiyouga added a commit that referenced this issue Apr 20, 2024

fix #3348

1fa287f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

加载llama3 70B模型时， AutoTokenizer 报错 #3348

加载llama3 70B模型时， AutoTokenizer 报错 #3348

ArcherShirou commented Apr 19, 2024

hiyouga commented Apr 19, 2024

ArcherShirou commented Apr 19, 2024

hiyouga commented Apr 19, 2024

ArcherShirou commented Apr 19, 2024

hiyouga commented Apr 19, 2024

ArcherShirou commented Apr 22, 2024 •

edited

Loading

luolanfeixue commented May 11, 2024

luolanfeixue commented May 15, 2024

加载llama3 70B模型时， AutoTokenizer 报错 #3348

加载llama3 70B模型时， AutoTokenizer 报错 #3348

Comments

ArcherShirou commented Apr 19, 2024

Reminder

Reproduction

Expected behavior

System Info

Others

hiyouga commented Apr 19, 2024

ArcherShirou commented Apr 19, 2024

hiyouga commented Apr 19, 2024

ArcherShirou commented Apr 19, 2024

hiyouga commented Apr 19, 2024

ArcherShirou commented Apr 22, 2024 • edited Loading

luolanfeixue commented May 11, 2024

luolanfeixue commented May 15, 2024

ArcherShirou commented Apr 22, 2024 •

edited

Loading