Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GLM4-9B-chat模型推理报错 #4917

Closed
1 task done
shatang123 opened this issue Jul 21, 2024 · 5 comments
Closed
1 task done

GLM4-9B-chat模型推理报错 #4917

shatang123 opened this issue Jul 21, 2024 · 5 comments
Labels
solved This problem has been already solved

Comments

@shatang123
Copy link

Reminder

  • I have read the README and searched the existing issues.

System Info

llamafactory version: 0.8.4.dev0

  • Platform: Linux-3.10.0-1127.19.1.el7.x86_64-x86_64-with-glibc2.17
  • Python version: 3.10.9
  • PyTorch version: 2.3.1+cu121 (GPU)
  • Transformers version: 4.41.2
  • Datasets version: 2.20.0
  • Accelerate version: 0.32.1
  • PEFT version: 0.11.1
  • TRL version: 0.9.6
  • GPU type: NVIDIA A100-SXM4-80GB

Reproduction

代码版本:LLaMA-Factory代码和GLM4-9B-chat模型是7-21日拉取的。
报错日志:too many values to unpack (expected 2)
image

依赖版本
transformers==4.41.2
image

目前GLM4-9B-chat官方仓库中要求transformers==4.42.4,而LLaMA-Factory安装该版本后提示版本不兼容
image

目前不知道如何解决

Expected behavior

No response

Others

No response

@github-actions github-actions bot added the pending This problem is yet to be addressed label Jul 21, 2024
@DentistCode
Copy link

+1,同样的问题

@0zyl
Copy link

0zyl commented Jul 22, 2024

+1,急求解决办法
0007310E

@hiyouga
Copy link
Owner

hiyouga commented Jul 22, 2024

fixed

@hiyouga hiyouga added solved This problem has been already solved and removed pending This problem is yet to be addressed labels Jul 22, 2024
@reyvan14
Copy link

+1

@AlexYoung757
Copy link

修改glm4模型的modeling_chatglm.py文件中的
model_kwargs["past_key_values"] = self._extract_past_from_model_output(
outputs, standardize_cache_format=standardize_cache_format
)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
solved This problem has been already solved
Projects
None yet
Development

No branches or pull requests

6 participants