微调llava1.5时报错图片数量不匹配，猜测llava_plugin中img_token的处理存在问题 #5344

wwwbq · 2024-09-03T15:37:41Z

我理解的LlavaPlugin中处理img_token的逻辑为（1）找到content中所有即对应的img token并且替换为{{image}}（2）把每个{{image}}替换为image_seqlen个img token。在llava中img token的数量为576，即image_seqlen为576。如果content中只有1个img token，那么经过LlavaPlugin的处理后，content会带有576个img token。但是查看LlavaForConditionalGeneration的源码发现，其实模型假定的是只会输入一个img token，找到img token位置后再一次性把图片的所有576个token插入进来，这样的话就和llama-factory源码的逻辑不太符合？

实际上我遇到的报错为：
File "/root/miniconda3/lib/python3.10/site-packages/transformers/models/llava/modeling_llava.py", line 339, in _merge_input_ids_with_image_features raise ValueError( ValueError: The input provided to the model are wrong. The number of image tokens is 576 while the number of image given to the model is 1. This prevents correct indexing and breaks batch generation.
或许可以理解为图片输入了576个img token，理应对应576张图片，但是数据集只输入了一张图，所以报错。在更新qwen-vl以前似乎img token的处理和现在的llama factory版本不太一样，是为了兼容qwen-vl吗

The text was updated successfully, but these errors were encountered:

Yangruipis · 2024-09-03T15:51:59Z

遇到了同样的问题

wwwbq · 2024-09-03T15:57:00Z

遇到了同样的问题

我正在尝试在LlavaPlugin的process_message中什么也不做，只比较图片数量和img token数量是不是相同，理论上这样是可行的，以前的llama factory貌似也是类似的逻辑

hiyouga · 2024-09-03T19:04:35Z

需要更新 transformers 至最新的 4.35.0.dev0
pip install git+https://github.com/huggingface/transformers.git

iie-ycx · 2024-09-05T07:46:30Z

这个问题咋解决呀，一直改不好

iie-ycx · 2024-09-05T07:47:08Z

遇到了同样的问题

我正在尝试在LlavaPlugin的process_message中什么也不做，只比较图片数量和img token数量是不是相同，理论上这样是可行的，以前的llama factory貌似也是类似的逻辑

想请教下大佬咋弄的

wwwbq · 2024-09-05T10:26:48Z

这个问题咋解决呀，一直改不好

将llava plugin中的message["content"] = content.replace("{{image}}", self.image_token * image_seqlen) 换成message["content"] = content.replace("{{image}}", IMAGE_PLACEHOLDER)就行，前者是插入image_seqlen个img token，后者是插入一个。

或者transformers升级到4.35.0.dev0，但是这个我还没测试

wangranran-neu · 2024-10-11T06:50:41Z

不知道大家是什么情况，报了这个错后，发现是content中的IMAGE_PLACEHOLDER写少了，用了几张图片，就应该写几个，改了之后正确执行了

xiehust · 2024-10-21T11:31:15Z

我也碰到同样的问题 transfomers 4.45。
按照上面那位兄弟的办法修改之后能work：

将llava plugin中的message["content"] = content.replace("{{image}}", self.image_token * image_seqlen) 换成message["content"] = content.replace("{{image}}", IMAGE_PLACEHOLDER)就行，前者是插入image_seqlen个img token，后者是插入一个。

zsworld6 · 2024-10-21T11:58:52Z

我也碰到同样的问题 transfomers 4.45。按照上面那位兄弟的办法修改之后能work：

将llava plugin中的message["content"] = content.replace("{{image}}", self.image_token * image_seqlen) 换成message["content"] = content.replace("{{image}}", IMAGE_PLACEHOLDER)就行，前者是插入image_seqlen个img token，后者是插入一个。

可以具体描述一下做法吗

zsworld6 · 2024-10-21T11:59:51Z

将llava plugin中的message["content"] = content.replace("{{image}}", self.image_token * image_seqlen) 换成message["content"] = content.replace("{{image}}", IMAGE_PLACEHOLDER)就行，前者是插入image_seqlen个img token，后者是插入一个

可以具体描述一下做法吗

github-actions bot added the pending This problem is yet to be addressed label Sep 3, 2024

hiyouga added solved This problem has been already solved and removed pending This problem is yet to be addressed labels Sep 3, 2024

hiyouga closed this as completed in d41d43a Sep 3, 2024

yuwangnexusera pushed a commit to yuwangnexusera/LLaMA-Factory that referenced this issue Sep 5, 2024

fix hiyouga#5344

628890f

yuwangnexusera pushed a commit to yuwangnexusera/LLaMA-Factory that referenced this issue Sep 5, 2024

fix hiyouga#5344

7a39aa6

yuwangnexusera pushed a commit to yuwangnexusera/LLaMA-Factory that referenced this issue Sep 5, 2024

fix hiyouga#5344

d0b6cac

yuwangnexusera pushed a commit to yuwangnexusera/LLaMA-Factory that referenced this issue Sep 5, 2024

fix hiyouga#5344

47a6105

yuwangnexusera pushed a commit to yuwangnexusera/LLaMA-Factory that referenced this issue Sep 5, 2024

fix hiyouga#5344

d5ebb20

yuwangnexusera pushed a commit to yuwangnexusera/LLaMA-Factory that referenced this issue Sep 5, 2024

fix hiyouga#5344

a070d7c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

微调llava1.5时报错图片数量不匹配，猜测llava_plugin中img_token的处理存在问题 #5344

微调llava1.5时报错图片数量不匹配，猜测llava_plugin中img_token的处理存在问题 #5344

wwwbq commented Sep 3, 2024 •

edited

Loading

Yangruipis commented Sep 3, 2024

wwwbq commented Sep 3, 2024

hiyouga commented Sep 3, 2024

iie-ycx commented Sep 5, 2024

iie-ycx commented Sep 5, 2024

wwwbq commented Sep 5, 2024

wangranran-neu commented Oct 11, 2024

xiehust commented Oct 21, 2024

zsworld6 commented Oct 21, 2024

zsworld6 commented Oct 21, 2024

微调llava1.5时报错图片数量不匹配，猜测llava_plugin中img_token的处理存在问题 #5344

微调llava1.5时报错图片数量不匹配，猜测llava_plugin中img_token的处理存在问题 #5344

Comments

wwwbq commented Sep 3, 2024 • edited Loading

Yangruipis commented Sep 3, 2024

wwwbq commented Sep 3, 2024

hiyouga commented Sep 3, 2024

iie-ycx commented Sep 5, 2024

iie-ycx commented Sep 5, 2024

wwwbq commented Sep 5, 2024

wangranran-neu commented Oct 11, 2024

xiehust commented Oct 21, 2024

zsworld6 commented Oct 21, 2024

zsworld6 commented Oct 21, 2024

wwwbq commented Sep 3, 2024 •

edited

Loading