[Bugfix] Fix PP for ChatGLM and Molmo, and weight loading for Qwen2.5-Math-RM #9422

DarkLight1337 · 2024-10-16T13:49:02Z

#9242 accidentally removed PP support from ChatGLM. On the other hand, MolmoModel already supports PP but the top-level MolmoForCausalLM does not. This PR fixes both of these issues.

The input processors of these models and Qwen2-VL have also been cleaned up a bit.

Also fixes #9090 (comment)

github-actions · 2024-10-16T13:49:16Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

DarkLight1337 · 2024-10-17T03:54:18Z

Sorry for breaking your code! @lixiaolx can you test out this PR and see if you can load Qwen/Qwen2.5-Math-RM-72B now?

Isotr0py

GLM4-V and Molmo PP tests all passed on my devices. LGTM once Qwen2-RM is also confirmed to work.

lixiaolx · 2024-10-17T07:14:27Z

Sorry for breaking your code! @lixiaolx can you test out this PR and see if you can load Qwen/Qwen2.5-Math-RM-72B now?

@DarkLight1337 , I tried adding lm_head filtering to the model.py file of qwen_rm.
loader = AutoWeightsLoader(self,skip_prefixes=["lm_head"])
Now I can load the model, but the final results of my test reference are all nan #8896. I tried printing the weights of the model, but it seemed that the weights were not loaded correctly.

add print code in
https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/model_loader/loader.py#L413
for name, param in model.named_parameters(): print(f"Parameter '{name}' =====tensor=======: {param}")

DarkLight1337 · 2024-10-17T08:16:56Z

Sorry for breaking your code! @lixiaolx can you test out this PR and see if you can load Qwen/Qwen2.5-Math-RM-72B now?

@DarkLight1337 , I tried adding lm_head filtering to the model.py file of qwen_rm.
loader = AutoWeightsLoader(self,skip_prefixes=["lm_head"])
Now I can load the model, but the final results of my test reference are all nan #8896. I tried printing the weights of the model, but it seemed that the weights were not loaded correctly.

add print code in
https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/model_loader/loader.py#L413
for name, param in model.named_parameters(): print(f"Parameter '{name}' =====tensor=======: {param}")

Can you add print statements inside AutoWeightLoader and list out the weight names in the iterator? This should provide enough information for me to figure out the discrepancies and fix them.

DarkLight1337 · 2024-10-21T05:18:01Z

Sorry for breaking your code! @lixiaolx can you test out this PR and see if you can load Qwen/Qwen2.5-Math-RM-72B now?

@DarkLight1337 , I tried adding lm_head filtering to the model.py file of qwen_rm.
loader = AutoWeightsLoader(self,skip_prefixes=["lm_head"])
Now I can load the model, but the final results of my test reference are all nan #8896. I tried printing the weights of the model, but it seemed that the weights were not loaded correctly.
add print code in
https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/model_loader/loader.py#L413
for name, param in model.named_parameters(): print(f"Parameter '{name}' =====tensor=======: {param}")

Can you add print statements inside AutoWeightLoader and list out the weight names in the iterator? This should provide enough information for me to figure out the discrepancies and fix them.

Any update? Otherwise, I'll merge this PR first and fix the model in another PR.

DarkLight1337 · 2024-10-24T04:13:43Z

Sorry for breaking your code! @lixiaolx can you test out this PR and see if you can load Qwen/Qwen2.5-Math-RM-72B now?

@DarkLight1337 , I tried adding lm_head filtering to the model.py file of qwen_rm.
loader = AutoWeightsLoader(self,skip_prefixes=["lm_head"])
Now I can load the model, but the final results of my test reference are all nan #8896. I tried printing the weights of the model, but it seemed that the weights were not loaded correctly.
add print code in
https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/model_loader/loader.py#L413
for name, param in model.named_parameters(): print(f"Parameter '{name}' =====tensor=======: {param}")

Can you add print statements inside AutoWeightLoader and list out the weight names in the iterator? This should provide enough information for me to figure out the discrepancies and fix them.

Any update? Otherwise, I'll merge this PR first and fix the model in another PR.

I think I fixed the problem, so I'm going to merge this now. Please tell me and share the logs (set the environment variable VLLM_LOGGING_LEVEL=DEBUG) if it still doesn't work.

Signed-off-by: Alvant <[email protected]>

Signed-off-by: Erkin Sagiroglu <[email protected]>

Signed-off-by: Shanshan Wang <[email protected]>

Signed-off-by: qishuai <[email protected]>

Signed-off-by: NickLucche <[email protected]>

Signed-off-by: Sumit Dubey <[email protected]>

Signed-off-by: Maxime Fournioux <[email protected]>

Signed-off-by: Tyler Michael Smith <[email protected]>

DarkLight1337 added 2 commits October 16, 2024 13:46

Add PP to ChatGLM and Molmo

7ef61a9

Cleanup

318cbed

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 16, 2024

DarkLight1337 requested review from Isotr0py and ywang96 October 16, 2024 13:49

DarkLight1337 requested a review from youkaichao as a code owner October 16, 2024 13:49

Fix typo

96d10aa

DarkLight1337 mentioned this pull request Oct 17, 2024

[Model] PP support for embedding models and update docs #9090

Merged

Fix Qwen2-RM

3760cae

Isotr0py approved these changes Oct 17, 2024

View reviewed changes

DarkLight1337 mentioned this pull request Oct 21, 2024

[Model] Support Qwen2.5-Math-RM-72B #8896

Merged

DarkLight1337 added 2 commits October 24, 2024 03:48

Debug weight loading

f5d1fb4

Merge branch 'main' into vlm-pp

5da62f8

DarkLight1337 enabled auto-merge (squash) October 24, 2024 04:13

DarkLight1337 changed the title ~~[Bugfix] Fix PP for ChatGLM and Molmo~~ [Bugfix] Fix PP for ChatGLM and Molmo, and Qwen2.5-Math-RM weight loading Oct 24, 2024

DarkLight1337 changed the title ~~[Bugfix] Fix PP for ChatGLM and Molmo, and Qwen2.5-Math-RM weight loading~~ [Bugfix] Fix PP for ChatGLM and Molmo, and weight loading for Qwen2.5-Math-RM Oct 24, 2024

DarkLight1337 merged commit 836e8ef into main Oct 24, 2024
61 checks passed

DarkLight1337 deleted the vlm-pp branch October 24, 2024 06:55

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[Bugfix] Fix PP for ChatGLM and Molmo (vllm-project#9422)

a0ac193

Signed-off-by: Alvant <[email protected]>

MErkinSag pushed a commit to MErkinSag/vllm that referenced this pull request Oct 26, 2024

[Bugfix] Fix PP for ChatGLM and Molmo (vllm-project#9422)

cee6f39

Signed-off-by: Erkin Sagiroglu <[email protected]>

DarkLight1337 mentioned this pull request Oct 28, 2024

[Bug]: Loading qwen2.5-math-rm-72b encountered an exception #9755

Closed

1 task

cooleel pushed a commit to cooleel/vllm that referenced this pull request Oct 28, 2024

[Bugfix] Fix PP for ChatGLM and Molmo (vllm-project#9422)

1dfdc13

Signed-off-by: Shanshan Wang <[email protected]>

cooleel pushed a commit to cooleel/vllm that referenced this pull request Oct 28, 2024

[Bugfix] Fix PP for ChatGLM and Molmo (vllm-project#9422)

4df5378

Signed-off-by: Shanshan Wang <[email protected]>

FerdinandZhong pushed a commit to FerdinandZhong/vllm that referenced this pull request Oct 29, 2024

[Bugfix] Fix PP for ChatGLM and Molmo (vllm-project#9422)

f96b436

Signed-off-by: qishuai <[email protected]>

NickLucche pushed a commit to NickLucche/vllm that referenced this pull request Oct 31, 2024

[Bugfix] Fix PP for ChatGLM and Molmo (vllm-project#9422)

e8a2a1b

Signed-off-by: NickLucche <[email protected]>

NickLucche pushed a commit to NickLucche/vllm that referenced this pull request Oct 31, 2024

[Bugfix] Fix PP for ChatGLM and Molmo (vllm-project#9422)

914ddb0

Signed-off-by: NickLucche <[email protected]>

Isotr0py mentioned this pull request Nov 14, 2024

[Misc] Add uninitialized params tracking for AutoWeightsLoader #10327

Merged

sumitd2 pushed a commit to sumitd2/vllm that referenced this pull request Nov 14, 2024

[Bugfix] Fix PP for ChatGLM and Molmo (vllm-project#9422)

ddaf258

Signed-off-by: Sumit Dubey <[email protected]>

KuntaiDu pushed a commit to KuntaiDu/vllm that referenced this pull request Nov 20, 2024

[Bugfix] Fix PP for ChatGLM and Molmo (vllm-project#9422)

55f5803

mfournioux pushed a commit to mfournioux/vllm that referenced this pull request Nov 20, 2024

[Bugfix] Fix PP for ChatGLM and Molmo (vllm-project#9422)

f58fc4d

Signed-off-by: Maxime Fournioux <[email protected]>

tlrmchlsmth pushed a commit to neuralmagic/vllm that referenced this pull request Nov 23, 2024

[Bugfix] Fix PP for ChatGLM and Molmo (vllm-project#9422)

9dcadc6

Signed-off-by: Tyler Michael Smith <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix] Fix PP for ChatGLM and Molmo, and weight loading for Qwen2.5-Math-RM #9422

[Bugfix] Fix PP for ChatGLM and Molmo, and weight loading for Qwen2.5-Math-RM #9422

DarkLight1337 commented Oct 16, 2024 •

edited

Loading

github-actions bot commented Oct 16, 2024

DarkLight1337 commented Oct 17, 2024

Isotr0py left a comment

lixiaolx commented Oct 17, 2024 •

edited

Loading

DarkLight1337 commented Oct 17, 2024 •

edited

Loading

DarkLight1337 commented Oct 21, 2024

DarkLight1337 commented Oct 24, 2024 •

edited

Loading

[Bugfix] Fix PP for ChatGLM and Molmo, and weight loading for Qwen2.5-Math-RM #9422

[Bugfix] Fix PP for ChatGLM and Molmo, and weight loading for Qwen2.5-Math-RM #9422

Conversation

DarkLight1337 commented Oct 16, 2024 • edited Loading

github-actions bot commented Oct 16, 2024

DarkLight1337 commented Oct 17, 2024

Isotr0py left a comment

Choose a reason for hiding this comment

lixiaolx commented Oct 17, 2024 • edited Loading

DarkLight1337 commented Oct 17, 2024 • edited Loading

DarkLight1337 commented Oct 21, 2024

DarkLight1337 commented Oct 24, 2024 • edited Loading

DarkLight1337 commented Oct 16, 2024 •

edited

Loading

lixiaolx commented Oct 17, 2024 •

edited

Loading

DarkLight1337 commented Oct 17, 2024 •

edited

Loading

DarkLight1337 commented Oct 24, 2024 •

edited

Loading