[Bug]: load_weights() does not work for RobertaModel embeddings since weights start with "roberta." #11821

chmeyers · 2025-01-08T00:33:43Z

Your current environment

The output of `python collect_env.py`

Your output of `python collect_env.py` here

Model Input Dumps

No response

🐛 Describe the bug

The RobertaEmbeddingModel here:

vllm/vllm/model_executor/models/roberta.py

Line 151 in a4e2b26

return BertModel(vllm_config=vllm_config,

just uses the base BertModel() class, so when model.load_weights() is called the param names don't match, leading to this stack trace:

File "/home/ray/anaconda3/lib/python3.10/site-packages/vllm/model_executor/models/bert.py", line 448, in load_weights
self.model.load_weights(weights)
File "/home/ray/anaconda3/lib/python3.10/site-packages/vllm/model_executor/models/bert.py", line 394, in load_weights
param = params_dict[name]
KeyError: 'roberta.embeddings.LayerNorm.weight'

I think it should be renaming the weights to remove the "roberta." bit similar to

vllm/vllm/model_executor/models/roberta.py

Line 186 in a4e2b26

def weight_filter():

Before submitting a new issue...

Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

The text was updated successfully, but these errors were encountered:

DarkLight1337 · 2025-01-08T02:28:51Z

cc @maxdebayser

chmeyers · 2025-01-08T16:42:17Z

Script for Repro:

import os

import torch
from torch import nn
from huggingface_hub import snapshot_download
from transformers import AutoModel
from safetensors import safe_open


from vllm import EngineArgs, LLMEngine
from vllm.config import (LoadConfig, ModelConfig, VllmConfig)
from vllm.model_executor.model_loader.loader import _initialize_model
from vllm.model_executor.model_loader.utils import set_default_torch_dtype
from vllm.model_executor.model_loader import BaseModelLoader

DOWNLOAD_PATTERN = ["*.json", "*.py", "*.safetensors", "*.txt", "*.model"]
model_dir = snapshot_download("FacebookAI/roberta-base", allow_patterns=DOWNLOAD_PATTERN)

class WeightsLoader(BaseModelLoader):
    def __init__(self, load_config: LoadConfig):
        super().__init__(load_config)

    def download_model(self, model_config: ModelConfig) -> None:
        pass

    def load_model(self, *, vllm_config: VllmConfig) -> nn.Module:
        device_config = vllm_config.device_config
        model_config = vllm_config.model_config

        with set_default_torch_dtype(model_config.dtype):
            with torch.device(device_config.device):
                model = _initialize_model(vllm_config=vllm_config)
            safetensorfile = os.path.join(model_dir, "model.safetensors")
            with safe_open(safetensorfile, framework="pt", device="cpu") as f:
                for name in f.keys():
                    buf = f.get_tensor(name)
                    print(name)
                    model.load_weights([(name, buf)])

        return model.eval()

engine_args = EngineArgs(model=model_dir, load_format=WeightsLoader, device="cpu")
LLMEngine.from_engine_args(engine_args)

chmeyers · 2025-01-08T18:28:57Z

Actually, after wondering how it was working on the default load path, I realized that it wasn't working there, either.

So a simpler repro:

from huggingface_hub import snapshot_download
from vllm import EngineArgs, LLMEngine

DOWNLOAD_PATTERN = ["*.json", "*.py", "*.safetensors", "*.txt", "*.model"]
model_dir = snapshot_download("FacebookAI/roberta-base", allow_patterns=DOWNLOAD_PATTERN)

engine_args = EngineArgs(model=model_dir, device="cpu")
LLMEngine.from_engine_args(engine_args)

noooop · 2025-01-09T04:59:12Z

@maxdebayser

FacebookAI/roberta-base without using roberta prefix
BAAI/bge-m3 using roberta prefix

very dirt

refer to

NickLucche · 2025-01-09T17:45:22Z

I can look into this

chmeyers added the bug Something isn't working label Jan 8, 2025

NickLucche mentioned this issue Jan 10, 2025

[Bugfix] Fix RobertaModel loading #11940

Merged

DarkLight1337 closed this as completed in #11940 Jan 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: load_weights() does not work for RobertaModel embeddings since weights start with "roberta." #11821

[Bug]: load_weights() does not work for RobertaModel embeddings since weights start with "roberta." #11821

chmeyers commented Jan 8, 2025

DarkLight1337 commented Jan 8, 2025

chmeyers commented Jan 8, 2025

chmeyers commented Jan 8, 2025

noooop commented Jan 9, 2025 •

edited

Loading

NickLucche commented Jan 9, 2025

[Bug]: load_weights() does not work for RobertaModel embeddings since weights start with "roberta." #11821

[Bug]: load_weights() does not work for RobertaModel embeddings since weights start with "roberta." #11821

Comments

chmeyers commented Jan 8, 2025

Your current environment

Model Input Dumps

🐛 Describe the bug

Before submitting a new issue...

DarkLight1337 commented Jan 8, 2025

chmeyers commented Jan 8, 2025

chmeyers commented Jan 8, 2025

noooop commented Jan 9, 2025 • edited Loading

NickLucche commented Jan 9, 2025

noooop commented Jan 9, 2025 •

edited

Loading