ppo合并失败 #4609

luowei0701 · 2024-06-28T10:40:15Z

Reminder

I have read the README and searched the existing issues.

System Info

训完ppo想和base model 合并
06/28/2024 10:37:27 - INFO - llamafactory.model.model_utils.attention - Using vanilla attention implementation.
Traceback (most recent call last):
File "/root/anaconda3/envs/llm/bin/llamafactory-cli", line 8, in
sys.exit(main())
File "/root/workspace/project/llm/LLaMA-Factory/src/llamafactory/cli.py", line 87, in main
export_model()
File "/root/workspace/project/llm/LLaMA-Factory/src/llamafactory/train/tuner.py", line 73, in export_model
model = load_model(tokenizer, model_args, finetuning_args) # must after fixing tokenizer to resize vocab
File "/root/workspace/project/llm/LLaMA-Factory/src/llamafactory/model/loader.py", line 160, in load_model
model = init_adapter(config, model, model_args, finetuning_args, is_trainable)
File "/root/workspace/project/llm/LLaMA-Factory/src/llamafactory/model/adapter.py", line 311, in init_adapter
model = _setup_lora_tuning(
File "/root/workspace/project/llm/LLaMA-Factory/src/llamafactory/model/adapter.py", line 191, in _setup_lora_tuning
model: "LoraModel" = PeftModel.from_pretrained(model, adapter, **init_kwargs)
File "/root/anaconda3/envs/llm/lib/python3.9/site-packages/peft/peft_model.py", line 430, in from_pretrained
model.load_adapter(model_id, adapter_name, is_trainable=is_trainable, **kwargs)
File "/root/anaconda3/envs/llm/lib/python3.9/site-packages/peft/peft_model.py", line 984, in load_adapter
adapters_weights = load_peft_weights(model_id, device=torch_device, **hf_hub_download_kwargs)
File "/root/anaconda3/envs/llm/lib/python3.9/site-packages/peft/utils/save_and_load.py", line 444, in load_peft_weights
adapters_weights = safe_load_file(filename, device=device)
File "/root/anaconda3/envs/llm/lib/python3.9/site-packages/safetensors/torch.py", line 311, in load_file
with safe_open(filename, framework="pt", device=device) as f:
safetensors_rust.SafetensorError: Error while deserializing header: InvalidHeaderDeserialization

Reproduction

model

model_name_or_path: model_zoos/shenzhi-wang/Llama3-8B-Chinese-Chat
adapter_name_or_path: saves/llama3-8b/lora/ppo_fdc/checkpoint-160
template: llama3
finetuning_type: lora

export

export_dir: saves/llama3-8b/lora/ppo_fdc_model
export_size: 2
export_device: cpu
export_legacy_format: false

Expected behavior

No response

Others

No response

luowei0701 · 2024-06-28T12:48:38Z

测试发现，是多卡环境下保存的权重有问题，但不知道为什么

LXYTSOS · 2024-07-02T12:55:54Z

我也碰到了跟你一样的问题，跑ppo保存模型权重有问题。我在5台机器上面都用一样的代码跑了，发现能正常保存出模型权重的环境里CUDA版本是11的，保存有问题的CUDA版本是12。

luowei0701 · 2024-07-03T02:43:56Z

我也碰到了跟你一样的问题，跑ppo保存模型权重有问题。我在5台机器上面都用一样的代码跑了，发现能正常保存出模型权重的环境里CUDA版本是11的，保存有问题的CUDA版本是12。

我这边cuda是11.8，我单卡和多卡环境是一致的呢

LXYTSOS · 2024-07-03T02:46:42Z

PPO这个还是有点问题，有的人能跑通有的不行

hiyouga · 2024-07-03T11:46:09Z

修复了

Allenljz · 2024-07-16T10:32:03Z

修复了

感觉多卡环境下保存adapter_model.safetensors还是有问题，

unwrap_model_for_generation(reward_model) is necessary for zero3 training

github-actions bot added the pending This problem is yet to be addressed label Jun 28, 2024

hiyouga closed this as completed in 8845e94 Jul 3, 2024

hiyouga added solved This problem has been already solved and removed pending This problem is yet to be addressed labels Jul 3, 2024

xtchen96 pushed a commit to xtchen96/LLaMA-Factory that referenced this issue Jul 17, 2024

fix hiyouga#4609

ee5a05e

unwrap_model_for_generation(reward_model) is necessary for zero3 training

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ppo合并失败 #4609

ppo合并失败 #4609

luowei0701 commented Jun 28, 2024

luowei0701 commented Jun 28, 2024

LXYTSOS commented Jul 2, 2024

luowei0701 commented Jul 3, 2024

LXYTSOS commented Jul 3, 2024

hiyouga commented Jul 3, 2024

Allenljz commented Jul 16, 2024

ppo合并失败 #4609

ppo合并失败 #4609

Comments

luowei0701 commented Jun 28, 2024

Reminder

System Info

Reproduction

model

export

Expected behavior

Others

luowei0701 commented Jun 28, 2024

LXYTSOS commented Jul 2, 2024

luowei0701 commented Jul 3, 2024

LXYTSOS commented Jul 3, 2024

hiyouga commented Jul 3, 2024

Allenljz commented Jul 16, 2024