Skip to content

Issues: hiyouga/LLaMA-Factory

🚨FAQs | 常见问题🚨
#4614 opened Jun 28, 2024 by hiyouga
Open
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[HELP] Does this repo support Llama-2-7b or 13b to sft or predict? pending This problem is yet to be addressed
#6115 opened Nov 23, 2024 by ZijunSong
1 task done
多机训练的训练速度和单机一样 pending This problem is yet to be addressed
#6111 opened Nov 22, 2024 by Wiselnn570
1 task done
再次求助。。wandb断点延伸曲线 pending This problem is yet to be addressed
#6110 opened Nov 22, 2024 by Saberlve
1 task done
求助:模型qwen2.5-7b-instruct全量sft的时候,训练过程中loss突然变为0。 pending This problem is yet to be addressed
#6109 opened Nov 22, 2024 by Chtholly1
1 task done
mllm 数据格式如果存在问题,如何设置忽略该样本 pending This problem is yet to be addressed
#6096 opened Nov 21, 2024 by DietDietDiet
1 task done
LoRA微调Qwen2-VL-2B时,Loss一直为0,grad_norm为nan pending This problem is yet to be addressed
#6092 opened Nov 20, 2024 by Tian-ye1214
1 task done
BAdam算法finetune的迭代轮次和论文不是很符合 pending This problem is yet to be addressed
#6088 opened Nov 20, 2024 by PhzCode
1 task done
训练参数以及训练时间疑问求解 pending This problem is yet to be addressed
#6087 opened Nov 20, 2024 by Beyond0831
1 task done
Maybe memory leak leak occurs after evaluation when using enable_liger_kernel. pending This problem is yet to be addressed
#6085 opened Nov 20, 2024 by upskyy
1 task done
关于 llamafactory-cli train 和 torchrun 训练耗费时间以及效果均不同的疑惑 pending This problem is yet to be addressed
#6080 opened Nov 19, 2024 by Maydaytyh
1 task done
昇腾910B3 两机16卡 lora sft Qwen2-72b报OOM npu This problem is related to NPU devices pending This problem is yet to be addressed
#6074 opened Nov 19, 2024 by hangxu124
1 task done
晟腾910b训练多卡报错 npu This problem is related to NPU devices pending This problem is yet to be addressed
#6072 opened Nov 19, 2024 by Ottomachine1
1 task done
无法在生成的 generated_predictions.jsonl 中保留额外字段并丢失 <image> 标记 pending This problem is yet to be addressed
#6070 opened Nov 19, 2024 by enerai
1 task done
损失函数阶段式下降 pending This problem is yet to be addressed
#6069 opened Nov 19, 2024 by Fan0fan
1 task done
function call 模型能支持流式输出吗? pending This problem is yet to be addressed
#6063 opened Nov 18, 2024 by SafeCool
1 task done
[Bug]: assert len(indices) == len(inputs) with Qwen/Qwen2-VL-2B-Instruct pending This problem is yet to be addressed
#6062 opened Nov 18, 2024 by sssunXw
1 task done
无缘无故被kill掉了,大佬能帮忙看看吗 pending This problem is yet to be addressed
#6058 opened Nov 18, 2024 by 1615070057
1 task done
量化qwen2.5-32b时出错,但7b没问题 pending This problem is yet to be addressed
#6048 opened Nov 16, 2024 by czhcc
1 task done
ProTip! no:milestone will show everything without a milestone.