You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This issue may be fixed in #6628 . However, we have observed another issue in the latest version of transformers, we will merge #6628 after the next transformers release.
Reminder
System Info
llamafactory
version: 0.9.1Reproduction
Others
由于transformers在4.46后的版本更新了梯度累计的计算公式,其需要额外计算num_items_in_batch来进行梯度累计计算。但当sft训练时,如果epoch>1,且训练数据条数不能正好整除梯度累计数时,其在跨越epoch的时候num_items_in_batch的计算存在问题,这直接影响了最后的loss计算,同时我们测试了transformers最新版本中该问题已经被修复,但llama-factory无法兼容到4.48的版本,有什么好的解决方案嘛
The text was updated successfully, but these errors were encountered: