-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ValueError: Attempting to unscale FP16 gradients. #1764
Comments
我也遇到这个问题,也是把训练数据集调大了一点后出现的,我用的是 chatglm3 |
Same here when scaling up the train set. |
+1 |
我是在alpaca_zh后面增加的自己的数据集 |
我用的是modelscope提供的服务器,每次上去需要重装依赖,大家也是嘛? |
我是在自己本地 |
之前我都是自己单独创建json文件,不过是很早以前版本 |
我是創建自己的json文件,週二資料集大小大約5萬筆資料是可行的,週四將資料集擴充到7萬多筆資料卻出現上述的問題 |
我是使用runpod雲端GPU進行訓練,就必須每次都重裝依賴沒錯 |
不太可能是数据集的问题,看起来更像是依赖版本问题 |
你是對的,測試較小資料集仍不起作用。 |
provide your system info |
linux centos 7 torch 1.13.1 |
ubuntu 22.04 pytorch:2.0.1 |
We recommend to use peft==0.6.0 |
Thank you for your reply, the error has been resolved. |
感谢回答,改成peft==0.6.0就可以运行了 |
Thank you for replying. |
thank you very much, I could solve the problem. |
I ran this command.
我昨天使用時是正常的,但當我今天改變了資料集大小後出現了這個問題,請問是發生了甚麼事呢?
The text was updated successfully, but these errors were encountered: