-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] ppo阶段wandb未创建job #1026
Comments
workflow代码内wandb.init之后 自定义的trainer里用wandb.log输出是可以创建任务同步日志,但是是每个卡创建一个,日志这个结果应该是汇总之后的吧 |
trl has its own logging parameters, you need first enable log with wandb in
and next, you should add some code in trainer.py for logging ppo stats:
finally, you should modify some details to avoid errors, because when you use wandb, trl will convert stats from scaler to numpy.
|
我提过这个问题,解决方案也和上面的一样,不过一直没有改 |
训练脚本如下
日志正常显示,wandb未创建任务并同步日志
The text was updated successfully, but these errors were encountered: