You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When running the training code in the rlhflow environment, I encountered a TypeError with the message: DPOTrainer.init() got an unexpected keyword argument 'beta'. It seems like there might be an issue with the compatibility of the arguments in the DPOTrainer initialization.
Could you provide information on the specific version of trl used in the rlhflow environment to investigate this issue further?
Thank you for your support and guidance. Your assistance in clarifying this matter is greatly appreciated.
Warm regards,
Yiju Guo
The text was updated successfully, but these errors were encountered:
When running the training code in the rlhflow environment, I encountered a TypeError with the message: DPOTrainer.init() got an unexpected keyword argument 'beta'. It seems like there might be an issue with the compatibility of the arguments in the DPOTrainer initialization.
Could you provide information on the specific version of trl used in the rlhflow environment to investigate this issue further?
Thank you for your support and guidance. Your assistance in clarifying this matter is greatly appreciated.
Warm regards,
Yiju Guo
The text was updated successfully, but these errors were encountered: