Skip to content
View hanyang1999's full-sized avatar

Highlights

  • Pro

Block or report hanyang1999

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Preference-Tuning-with-Human-Feedback Preference-Tuning-with-Human-Feedback Public

    Githun Repo for “Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey”

    4

  2. RainbowPO RainbowPO Public

    Implementation of RainbowPO based on TRL

    Python 1

  3. Improved-RLHF-for-Diffusion-Models Improved-RLHF-for-Diffusion-Models Public

    Code implementation for "Improved techniques in RLHF for Diffusion Models"

    Python 2