RLHF-Reward-Modeling
Visit ToolRLHF-Reward-Modeling is an Open Source tool that provides recipes to train reward models for Reinforcement Learning from Human Feedback (RLHF). It includes various techniques like Bradley-Terry and pairwise preference models.
At a glance
Trending