Reward-Bench
Visit ToolRewardBench is an open-source benchmark designed to evaluate the capabilities and safety of reward models, including those trained with Direct Preference Optimization (DPO). It provides common inference code, dataset formatting, and analysis tools for fair reward model assessment.
At a glance
Trending