ShypdShypd.ai

Safe-Rlhf

Visit Tool

safe-rlhf is an open-source framework for Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback. It provides a reproducible code pipeline for alignment research, supporting SFT, RLHF, and Safe RLHF training methods.

At a glance

Pricing
Open Source
Free tier
Yes
API
No
Skill level
Technical

Trending

      

Also listed in

This tool also appears in

Explore

Browse AI tools by category