HALOs
Visit ToolHALOs is an open-source library for aligning Large Language Models (LLMs) using various human-aware loss functions like DPO, KTO, PPO, and ORPO. It offers modular, extensible, and simple implementations for training and evaluation.
At a glance
Trending