HALOs

Visit Tool

HALOs is an open-source library for aligning Large Language Models (LLMs) using various human-aware loss functions like DPO, KTO, PPO, and ORPO. It offers modular, extensible, and simple implementations for training and evaluation.

Claim this tool

1View

At a glance

Pricing

Open Source

Free tier

Yes

API

Skill level

Technical

About

What is HALOs?

HALOs (Human-Centered Loss Functions) is a Python library designed to facilitate the alignment of Large Language Models (LLMs) with human preferences. It provides extensible implementations of popular alignment methods such as DPO, KTO, PPO, and ORPO. The library emphasizes modularity, separating dataloading, training, and sampling, and extensibility, allowing users to quickly implement custom dataloaders or new alignment losses. HALOs is built for simplicity, making it easy to hack on, and has been tested with LLMs ranging from 1B to 30B parameters. It supports LoRA training, reference logit caching to reduce memory, and integrates with tools like Hydra for configuration and Accelerate for job launching with FSDP. The repository also includes scripts for evaluation with AlpacaEval and LMEval.

Best used for

Ideal for developers and data scientists who need to align LLMs using advanced human-aware loss functions like DPO, KTO, and ORPO. Especially valuable for researchers and practitioners looking for a modular and extensible framework to experiment with and evaluate different alignment methods.

Common actions

align LLMs

train LLMs

implement loss functions

evaluate LLMs

fine-tune LLMs

face swapping"AI Agents"github copilotworkflowsdeepfakeopen-sourceautomated workflowlow-code/no-codecollaboration

Capabilities

Key features

DPO, KTO, PPO, ORPO implementations
LoRA training support
Reference logit caching
Modular dataloading
Extensible loss functions
FSDP training support
AlpacaEval, LMEval integration

Target Audience

developerdata scientist

Integrations

Not yet documented

Pricing & Plans

Open Source

Free

FAQs

What alignment methods does HALOs support?

HALOs provides extensible implementations for several human-aware loss functions, including DPO (Direct Preference Optimization), KTO (Kahneman-Tversky Optimization), PPO (Proximal Policy Optimization), and ORPO (Odds Ratio Preference Optimization), among others.

Can I use HALOs for LoRA training?

Yes, HALOs supports LoRA (Low-Rank Adaptation) training. You can enable it by setting the 'use_peft' flag and customize LoRA hyperparameters as needed. Final models will have LoRA weights merged for saving.

How does HALOs help with memory efficiency during training?

HALOs allows you to precompute and cache the log probabilities of the reference model. This feature, enabled by setting 'cache_reference_logprobs=true', can substantially reduce memory consumption, especially when using the same reference model across multiple jobs.

Trending

Subcategories trending in AI Agents & Automation

AI Frameworks & Infra Chatbots & Conversational AI Workflow Agents Personal Assistants RAG & Document AI Voice Agents

Trending

Explore

Browse AI tools by category

Content & Design Productivity & Business Coding & Development AI Agents & Automation Research & Education Wellness & Lifestyle Career Development Marketing & Growth Data & Analytics Customer Support & CX Finance E-commerce