LLM-Dojo

Visit Tool

LLM-Dojo is an open-source AI Frameworks & Infra tool that provides a lightweight platform for LLM post-training experiments. It supports SFT, RLVR, On-Policy KD, Guide KD, and mixed training.

Claim this tool

2Views

At a glance

Pricing

Open Source

Free tier

Yes

API

Skill level

Technical

About

What is LLM-Dojo?

LLM-Dojo is a lightweight, open-source framework designed for post-training large language models (LLMs). It offers comprehensive support for various training methodologies, including Supervised Fine-Tuning (SFT), Reinforcement Learning from Human Feedback with Value Regularization (RLVR), On-Policy Knowledge Distillation (On-Policy KD), and Guide Knowledge Distillation (Guide KD). The platform also facilitates mixed training approaches, enabling single-round or multi-round Guide distillation, multi-teacher distillation, and reward mixed training. A key feature is its automated data shunting capabilities. Built on a refactored OpenRLHF core, LLM-Dojo streamlines the framework by retaining only the essential RLVR components and integrating advanced KD and Guide-KD techniques, making it suitable for rapid fine-tuning experiments with features like Deepspeed support, LoRA/QLoRA, and automatic chat template adaptation.

Best used for

Ideal for researchers and developers who need to conduct advanced experiments in LLM post-training, including supervised fine-tuning and various knowledge distillation techniques. Especially valuable for those looking to integrate RLHF, KD, and Guide-KD methods with automated data management in a lightweight framework.

Common actions

fine-tune LLMs

experiment with RLHF

implement knowledge distillation

develop training frameworks

workflowscollaborationopen-sourceautomated workflowlow-code/no-codeface swappinggithub copilot"AI Agents"deepfake

Capabilities

Key features

SFT training
RLVR support
On-Policy KD
Guide KD
Mixed training
Automated data shunting
Deepspeed support

Target Audience

professor

Integrations

Not yet documented

Pricing & Plans

Open Source

Free

FAQs

What types of LLM post-training does LLM-Dojo support?

LLM-Dojo supports a variety of post-training methods including Supervised Fine-Tuning (SFT), Reinforcement Learning from Human Feedback with Value Regularization (RLVR), On-Policy Knowledge Distillation (On-Policy KD), and Guide Knowledge Distillation (Guide KD). It also allows for mixed training strategies.

Does LLM-Dojo support distributed training?

Yes, LLM-Dojo supports distributed training through Deepspeed. This allows users to efficiently train large language models across multiple GPUs or machines, which is crucial for handling complex models and large datasets.

Can LLM-Dojo handle different types of fine-tuning?

LLM-Dojo is designed for flexible fine-tuning experiments. It supports LoRA, QLoRA, and full-parameter fine-tuning. Additionally, it automatically adapts to various chat templates, simplifying the process of preparing models for conversational AI tasks.

Trending

Subcategories trending in AI Agents & Automation

Chatbots & Conversational AI General-Purpose Agents Workflow Agents Personal Assistants RAG & Document AI Voice Agents

Trending

Also listed in

This tool also appears in

Research & Education › Course Creation Coding & Development › Open Source & Models

Explore

Browse AI tools by category

Content & Design Productivity & Business Coding & Development AI Agents & Automation Research & Education Wellness & Lifestyle Career Development Marketing & Growth Data & Analytics Customer Support & CX Finance E-commerce