LLM-Dojo
Visit ToolLLM-Dojo is an open-source AI Frameworks & Infra tool that provides a lightweight platform for LLM post-training experiments. It supports SFT, RLVR, On-Policy KD, Guide KD, and mixed training.
At a glance
Trending
Also listed in
LLM-Dojo is an open-source AI Frameworks & Infra tool that provides a lightweight platform for LLM post-training experiments. It supports SFT, RLVR, On-Policy KD, Guide KD, and mixed training.
Trending
Also listed in
About
LLM-Dojo is a lightweight, open-source framework designed for post-training large language models (LLMs). It offers comprehensive support for various training methodologies, including Supervised Fine-Tuning (SFT), Reinforcement Learning from Human Feedback with Value Regularization (RLVR), On-Policy Knowledge Distillation (On-Policy KD), and Guide Knowledge Distillation (Guide KD). The platform also facilitates mixed training approaches, enabling single-round or multi-round Guide distillation, multi-teacher distillation, and reward mixed training. A key feature is its automated data shunting capabilities. Built on a refactored OpenRLHF core, LLM-Dojo streamlines the framework by retaining only the essential RLVR components and integrating advanced KD and Guide-KD techniques, making it suitable for rapid fine-tuning experiments with features like Deepspeed support, LoRA/QLoRA, and automatic chat template adaptation.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending