Tunix

Visit Tool

Tunix is a JAX-based library designed to streamline the post-training of Large Language Models (LLMs). It provides efficient and scalable support for supervised fine-tuning, reinforcement learning, and agentic RL on TPUs.

Claim this tool

10Views

At a glance

Pricing

Open Source

Free tier

Yes

API

Skill level

Technical

About

What is tunix?

Tunix (Tune-in-JAX) is a JAX-based library developed by Google, specifically engineered to optimize the post-training phase of Large Language Models (LLMs). It offers efficient and scalable support for various advanced training methodologies, including Supervised Fine-Tuning (SFT), Reinforcement Learning (RL), and Agentic RL. Leveraging the power of JAX, Tunix ensures accelerated computation and seamless integration with JAX-based modeling frameworks like Flax NNX. It also integrates with high-performance inference engines such as vLLM and SGLang-JAX for efficient rollout. Tunix is designed to work within the JAX training stack, utilizing foundational tools like Flax and Optax, and streamlining tuning workflows on XLA and JAX infrastructure. It supports a growing list of models including Gemma, Llama, and Qwen families.

Best used for

Ideal for developers who need to efficiently fine-tune Large Language Models, implement advanced reinforcement learning algorithms, and develop agentic AI systems. Especially valuable for those working with JAX and requiring high-performance training on TPUs for scalable and reproducible experiments.

Common actions

fine-tune LLMs

optimize LLM performance

implement reinforcement learning

develop agentic AI

open-sourcedeepfakeautomated workflowworkflowscollaborationlow-code/no-codeface swapping"AI Agents"github copilot

Capabilities

Key features

Supervised Fine-Tuning (SFT)
Reinforcement Learning (RL)
Agentic RL
JAX-based acceleration
TPU optimization
Multi-host distributed training

Target Audience

developer

Integrations

vllmsglang-jaxflaxoptaxorbaxmaxtextmaxdiffusion

Pricing & Plans

Open Source

Free

FAQs

What types of post-training does Tunix support for LLMs?

Tunix supports Supervised Fine-Tuning (SFT) with options like Full Weights Fine-Tuning and PEFT, various Reinforcement Learning (RL) algorithms including PPO and DPO, and Agentic RL for multi-turn tool use and asynchronous rollout.

What are the key performance advantages of using Tunix?

Tunix leverages JAX for accelerated computation and offers SOTA training performance on TPUs. It integrates with high-performance inference engines like vLLM and SGLang-JAX, and supports micro-batching and seamless multi-host distributed training for scalability.

Which LLM models are compatible with Tunix?

Tunix supports a growing list of popular LLM models, including the Gemma, Llama, and Qwen families. The library is actively developed to expand its capabilities and model compatibility, with new updates and features regularly released.

Trending

Subcategories trending in Coding & Development

Open Source & Models DevOps & Infrastructure No-Code / Low-Code Testing & QA Backend & APIs Prompt Engineering

Trending

Also listed in

This tool also appears in

AI Agents & Automation › AI Frameworks & Infra

Explore

Browse AI tools by category

Content & Design Productivity & Business Coding & Development AI Agents & Automation Research & Education Wellness & Lifestyle Career Development Marketing & Growth Data & Analytics Customer Support & CX Finance E-commerce