ShypdShypd.ai

Exllamav3

Visit Tool

ExLlamaV3 is an optimized quantization and inference library for running LLMs locally on modern consumer GPUs. It features a new EXL3 quantization format based on QTIP and supports flexible tensor-parallel inference.

At a glance

Pricing
Open Source
Free tier
Yes
API
Yes
Skill level
Technical

Trending

      

Also listed in

This tool also appears in

Explore

Browse AI tools by category