Auto-Round
Visit Toolauto-round is an open-source quantization toolkit for Large Language Models (LLMs) and Vision-Language Models (VLMs). It achieves high accuracy at ultra-low bit widths (2–4 bits) with minimal tuning and broad hardware compatibility.
At a glance
Trending