Mixtral-Offloading
Visit Toolmixtral-offloading is an AI Agents & Automation tool that enables efficient inference of Mixtral-8x7B models. It uses mixed quantization and MoE offloading to run models on Colab or consumer desktops.
At a glance
Trending