GPTQ-For-LLaMa
Visit ToolGPTQ-for-LLaMa is an AI Agents & Automation tool that provides 4-bit quantization for LLaMA models using the GPTQ method. It is a one-shot weight quantization method, primarily designed for Linux environments.
At a glance
Trending