Llama.Cpp
Visit Toolllama.cpp is an open-source C/C++ inference engine that enables LLM inference with minimal setup and state-of-the-art performance on a wide range of hardware. It supports various quantization methods for faster inference and reduced memory use.
At a glance
Trending