KVCache-Factory
Visit ToolKVCache-Factory is an AI Frameworks & Infra tool that provides unified KV Cache compression methods for auto-regressive models. It supports multi-GPU inference with large language models like Llama-3-70B-Instruct.
At a glance
Trending