T-MAC
Visit ToolT-MAC is an AI Frameworks & Infra tool that enables low-bit LLM inference on CPU/NPU using a lookup table approach. It supports efficient execution of large language models on resource-constrained devices, offering faster inference than dequantization methods.
At a glance
Trending
Also listed in