Onnxruntime-Genai
Visit Toolonnxruntime-genai is an open-source tool that provides generative AI extensions for ONNX Runtime. It offers a flexible and performant way to run large language models (LLMs) on devices, including pre and post-processing, inference, and KV cache management.
At a glance
Trending