Jlama
Visit ToolJlama is an open-source LLM inference engine for Java that allows developers to integrate large language models directly into their applications. It supports various models including Llama, Mistral, and Gemma, and offers features like paged attention and distributed inference.
At a glance
Trending