LMFlow
Visit ToolLMFlow is an extensible toolkit for finetuning and inference of large foundation models. It is designed to be user-friendly, speedy, and reliable for the entire community.
At a glance
Trending
LMFlow is an extensible toolkit for finetuning and inference of large foundation models. It is designed to be user-friendly, speedy, and reliable for the entire community.
Trending
About
LMFlow is an open-source, extensible toolkit designed for the finetuning and inference of large machine learning models. It emphasizes user-friendliness, speed, and reliability, making large models accessible to a broad community. Key features include support for various finetuning methods like Full Finetuning, LISA (Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning), and LoRA (Low-Rank Adaptation). The toolkit also offers acceleration and memory optimization techniques such as FlashAttention (versions 1 and 2), Gradient Checkpointing, and Deepspeed Zero3 Offload. For inference, LMFlow supports CPU inference for LLaMA models via 4-bit quantization and integrates with vLLM for fast serving. It also provides long context support through position interpolation for LLaMA models and includes a Gradio-based UI for local chatbot deployment.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending