LLaMA-O1
Visit ToolLLaMA-O1 is an open-source framework for training, inference, and evaluation of large reasoning models. It supports PyTorch and Hugging Face for model development and deployment.
At a glance
Trending
LLaMA-O1 is an open-source framework for training, inference, and evaluation of large reasoning models. It supports PyTorch and Hugging Face for model development and deployment.
Trending
About
LLaMA-O1 is an open-source framework designed for the development, deployment, and evaluation of large reasoning models. It leverages PyTorch and Hugging Face, providing a robust environment for researchers and developers. The framework includes resources for supervised fine-tuning and base pretraining, with datasets like OpenLongCoT-SFT and OpenLongCoT-Pretrain-1202 available on Hugging Face. LLaMA-O1 also offers pre-trained models and a CPU-only online demo, making it accessible for experimentation. Future developments include Reinforcement Learning With Self-Play and Inference-time Reasoning Enhancement Frameworks, indicating continuous advancement in the field of large reasoning models.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending