ShypdShypd.ai

LMM-R1 is an Open Source research tool that extends OpenRLHF to support Large Multimodal Model (LMM) Reinforcement Learning (RL) training. It empowers 3B LMMs with strong reasoning abilities through a two-stage rule-based RL framework.

No Views Yet

At a glance

Pricing
Open Source
Free tier
Yes
API
No
Skill level
Technical

Trending

      

Explore

Browse AI tools by category