Lmm-R1
Visit ToolLMM-R1 is an Open Source research tool that extends OpenRLHF to support Large Multimodal Model (LMM) Reinforcement Learning (RL) training. It empowers 3B LMMs with strong reasoning abilities through a two-stage rule-based RL framework.
At a glance
Trending