Simple_GRPO
Visit Toolsimple_GRPO is an open-source research and education tool that provides a very simple implementation of the GRPO algorithm. It is designed for reproducing r1-like LLM thinking and efficient training.
At a glance
Trending
Also listed in