GRPO-Zero
Visit ToolGRPO-Zero is an open-source Code Assistants tool that implements DeepSeek R1's GRPO algorithm from scratch. It is designed for training large language models with minimal dependencies and low GPU memory usage.
At a glance
Trending