ReasonFlux
Visit ToolReasonFlux is an open-source LLM post-training suite that enhances reasoning capabilities through data selection, reinforcement learning, and inference scaling. It features specialized models for process reward modeling and code generation.
At a glance
Trending