Official Implementation of Skeleton2Stage
-
Updated
Mar 19, 2026 - Python
Official Implementation of Skeleton2Stage
RFT with GRPO: RFT helps adapt LLMs to complex reasoning tasks like math and coding by using RL, enabling models to develop their own strategies instead of mimicking examples as in SFT. GRPO, a tailored RL algorithm, excels in tasks with verifiable outcomes and works well with small datasets.
Add a description, image, and links to the rlft topic page so that developers can more easily learn about it.
To associate your repository with the rlft topic, visit your repo's landing page and select "manage topics."