❯

❯

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

created Mar 15, 2026modified Mar 22, 20261 min read

Notes

TLDR; RL actually worsen Pass@K metrics. Base model already has the reasoning path that RL-ed models have.

Graph View

Backlinks

Run RL Training
Can We Improve Creativity using RL

Created with Quartz v4.5.2 © 2026

GitHub
Discord Community