If this was just the effect you mention you would not expect the base model to s...

		Certhas 4 months ago \| parent \| context \| favorite \| on: Does RL Incentivize Reasoning in LLMs Beyond the B... If this was just the effect you mention you would not expect the base model to surpass the RL model though. Plus their k are much smaller than that. I think it's a very interesting and meaningful study.