Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
Certhas
4 months ago
|
parent
|
context
|
favorite
| on:
Does RL Incentivize Reasoning in LLMs Beyond the B...
If this was just the effect you mention you would not expect the base model to surpass the RL model though. Plus their k are much smaller than that.
I think it's a very interesting and meaningful study.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
I think it's a very interesting and meaningful study.