This seems like anthromorphizing the model ... Occam's Razor says that the impro...

og_kalu · 2025-01-03T17:54:16 1735926856

>If the latter were the case then one could get the best version on first attempt by telling it your grandmother's life was on the line or whatever.

Setting aside the fact that "best" is ambiguous, why would this get you the best version ?

If you told a human this, you wouldn't be guaranteed to get the best version at all. You would probably get a better version sure but that would be the case for LLMs as well. You will often get improvements with emotionally charged statements even if there's nothing to iterate on (i.e re-running a benchmark with an emotion prompt added)

https://arxiv.org/abs/2307.11760

HarHarVeryFunny · 2025-01-03T22:37:39 1735943859

The thesis of the article is that the code keeps betting better because the model keeps getting told to do better - that it needs more motivation/criticism. A logical conclusion of this, if it were true, is that the model would generate it's best version on first attempt if only we could motivate it to do so! I'm not sure what motivations/threats work best with LLMs - there was a time when offering to pay the LLM was popular, but "my grandma will die if you don't" was also another popular genre of prompts.

If it's not clear, I disagree with the idea that ANY motivational prompt (we can disagree over what would be best to try) could get the model to produce a solution of the same quality as it will when allowed to iterate on it a few times and make incremental improvements. I think it's being allowed to iterate that is improving the solution, not the motivation to "do better!".

og_kalu · 2025-01-03T23:30:14 1735947014

>If it's not clear, I disagree with the idea that ANY motivational prompt (we can disagree over what would be best to try) could get the model to produce a solution of the same quality as it will when allowed to iterate on it a few times and make incremental improvements.

Ok i agree but.. this would be the case with people as well ? If you can't iterate, the quality of your response will be limited no matter how motivated you are.

Solve the riemann hypothesis or your mother dies but you can't write anything down on paper. Even if such a person could solve it, it's not happening under those conditions.

Iteration is probably the bulk of the improvement but I think there's a "motivation" aspect as well.

minimaxir · 2025-01-03T17:58:16 1735927096

I performed that exact incentive analysis in a previous post: https://news.ycombinator.com/item?id=39495476

That said, it was done with ChatGPT 3.5/4, I suspect Claude 3.5 Sonnet would behave much different.