Hacker News new | past | comments | ask | show | jobs | submit login

And I suspect o3 is something like monte carlo: generates tons of CoTs, with most of them are junk, but some hit the answer.



Sounds plausible given I’ve recently observed a ton of research papers in the space that in some way or another incorporate MCTS




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: