Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Too long and complex for my morning but ultimately we’re at a point of hardware/computing commodification and ubiquitous data that online RL can start to provide meaningful efficiencies across any possible real task

Here’s two good papers to start with:

https://arxiv.org/abs/2410.14606

https://storage.googleapis.com/deepmind-media/Era-of-Experie...



Thanks, will read them.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: