Hacker News new | past | comments | ask | show | jobs | submit login

This says nothing on how RLHF works, but a lot on what can be the results.



You can check here for an explanation (with some helpful figures) https://www.assemblyai.com/blog/the-full-story-of-large-lang...


Yes! I came to make the same comment.

It's got a catchy title but it leaves much to be resolved.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: