Hacker News new | past | comments | ask | show | jobs | submit login

Relevant thread on /r/locallama ([1]). A relevant quote from the comments:

> There's a Twitter thread from one of the authors ([2]). This part seems pretty important: "The models in this paper were done training 5 months ago. We've progressed significantly since then."

1. https://www.reddit.com/r/LocalLLaMA/comments/1ctsala/newly_p...

2. https://x.com/ArmenAgha/status/1791275549815648473




Thanks for sharing! It's encouraging to see the authors actively improving their models post-publication. Exciting to witness science in action!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: