Hacker News new | past | comments | ask | show | jobs | submit login

> Karpathy's TinyStories

Did you mean Karpathy's tinyllamas? [1][2]

Or did you mean Ronen Eldan and Yuanzhi Li's "TinyStories: How Small Can Language Models Be and Still Speak Coherent English?" [3][4]

[1]: https://huggingface.co/karpathy/tinyllamas/tree/main

[2]: https://github.com/karpathy/llama2.c

[3]: https://www.microsoft.com/en-us/research/publication/tinysto...

[4]: https://arxiv.org/abs/2305.07759




Karpathy's tinyllamas are based on the tiny stories dataset. I could have phrased it better, sorry.




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: