Ideally a lot of prose that explains the concepts, and not many graphs/code blocks/math formulas.
https://arxiv.org/pdf/1706.03762.pdf
The attention is all you need paper that tikkun linked is great, but not exactly a gentle start. This might help a little: https://sebastianraschka.com/blog/2023/llm-reading-list.html
https://arxiv.org/pdf/1706.03762.pdf