> In this section, we aim to understand the sources of GPT’s predictive ability....

og_kalu · 2024-05-25T00:02:05 1716595325

>Who will tell them how an LLM works and that the neural net does not calculate anything?

You don't understand LLMs as well as you think you do. Yes, the neural network calculates things.

>It only predicts the next token in a sentence of a calculation if it's been loss-minimized for that specific calculation.

No that's not necessary at all.

https://www.alignmentforum.org/posts/N6WM6hs7RQMKDhYjB/a-mec...

https://cprimozic.net/blog/reverse-engineering-a-small-neura...

caseyy · 2024-05-25T00:36:08 1716597368

I have heard of generalization vs memorization, but the article you shared is very high quality. Thank you.

I do not think that SOTA LLMs demonstrate grokking for most math problems. While I am a bit surprised to read how little training is necessary to achieve grokking in a toy setting (one specific math problem), the domain of all math problems is much larger. Also, the complexity of an applied mathematics problem is much higher than a simple mod problem. That seems to be what the author of the first article you quoted thinks as well.

Our public models fail in that large domain a lot. For example, with tasks like counting elements in a set (words in a paragraph). Not to mention that they fail in complex applied mathematics tasks. If they have been loss-minimized for that specific calculation to the point that they exhibit this phase change, then that would be an exception.

But in the financial statement analysis article, the author says explicitly that there isn't a limitation on the types of math problems they ask the model to perform. This is very, very irregular, and there are no guarantees that model has generalized them. In fact, it is much more likely that it hasn't, in my opinion.

In any case, thank you again for the article. It's just such a massive contrast with the MBA article above.

SCM-Enthusiast · 2024-05-25T13:11:50 1716642710

Phase changes and grokking make me nervious... It seems once you reach a certain threshold of training, you can continually "phase-change" and generate these emergent capabilities. This does not bode well for alignment.