That's such an oversimplification.

edrxty · on Feb 21, 2023

How is it wrong?

danielbln · on Feb 21, 2023

They literally "do context", as they do in-context learning. Also, factuality can effectively be solved by chain-of-thought prompting[0] and the use of external tools[1], also see toolformerzero[2] for a simple example.

[0]: https://arxiv.org/abs/2201.11903

[1]: https://arxiv.org/abs/2302.04761

[2]: https://toolformerzero.com/

blurbleblurble · on Feb 21, 2023

The "predicting words" part of the language models is a building block, but the actual applications being built around the LLMs are using that predictive ability in very dynamic ways along with external memory, feedback and sophisticated control methods to attenuate and channel their behavior.

And anyhow:

> Memory Augmented Large Language Models are Computationally Universal https://arxiv.org/abs/2301.04589