A lot of people cite these numbers as cynics or to dissuade others' optimism, bu...

tmalsburg2 · on Feb 13, 2023

The training data contains tons of false information and the training objective is simply to reproduce that information. It's not at all surprising that these models fail to distinguish truth from falsehood, and no incremental change will change that. The problem is paradigmatic. And calling people cynics for pointing out the obvious and serious shortcomings of these models is poor form IMO.

sigmoid10 · on Feb 13, 2023

The large corpus of text is only necessary to grasp the structure and nuance of language itself. Answering questions 1. in a friendly manner and 2. truthfully is a matter of fine-tuning as the latest developments around GPT3.5 clearly show. And with approaches like indexGPT the usage of external knowledge bases that can even be corrected later is already a thing, we just need this at scale and with the correct fine tuning. The tech is way further than those cynics realize.

LarryMullins · on Feb 13, 2023

Without the opportunity to live in the real world, these LLMs have no ground-truth. They can only guess at truth by following consensus.

pillefitz · on Feb 13, 2023

I'm sure you can add constraints of some sorts to build internally consistent world models. Or add stochastic outputs as has been done in computer vision to assign e.g. variances to the probabilities and determine when the model is out if its depth (and automatically query external databases to remove the uncertainty / read up on the topic..)

MagicMoonlight · on Feb 13, 2023

These models inherently cannot be truthful because there is no intelligence behind them that can have any sort of intent at all.

It’s literally monkeys with typewriters pressing keys randomly.

Until we get new models which have true understanding, they will never be truly useful.

sigmoid10 · on Feb 13, 2023

This gets repeated by the cynics every time like prayer, but it shows a very poor understanding of the current state of research.

chordalkeyboard · on Feb 13, 2023

Can you explain or share what research is being done on getting machines to understand the meaning and intent of the data and prompts they are given?

sigmoid10 · on Feb 15, 2023

Here is just one recent example: https://arxiv.org/abs/2210.13382

If you actually follow the literature, you'll find that there is tons of evidence that the seemingly "simple" transformer architecture might actually work pretty similar to the way the human brain is believed to work.