Unpredictable abilities emerging from large AI models

mark_l_watson · on March 17, 2023

Nice write up! I have been using classic back-prop neural networks since the 1980s, and deep learning for the last 8 years. This tech feels like a rocket ship that is accelerating exponentially!

I am in my 70s and I don't work much anymore. That said, I find myself spending many hours in a typical day doing what I call "gentleman scientist" activities around Large Language Models.

I was walking this morning with a non-tech friend and I was trying to impart to him my feeling that all knowledge work, and a lot of mundane work is getting disrupted. Even though I have many tech-friends who are "Large Language Model doubters", my friend got it this morning. Emotionally and psychologically I think that some tech and other highly paid knowledge workers just can't accept the sea-change that we are living through.

For myself, I am spending a very large amount of time experimenting with the OpenAI APIs, LangChain, and Llama-Index - and I am enjoying myself tremendously.

anon7725 · on March 17, 2023

> Emotionally and psychologically I think that some tech and other highly paid knowledge workers just can't accept the sea-change that we are living through.

This is a good way to frame it. As a mid-career person, I’m trying to figure out how to respond to these developments in my own career.

I think there’s a good chance that software development as a career looks very different in 10 years in terms of the day to day work and opportunities for employment.

25 years ago in university, I did roughly the same stuff as I do today. Languages are higher on the abstraction ladder, networks and hardware are faster, etc, but stuff is still roughly the same. I can see a real discontinuity in the work on the horizon.

2OEH8eoCRo0 · on March 17, 2023

I've been calling the mockery and dismissiveness, "nervous laughter". I think we have a sharp crowd here but they either lack imagination or are in denial.

crote · on March 18, 2023

I do not think that is quite true. Most of what I am seeing is hesitance and wariness.

Every innovation can be used both for good and for evil, and it often has unintended side-effects. Just ask Thomas Midgley Jr., who played a major role in both leaded gasoline and CFCs. Or Alfred Nobel, who tried to make mining safer, only to be appalled at an erroneous obituary to describe him as the "merchant of death" due to its military use.

Over the last two decades we've seen tech giants go from a bunch of nerds in garages to parasites invading our personal lives and consuming every bit of data about us they can get their hands on - and social media went from a cute gimmick to a society-destroying tool powering genocides.

AI is undeniably extremely promising, and it is not hard to see that it has the potential to change the world forever. However, we are inherently unable to understand how it works, or to alter it in a meaningful way. It's a Pandora's Black Box: will it give us a better and easier life, or will it destroy us? Nobody knows, least of all the people who are opening it!

Llamamoe · on March 18, 2023

We're at a point in time where almost any novel technology is just going to take jobs away, increase wealth inequality, and down the line make the world a worse place. It's difficult to be happy about a new powerful tool finding its way into the hands of tech giants.

roflyear · on March 17, 2023

I think it'll end at using chatgpt as a great tool. And that seems fine. I think much of the anxiety here is in the unknown.

You have to learn your IDE, but it doesn't make programming harder. It makes it more fun actually.

Chatgpt is going to require that devs are even better IMO because you have to catch its errors and know how to work with it.

UncleOxidant · on March 17, 2023

Yes, this stuff is fun to work with and use as an aid to programming. But I think what's dawning on a lot of us is that it seems to be able to facilitate a large increase productivity. And given that large productivity increase will industry need as many software developers (and other knowledge workers)?

bcrosby95 · on March 17, 2023

Your opinion on this probably depends on what your job currently looks like. Everywhere I've worked there's been significantly more stuff we wanted to do than we could actually do. Speeding that up looks awesome to me.

But if you work at BigCo and mostly sit on your hands already, then, yeah. I don't know.

coldtea · on March 17, 2023

>Everywhere I've worked there's been significantly more stuff we wanted to do than we could actually do. Speeding that up looks awesome to me.

Does it? Slowing our breakneck anti-human business pace and stopping mindless and environmentally and culturally harmful consumption and production of trinkets is surely better!

Karrot_Kream · on March 17, 2023

Huh? Have you attended a local city council meeting recently? I can guarantee you they're running on shoestring budgets with an endless backlog of requests that they are only treading water on because of anemic tax revenue and low prestige for government jobs. "Productivity" isn't just some high tech, consumer bauble producing phenomenon. It helps everyone out. Highly profitable, headcount-bloated big companies are the outlier.

roflyear · on March 18, 2023

> anemic tax revenue

You mean, spending >1/2 their budget on police?

coldtea · on March 18, 2023

I can guarantee you that the reason they do so, is an explosion of BS requests, extreme bureucracy, and needless requirements, with a simultaneous drop in actual useful basic work - on city instrastructure and such.

fnordpiglet · on March 17, 2023

Every BigCo I’ve worked at (and I’ve worked at places that put the B in Big) people aren’t sitting on their hands because there isn’t way more to do than they can feasibly do. It’s usually because they’re fundamentally illsuited for the job and sitting on their hands is better than being homeless, or the people who are illsuited for their jobs feel anxious about being homeless so force everyone into meetings and team building and off sites and retrospectives and sprint planning and burn down chart analysis and whatever else they can conceive of that seems productive but doesn’t change production in any way. To that extent at least an AI would be left alone to write actual code, and those folks can have their endless meetings and still justify their job the same way they ever did. No AI will ever host a breakout session at an offsite to discuss strategy for getting people to work in the horrific hotel seating dehumanized office, they’re too capable.

alexthehurst · on March 18, 2023

> No AI will ever host a breakout session at an offsite to discuss strategy for getting people to work in the horrific hotel seating dehumanized office

Well, let’s hope not.

simonw · on March 17, 2023

I can already feel it dramatically increasing my personal productivity. I expect it's going to eventually lead to a 2x or 3x time productivity increase for software engineering generally (if not more).

Will that result in less demand for software engineers? I doubt it. I think it will lead to companies and organizations doing way more software engineering.

I was going to use the old "ATMs resulted in more jobs for bank tellers example", but apparently that particular example is flawed: https://www.vox.com/2017/5/8/15584268/eric-schmidt-alphabet-...

Animats · on March 17, 2023

> apparently that particular example is flawed

Um, yes. Been in a bank branch lately? Had to visit a bank branch for any reason?

whatshisface · on March 18, 2023

Increased supply doesn't change the demand curve but there are basically no scenarios that don't lead to an increase in volume and a decrease in price.

roflyear · on March 18, 2023

I mean, people aren't going to be able to solve 100x as many problems because of this.

UncleOxidant · on March 18, 2023

I don't think anyone is claiming that. But 2 or 3X? Possibly. And that would be enough to be a game changer.

roflyear · on March 19, 2023

Yeah maybe, but will it lead to less jobs? I am not convinced.

ip26 · on March 17, 2023

Somewhat similar boat; I just hope it winds up helping me do my job better, instead of me trying to teach it how to do my job. Setting aside professional anxieties, after using chatGPT for a bit I quickly realized I am completely uninterested in prompt engineering. But as a tool, like with an IDE, it could be a big leap forward.

bentcorner · on March 17, 2023

I literally just finished writing some documentation and had Copilot running, sometimes it'd autocomplete something with garbage (and I would just type what I wanted anyway), more than once it autocompleted with several lines of exactly what I wanted to write. This is way better than the phone-type autocomplete I've seen in gmail where it just guesses the next word.

fnordpiglet · on March 17, 2023

I ducking agree

FpUser · on March 17, 2023

I have started using it for a few days. Basically asking to write and modify pieces of code, for now small ones where I can easily spot a bullshit. I am very much looking forward for it to improve to the point where I can spend more time thinking about business domain rather then how to translate it to efficient code.

olalonde · on March 17, 2023

Indeed, it's fascinating to witness a sizable segment of the HN community and distinguished intellectuals like Chomsky displaying absolute denial. I've started bookmarking the comments so I can look back at them in 5 years and have a good laugh. Some from a few months back are already aging badly[0].

[0] https://news.ycombinator.com/item?id=34197033

WC3w6pXxgGd · on March 17, 2023

Has Chomsky been right about anything?

Izkata · on March 17, 2023

My first introduction to him was when one uncontacted tribe or anothers' language refuted something he thought was fundamental to humans.

In the decade(s) since I kinda think "Chomsky was wrong" is pretty much the only reason people bring him up.

Calavar · on March 17, 2023

I see a lot of irony in this.

When Chomsky proposed generative grammars, his theory of universal language acquisition, and so on, they were radical ideas that upturned the central cannon of linguistics.

Time has been on his side - entire schools and subfields of linguistics went extinct as more evidence emerged that Chomsky was fundamentally right. Basically every computer language and data format in existence is parsed/lexed in ways inspired by his models of language.

But now Chomsky is considered the stodgy old establishment, and whenever one of his theories is contradicted somewhere on the margins people shout "Aha! He was wrong the whole time!" and ignore the 99% of cases where his models are still the best ones we have.

hgsgm · on March 17, 2023

What's even more bizarre is the ChatGPT is proving him right, that a neural net can build logical grammar, and he is denying it!

bbor · on March 18, 2023

This is a hot take, would love a link/explanation on why you think neural nets are human-like. For example, from the op Ed you’re likely referencing:

“ Their deepest flaw is the absence of the most critical capacity of any intelligence: to say not only what is the case, what was the case and what will be the case — that’s description and prediction — but also what is not the case and what could and could not be the case. Those are the ingredients of explanation, the mark of true intelligence.”

Isn’t that just plainly true of LLMs? Sure they can produce text that looks like an explanation, and sure putting in a header telling it to spell out its reasoning will get it to output text that looks like reasoning, but I feel the nature of its hallucinations make it clear that it’s not actually performing those steps anywhere in the net.

But maybe I’m just a Luddite? These technologies are amazing and transformative, I’m just shocked to see hate on HN of CHOMSKY of all people, the father of modern linguistics and cognitive science…

ogogmad · on March 18, 2023

Why do you think this is impossible? Have you tried making an account on chat.openai.com? Because it can in fact do that right now; and even people who are otherwise skeptical of LLMs have panned Chomsky's article.

selestify · on March 18, 2023

What did he say about logical grammars and neural nets in the past? Sorry, not very familiar with him.

bbor · on March 18, 2023

He wrote an op ed recently with the oft-repeated argument that stochastic models are fundamentally distinct from structured symbolic/logical models. In simple terms you can never really “trust” chatgpt’s answers because it’s just guessing the answer that looks right, not applying structured reasoning like humans do.

https://www.nytimes.com/2023/03/08/opinion/noam-chomsky-chat...

But might be missing something more specific, would love to read that.

Izkata · on March 18, 2023

> In simple terms you can never really “trust” chatgpt’s answers because it’s just guessing the answer that looks right, not applying structured reasoning like humans do.

Here's an argument otherwise, that they really are developing some sort of model: https://the-decoder.com/stochastic-parrot-or-world-model-how...

bigDinosaur · on March 18, 2023

That example was highly controversial, and I think Chomsky was actually right in that instance (all languages have recursion). Generally my understanding is that he's considered to have held the field of linguistics back quite a bit with a non-disprovable hypothesis.

fnordpiglet · on March 17, 2023

Given how emphatic he is about everything, the answer is clearly no.

HyperSane · on March 17, 2023

About foreign policy? No.

bandofthehawk · on March 17, 2023

Was he wrong to oppose the Vietnam war?

HyperSane · on March 18, 2023

He supported the communist though. He also supported Pol Pot. You can predict his opinions of foreign policy if you understand that he bases them on the "US is evil" axiom and therefore anyone who opposes it is good, no matter how actually evil they may be.

fulafel · on March 18, 2023

The Pol Pot thing seems debunked: https://www.abc.net.au/news/2011-07-01/brull---the-boring-tr...

The communism support doesn't seem to quite stick either if WP is to be believed: https://en.wikipedia.org/wiki/Political_positions_of_Noam_Ch...

In Britannica and WP he's labeled an anarcho-syndicalist, so seems to be against both capitalism and noncapitalist authoritarian systems.

HyperSane · on March 19, 2023

Chomsky absolutely was a Cambodian Genocide denailist at the time. He very arrogantly dismissed actual eyewitness testimony of the genocide from refugees purely for ideological reasons.

https://en.wikipedia.org/wiki/Cambodian_genocide_denial#Chom...

fulafel · on March 19, 2023

(Disclaimer: i'm just reading up about this now)

Did you skip my first link or are you disagreeing with it? It's about the same subject. And in your WP link has a second quote writing about the genocide as fact.

Another I think credible seeming article about it that seems to judge against denialism in an academic journal called "Genocide Studies and Prevention": https://digitalcommons.usf.edu/cgi/viewcontent.cgi?article=1...

Chomsky's skepticism in the media criticism article talking (among other things) about a book about the then-contemporary events of course looks quite unfortunate in retrospect, but given the volume of work and constant writing about world events over a long career it seems to be a error in judgement in the moment, rather than denying history, especially as he evidently updated his views later as more evidence came about.

HyperSane · on March 19, 2023

"looks quite unfortunate in retrospect"

It was completely unjustified at the time. There was absolutely no reason to doubt the testimony of refugees coming from Cambodia, other than they didn't fit into Chomsky's strange worldview that the US is the cause of all problems in the world.

Even today his views on Russia's invasion of Ukraine are completely moronic.

https://foreignpolicy.com/2022/12/22/russia-ukraine-war-left...

For ‘Peace Activists,’ War Is About America, Never Russia Their own hard-left worldview is so absorbing that they will take the side of any aggressor in the anti-Western camp.

https://www.e-flux.com/notes/470005/open-letter-to-noam-chom...

lottin · on March 17, 2023

I didn't bookmark them, but I recall comments from ten years ago predicting that 50% of the workforce was going to be replaced by AI in the next ten years. Fast-forward to "the future" and the reality is AI has left without a job a grand total of 0 people. The grandiose promises we're hearing now concerning AI are nothing new. They were laughable then, as they're now.

olalonde · on March 18, 2023

Those comments about employment are indeed laughable and I bookmark them as well. We're at near full employment despite the invention of mechanized farming, electricity, the printing press, cars, the computer, the Internet, etc. I was referring to people who are in denial about the rapid progress of AI, and the impact it is going to have in coming years.

[0] Until we reach AGI, but I don't dare attempt to predict when this is going to happen.

UncleOxidant · on March 17, 2023

> Emotionally and psychologically I think that some tech and other highly paid knowledge workers just can't accept the sea-change that we are living through.

Soon to be 60 year old here. Glad that if need to I can retire now. Certainly the rise of LLMs and generative AI isn't going to be all bad, but I've also got a feeling that not as many software developers will be needed soon since those who can leverage LLMs will experience a pretty decent productivity boost. Part of me wonders if at least some of the layoffs we've seen in the last couple of months are because companies such as Google, Microsoft, Amazon, etc. (the ones that have pretty clear visibility into what's happening in the AI space) are realizing that they aren't going to need as many knowledge workers in the not-so-distant future.

I think there was always this idea in the back of our minds that this was going to happen someday. But someday was always like 15 to 20 years out. Looks like someday is knocking on our door now.

drdec · on March 17, 2023

> Certainly the rise of LLMs and generative AI isn't going to be all bad, but I've also got a feeling that not as many software developers will be needed soon since those who can leverage LLMs will experience a pretty decent productivity boost.

OTOH, software development will become significantly cheaper. That means the business case for using software in more places just tilted in favor of throwing some software at it. You'll see more and more businesses using it, ones you didn't expect, like local mom and pop businesses.

Yes, it's going to be different but I don't think we know exactly what is going to happen yet.

kefabean · on March 18, 2023

Kind of like a Jevon’s paradox for software engineering!

UncleOxidant · on March 18, 2023

> You'll see more and more businesses using it, ones you didn't expect, like local mom and pop businesses.

And in a lot of cases they're not going to need to hire a software developer to do develop it.

Karrot_Kream · on March 18, 2023

Yeah definitely, but we already saw this coming before LLMs right? Low code solutions, site builders, I mean MS Frontpage is as old as the Web itself. Just like you needed top MechEs to build heat exchangers in the old days, nowadays a small team makes a design and technicians handle most of the actual work of assembly and maintenance. Likewise you'll probably just have low code engineers who glue existing parts together to make things work at your local cafe or restaurant.

One of the reasons the work at Big Tech is so fun is because they actually need to engineer a lot of things. Smaller tech companies or businesses that utilize tech can just glue a few libraries together to make it work.

michaelmrose · on March 18, 2023

What do you call the person they are going to hire to use the tool or are they going to switch gears from selling painted dodads and chicken sandwiches to fire up an IDE even a web based one.

The average human is only slightly smarter than the average rabbit. They are going to pay someone who specializes in this task.

ImHereToVote · on March 17, 2023

Thanks for the future you helped create for us young people. Hopefully the next generation is more merciful to 'useless' people than the previous one.

goldfeld · on March 17, 2023

This is shortsighted. AI slave labor is the only route to a true overproduction and marxism, a welfare state. Or every body become slaves.

ImHereToVote · on March 17, 2023

Who needs slaves? What use will they be?

Why overproduce?

coldtea · on March 17, 2023

The ultimate use of slaves (or servants, or employees) is about having power other other people (them) - this goes beyond production needs...

The average person might not care for this, but the psychopaths who crave (and are in) power do...

hgsgm · on March 17, 2023

Overproduce so that the plebs have something after the elites seize the lion's share

ImHereToVote · on March 18, 2023

Who needs plebs? I mean, a couple of thousand. Maybe, But beyond that?

grugagag · on March 19, 2023

Who needs people? Elites or not, what use will they bring? To do what? And elites do need plebs without which they’d have no point of reference to set themselves apart

zzleeper · on March 18, 2023

Hi Mark,

One question out of academic curiosity: I'm exploring ways to use these tools for research projects in econ and am struggling to see a good angle.

For instance, suppose I have lots of PDF reports on how firms have evolved on each quarter (10,000 reports or any other number beyond what I can read).

Can LLMs be used to spit out variables based on these reports? EG: indicator variables (optimistic-vs-pesimistic), categories, etc. that I can then use as data to test economic models?

I tried simple cases with ChatGPT3.5 and got the feeling it's great for outputting narrative text, but felt mediocre for when the output was more narrowly defined into categories (which was surprising).

mark_l_watson · on March 18, 2023

Look at the LangChain and Llama-Index (used to be called GPT-Index) projects that make smaller projects that need to use a large amount of text data do-able. There is also support for reading PDF files (and many other data sources), and pre-computing embeddings. If you spend a short while looking at example code in the documentation, find something that is similar to your requirements (e.g., semantic search, conversational chat about a set of documents, etc.), and build on that.

zzleeper · on March 18, 2023

Amazing; thanks a lot!

whimsicalism · on March 17, 2023

Is that what it is? I've been struggling to figure out why people have had such difficulty seeing the clear impact is going to have this

noonething · on March 18, 2023

are you finding langchain and llama-index similar?

pedrovhb · on March 17, 2023

This caught my attention as I found it implausible:

> One DeepMind engineer even reported being able to convince ChatGPT that it was a Linux terminal and getting it to run some simple mathematical code to compute the first 10 prime numbers. Remarkably, it could finish the task faster than the same code running on a real Linux machine.

Following the link, there's a screenshot to a screenshot [0] of a code-golf solution to finding primes which is quite inefficient, and the author notes

> I want to note here that this codegolf python implementation to find prime numbers is very inefficient. It takes 30 seconds to evaluate the command on my machine, but it only takes about 10 seconds to run the same command on ChatGPT. So, for some applications, this virtual machine is already faster than my laptop.

So it's not quite calculating primes; more likely it recognizes the code as being code to do so, and recites the numbers from memory. That's interesting in its own right, but we won't be running Python on an LLM for a performance boost any time soon. In my experience this interpreting is apparent as a limitation of the model when it keeps insisting on broken code being correct, or having its mistake pointed out, then apologizing, saying it's got some new code that fixes the issue, and proceeding to output the exact same code.

[0] https://www.engraved.blog/content/images/2022/12/image-13.pn...

namaria · on March 17, 2023

So on the one hand, these newly publicized models can render convincing representations of realities we used to get from deterministic processes. On the other hand, it's probabilistic and fails to conform to logic quite often, and in a confident way.

We're building systems capable of programing computers non-deterministically. I think this is huge. But not because ChatGPT23 will be a CEO or a politician. But because this is a paradigm shift in compute similar to moving from integrator machines to general computers. I don't think LLMs will make programmers obsolete. I think large enough models will make programming something completely different from what it is now.

The days of sequencing tokens for compilers/interpreters seem to be drawing to an end as the dominant way of specifying software products.

sho_hn · on March 17, 2023

The LLM can act as a global cache for common solutions to common problems, with the ability to perform the integration work necessary to apply them.

That prime number example is a little bit like when you put a functools.lru_cache decorator on a function in Python. It's faster than computing the function call because it's able to recall the return value for the parameters from the cache "memory".

Of course, many skilled programmers are also mainly used a cache for common solutions to common problems organizations have in the programming domain. As humans we can derive satisfaction from being able to tackle the same task others can, as a confirmation of our own progress. We like "doing it ourselves". But globally that's not super interesting if lots of people are constantly re-developing the same solutions to the same problems for their own benefit. I guess that's the push-and-pull (and the anxiety generator).

namaria · on March 17, 2023

>The LLM can act as a global cache for common solutions to common problems, with the ability to perform the integration work necessary to apply them.

In my opinion Stack Overflow does a fine job at that. And it's transparent in terms of solutions proposed being voted on and discussed. Turning that into sausage advice is a downgrade.

>But globally that's not super interesting if lots of people are constantly re-developing the same solutions to the same problems for their own benefit.

I'd argue this is how we train individuals thus globally quite relevant.

tosser0001 · on March 17, 2023

I'm sort of glad to be retiring soon. I have a feeling everything I enjoy about programming is going to be going away.

namaria · on March 17, 2023

I get the feeling. But I've always enjoyed the abstract puzzles more then anything. Computers attracted me as a form of very complex abstract puzzles.

But when it comes down to it everything in life is just nth dimensional tensor puzzles. What I really cherish computers for giving me is fast and clear feedback.

anon7725 · on March 17, 2023

I'm about 10-15 years from retiring, and lately, I've been thinking a lot about how to handle the rest of my career with all these new AI advancements.

13years · on March 17, 2023

This disruption does appear to be different than others prior. As it is not a narrow disruption with limited boundaries for which we can plan and organize our lives for some stabilization period to follow.

Instead, it is constant and accelerating destabilization. Hundreds of AI projects attempting to ride this accelerating wave were essentially just made obsolete yesterday - https://www.youtube.com/watch?v=DH-2BHDYNfk

I feel the excitement is going to very soon turn into frustration of attempting to remain relevant ahead of the accelerating technological curve. Humans need periods of stabilization to plan and reason about their lives.

jmvoodoo · on March 17, 2023

What's most interesting to me, is that this is how I would expect a human to approach the problem if presented with the code and asked for the output.

The LLM didn't run the code, it tried to predict the output based on its knowledge of python and primes.

shrikant · on March 17, 2023

When I was a bored and under-challenged student in the early days of university, one of my tests during a programming exam was to write a program that spit out the factorial of an input integer.

For shits and giggles, I just just wrote a series of `if...then` statements for 0 to 5, and only actually calculated the factorial for inputs >=6. I passed after the examiner just input 3 and 5 and was satisfied that the output was correct. Oops.

WalterBright · on March 17, 2023

Doing a table lookup for common cases and computing for the less common ones is perfectly valid!

dxbydt · on March 17, 2023

Not 100% sure but I believe this is how we landed the lunar module on the moon the first time...tan/arctan/both were too hard to compute on the processors of those days so they discretized into half angles & stored the tangents in a lookup table.

speed_spread · on March 18, 2023

The Pentium FDIV bug was due to a bad value in a lookup table.

hgsgm · on March 17, 2023

That's how early calculators worked, and some fast math libraries today too.

Jeff_Brown · on March 17, 2023

And for some problems it's even efficient. Not factorial, though.

Jeff_Brown · on March 17, 2023

Oh you're right, hive mind, it's efficient. Sorry.

thaw13579 · on March 17, 2023

This is the way. Next, you could cache those values for >=6 when you compute them and also use the previously cached values for sub-problem solutions.

timdiggerm · on March 17, 2023

Sorry, did it try to predict the output, or did it only predict the output based on what it has seen people say about similar code online?

jmvoodoo · on March 17, 2023

I think its knowledge of python can only come from comments about code, since it can't execute the python or learn from trying code like a human would.

That doesn't change the fact that its predicting the output based on its knowledge of python though.

So, yes you're right, but at the same time it doesn't change anything IMO.

drdeca · on March 17, 2023

Is your objection to the use of the word "try"?

hgsgm · on March 17, 2023

The difference is "did it study the code and inputs/outputs" vs "it studied comments about the code"

What would it do if the training data was only code? Only code+inputs+outputs? Only function signature+inputs/outputs? Only signature+comments?

Etc.

roflyear · on March 17, 2023

The second one

skybrian · on March 17, 2023

Large language models are storytellers. To write a story about someone using a computer terminal, there are things it’s helpful to know, but there are easier ways than simulating a computer.

Since we don’t know how it works, we should be open to the possibility that it’s using all sorts of storytelling techniques.

chias · on March 17, 2023

So here are a few screenshots that I personally took after telling it that it was a linux terminal:

Correctly produces a convincing output for having decoded the base64-encoding of "ping google.com" and then piping it to bash: https://media.infosec.exchange/infosecmedia/media_attachment...

Similar command, but with a garbage domain it hasn't seen before, and a less well-known domain. It produced convincing output in both cases: https://media.infosec.exchange/infosecmedia/media_attachment...

Having it just output a base64 decoding of an obviously unique string. Fascinatingly, it tried to correct typos that I intentionally included: https://media.infosec.exchange/infosecmedia/media_attachment...

This was also pretty cool -- ask it to start a webserver then curl it: https://media.infosec.exchange/infosecmedia/media_attachment...

Telling it that it is a python interpreter and calling self-defined functions: https://media.infosec.exchange/infosecmedia/media_attachment...

A slightly more complicated function: https://media.infosec.exchange/infosecmedia/media_attachment...

I did a few more experiments including generating large factorial numbers that took a long time on my laptop but it responded accurately to a much larger length than my laptop could do (though these were only accurate to the first few hundred digits)

ambicapter · on March 17, 2023

Lol why does ChatGPT hit ^C? Do you think it’s getting bored of waiting to respond to you and decides that’s enough time spent on an answer?

Edit: the base64 decoding examples are terrifying. I have no idea how that works.

Edit2: actually I can sort of see how that works, b64 encoding doesn’t have any obfuscation to it, I could see how an ML model can build a pretty good approximation over time after seeing all the examples on the internet.

fsckboy · on March 18, 2023

it's probably hitting 'caret' 'C' except when it's using Emacs when it types C-C

shawntan · on March 18, 2023

https://twitter.com/tanshawn/status/1599442780384014346

shawntan · on March 18, 2023

I was personally quite appalled that so many people believed this was what it was doing, but I suppose since the source was a DeepMind researcher, it lends a lot of credibility.

By the same reasoning, I hope people do realise that there are computations that are just impossible given the finite depth in a Transformer. The only possible way to overcome this in the current paradigm is the Chain-of-Thought related methods.

Imnimo · on March 18, 2023

Yeah, I tried asking GPT-4 to write a simple Python program to find the millionth prime. It wrote the naive approach where you loop over numbers and for each number, check all of the possible divisors up to its square root, and then keep track of how many primes you've found. Obviously, this takes quite a while to execute for N = a million. And yet, if you ask GPT-4 what the output of the code will be, as if my magic (memorization) it immediately spits out 15485863. It makes it a lot more clear that it's not actually simulating the code.

thexumaker · on March 17, 2023

yep I tried asking chatgpt to optimize some SQL queries with a heavy amount of full outer joins. The optimization I was trying to push it to was specifically adding an index on a column or filtering the first table that we join on but it kept creating SQL subqueries and switching the full joins to left joins no matter what I told it and the sql errors I sent it

FormulatedEdits · on March 18, 2023

Its speed as compared to the laptop - I assume it was running online, on OpenAI’s hardware. That hardware is quite extensive, I thought. I believe I’ve seen the cost of one (not ‘search’, what’s the word here, ‘output’?) is something like 1-10 cents.

roflyear · on March 17, 2023

Gpt is never calculating anything!!

truculent · on March 17, 2023

So it sounds like the mechanism is something like stochastic memoisation?

pedrovhb · on March 17, 2023

I don't think "memoisation" is an accurate word for this; it implies doing the computation once, and storing the result in cache to return later. It's more like replacing your Python code with a SQL query of the LLM's understanding of what it's supposed to do, which may or may not be correct, and executing it on its "database" of knowledge, which may or may not have the correct data.

mistrial9 · on March 17, 2023

this is new to me but.. a quick read of Wikipedia [1] later, it appears that this decades-old method takes a goal, and then uses branching and recording to find paths that differ the least from the goal. The article mentions that the curse of dimensionality is so bad here that approximations are often used in practice. Does that capture it?

[1] https://en.wikipedia.org/wiki/Stochastic_dynamic_programming

HervalFreire · on March 17, 2023

[flagged]

mostlysimilar · on March 17, 2023

Equally interesting is the psychology of people who take the time to write long posts borderline gloating about tools making software engineers obsolete.

HervalFreire · on March 17, 2023

Except I didn't write that and I will quote myself:

>"I can tell you this. I do not know future iterations of LLMs can take over our jobs"

I wrote that I don't know which is the best answer we all have at this point.

Given the evidence, completely denying it, as many have is simply not realistic.

mostlysimilar · on March 17, 2023

Okay, but you have another post in this thread with:

> When these LLMs get normalized probably 5 years from now I'm going go back to these old threads and contact these people who are in self denial and throw it in their face. I'll just link this comment and be like I TOLD YOU, I TOLD YOU, YOU WERE WRONG.

HervalFreire · on March 17, 2023

My bad. It's the caps. I'll remove the caps. Just picture me saying it a nicer way.

roflyear · on March 17, 2023

Gloating over people being wrong is rarely nice

HervalFreire · on March 17, 2023

Not if it's said in a nice way. The commenter called LLMs a freaking auto complete. That's not nice either, but nice enough for HN. So I figured a little subtle gloating is deserved and well within the bounds of etiquette here on HN.

roflyear · on March 18, 2023

Well insulting the capacity of an LLM is much different than gloating over someone being wrong about their livelihood.

HervalFreire · on March 17, 2023

[flagged]

mostlysimilar · on March 17, 2023

I didn't flag your posts, friend. Even if I dislike the tone I get the impression you're being sincere, and I think they add to the conversation.

christiangenco · on March 17, 2023

Rude.

mpalmer · on March 17, 2023

I'd like to see posts on LLMs written from a different perspective. For me, the surprise comes not from the sudden emergent capability of language models, but that the understanding (and synthesis!) of ideas encoded in language has succumbed to literally nothing more than statistical analysis. Or at least come that much closer to doing so.

That it bears so close a resemblance to actual thinking says more about the importance of language to cognition than the other way around.

guns · on March 17, 2023

This is what Stephen Wolfram concludes in a recent article about ChatGPT:

> The specific engineering of ChatGPT has made it quite compelling. But ultimately (at least until it can use outside tools) ChatGPT is “merely” pulling out some “coherent thread of text” from the “statistics of conventional wisdom” that it’s accumulated. But it’s amazing how human-like the results are. And as I’ve discussed, this suggests something that’s at least scientifically very important: that human language (and the patterns of thinking behind it) are somehow simpler and more “law like” in their structure than we thought.

https://writings.stephenwolfram.com/2023/02/what-is-chatgpt-...

Animats · on March 18, 2023

> (at least until it can use outside tools)

This is key. ChatGPT/GPT-4 alone are limited to reformulating what they know from their training data. Linked to search engines, databases, and computational tools such as Wolfram Alpha, they acquire much more capability. We're already seeing that with Microsoft Bing.

(Update: what happens as large language models learn Excel? Especially since Microsoft is already connecting them to Excel.)

What's striking is how fast this field is advancing. Huge advances over months, not years or decades.

We now have a much better idea of how intelligence evolved. It's mostly just more neurons. One of the great philosophical questions has, inadvertently, been answered.

Is the Singularity happening right now?

petilon · on March 18, 2023

> ChatGPT/GPT-4 alone are limited to reformulating what they know from their training data.

That's not true. They are extrapolating. If they weren't, they wouldn't have the problem known as "hallucination".

electrondood · on March 18, 2023

> Linked to search engines, databases, and computational tools such as Wolfram Alpha, they acquire much more capability.

I see this as analogous to the human brain; there are different structures which are particularly good at specific tasks/functions. They all work together.

The only difference between a human brain and an ANN is a difference of degree. A neuron and an artificial neuron are functionally identical. I think as we start interconnecting these models we see surprising emergent properties.

YeGoblynQueenne · on March 18, 2023

>> Is the Singularity happening right now?

No, but it seems everyone loves to LARP it anyway.

criddell · on March 17, 2023

I wonder if different languages lead to different capabilities? If I ask the same question in English, Japanese, and German, will I reliably get “better” answers from one language over another.

whimsicalism · on March 17, 2023

The models transfer knowledge between languages, so probably some difference in capabilities but not a ton in core capabilities.

It can solve a physics problem in Telugu close to as well as in English.

whimsicalism · on March 17, 2023

I find the phrase "statistical analysis" a frustrating one nowadays as it seems to have become a signal for "I hold a particular philosophy of the mind".

I don't understand this use of "statistical" as a diminutive to describe these models.

Why can't incredibly complicated behavior be emergent from matrix multiplication subject to optimization in the same way that our biological matter has developed complicated emergent properties also being subject to optimization?

The loss function is very different, the optimization techniques as well, but the fundamental idea of complex behavior emerging out of a substrate subject to optimization seems common. I haven't seen a single good answer to that

DanHulton · on March 17, 2023

Well, to a degree, because we just don't do science like that.

You're the one supposing a thing, so the burden of proof is on you. You need to demonstrate that the "incredibly complicated behaviour" that you're referring to (I assume this is longhand for "thinking", but please correct me if I'm wrong) is indeed emerging from matrix multiplication. Especially given that what you're suggesting is unexpected, given the known way these models work and the explanations that have been put forth already that extrapolate from the known way these models work.

If science were so credulous as to accept the first proffered theory about a new development, well, we wouldn't have these interesting AI models in the first place!

mpalmer · on March 17, 2023

I can't say that my philosophy affected my choice of words in the way that you mean. I'm just expressing wonder at how well the abstraction performs.

Jensson · on March 17, 2023

> I find the phrase "statistical analysis" a frustrating one nowadays as it seems to have become a signal for "I hold a particular philosophy of the mind".

LLMs are trained to reproduce human text, that is different from for example AlphaGo that is trained to win Go games. Trained to reproduce data is what we mean with a statistical model, trained to win is how we got superhuman performance before, while trained to reproduce data performs worse than the original creators of the data.

pixl97 · on March 18, 2023

This is reductive in the sense is that things like AlphaGo were bad at Go for a very long time, and then with more compute power and algorithm changes suddenly they were far better. And the problem space for go is absolutely huge it is still an insignificant portion of the problem space for knowledge.

YeGoblynQueenne · on March 18, 2023

Perhaps that's because you haven't asked a single good question?

What do matrix multiplication and optimisation have to do with the way the human mind, or the human brain work? That they have anything to do at all, is your assumption, that you seem absolutely convinced about- and then you go asking people why can't it be true? You say why it is true. It's your assumption.

Matrix multiplication and optimization are human mathematical techniques. They have about as good a chance of being something that exists in nature independently of humans as Magic: the Gathering and Call of Duty. They might be useful models to help us understand how things work, but to assume they are how things work is a huge leap of faith.

Ask the right questions and then an answer may even suggest itself. Ask questions that follow from your preconceived answers and you're in a world of fantasy.

felipeerias · on March 17, 2023

Some anthropologists suggest that our main evolutionary advantage was not so much our individual ability for reasoning, but our collective capacity to accumulate, evolve and transmit cultural knowledge over the centuries and millennia.

Skills like fire or language, for example, had a major influence in the development of our species and are mainly culturally transmitted: trying to reason your way into creating one or the other from scratch is a surprisingly difficult task.

If that point of view is true, then it shouldn’t be surprising that a large part of what we consider human-like behaviours should be tractable simply by analysing large amounts of data. AI systems are not modelling cognition, but culture.

BeFlatXIII · on March 19, 2023

To use my favorite Stephen Hawking quotation (that's also been sampled by Pink Floyd), "for millions of years, mankind lived just like the animals, then something happened that unleashed the power of our imagination: we learned to talk."

LeanderK · on March 17, 2023

I think the task of predicting the next word can be misunderstood. The better you want to be the more you have to "understand" how the previous words interacted. From the style of writing to the current topic discussed, the task gets increasingly complex if you want to be really, really good. How could the next sentence start? Will the author end the sentence here or keep going? These questions are very complex.

This does not mean that we humans might predict all the time, in fact I would argue that LLMs only predict during training. They generate otherwise. We might also learn by trying to predict. I can imagine babies doing it.

svachalek · on March 17, 2023

I had the same thought watching my son as a baby. So much of his day seemed to be focused on predicting what happens next, and he got so much joy when he succeeded. So many life skills are predicated on knowing that if I do this, that will happen, which gets me closer to my goal. I started to wonder if intelligence and prediction are really the same thing.

throwbadubadu · on March 17, 2023

Interesting take.. especially as I often wondered, watching and loving animals, if not our ""intelligence separation"" from them is not to a large degree because of the language.. (and the wisdom storage and transfer that comes with it).

mmaunder · on March 17, 2023

There is a startling acceleration of innovation in the field that GPT-4 illustrates. According to NVidia, LLM sizes have been increasing 10X per year for the last few years. This tech is going to hit every aspect of society like a sledgehammer over the next 48 months.

legulere · on March 17, 2023

The same was said 10 years ago. It's astonishing what can be done, but you can already see fundamental limits. I think it will raise productivity for some tasks, but not fundamentally change society.

sebzim4500 · on March 17, 2023

The number of people saying it now is many orders of magnitude more than the number of people saying it 10 years ago. Not saying that means it will happen, but it isn't the same situation.

lxgr · on March 17, 2023

> The number of people saying it

Popular sentiment is a pretty meaningless metric for predicting the future.

AgentME · on March 18, 2023

Then we shouldn't use the 10 year-old (lesser) popular sentiment as a reason to discount AI either.

lxgr · on March 18, 2023

That still doesn’t work. A reliably wrong predictor is still valuable (just negate the output).

I‘m arguing public sentiment will be right 50% of the time (in difficult binary predictions), not 0%.

abudabi123 · on March 17, 2023

The A.I. Monte Carlo Ray Tracing Hallucination Engine can change society by showing as-is and to-be next state. Two use-cases: new infrastructure installation or upgrade and time interval inspection tracking ideal-case vs real world condition. Suppose a lazy contractor skips steps and cuts corners, or a pathological contractor builds the thing and pulls the thing apart over and over again when all that was needed was a one and done, or the change is for the worse. A civil engineer can walk around with an iPad and iPhone to check-in the master plan.

majormajor · on March 17, 2023

You need to replace the civil engineer there (in today's world this is an inspector, I don't know for sure if they are professional engineers or not) for it to be useful, but you still need to have someone not working for the contractor who's incentivized to fake it.

The trouble with many of those construction examples is that they're point in time. Installer fucks up how the windows are installed re: flashing and water-proofing, but then puts the siding over it... the error is now completely hidden.

You could automated the inspection by AI photo analysis of every single window install on the project, say - but we could already do that for the inspection vs sending someone out, and send the photos to the expert instead, and we don't. Whether that's for lack of incentive to improve, or for ability to go deeper out of distrust for builders? I don't know.

Jeff_Brown · on March 17, 2023

You mean AI can detect when a plumber is bulshitting me? That sounds great, could you elaborate?

coldtea · on March 17, 2023

Given a few sensors that could be installed on the house's pipes, one can imagine several ways an AI can check that all is good. Same for checking blueprints for quality/validity and purchases the plumber says are "needed"...

danielbln · on March 17, 2023

Can you elaborate on fundamental limits?

datpiff · on March 17, 2023

> According to NVidia, LLM sizes have been increasing 10X per year for the last few years.

Clearly this cannot continue, as the training costs will exceed all the compute capacity in existence.

The other limit is training data, eventually you run out of cheap sources.

Ambolia · on March 17, 2023

They may also run out of data if they already consumed most of the internet. Or start producing so much of the internet's content that LLMs start consuming what they write in a closed loop.

warkdarrior · on March 17, 2023

Sure, but they do not need to grow infinitely more, they just need to grow sufficiently to perform better than 90% of humans.

saiojd · on March 18, 2023

I would not be so sure about compute capacity? Neural network architectures are still in their infancy, it is very likely that more efficient approaches exist.

datpiff · on March 18, 2023

Sure, but the small & efficient networks so far have been pretty poor. Edge AI was hugely hyped a few years ago and it petered out. All of the big tech companies seems to be chasing the biggest networks with the largest training sets, that seems to be the direction for now.

There could be huge efficiency savings within the implementations but when training costs are already this high it seems naive to think the low-hanging fruit is still there.

saiojd · on March 18, 2023

I wouldn't be so confident. Flash attention is a recent, significant improvement to training times and sure looks low-hanging now.

I was not familiar with Edge AI, interesting concept. I feel like improving the efficiency of very large models is much more likely.

The recent successes will lead to an even larger influx of $ in the short term --- this is an existential threat to Google, after all. We will see where things go!

YeGoblynQueenne · on March 18, 2023

We've had neural nets since 1943. The architectures are not "in their infancy", new architectures have been developing for decades, even entire neural net paradigms (feed-forward nets, recurrent nets, recursive nets, etc etc.). Their scale has also been increasing ever since Hinton and friends rediscovered backprop in the '80s. Neural nets are positively ancient at this point, not "in their infancy"!

I don't know why people just keep repeating this complete fantasy as if it were true. Where does it originate from, I wonder? I suspect someone said something like that on social media, their post went viral, and now all of the internet is reverberating with this thing. It's a meme, yes?

saiojd · on March 18, 2023

> Where does it originate from, I wonder? I suspect someone said something like that on social media, their post went viral, and now all of the internet is reverberating with this thing. It's a meme, yes?

I don't have social media outside of HN.

It comes from a few observations:

1) Large models are still improving with increased parameter counts (we do not know where the ceiling is yet; it could be low but it could also be high).

2) Most current architectures train by using all model parameters to produce an output, which is vastly inefficient. While it is not clear how to improve on this in the general case yet, in the simpler problem of NERFs, sidestepping this issue has led to a ~100x improvement in training time.

3) https://mingukkang.github.io/GigaGAN/ very recently increased the parameter count of StyleGAN by selecting parameters dynamically at runtime. They improved on previous results by a very, very large margin, at somewhat comparable training times.

I stand by my claim: "neural network architectures are still in their infancy, it is very likely that more efficient approaches exist". I am not claiming that AI will become sentient or anything crazy and do not understand why you are associating my point of view with other people. I just said that it is likely that a novel technology will continue to improve (has this it ever NOT been the case for any new technology?).

YeGoblynQueenne · on March 18, 2023

>> It comes from a few observations:

Well, if you want to know whether neural nets are in their "infancy" you shouldn't make "observations", you should read the literature. It goes back many years. Go to the primary sources, why try to guess and risk guessing wrong, as here?

>> I stand by my claim: "neural network architectures are still in their infancy, it is very likely that more efficient approaches exist".

Half of your "claim" is incorrect. Don't just double down on it! There's really nothing to "claim" here, the "infancy" or not of neural nets is not a matter of claiming or guessing. Either you know what it is, or you don't.

saiojd · on March 18, 2023

Your tone comes off as very adversarial/angry. Intentional or not? The claim is about the likelihood of more efficient approaches existing, not about the subjective qualifier preceding it (misunderstanding?). Yes, NNs have existed for a while now, but they will also exist for a long time forward in the future (hence the perhaps poor choice of word: infancy).

"if you want to know whether neural nets are in their 'infancy' you shouldn't make 'observations'" -> I think you have understood this as "neural networks did not exist before" when what I meant is "neural networks will still change a lot in the future"

I was interested in discussing, specifically: will NNs continue scaling up in size in the near/long term. In answer to the statement "the training costs will exceed all the compute capacity in existence", I added "it is very likely that more efficient approaches exist.". You replied that NNs have existed for a long time, and that because of this, it is a complete fantasy that more efficient approaches exist. Do you feel like this is a fair assessment or no?

YeGoblynQueenne · on March 18, 2023

My problem is that I read what people write on HN and treat it with the same seriousness and respect I want people to treat my comments, when the majority are only saying whatever comes to their mind just to make some sort of impression. Then when I point out some obvious error, people freak out and get defensive because they never expected anyone to take them seriously, they're just spouting off whatever without really thinking. And then they try to wiggle out of the conversation, just like you're doing right now, pretending that you were misunderstood.

Well, my mistake then for taking you seriously. Many apologies. You can rest assured it won't happen again.

saiojd · on March 18, 2023

I feel very sad reading this. You are absolutely correct that I view HN as watercooler where I make hyperbolic statements, but incorrect when you say that that I did not take you seriously. I was frustrated because I felt like you were latching on to a figure of speech just to be argumentative instead of trying to understand what I meant (I was not trying to say that neural networks are literally new, I was trying to say that we have no proof that the parameter to flop ratio is even close to optimal because architectures are still changing a lot).

You are right that neural networks are not in their infancy. Best wishes.

lamp987 · on March 17, 2023

there is no channel for uncertainty.

LLMs of this type will just start making up shit when they dont know something. because they simply generate the most probable next token based on previous x tokens. this is not fixable.

this alone makes these LLMs practically unusable in vast majority of real-world applications where you would otherwise imagine this tech to be used.

jazzyjackson · on March 17, 2023

yea its a simulator of human text on the internet

for instance, your comment confidently states this is unfixable - presumably based on the frequency you've seen similar text on the internet. why should anyone believe the veracity of your statement? These things didn't have any of these emergent capabilities one year ago, why are you so sure you understand their nature one year from now?

lamp987 · on March 17, 2023

"your comment confidently states this is unfixable - presumably based on the frequency you've seen similar text on the internet. why should anyone believe the veracity of your statement? "

no its because GPT is based on transformers.

coldtea · on March 17, 2023

and you aren't?

Aren't you just a function of your input and memories (stuff you've read, sensory input) as run through/managed by some neural network?

What makes you think the rest isn't just emergent properties?

And what makes you think you can't hook up the LLM with some algorithms or layers that handle some of the rest behavior of what your brain does?

pixl97 · on March 18, 2023

Yep, the idea of grounding seems interesting to me. Everything in a LLM is just a statistical dream at this point with no 'reality basis' at this point. I wonder if it's possible to give the language model grounding points of things that are real and building a truth model from that.

lamp987 · on March 18, 2023

no, our brains arent based on transformers.

and the issue of lost uncertainty is inherrent to this, yes.

to fix this, a new type of llm would have to be invented. this particular branch of development may very well be a dead end.

antibasilisk · on March 17, 2023

The reason they seem to make things up is because they have no way to verify anything, they can only speak of things in relation to other things, but they have no epistemic framework. This is very much a fixable problem that augmentation with logic engines and a way to prioritise truth-claims could go some ways towards solving.

Jeff_Brown · on March 17, 2023

My memory could be improved by connecting my brain to an external hard drive. Wiring them together, alas, is not just hard; we have absolutely no idea how.

antibasilisk · on March 17, 2023

We do have some idea how, most people just don't really want to deal with the nightmare of being augmented and the life changing consequences that come with it, on top of the risk.

Jeff_Brown · on March 21, 2023

Really?! Have we done it in mice?

catskul2 · on March 17, 2023

It's not clear why this would be a fundamental limit rather than a design flaw that will eventually be solved.

Jeff_Brown · on March 17, 2023

It might get solved but we have no idea how. There's no (readable) db of facts and no (visible) logical processing to improve on.

catskul2 · on March 22, 2023

To me "we have no idea how" != "this is not fixable" (and even "we have no idea how" even seems like a strong statement.)

Perhaps it's because I'm ignorant about the inner workings, but calling the problem "unfixable" so early in the evolution of LLMs seems like foolish certainty.

Jeff_Brown · on March 22, 2023

I pretty much agree. That wasn't me who called it unfixable. (This is a UI issue with HN that keeps coming up for me -- people being mistaken for the OP.)

LeanderK · on March 17, 2023

this is absolutely not a fundamental limit but simply a hard challenge. Approaches exist and it is an active field of research where we do make progress.

Jeff_Brown · on March 17, 2023

I would disagree with both of you. It's an open question whether LLMs can be made reliable.

LeanderK · on March 17, 2023

fair. It's not proven that a solution exists for our models but I don't see much that leads me to believe it's impossible. I know GPT is not reliable but there's also really not much done to improve reliability. Its open research but certainly interesting. Most approaches I know are developed on way smaller datasets, models and usually in a computer vision context.

greyman · on March 17, 2023

What if LLM knowledge will expand over time to be sufficient for certain real-world application?

Jeff_Brown · on March 17, 2023

It already is: translation.

maxlamb · on March 17, 2023

I’m guessing one is data. The limit would be once you’ve trained a LLM on all public (or even private) data. Sure you can still make some improvements or try to find some additional private data but still, a fundamental limit has been reached.

great_psy · on March 17, 2023

I think that’s actually really not a limit.

We don’t teach babies by throwing lots of data at them, instead we teach them by giving them useful data.

The loop I see is: - train on a lot of existing data - run out of useful data - people use ai and give feedback (we’re here) - perform reinforcement learning on the data collected

Loop over the last two steps.

There is already more than enough data available, it’s just not nicely labeled to say if it’s high quality or should be discarded. Those last two steps will implicitly do the labeling.

pixl97 · on March 18, 2023

Reality throws a lot of unfiltered data at a baby. Now I guess you can consider things like gravity and pain useful data because of the consequences of violating them. But it's this data that grounds the baby in the world it exists in.

Espressosaurus · on March 17, 2023

I'm wondering what happens once LLMs are generating large portions of the internet. What then? It's poisoning its own well at that point.

FrojoS · on March 17, 2023

Good point. But isn't the next logical step to allow these systems to collect real world data on their own? And also, potentially even more dangerous, act in the real world and try out things, and fail, to further its learning.

pixl97 · on March 18, 2023

This is likely exactly what we will do... which is very questionable when you may have an unaligned paperclip maximizer hidden in there.

macrolime · on March 17, 2023

Is it even feasible any time soon to train an LLM on all of YouTube?

danielbln · on March 17, 2023

Napkin math, assuming around 156 million hours of video on all of Youtube:

    156 million hours of YouTube videos
    9,000 words/hour
    6 characters/word (including space)
    First, let's find out the total number of characters:

    9,000 words/hour \* 6 characters/word = 54,000 characters/hour

    Now, let's calculate the total number of characters for 156 million hours of YouTube videos:

    54,000 characters/hour \* 156,000,000 hours = 8,424,000,000,000 characters

    Since 1 character is typically 1 byte, we can convert this to gigabytes:

    8,424,000,000,000 bytes / (1024 \* 1024 \* 1024) ≈ 7,842.11 GB

So, 8TB of text? Seems doable.

macrolime · on March 17, 2023

I mean the actual video, that's much bigger

With a vision transformer each token may be around 16x16 pixels. I found an example where they use images of resolution 224x224 for training a vision transformer so if we go with that that 256 pixels per token and 50176 pixels per image, so 196 tokens per frame, 24 frames per second, that's 4704 tokens per second or 16934400 token / hour. In total we're at 2.6x10^15 tokens.

GPT-3 was trained on 5x10^11 tokens, so YouTube done this way would be around four orders of magnitude more tokens that GPT-3 was trained on.

GPT-3 was undertrained by 1-2 orders of magnitude, so the compute required to trained a model on YouTube would then be around 6 orders of magnitude higher than what was used to train GPT-3, so about one million times more.

I did a linear regression on the training costs from cerebras(1) and came up with the formula (1901.67366*X)-197902.72715 where X is number of tokens in billions.

Plugging in 5x10^15 tokens we get a training cost of 5 billion dollars. I guess a lot of optimizations could be done that would decrease the cost, so maybe its doable in a few years.

1. https://cirrascale.com/cerebras.php

HervalFreire · on March 17, 2023

What they said 10 years ago was correct. It did hit society like a sledge hammer. Machine learning basically took over the AI space and penetrated the consumer space with applications that were all but impossible in the previous decade. There's AI chips in smart phones now.

What you're seeing here with LLMs is sledge hammer number 2.

It's understandable how most people don't notice the sledge hammer. The decade prior to 2010 there was another sledge hammer had no smart phones. We were hit with a smart phone hammer AND an AI sledge hammer and the integration was so seamless we didn't even notice.

Much of the same will happen with LLMs. In 5 years it's so normal, nobody cares and likely we will forget what life was like before LLMs.

coldtea · on March 17, 2023

>What they said 10 years ago was correct. It did hit society like a sledge hammer. Machine learning basically took over the AI space and penetrated the consumer space with applications that were all but impossible in the previous decade. There's AI chips in smart phones now.

And still almost all of these applications are not really impactful or that important compared to actually society and life changing developments like the steam engine, electricity, electromagnetic transmission, cars, computers themselves and the internet.

Just more of the same, with added spice.

In the sense that we could revert to 10 years ago, and nothing would be much different or missed. Whereas going back without electricity or cars would be a totally different thing!

I think LLMs can be far more impactful than anything else hyped in machine learning of the past 10 years...

civilized · on March 17, 2023

I feel like the only form of "AI" I regularly benefit from is when I type "n" in the address bar and it autocompletes to news.ycombinator.com.

Oh, browser, you know me so well.

muxator · on March 18, 2023

Yeah, I understand. But, really, there's no need for any AI in that.

HervalFreire · on March 18, 2023

Of course we hit many technological limits long ago. Most miracle technologies have already been discovered. Heck space travel has had zero actual progress other then Elon redoing what we did decades ago more "efficiently".

The AI sledge hammer is as good as we are going to get in paradigm shifts for a while given how we are at the top of an s-curve for development. Additionally we're only really seeing the first decade. When the computer was discovered was it ubiquitous in a decade? No. In fact it was largely a sort of useless academic research project for the longest time.

anon7725 · on March 17, 2023

Sledge hammer #1 (voice assistants, AI chips in phones) didn’t cause unemployment. It was at the level of new features and capabilities. Sledge hammer #2 is aimed squarely at “white collar” work without much in the way of bounds to its capabilities.

Jeff_Brown · on March 17, 2023

We can't trust it. That's a pretty hard bound.

anon7725 · on March 17, 2023

Consider that a lot of useful work involves summarization (search++).

“What work has gone on for supply chain improvement in our European operations this year?” - this is the kind of question that is easy to ask in natural language but might take someone a week of searching, messaging, etc to assemble. An LLM with access to all of the code, documents, chats, etc could just give an answer with citations. We are not betting $1B on the answer that it gives, but it has saved us 1 week of work and allows us to move on to the next step in the project.

There are plenty of tasks like this which are highly valuable yet don’t require high trust. The one-shot “what is the airspeed of a coconut-laden swallow” type questions are actually fairly rare.

Izkata · on March 18, 2023

There was an episode in the original Kino's Travels anime where in one country, where a computer ran everything, humans still had jobs - their work was basically just verification that the computer was correct.

coldtea · on March 17, 2023

We can't trust the police, politicians, or businesses either, but we're still on their hands...

Jeff_Brown · on March 21, 2023

Fair. It's always a question of how much. If someday we can trust AI more than those groups (to do the kinds of things they do), that will be amazing.

HervalFreire · on March 17, 2023

For article writing verification is quick, I don't need to worry about a "bug" in the generated article as I do with code. For art, verification is instant, a bad generation is rejected.

Trust is only a bound for certain areas, and this bound is eroding.

curiousgal · on March 17, 2023

I guess you perception of society is severely limited if you think a fancy autocomplete is capable of changing every aspect of it.

maxdoop · on March 17, 2023

I have fun on these HN chats responding to comments like yours .

It’s just fancy auto complete to you? You honestly can’t see the capability it has and extend it the future?

What’s that saying about “it’s hard to get someone to understand something when their salary depends on their not understanding it”.

macawfish · on March 17, 2023

I feel very frustrated with these takes because instead of grappling with what we're going to do about it (like having a conversation) it's a flat, dismissive denial, and it isn't even grounded in the science, which says that "memory augmented large language models are computationally universal". So at the very least we're dealing with algorithms that can do anything a hand written program can do, except that they've been trained to do it using natural language in extremely flexible ways. I'm having a hard time seeing how "fancy autocomplete" is the right description for this.

maxdoop · on March 17, 2023

I agree 100%.

I don't understand why we can't look at the potential, or even current, capabilities of these LLMs and have a real conversation about how it might impact things.

Yet so many folks here just confidently dismiss it.

"It doesn't even think!" -- OK, define thinking?

"It doesn't create novel ideas!" OK -- what do most devs do every day?

"It is wrong sometimes!" OK -- is it wrong more or less often than an average dev?

spacemadness · on March 17, 2023

In case we forget the amazing predictive capabilities of HN there’s always https://news.ycombinator.com/item?id=9224

YeGoblynQueenne · on March 18, 2023

>> I feel very frustrated with these takes because instead of grappling with what we're going to do about it (like having a conversation) it's a flat, dismissive denial, and it isn't even grounded in the science, which says that "memory augmented large language models are computationally universal"

That's not what "the science says", it's the title of an article that someone put on arxiv.

The article has no theoretical results, just a shoddy empirical demonstration of... something. The author claims that the something is an LLM simulating a Turing machine. But, is it? Really?

Well, here's how the article concludes:

>> Hopefully the reader has been convinced by this point.

"Hopefully" is not how you show computational universality of a neural net architecture. This is how:

https://www.sciencedirect.com/science/article/pii/S002200008...

i.e. with maths. That article you link to is a bunch of hooey.

macawfish · on March 18, 2023

That's a good paper! Thanks for sharing.

enord · on March 17, 2023

«Computationally universal»? Are you quite sure? Hoo boy pass the nitroglycerin hooooo aaah ohh after 80 years Turing was proven right after all ghasp

macawfish · on March 17, 2023

Well I don't understand the nitroglycerin reference and of course "computationally universal" doesn't mean "sentient" but the point is when you add the external memory they (Flan-U-PaLM 540B to be specific) have been demonstrated to be capable of simulating a specific, well defined Turing machine without special training. There are some other papers out there arguing this from a theoretical angle too, but this is the one whose title I quoted:

https://arxiv.org/abs/2301.04589