OpenAI's GPT-3 may be the biggest thing since Bitcoin

meredydd · on July 19, 2020

I am deeply enjoying this comment thread - it's a bit of a Barium Meal [0] for determining how many people read (a) the headline, (b) the first paragraph, or (c) the whole thing before jumping straight into the compose box.

Having read to the bottom, the quality of text generation there absolutely blew me away. GPT-2 texts have a somewhat disconnected quality - "it only makes sense if you're not really paying attention" - that this article lacks entirely. Adjacent sentences and even paragraphs are plausible neighbours. Even on re-reading more closely, it doesn't feel like the world's best writing, but I don't notice major loss of coherence until the last couple of paragraphs. I am now really curious about the other 9 attempts that were thrown away. Are they always this good?!

[0] https://en.wikipedia.org/wiki/Canary_trap#Barium_meal_test

thewarrior · on July 19, 2020

I've started working on a version of GPT-2 which generates English text. The purpose of this is to improve its ability to predict the next character in a text, by having it learn 'grammatical rules' for English. It already works well for predicting the next character when it has seen only a small amount of text, but becomes less accurate as the amount of training text increases. I have managed to improve this by having it generate text. That is, it creates an 'original' piece of text about 'topic x', then a slightly altered version of this text where one sentence has a single word changed, and this process is repeated many times (about a million). It seems to quickly learn how to vary sentences in a way that seems natural and realistic. I think the reason this works is because it reduces the chance that the grammar it has learned for one specific topic (e.g. snow) will accidentally be transferred to another topic (e.g. dogs). Of course, this all means nothing unless it actually learns something from the process of generating text. I haven't tried this yet, but the plan is to have it generate text about a topic, then have a second GPT-2 system try to guess what that topic is. If the resulting system is noticeably better at this task, then we know the process has increased its ability to generalize.

One potential issue with this approach is that the text it generates is 'nonsensical', in that it is almost like a word-salad. Although this is a standard problem with neural nets (and other machine learning algorithms), in this case the text actually is a word-salad. It seems that it has learned the rules of grammar, but not the meaning of words. It is able to string words together in a way that sounds right, but the words don't actually mean anything.

Plot twist: This comment was generated by GPT-3 prompted with some of the comments in this thread.

cs702 · on July 19, 2020

The thing that kills me is that to the vast majority of human beings the nonsensical technobabble above is probably indistinguishable from real, honest, logically consistent technobabble.[a]

Soon enough, someone will replicate the Sokal hoax[b] with GPT-3 or another state-of-the-art language-generation model. It's not hard to imagine GPT-3 writing a fake paper that gets published in certain academic journals in the social sciences.

[a] https://en.wikipedia.org/wiki/Technobabble

[b] https://en.wikipedia.org/wiki/Sokal_affair -- here's a copy of Sokal's hoax paper, "Transgressing the Boundaries: Towards a Transformative Hermeneutics of Quantum Gravity:" https://physics.nyu.edu/faculty/sokal/transgress_v2/transgre...

thewarrior · on July 19, 2020

It's not hard to imagine GPT-3 writing a fake paper that gets published in certain academic journals in the social sciences. And then, it'll be all over for us. We won't have any more funding and our jobs will disappear. I can already hear the protests: "But we're not just scientists! We're also philosophers!" Well, yes and no. Philosophers are supposed to think about things philosophically, but they don't actually do anything about them; they're just entertainers. Scientists do something about them. They make things happen. And when those things happen, people take notice. If science and technology have a weakness, it's that they work too well. This was probably a strength at one point, but not anymore. In the not-too-distant future, there probably won't be any more philosophy professors; there will just be philosophers. But only in the same sense that there are lions and mushrooms.

This comment was also written by GPT-3.

6nf · on July 19, 2020

How many attempts did it take or did you just choose the first one?

I have to admit, this is passing my turing test...

geofft · on July 19, 2020

Really? I got about halfway through and realized that the comment had no point. If you tried to summarize what it was arguing, beyond the first sentence, I don't think you could make a coherent summary.

Maybe the real lesson is we don't expect human-written comments on discussion fora to be particularly coherent....

ascar · on July 19, 2020

Also both comments made me suspicious half way through and I scrolled to the bottom to check for a GPT-3 note. Without that note I would definitely have regarded it as incoherent rambling by a human.

Especially the second comment can be coherently interpreted with some good will and a cynical view of the humanities and philosophy. The "author" could say that once GPT-3 can write humanities papers it will quickly make humanity scientists redundant and that humanities scientists are philosophers is not important and doesn't warrant a job alone ("they don't actually do anything"). Eventually it shifts that this is the fault of science working too well (GPT-3 being a product of science)

It's not a consistent argument, but without the context of these comments being GPT-3 it would have totally passed my turing test, just not my sanity test.

thom · on July 19, 2020

I think (slash worry) that this is going to be a simple upgrade in future iterations. Obviously there are powerful attention mechanisms at work to keep the subject matter coherent, but we’re not too far off the model being able to generate a core thesis, and ensure that all the generated text supports that in some way.

tsimionescu · on July 19, 2020

I think that if that worked it would prove that either language is a much more powerful tool than we realize, or our cognitive capacities are much more trivial than we realize.

The model fundamentally has no understanding of the world, so if it can successfully argue about a central thesis without simply selecting pre-existing fragments, then it would suggest that the statistical relations between words capture directly our reasoning about the world.

EGreg · on July 19, 2020

Who here thinks some Donald Trump's answers were written by an early version of GPT3, designed to produce more bombastic and rambling rhetoric than usual?

rusk · on July 19, 2020

In principle it’s not too far fetched ... there’s almost certainly some kind of data-driven algorithmic processing going into a lot of speech writing these days; some of the drops are so implausible they’d almost certainly have to have been suggested by a machine!

everdrive · on July 19, 2020

Not being sarcastic, but I know some people with less coherent writing than this. A lot of people struggle to make a point, use vague language, or wander in and out of topics quite easily.

rusk · on July 19, 2020

Yeah they’re typically mimicking a style of speaking that they’ve heard other people use but don’t really understand the subject matter themselves ...

vidarh · on July 19, 2020

It felt like it was making a slightly ranty observation that scientists are already trying to much to be philosophers than to actually do science that changes the world, yet science has brought us far enough that it acts as an enabler for all kinds of pop-philosphers.

The final bit doesn't quite connect, but overall I've seen far less coherent comments written by humans on subject with far more logical flaws.

riffraff · on July 19, 2020

I would not have imagined it was automatically written. Rambling and there's little connection between the first part and the latter, but absolutely something that might appear on a random internet forum.

I am genuinely awed.

2038AD · on July 19, 2020

> I got about halfway through and realized that the comment had no point

Pretty average for HN then ;)

pmiller2 · on July 19, 2020

Given that I know this stuff is generated text, it looks pretty good. But, if I’m judging it assuming that it was written by a human, it has a very uncanny valley sort of feel. That’s actually a good thing compared to previous models that would generate a lot of jarring non sequitors, because the GPT-3 text is very good if you look at it in 2-3 sentence chunks.

hombre_fatal · on July 19, 2020

You say it like the bot wouldn't fit right in alongside most human comments because it meanders and doesn't seem to actually be responding to anyone, rather listening to itself talk.

ALittleLight · on July 19, 2020

Unfortunately I saw the sentence at the end before reading the whole comment, so I don't know how my detector would've done, but I thought this line:

>In the not-too-distant future, there probably won't be any more philosophy professors; there will just be philosophers

Was quite clever and I'm still trying to figure out what it means.

jlokier · on July 19, 2020

Maybe the real lesson is it was trained on human-written comments in discussion fora, so it perfectly mimics the average lack of point, weak arguments, rambling and incoherence in fora?

It would be interesting to see if the output has a similar quality when trained only on highly regarded texts.

ci5er · on July 19, 2020

> Maybe the real lesson is we don't expect human-written comments on discussion fora to be particularly coherent....

How could we expect it? After 35+ years (BBS and Usenet onward), we've learned that they are often not.

tgflynn · on July 19, 2020

Yep, it looks like GPT-3 might not be too far from achieving artificial schizophrenia.

amelius · on July 19, 2020

I don't think these gpt3 comments will get many upvotes on HN anyway. I downvoted the first one for being incoherent, but then realized it was meant as an example so I undowned it.

regularfry · on July 19, 2020

In possibly an unwise move, I'm actually going to respond to the initial point here.

There's a totally valid discipline in taking concepts from different areas and smushing them together to make a new idea. That's what a lot of creativity is, fundamentally. So a bot that's been trained across a wide variety of texts, spitting out an amalgam in response to a prompt that causes a connection to be made, is not only possible, but likely a very good way of generating papers (or at least abstracts) for humans to check. And if the raw output is readable, why not publish it?

rsync · on July 19, 2020

"This comment was also written by GPT-3"

Would you please show us the input text, or rules, you gave to GPT-3 to create this comment ?

jjoonathan · on July 19, 2020

Somebody get this thing onto scp-wiki, it will be right at home.

Not gonna lie, I went poking around to see if I could get my hands on it, but it seems like the answer is no, for now.

stavros · on July 19, 2020

That's a good question, how do we get access? I signed up on the list, but it must be thousands of people long by now. Does anyone here know anyone who can get people access?

jimkleiber · on July 19, 2020

I'm starting to sense that, in most scenarios, I will no longer want to engage in text-based conversations unless I know that I'm talking with a human. I already don't like spending a lot of time arguing with a bot on Twitter, this just makes it much more likely I'll also argue with a bot on medium-length text areas (e.g., HN, FB, Whatsapp, SMS, etc.) and maybe even on long-length text areas (e.g., Medium, blogs, NYTimes or things pretending to be real newspapers, etc.)

Second, I'm curious/terrified at how future iterations of GPT-3 may impact our ability to express ourselves and form bonds with other humans. First it's text messages and comments. Then it's essays. Then it's speeches. Then it's love letters. Then it's pitches. Then it's books. Then it's movie scripts. Then it's...

TLDR; Fascinated by the technology behind making something like this work and quite worried about the implications of the technology.

jacobedawson · on July 19, 2020

So it looks like we're about 2 years away from the 'Her' relationship model.

schoen · on July 19, 2020

There was an article recently about people pursuing romantic relationships with chatbots. I thought there was a big HN discussion about it, but the only thing I've been able to find is this WSJ piece

https://news.ycombinator.com/item?id=22833407

(So I think it was some other story on the same topic.)

EGreg · on July 19, 2020

I am seeing it being used in tons of academic settings, especially with distance learning haha

randomsearch · on July 19, 2020

Was it really? I don’t believe you. It makes sense.

thewarrior · on July 19, 2020

The thing is that this thing has now crossed into the uncanny valley. Earlier it would have great trouble making a single sentence that makes sense. You only ever remember whether the last two sentences made sense and go together. And with GPT-3 any pair of sentences always makes almost perfect sense. By the time you're four sentences down you go wait a minute ...

_54qb · on July 19, 2020

This was very apparent when reading the generated stories [1].

Especially the shoggoth cat dialogue, I found that one really creepy. The fragment below comes straight out from the uncanny valley:

Human: Those memes sound funny. But you didn’t include any puns. So tell me, what is your favorite cat pun?

AI: Well, the best pun for me was the one he searched for the third time: “You didn’t eat all my fish, did you?” You see, the word “fish” can be replaced with the word “cats” to make the sentence read “Did you eat all my cats?”

[1]: https://www.gwern.net/GPT-3

spion · on July 19, 2020

An instance of a Voight-Kampff test.

techdragon · on July 19, 2020

Yeah, GPT-3 never really gives any kind of answer to “why”. It rambles on like Abraham Simpson talking about how an onion on his belt was the style at the time. Devoid of purpose it fills the void with meaningless words until told to stop. It’s subtle gibberish... and fucking irritating as soon as you catch it.

p1esk · on July 19, 2020

If he didn't put in the last line ("plot twist ...") I'm pretty sure no one here on HN would have guessed it.

In fact, while reading that comment I started to wonder why no one has tried to use GPT to generate text one character at a time. Or if someone has, what are the advantages and disadvantages over the BPE approach.

the_other · on July 19, 2020

I’ve not studied neural nets and ML since undergrad level in 1998. So I am almost as knowledgeable as a random person on the street.

The quality of writing was very high, so I was convinced I was reading something put together by a human with agency... except it didn’t pass my gut-feeling “how IT works”. It made me suspect that either the algorithm (the described one, not the AI responsible) was off, or that I just didn’t understand AI any more. As I know I don’t have up to date AI knowledge, the algorithm appeared more believable. I hiked deep down the uncanny valley with that one.

p1esk · on July 19, 2020

This comment didn’t pass my gpt filter.

foobarbecue · on July 19, 2020

Your second paragraph is GPT-3?

O_H_E · on July 19, 2020

Ok, now this is getting deep and I don't like it.

Scarblac · on July 19, 2020

I assume both previous comments, and this one, are also GPT-3?

Edit: it is amusing to think that soon the way to distinguish them will be that human comments have weird errors caused by smartphone keyboard "spell checking" in them...

p1esk · on July 19, 2020

You really think my comment above was GPT-3 generated? Wow. Did I really make so little sense?

foobarbecue · on July 20, 2020

On a re-read, I'm not sure why I thought that, sorry. Context: I don't know much about machine learning and when I was scanning through comments, doing text generation one character at a time seemed silly and I must have been in the grips of "everything could be GPT!" hysteria. My robot detector needs work, clearly. Need to get educated.

croddin · on July 19, 2020

But then if the models are trained on that dataset they might make the same errors to better approximate humans on the forum.

more-coffee · on July 19, 2020

This entire thread is surrealistic and giving me Blade Runner vibes.. I love it.

che_shirecat · on July 19, 2020

What if the plot twist line is also GPT-3?

killerstorm · on July 19, 2020

Well, you'd need to train it from scratch to operate on character level. And it would have smaller context, thus lower quality. So if you want same quality, you need much bigger context.

Still, would be an interesting experiment. Gwern swears it would improve stuff, so worth trying and comparing, I guess

g_p · on July 19, 2020

There's actually precedent for this - sci-gen got papers accepted in IEEE and Springer publications, through peer review, and they had to investigate their back catalogue to look for others.

Given the propensity for academic writing to often favour the strategy of confusing the author through obfuscation (to make a minor advance sound more significant than it is), I suspect tools like this could, as you say, actually get published papers in some fields like social sciences. In an engineering or science paper you can check equations match conclusions, and that graphs match data etc.

In a more qualitative field of work, reviewed in a publish-or-perish system that doesn't incentivise time spent on detailed reviewing, I think there's a very real risk babble like this just comes across like every other paper they "review".

I think it takes a certain level of confidence to dismiss others' work as nonsensical waffle, but sadly this is a confidence many lack, and they assume there must be some sense located therein. Marketing text is a great place to train yourself to recognise much of what is written is meaningless hocum.

Sci-Gen - https://pdos.csail.mit.edu/archive/scigen/

Reporting on withdrawals of papers - https://www.researchgate.net/publication/278619529_Detection...

29athrowaway · on July 19, 2020

Dr. Herbert Schlangemann (fake alias for Sci-gen) not only got papers accepted in journals, it was invited to participate as session chair at conference co-sponsored by IEEE.

https://en.wikipedia.org/wiki/SCIgen#Schlangemann

barrkel · on July 19, 2020

In a similar way to how image detection networks appear to key largely on visual textures, GPT-3 seems to key on jargon, tone and pacing, the texture of speech within specific milieus.

yaj54 · on July 19, 2020

The thing that kills me is that soon enough, a "fake" paper written by GPT-3 will get published in an academic journal because it has actually contributed a new insight.

It's easy to consider text generation models as "just mimicking grammar". But isn't grammar also just a model of human cognition?

Is GPT modeling grammar or is it modeling human cognition? Since GPT can ingest radically more text (aka ideas) won't it soon be able to generate texts (aka ideas) that are a more accurate collation of current knowledge than any individual human could generate?

--

[Was this comment written by GPT-3?]

lostmsu · on July 31, 2020

It was not - there's a point :-D

I am impressed though nobody dared to guess in 2 weeks.

Udik · on July 19, 2020

My impression is that these models are already doing far more than what the language production machinery in our brain does. We are able to produce language according to grammar and semantics, but we also have independent mental representations to guide the generation of language and to provide context.

I don't really understand why we're trying so hard to build models that can generate coherent texts based on having predigested only other texts, without any other experience of reality. Their capabilities appear already superhuman in their ability to imitate styles and patterns of any kind (including code generation, images, etc.). It feels like we're overshooting our target by trying to solve an unsolvable problem, that of deriving the semantics of reality from pure text, without any other type of input.

pests · on July 19, 2020

One of my favorite conspiracists, Miles Mathis, has this quality. He can strong together entire pages of very real and consistent nonsense that is totally logical and makes enough sense to be real. I have to remember I'm not reading a legit theory and now really do confuse myself with his version of science vs reality.

pasabagi · on July 19, 2020

Kind of annoying 'hoax', though. Obviously you can publish garbage in fringe journals if you leverage pre-existing prestige and position like Sokal did. Doesn't really say anything about the social sciences.

You can also publish a lot of nonsense in certain chinese journals that optimize for quantity in quality, in whatever field you want.

CamperBob2 · on July 19, 2020

Worse, Sokal's Revenge is probably inevitable, in which someone will generate a nightmarishly complex mathematical proof that takes the whole field in unexplored directions. Some of the most respected professors and most promising students will then be distracted for years trying, and ultimately failing, to make sense of it.

Some say this has already happened. Nobody has ever seen the Social Text editors and Mochizuki in the same room together, have they?

pjc50 · on July 19, 2020

> Plot twist: This comment was generated by GPT-3 prompted with some of the comments in this thread.

This kills the forum.

Seriously, once this is weaponised, discussion of politics on the internet with strangers becomes completely pointless instead of just mostly pointless. You could potentially convince a human; you can't convince a neural net that isn't in learning mode.

amelius · on July 19, 2020

Perhaps it will destroy anonymity. Because the only way to be sure a human wrote something is if you somehow know who the comment came from.

We might end up with reputation based conversations.

pjc50 · on July 19, 2020

Of course, the human can simply lend their name to the robot. And as previously discussed, ending anonymity entrenches the existing power structure.

amelius · on July 19, 2020

> Of course, the human can simply lend their name to the robot.

That could have consequences for their reputation, though.

pjc50 · on July 19, 2020

Are you suggesting that people found to have misled the public should be .. cancelled?

(Reputation is a lot more controversial and complicated than it sounds)

heavenlyblue · on July 19, 2020

You can still be anonymous with this. You just need to be pseudonymous.

amelius · on July 19, 2020

Robots can also have a pseudonym.

kilotaras · on July 19, 2020

Go to URL and tell me what is written there.

regularfry · on July 19, 2020

It's more insidious than that. You can think you've convinced a human whereas you've just spent your energy on a bot. Assuming "political arguments on social media" has any relevance to voted cast, that's a vote for your side which doesn't happen.

jlokier · on July 19, 2020

> Seriously, once this is weaponised, discussion of politics on the internet with strangers becomes completely pointless

Quite the opposite, I suspect.

Eventually, to engage in the most persuasive conversations, the AIs will develop a real-time learning mode.

Once that is weaponised, the AIs will be on track to be in charge of running things, or at least greatly influencing how things are run.

What the AIs "think" will matter, if only because people will be listening to them.

Then it will be really important to discuss politics with the AIs.

LdSGSgvupDV · on July 21, 2020

Could you potentially convince a human for politics issue? It is extremely hard to convince stranger in the forum when there are some priors in mind.

randomsearch · on July 19, 2020

That’s interesting, as within a sentence I had dismissed your comment as rambling and moved on to the next one, without thinking it had been generated... but maybe you’re double bluffing me.

woko · on July 19, 2020

Same, except that I skipped straight to the last line to check whether it was a generated text after I noticed the first sentence made no sense (GPT-2 already generates grammatically correct English sentences).

_mlxl · on July 19, 2020

This comment reminds me of https://www.youtube.com/watch?v=RXJKdh1KZ0w

c3534l · on July 19, 2020

I read the beginning, went "what the fuck is this guy on about? Get to the point" and then came to check the comments to if this was anything interesting or worth reading, saw your comment, and skimmed the end bit. Overall I'm pleased with my process as its an efficient way to find out which articles are worth reading. But it was also clear to me that the author had difficulty making a clear point or had a goal in his writing. I skipped through it for a reason and I suspect many other people did as well.

crucialfelix · on July 19, 2020

Yes, we have poorly written babble from humans too. Now we will have weapons grade babbling from machines.

The result is that worthwhile public discussion is dying. We have to transition now to secure verified communication.

Either that or the bots fork off a new cultural discourse and we treat them like a new form of entertainment.

ALittleLight · on July 19, 2020

From your comment it's not clear to me if you realize the author of the article is GPT-3.

cryptoz · on July 19, 2020

At this point I'm not even sure if that particular comment was written by GPT-3 or not.

c3534l · on July 19, 2020

I swear I'm not a robot, I pass Google captchas and everything!

thom · on July 19, 2020

I’m human and regularly don’t.

stavros · on July 19, 2020

I'm sorry to be the one to break it to you, but that means you aren't human.

more-coffee · on July 19, 2020

You say you're human, but how can we know for sure?

thom · on July 19, 2020

How does it make you feel that I say I’m human?

c3534l · on July 19, 2020

I do, but I see how that wasn't clear.

mdoms · on July 19, 2020

It's very clear to me that he does not. But he does an excellent job of making GP's point.

jjoonathan · on July 19, 2020

Err, it seems very clear to me that he does realize GPT-3 is the author, and that it was easily caught by his bullshit filter. Which was my experience too -- but I am less dismissive. I regularly see human-produced bullshit get very far with less coherence than these examples from GPT-3.

GPT-3 isn't AGI, but it's weapons-grade in a way that GPT-2 wasn't.

AnimalMuppet · on July 19, 2020

OK, but a weapons-grade BS generator is not what the world needs right now...

jjoonathan · on July 19, 2020

Ready or not, here GPT-3 comes.

nashadelic · on Aug 1, 2020

I feel bad for realizing I give authors benefits of doubts and let them ramble for me to learn the point. I guess I need to drop my bar when I'm reading but a LOT of people ramble a lot. Almost every blog post I ready, I skip the first 2 paragraphs because they're just an intro/context which you already know from the headline or prior knowledge.

Alex3917 · on July 19, 2020

> Even on re-reading more closely, it doesn't feel like the world's best writing, but I don't notice major loss of coherence until the last couple of paragraphs.

I guessed it was fake before getting to the end, not from the content, but from the fact that all the sentences are roughly the same length and follow the same basic grammatical patterns. Real people purposely mix up their sentence structure in order to keep the readers engaged, whereas this wasn't doing that at all. Still very impressive though; if not for the fact that the post was about computer generated content I probably wouldn't have noticed.

bhntr3 · on July 19, 2020

Besides predictable sentence structure, GPT-3 writes like George R. R. Martin: Interesting premises, solid setup but then it devolves into rambling tangents and never quite delivers the concluding action that ties everything together.

Lots of examples I've seen have phrases like "see table below". Of course there's no table and it's hard to imagine how there could be.

But GPT is trained on internet content and the internet is full of terrible writing that never gets to the point. I doubt there's any way to know how much is "not actually understanding the subject matter" vs. "learning bad writing from bad writers". I'm inclined to believe the majority is the former but there's got to be a little of the latter sprinkled in.

niklasd · on July 19, 2020

I am really curious how the model would be if you would train it with a decent amount of really good literature. Kazuo Ishiguro et al. instead of Reddit.

kmill · on July 19, 2020

I was playing with AI Dungeon tonight to get access to GPT-3, and one of my many experiments ended up with me meeting a character called the Narrator who believed they were in control of all characters in the game, including me. Eventually, through my predicting what they were about to say by checking and undoing, they seemed convinced I wasn't another character and started asking about whether certain authors were still alive and which I liked to read. It didn't recognize Ishiguro. Later it gave me a truly bizarre (and amusing) summary of Infinite Jest, clearly having never read it. Anyway, the entire experience was uncanny and surreal.

One thing I learned was it has detailed knowledge of the world of Avatar: The Last Airbender, seemingly through fanfics. It was fun having it to teach me the lost arts of pizzabending ("form your hands into the shape of a letter 'P'" and so on, and needing to practice by juggling rubber pizzas) and beetlebending ("always remember that to beetle bend it helps to like beetles," my wise uncle suggested). Each of these tended to precipitate a narrative collapse.

The writing style was surprisingly homogeneous, and it reminded me of young adult novels. It would definitely be interesting to see it with other writing styles, beyond the occasional old poetry.

derangedHorse · on July 19, 2020

I've never heard of AI Dungeon before reading your post but even after playing for 2 minutes, I can tell it's going to be huge.

hnick · on July 19, 2020

How about full adult? Taking it for a test run and this happened after I told a man to stop copying me. Before this he kept talking about clothes for some reason.

> The man walks away and starts undressing. You shrug and keep following him. Soon, you find yourself naked.

Reelin · on July 19, 2020

Library Genesis contains lots (millions?) of fiction ebooks (among other things). It's available in torrent form. Not that I would ever condone piracy or anything.

gpderetta · on July 19, 2020

Well now we know what is going to finish asoiaf in case of author existence failure.

gomoboo · on July 19, 2020

I did not guess that it was fake but skipped to the bottom because the article did not seem to be worth reading. It felt like the author was not moving anywhere with their words. I laughed out loud at the reveal that it was written programmatically.

pyb · on July 19, 2020

You raise a good point. The Internet has trained us to skim any text that seems pointless or just unsufficiently insightful. So it turns out we have already built up some "mental defences" against GPT-3.

gbrown · on July 19, 2020

Now we need to make human augmented generative adversarial eyetracking networks that are trained on getting people not to skim.

javajosh · on July 19, 2020

There are a few concrete things the (fake) article says, primarily about what software to try (OpenAI's GPT-3), and where to try it (bitcointalk forum). Personally, I actually resent being mislead like this, at least on that second point, even with the full disclosure. The output is very high quality, but it is making at least one falsifiable assertion (no test was ever done in that forum).

nullc · on July 19, 2020

If you've ever spent much time with a toddler you might have noticed that they spout a lot of fantasy. Learning to not make up untrue claims takes years of additional training for humans.

selestify · on July 19, 2020

I’ve never spent much time with toddlers. What do they make up things about? Their own actions, other’s actions, claims about the environment?

Scarblac · on July 19, 2020

My six year old will just continue as long as he has people's attention, and if that means he has to make things up, so be it. Freely stealing phrases from other recent conversations.

So this morning he heard about an animal, it was kind of a lion. But with bat's ears, it lives in Africa. It looks like it's a rock, but it's actually not, it's rock shaped but has tiny legs. And it's gray and hard. Its face... It doesn't really have a face. It lives up in trees where it eats bamboo and apples. It has these huge fangs like sabertooth tigers, you know?

It's glorious.

riffraff · on July 19, 2020

All of those, in my experience.

My smallest kid has a habit of telling stories about himself that actually come from whatever he heard recently, e.g. "once I was Godzilla..", or claims about things in reality that come from stories or misunderstandings all mixed up "did you know, there are three pigs, but they are not pigs, they are wolves and a hunter came and killed them but they weren't wolves they were dragons..."

It's actually very GPT-3-ish now that I think of it.

IIAOPSW · on July 19, 2020

Some of them never learn it at all!

dannyw · on July 19, 2020

If that’s the only thing that separates this from human writing; I’m sure it can be influenced easily.

Alex3917 · on July 19, 2020

> I'm sure it can be influenced easily.

Maybe. Right now this reads like a glorified shopping list. It's coherent, but actually sounding human also requires a theory of mind.

E.g. I explain here why it's possible for written statements to be objectively insightful, informative, interesting, or funny, but objectively in a way that's relational to other information or beliefs. The implication being that statements are only going to seem subjectively funny or insightful (or whatever) to others who have that knowledge or those beliefs, which means that you can't reliably create those subjective experiences in a reader without having some sort of theory of mind for them.

I guess you can create content that's funny or insightful relative to that content itself, but that's not especially useful. It's entertaining at the time, but the experience is more like seeing a movie that you laugh a lot during but then leave and are kind of like what was the point? It's an empty experience because it wasn't transformative.

I definitely don't think it's impossible, but I also don't think it's a matter of just adding a couple more if-else statements.

https://alexkrupp.typepad.com/sensemaking/2010/06/how-writin...

roenxi · on July 19, 2020

> Maybe. Right now this reads like a glorified shopping list. It's coherent, but actually sounding human also requires a theory of mind.

I'm going to call this goalpost shifting. This article is better writing than some % of humans, theory of mind or otherwise. The AI has comfortably surpassed Timecube-level writing and is entering the pool of passes-for-a-human.

'Sounds human' is a spectrum that starts with the mentally ill and goes up to the best writers in human history.

Alex3917 · on July 19, 2020

> I'm going to call this goalpost shifting.

That's completely fair. On the other hand, without a theory of mind it can't really educate or inspire people, the only thing it can do is maybe trick them about the authorship of something. But once people learn the techniques for identifying this kind of writing, it can't even do that anymore. To me this is like the front end of something, but it still needs a back end.

Don't get me wrong, it's super cool research and seems like a huge step forward, and I'm excited to see where it goes. But I also don't see this AI running a successful presidential campaign or whatever, at least within the next couple years.

Scarblac · on July 19, 2020

It made me consider: - The existence of this model I hadn't heard of - Bitcoin (sigh) - Testing it out on a forum, trying to become a well known poster - picking a forum with many different types of posters - some of which you dislike

And that got me thinking about what I could do with this thing, whether I should, what I wanted to try out...

So the BS random ideas were still inspiring a bit.

CamperBob2 · on July 19, 2020

On the other hand, without a theory of mind it can't really educate or inspire people

I wouldn't agree with that, either. How often have we heard of someone gaining useful insights by considering ideas that were misapplied or just plain wrong? Entire branches of physics have evolved that way. As far as successful presidential campaigns are concerned... well, let's not even go there.

If there's such a thing as a 'theory of mind', it applies to the reader, not the writer.

AnimalMuppet · on July 19, 2020

I think I disagree.

For example, I delayed in writing this comment because the cat was on my lap, and I couldn't fit the laptop and the cat both. You get that. I know you do, even if you don't own a cat, and even if you're reading this on a phone or a desktop.

GPT-3 does not understand about the cat. To GPT-3, they're just words and phrases that occur in the vicinity of each other with certain probability distributions. As a result, it can't write something and know that there's something there in your mind for it to connect to.

Cyc would handle the bit about the cat differently. It would ask "Did you mean cat-the-mammal, cat-the-Caterpillar-stock-symbol, or cat-the-ethernet-cable-classification?" It has categories, and some notion of which words fit in each category, but it still doesn't understand what's going on.

But you the human understand, because you have a lap, and you've at least seen a cat on a lap.

lowdose · on July 19, 2020

> But you the human understand, because you have a lap, and you've at least seen a cat on a lap.

You really think GPT-3 never came across a comment about a cat in lap? 50% of all the pictures on the internet are cats sitting on people. GPT-3 doesn't need to understand it to echo this common knowledge.

Airplanes don't look like birds at all but they do fly.

dTal · on July 19, 2020

I don't see how any number of comments about cats in laps can allow it to synthesize the following logical chain: cat-on-lap -> pauli exclusion principle -> laptop NOT on lap -> laptop awkward to reach -> delayed comment

vladTheInhaler · on July 19, 2020

I think the issue is that text doesn't exist in a vacuum, but the corpus that the model is learning from does. A piece of human writing exists for a particular reason - to persuade, to inform, to ask a question, etc - and its value is judged on its ability to perform that task. But that's not a quality that is evident from the text itself, only from looking at the world outside the text. This suggests to me some limits on this kind of passive self-supervised approach. Perhaps it could be improved by augmenting the text with other forms of data? For instance predicting video from text and vice versa. But I think that to learn a true "theory of mind", it needs to use text like an agent - to influence its environment, not merely predict it.

dannyw · on July 19, 2020

It could also be a result of training data. If every page is weighted equally, you'd expect SEO spam and even autogenerated content to far surpass high quality content in volume.

I would like to see a GPT model where training data is weighted by credibility / authority (e.g. using Pagerank).

vladTheInhaler · on July 20, 2020

My understanding is that GPT-2 was actually trained on a dataset that was designed to avoid those pitfalls. They followed all the links posted to Reddit that had more than a couple karma, under the theory that the content was at least slightly interesting to some actual humans, as opposed to a giant blob of search keywords or what have you.

IIAOPSW · on July 19, 2020

>Right now this reads like a glorified shopping list. It's coherent, but actually sounding human also requires a theory of mind.

What if the ultimate theory of mind turns out to be that consciousness is an illusion and nothing separates us from a sufficiently sophisticated markov process.

throwing947383 · on July 19, 2020

>What if the ultimate theory of mind turns out to be that consciousness is an illusion and nothing separates us from a sufficiently sophisticated markov process.

Conscious experience would still exist (see cogito ergo sum, Chalmers, etc). If we were to be shown we're just Markov processes, that wouldn't disprove the existence of conscious experience. Just like confabulation, a misleading experience is still an experience.

What it would disprove is any sense of agency.

chrisweekly · on July 19, 2020

"objectively ...funny" strikes me as a contradiction in terms; concepts like humor, insight, and interest are fundamentally subjective, dependent by definition on a subject's consciousness and expectations

488643689 · on July 19, 2020

Aww c'mon. You guessed it was fake, because it's an article about computer generated articles. Who would read that and not question the content in front? You are not analysing everything you read for oddities in writing style.

antonvs · on July 19, 2020

> You are not analysing everything you read for oddities in writing style.

I certainly do, don't you? When I read a blog post and it's full of poorly-integrated buzzwords that make it seem like it was churned out by a non-English speaker being paid very poorly per word, I stop reading and move on.

I recently read a few pages of a book someone had recommended to me and stopped reading because of the writing style.

Heck, you can read a few pages of, say, a Dan Brown novel, and based on the writing style might choose not to read it, since the style tells you a lot about the kind of book it is.

yyyk · on July 19, 2020

I'm not a very good test case. I briefly skimmed (not expecting very much from a Bitcoin-themed article), read the end, and only then read more carefully. So my first read was brief and biased, and my second was very biased.

That said, the content of the computer-generated parts doesn't make much sense even for a Bitcoin-influenced article (what would be the point of paraphrasing your previous post in a forum on a regular basis, and how does this not get one very quickly banned?), but the grammar is far far better than previous attempts - it reads like Simple English wiki.

woko · on July 19, 2020

> I briefly skimmed (not expecting very much from a Bitcoin-themed article), read the end, and only then read more carefully.

It sounds to me like you must be an academic, or someone with good habits for being efficient at reading articles.

zamadatix · on July 19, 2020

Or maybe we're all bots too and you're the only real HN user!

I agree, responses are almost as interesting as GPT-3. And this place has always felt like one of the better when it comes to people reading past the titles!

JBiserkov · on July 19, 2020

"Every account on reddit is a bot except you."

https://www.reddit.com/r/AskReddit/comments/348vlx/what_bot_...

jcahill · on July 19, 2020

GPT-3 is a neat party trick. But the things that'll be done with web archives* in the next 20y will make it look like the PDP-8. ~love, a web archivist

* GPT-3 is trained on one

ypcx · on July 19, 2020

The transformer model as presented in GPT-3 may be a few tweaks away from a human-acceptable reasoning, at which point we may realize that human brain is just a neat party trick as well. This may come difficult for some people to internalize, especially those who understand the technology in depth. Because it means that the medium of our reality is the consciousness.

walleeee · on July 19, 2020

Was this comment generated by GPT-3?

Naracion · on July 19, 2020

I doubted that as well, but I don't think it is--at least it's not a simple copy paste. There's an emphasis on _is_ in the last sentence which I don't think the algorithm could have generated.

However that makes one wonder if it can also learn to generate emphases, and if so, how would it format? With voice generation it can simply change its tonality but with text generation it has to demarcate it in some way--does the human say "format the output for html", for instance?

sytelus · on July 19, 2020

You are confusing pattern matching with reasoning. If your brain was replaced by GPT-3 model and you were cast away on a distant island, I highly doubt you will be able to perceive, plan and prosper during your survival against all the calamity nature would through at you.

mamon · on July 19, 2020

To be honest, most city-raised humans wouldn't be able to survive on a distant island as well.

Tarean · on July 19, 2020

The transformer model in GPT-3 has a short context window and no recurrence. Without some significant architecture changes that is a fundamental limit on the problems GPT-3 can solve.

lucidrains · on July 19, 2020

https://arxiv.org/abs/1807.03819

visarga · on July 19, 2020

> Because it means that the medium of our reality is the consciousness.

I agree. The environment - as the source of learning and forming concepts, is the key ingredient of consciousness, not the brain.

yomly · on July 19, 2020

I don't fully understand what you're getting at here...

Basically the brain and "consciousness" isn't as fancy as we think?

lucidrains · on July 19, 2020

Exactly.

h0p3 · on July 19, 2020

No pressure: feel free to ignore me, please. Would you mind elaborating? I'm interested in what you have to say (and, of course, feel free to say it privately if you prefer). I would like to even hear your dreams, wild speculations, or gut feelings about the matter.

jcahill · on July 19, 2020

Sure, what do you want to know?

I currently work on synbio × web archival.

Some of us are cooking up futuretech aimed at storing all of IA (archive.org) in a shoebox. Others are working on putting archival tools in more normal web users' hands, and making those tools do things that people tend to value more in the short-term, like help them understand what they're researching, rather than merely stash pages.

My ambitions for web archives are outsized compared to other archivists, but I'm fine with that. I'm looking beyond web archives as we currently understand them toward web archives as something else that doesn't quite exist yet: everyday artefacts, colocated and integrated with other web technology to an extent that they serve in essential sensemaking, workflow, and maybe security roles.

Right now, some obvious, pressing priorities are (a) preserving vastly more content and (b) doing more with the archives themselves.

A: The overwhelming majority of born-digital content is lost within a far narrower time-slice than would admit preservation at current rates, and data growth is accelerating beyond the reach of conventional storage media. So, for me, the world's current largest x is never the true object of my desire. I'm after a way to hold the world that is and the world to come.

Ideally, that world to come is one where lifelong data stewardship of everything from your own genome to your digital footprint is ubiquitously available and loss of information has been largely rendered optional.

This, of course, requires magic storage density that simply defies fundamental limitations of conventional storage media. I'm strongly confident that we're getting early glimpses of the first real Magic contenders. All lie outside, or on the far periphery of, the evolutionary tree that got us the storage media we have today. For instance, I'm running an art exhibition that involves encoding all the works on DNA.

B: Distributed archival that comes almost as naturally as browsing is well within reach, and with that comes some very new potential for distributed computation on archives. One hand washes the other.

One important thing to realize here is that, in many cases, you can name a very small handful of individuals as the reason why current archival resources exist. GPT-3 is cracking the surface by training on data produced by one guy named Sebastian, for instance.

…i'm sorta tired and have to respond to something about every twitter snapshot since June being broken, though, so I'll pick this back up later.

greyface- · on July 19, 2020

This is an interesting thought. GPT-3 used 45TB of raw CommonCrawl data (which was filtered down to 570GB prior to training). The Internet Archive has 48PB of raw data.

GreenHeuristics · on July 19, 2020

That 48PB is mostly just old video game roms and isos though

scoot_718 · on July 19, 2020

Hopefully in a way that secures some funding for those making archives of the web.

jcahill · on July 19, 2020

I'm running the Coronavirus Archive. Largest thematic archive on the pandemic, since January. I'm also teaching community biolab techniques to people in parts of the world without ready access to commercial COVID-19 test kits, on all but zero resources at this point.

I could use… what's the word? I think it's more funding.

m3kw9 · on July 19, 2020

Problem was I lost interest half way because it lost my interest after the 2nd paragraph. For those that say it was good till the last few is really pretending to understand what it said. It really did not made much sense.

blueblimp · on July 19, 2020

I'm finding with secret GPT-3 output that I often find it boring before realizing it's GPT-3. I might even be getting to recognize its wordy, dull, cliche-ridden, borderline-nonsensical style. It's remarkably good at passing as human writing of no value whatsoever.

x0 · on July 19, 2020

Well, it is trained on web content. Often I read an article and it's obvious they're either stretching it out to hit a word count, or trying to get as many google-able phrases in as they can. Some sites are worse than others, with the more no-name ones the worst offenders.

visarga · on July 19, 2020

The fast jump in quality from GPT2 to 3 is more important than the current level of GPT3. Maybe next year it will be not-boring.

wcoenen · on July 19, 2020

It's ironic that you're critiquing the algorithm for not making sense, while contradicting yourself in your first sentence. Did you lose interest "half way", or "after the second paragraph"? It can't be both.

necovek · on July 19, 2020

Wouldn't "after the second paragraph" be "half-way" for a four paragraph piece? :D

But you are right, it can't be both in the context of this article :)

ricksharp · on July 19, 2020

Those types of contradictions are what made me suspect the article was generated.

Now, I’m not so sure :)

sidcool · on July 19, 2020

This is so true. I had a high regard for HN comments till quite recently.

sangnoir · on July 19, 2020

I take it you've never had your comment - which happens to be on a subject of your expertise or lived experienced - downvoted because simply because it doesn't feel "truthy" enough for HNs demography. I have long since adopted a healthy disregard of HN comments in other areas that I'm no expert in. I still haven't found a way to monetize that though; that is my holy grail.

dTal · on July 19, 2020

It is a bit unfortunate that your comment is now at the top - it spoils the test :)

Saying that, I briefly saw the first sentence of your comment and went to read the article with the idea that trickery was afoot, specifically guessing correctly the nature of the article. And yet, even then, on the back foot... it fooled me. Incredible.

OOPMan · on July 19, 2020

Thanks to this comment I actually read the blog post.

It was relatively good, although I began to suspect it was GPT3 generated about halfway through (partially because the style felt a bit stiff but also just out of a shayamalan-what-a-twist 6th sense of mine that was tingling)

classified · on July 22, 2020

Eliza could do this better. Or, just use a Markov chain that has read enough corporate PR bullshit. It's just sad how many people use this "AI" meme to fulfill their need to worship something.

DonHopkins · on July 19, 2020

It reminds me of the incoherent demented ramblings we've all been been hearing (but hopefully not following as medical advice) for the past several years.

simonebrunozzi · on July 20, 2020

Ah, but you are almost spoiling the end with your second paragraph! :)

I agree with you. I suspect few people have read until the end to realize that, in fact, ...

m3kw9 · on July 19, 2020

Is exactly the issue. You still need humans to check it before releasing an output. It can only be what the author says “Bitcoin” level of implication if it can get things probably needing at least “99%” quality and correctness.

joe_the_user · on July 19, 2020

OK, I read the first sentence and it sounded like a typical poor-quality marketing-blather article so I came here and read your post.

I then reread it and it indeed read like a weird, rambling, incoherent article. Looking at it closely, it had a good many contradictory, meaningless and incoherent sentences.("It is a popular forum with many types of posts and posters.")

The headline, however, seemed about right.

It's true the nonsense in this article is a bit different than the nonsense of a GPT-2 article. But the thing GPT-2 paragraphs sound pretty coherent 'till they suddenly go off the rail. This is more like an article that was never quite on the rails and so it's slightly more internally cohesive. But not "better".

Maybe the article just reflects the author's style. Anyone have a GPT-3 test site link?

minimaxir · on July 19, 2020

I published a response today to the sudden hype urging people to temper their expectations for GPT-3 a bit: https://minimaxir.com/2020/07/gpt3-expectations/

GPT-3 is objectively a step forward in the field of AI text-generation, but the current hype on VC Twitter misrepresents the model's current capabilities. GPT-3 isn't magic.

nbardy · on July 19, 2020

One of the biggest issues is with cherry-picking. Generative ML results benefit greatly from humans sampling the best results. They are capable of producing astonishing results but don’t do this consistently this has a huge impact on any effort to productize. For example I’ve seen quite a few examples of text->design, text->code, with GPT-3 you could build a demo in a day, but the product will probably be useless if it’s not delivering results 50%+ of the time

teruakohatu · on July 19, 2020

I don't know about GPT-3 but playing around with GPT-2 I often got the impression that it was regurgitating learned knowledge (reddit comments) rather than actually coming up with something novel.

With so many weights, it practically encodes a massive Internet text database.

stavros · on July 19, 2020

I had that thought too, and my immediate next thought was that the value isn't in knowing the sentences, but in being able to put them together usefully.

Kinrany · on July 19, 2020

Having a better alternative to search engines would be great.

quotemstr · on July 19, 2020

I think too few people are taking into account how inscrutable and inconsistent human creative-output results can be. We critique GPT-3 on the basis of it sometimes producing bad results --- but don't we all? Take poetry, for example. "The Complete Works of X", for any X, will probably contain a majority of forgettable or just bad works. What we remember from any author X is his cherry-picked best output. Likewise for ML systems.

yyyk · on July 19, 2020

The hype/scare re GPT-2/3 (etc.) is not for their poetry output, but rather for its potential for mass propaganda, telemarketing and so on. We can already get humans to do this stuff, all GPT could give is scale (that's no small deal).

However, if the output needs to be curated and edited by humans, the scale and automation is gone - we just get a different manual process, with a modest improvement to speed at cost of some decline in quality, and that's not very impactful.

threeseed · on July 19, 2020

The truly scary part is SEO where GPT-3 could ruin search engines overnight.

Google at this point favours long form content for many search intents. Being able to generate thousands of these pages in one-click is a real problem. Not just because of popular topics e.g. "covid-19 symptoms" but more so for the long tail e.g. "should I drink coffee to cure covid-19".

yyyk · on July 19, 2020

Quite a lot of SEO already uses simple word generation techniques. It isn't clear GPT-3 is an improvement there - human text recognition might not be whatever Google does.

It may be that Google's algorithms don't care at all how human-like the text is, or that their own recognition algorithm/NN (whatever they use) isn't fooled. Even if it is affected, Google has the money and corpus to build its own competing NN to recognize GPT-3 text.

mr__y · on July 19, 2020

While I have no doubts that they could build NN capable of recognizing GPT-3 text I believe that this would still pose a problem given the amount of content to be analyzed at the scale that Google deals with

yyyk · on July 19, 2020

I'm sure Google out of all entities could handle scale.

That said, there might be a different threat to Google. GPT-3 seems really useful as a search engine of sorts (with the first answer implementing the 'I'm Feeling Lucky' button). Tune it for a query syntax, and for getting the 'top X' results somehow, then we just need the web corpus and a basic filter over the results. We could have a very interesting Google competitor.

joe_the_user · on July 19, 2020

If OP article was cherry-picked, the tree must not be very productive.

More than cherry-picking, there's the Eliza Effect - it's pretty easy to make people think generated text is intelligent. That text can seem intelligent for a while isn't necessarily impressive at all.

https://en.wikipedia.org/wiki/ELIZA_effect

bemmu · on July 19, 2020

To be honest, personally I had no idea it was generated until the author said so at the end.

Makes me worry about my own reading comprehension, but I think what happened was that since it was posted on HN and got upvoted a lot, I simply assumed that anything that I didn't understand was not the writer's fault, but mine.

For instance, it was unclear from the post what the bitcoinforum experiment was about, but I just dismissed it as me not being attentive enough while reading.

At one point GPT-3 writes: "The forum also has many people I don’t like. I expect them to be disproportionately excited by the possibility of having a new poster that appears to be intelligent and relevant." Why would people he doesn't like be paricularly excited about a new intelligent poster? Again I just assumed that I missed the author's point, not that it was nonsensical.

Twice it refers to tables or screenshots that are not included, but it seemed like an innocent mistake. "When I post to the forum as myself, people frequently mention that they think I must be a bot to be able to post so quickly" seemed like another simple mistake, meaning to say that when he posted as GPT-3, people thought he was being too quick.

This is like a written Rorschach test, when I'm convinced that what I'm reading must make sense, then I'll guess at the author's intent and force it to make sense, forgiving a lot of mistakes or inconsistencies.

kunfuu · on July 20, 2020

The second one doesn't sound like a mistake to me. Someone being able to consistently post so quickly is actually a valid sign of being a bot.

MetalGuru · on July 28, 2020

Really interesting. Looking back, I did the exact same thing at many of the same points.

rapnie · on July 19, 2020

It will be very annoying for e.g. forum moderators to determine whether first user posts are just a bit incoherent, or generated spam garbage.

luckylion · on July 19, 2020

That used to be a pretty annoying thing back in the days of IRC as a kind of DOS: run a bunch of bots that just replay a conversation from another channel. Engaging them fails, but is that because they are bots, or because they're just ignoring you?

rapnie · on July 19, 2020

The new kind can be more targeted for specific purposes. They could be excellent tools for trolling a forum, inciting flame wars and such.

luckylion · on July 19, 2020

That would require some more advanced tech though. I don't think GPT-3 can target divisiveness yet, especially since it would heavily depend on the community you're writing for, e.g. driving a wedge into the general population is very different than driving a wedge niches. The Linux vs Windows debate might get you engagement in a tech forum, but it'll fall short with social housing activists, and whatever issues they split on will probably not get you anywhere with the tech crowd.

regularfry · on July 19, 2020

I don't think it needs to understand what a divisive issue is to have an effect. If you've got a human operator who can pick a divisive enough prompt, this can dramatically increase their inflamations-per-hour because they don't need to compose the body text.

joe_the_user · on July 19, 2020

It's true that distinguishing these articles from ordinary jointed ramblings of poor writers would be hard. But I'm not sure what the benefit of filling forums with babble has to those running these models.

Bots offering idiocy and idiocy generally has done lots of damage. But by idiocy here I would quite carefully calculated cleverly polarized positions and I don't think just bot-rot would be enough (to maybe coin a phrase).

drcode · on July 19, 2020

I agree, but on the other hand one has to be careful not to be blind to the obvious power of a new technology, simply because it cannot be immediately turned into $$$.

Vinnl · on July 19, 2020

Hmm, I don't know. If you're the IRA [1], it sounds like it could be more efficient to have your trolls select plausible-looking comments from the auto-generated ones rather than having them write them themselves all the time.

[1] https://en.wikipedia.org/wiki/Internet_Research_Agency

ricksharp · on July 19, 2020

Yeah, I saw a text => UI generator.

It’s cool, but it looked like very basic stuff - the type of UI that is very easy to create in a few minutes. (And really with what was setup behind the scenes - maybe just as fast to just write the code.)

The hard part about software development is not those bits which are common, but the parts that are unique to our specific solution.

fluffernutter · on July 19, 2020

> but the parts that are unique to our specific solution

Search terms tweaked for your unique interests, and not a commercial entity's, for example.

bjourne · on July 19, 2020

True! That means that whoever can come up with a system that takes 10 texts written by GPT-3 and always selects the best one (as judged by humans) will become rich and famous. This sampling problem is one of the few major hurdles before generative ai:s become really useful.

throwawaygh · on July 19, 2020

> rich

Is reddit gold really that valuable?

> famous

Surely there are easier ways.

> really useful

We already have enough 2020 reddit commenters regurgitating 2010 hn threads regurgitating 2000 slashdot threads, thanks.

tgsovlerkhgsel · on July 19, 2020

It seems like with minor improvements, you could use this to significantly accelerate mundane parts of programming or writing. Human writes bulletpoints, neural net turns it into a program or letter, human corrects. There already was a pretty smart looking AI-based autocomplete shown on HN a couple weeks ago.

This will accelerate development. Is the current version there? Probably not. But GPT-4 might, and would then accelerate the development of future versions.

Even though this is not "magic", it sounds like it will turn into a practically usable and extremely valuable tool soon.

GrantZvolsky · on July 19, 2020

Being a speaker of Czech and English without a single dominant language, I use Google Translate to improve my writing. I will write a draft in the target language and feed Translate the other. It often comes up with improved style and more accurate expressions, especially in Czech. So as far as writing goes, we're already there.

_the_inflator · on July 19, 2020

Yes, it is more like happy path testing.

However I like spirit of optimism and first looks at encouraging and very promising results.

Exciting times!

m3kw9 · on July 19, 2020

I saw a demo of a gpt3 designing an app that looked just like Instagram home feed skeleton. While it seem impressive, but until you show me something more obscure, that was nothing to brag about.

jbhouse · on July 19, 2020

Please please please post a link to that video. It sounds super interesting

Zetaphor · on July 19, 2020

I assume they're referring to this tweet, where someone created a Figma plugin using the API

https://twitter.com/jsngr/status/1284511080715362304

threeseed · on July 19, 2020

It was posted here: https://twitter.com/jsngr/status/1284511080715362304

Honestly not that impressive since you can get comparable results with a series of regex rules given that there are limited ways to describe your intent e.g. "create a button of colour <colour> at the <location of button>"

emteycz · on July 19, 2020

What are your thoughts on why nobody made the set of rules yet?

luckylion · on July 19, 2020

If designers wanted to write texts to create visual designs, they'd be using some form of DSL and learn to code, wouldn't they?

I believe the hype is that people think they can replace the designer by "just telling the computer" what they want. I don't believe that will work, as they already have trouble telling a human what they want, and a computer won't really know what to do with "I want it to kind of feel like it's from that movie with the blue people that Cameron did, you know?"

In my experience, people have a hard time writing their ideas about designs & features down, because they don't know what they want. They want to talk about it abstractly with somebody who has a better understanding of the field so that person can help them develop the idea. I don't think ML will cover that part any time soon.

Reelin · on July 19, 2020

> people have a hard time writing their ideas about designs & features down, because they don't know what they want

From an academic standpoint, writing is part of the thinking process. If you haven't written it down, you haven't fully thought it through. If it feels difficult, that's probably because your understanding isn't as complete as you thought it was.

From a software development standpoint, implementing something is part of the thinking process. Ever notice how the requirements have a tendency to break as soon as you actually try to implement them? If a spot seems difficult it just means you hadn't really figured it out yet.

luckylion · on July 19, 2020

> From an academic standpoint, writing is part of the thinking process. If you haven't written it down, you haven't fully thought it through.

I 100% agree. I noticed a giant shift in tasks when I made one client write tickets instead of making phone calls. Writing it down forces you to think it through.

And I agree about software development as well, yes. Though I think it's even rare to have somebody describe all the features they want unless it's an experienced software developer who basically writes a textual representation of the application.

But for most PMs (that I've worked with at least), they have vague ideas about what they want, and bringing them into focus is a back and forth with developers and designers. I don't see them getting anywhere with an NLP automaton, but maybe with an Eliza-style system: "Give me a big yellow button saying 'Sign up'" - "Why do you want a big yellow button saying 'Sign up'?" - "You're right, that's too on the nose... give me a link saying 'Sign up'"...

nutanc · on July 19, 2020

GPT-3 isn't magic. That's the most important thing. I got so amused with the hagiographical tweets that I coded myself a non GPT-3 demo :)

https://twitter.com/nutanc/status/1284446411438710784

davidgerard · on July 19, 2020

What this means is that GPT-3 is good enough to fool a crypto VC.

@balajis being generated by GPT-3 would make a lot of sense, though.

coderintherye · on July 19, 2020

I don't know, this seemed pretty close to magic to me:

https://twitter.com/jsngr/status/1284511080715362304

Granted, it seems like there was a lot of behind the scenes work to make that happen.

ekianjo · on July 19, 2020

this is something you could do with NLP already before.

jcims · on July 19, 2020

Have you spent any time interacting with GPT-3?

It's qualitatively different than GPT-2. I was on a discord with someone that has access to it and a bunch of us were throwing ideas out for prompts. One of them was to provide an anonymized bio of someone and see if it could guess who it was. The format was 'this person...they..and then they...\nQ: Who is this person?\nA: '

At the first pass it didn't guess correctly. But we erased its response and tried again and it got the answer correct. We then asked it to screenwrite some porn and tell jokes. Yes there were some misses, but it got things right so frequently that you can start to see the future.

Having all of this capability in one package is pretty remarkable and nothing has approached it to date.