Show HN: Whole Git repo was made with ChatGPT

dkjaudyeqooe · on Dec 4, 2022

I guess we can look forward to weeks of "Show HN: $X created by ChatGPT" but people should be cautioned not to read to much into these results. Always remember that almost all of what is being presented here is the work of humans, regurgitated by a very much non-intelligent machine, despite its name. It's basically:

Human creation -> ChatGPT -> Human query -> Human interpretation

The last bit, the interpretation, is particularly important. Just like we're predisposed to seeing faces everywhere, we're predisposed to seeing meaning, and perhaps "intelligence", everywhere. In this case the meaning is very convincing since it comes from other humans, diced and sliced, but is merely presenting ourselves to ourselves in an interactive way, using our style of discourse.

jchw · on Dec 4, 2022

I'm sure this line of logic is very comforting, but frankly, this comfort disappears quickly when you actually use ChatGPT. What you find is that you can interact with it in a quite natural way, and it is able to synthesize and iterate at a level that feels easily on par with a moderately skilled human software engineer. I know it's uncomfortable, but it doesn't even matter if the machine is "non-intelligent." Nobody gives a damn. What matters is what you can do with it, and every iteration of GPT the goal posts keep moving further, but this time it's really difficult to deny: you really, really can describe a program at a high level and ChatGPT can implement it. You can point out an error and it can fix it. Hell, you can feed it compiler errors.

Is it literally as good as a human software engineer? No, but it's also better too. I doubt ChatGPT could debug as effectively as a veteran software engineer (... In fairness, most humans can't either.) It can debug pretty decently, but there's still work there. That said, the breadth of knowledge encoded in a language model is stunning. I'm pretty sure you can't just regurgitate an implementation of the discrete cosine transform in Rust without at least pulling up Wikipedia, but ChatGPT can, because well. It doesn't have to pull it up.

I still don't think ChatGPT is ready to replace human programmers. It may be a long time before we have general enough intelligence to replace knowledge work meaningfully with AI. However, if you think it's not happening ever, because machines are not "intelligent" based on some set of goal posts, I've got bad news: that's not part of the job listing.

It's easy to laugh at MSN publishing articles written by GPT; that's just stupid. However, at some level you have to admit that the input to ChatGPT is almost as high level as directives from project managers, and the output is almost low level enough to simply input directly into source control. That leaves very little to the imagination for how this could quickly spiral out of control.

rightbyte · on Dec 4, 2022

Ye I signed up and tried some queries. It was quite scary.

"Write a function that sorts the elements of a c string backwards, in C." "Add a flag to the sort function which makes it sort the string forward." "Could you write a endian swap function for double argument in PIC assembler?" "Could you write a binary sort in PIC 8-bit assembler" "Write a Javascript function that prints the day of the week together with some Chinese words of wisdom."

It had no problem doing any one those. I ran them all, except the assembler ones.

The question is how good it is to process larger chunks of code and makes changes to it.

People thinking about becoming programmers might need to rethink their plans if this one improves ...

EDIT: Oh dear. I introduced bugs in its sort code and it found them and explained what they did.

guiambros · on Dec 4, 2022

Last night I entered the page-long instructions for Advent of Code day 4, and it spewed out perfectly readable code and solved it on the first try [1]. And we're not talking about a common algorithm that has been solved many times before, but a convoluted story that is full of "elves cleaning up overlapping sections of the camp" (!), and ChatGPT was still able to understand it, write the code to solve it, and even explain how it works.

It's nothing short of a phenomenal milestone.

[1] https://twitter.com/GuiAmbros/status/1599282083838296064

ak_111 · on Dec 4, 2022

Are you sure this whole problem with its solution wasn't indexed in ChatGPT training set, so all it did here is retrieve the whole answer to the problem similar to how google will find the solution pretty quickly if you search for "advent of code day 4 solution"?

Can you see if it solves an advent of code problem whose solution hasn't been posted yet? If it did I will consider this an almost publishable breakthrough.

ghusbands · on Dec 4, 2022

It wasn't in the training set - the puzzle was released today, as part of this year's advent of code. ChatGPT is eminently publishable and has demonstrably produced plenty of novel, if boring, code.

TedDoesntTalk · on Dec 4, 2022

Genuine question: can it write larger codebases yet such as: “write a pac-man clone in JavaScript, html, and css”

MacsHeadroom · on Dec 4, 2022

Yes, but you'll have to chop the project up into different parts and describe each sufficiently well.

For example, start with "write a level generator in JavaScript which outputs pacman like levels encoded in ASCII which could be used as the basis for a pacman clone."

Then you'd say "Great, now write a pacman clone in JavaScript which takes the input of the previously generated level generator and takes interactive input on the console."

Then you'd have to ask it to generate the HTML and CSS. It won't do it all at once. But it will do it piece by piece.

rafabulsing · on Dec 6, 2022

Yes, it's absolutely feasible. I did exactly that with a simpler version of this (Tic Tac Toe, and console only, no actual visual interface) but it worked pretty well.

The main thing is having a good idea of what the overall program structure will look like. If you have that, I think you can go pretty far with it.

Btw, in case anyone wants to see what working like this may look like, I recorded myself while doing that little experiment: https://youtu.be/TBMQIDMwI5E

negamax · on Dec 4, 2022

It can't write pacman. But if you describe a scenario to it, it writes pretty accurate code and can drill down. Very impressive

dwild · on Dec 5, 2022

I'm trying that right now. It can probably do it, the big issue right now is that it won't give really long answers and just stop in the middle, which can be avoided by asking him to finish it, but it doesn't always works.

rafabulsing · on Dec 6, 2022

If you can break down what you want into smaller functions, instead of just asking for the whole thing at once, you can actually do a lot with it, including telling it to out it all together at the end.

Won't link it again so as to not get too spammy, but I posted a recording of an experiment I did playing around with ChatGPT in that way on a sibling comment if you want to check that out.

ignu · on Dec 4, 2022

I've tried giving it languages like Turbo Pascal or telling it to obscure frameworks like fp-ts and while sometimes it's not syntactically perfect, but I can guarntee it's not just scraping code snippets from somewhere.

dwild · on Dec 5, 2022

The guy that got the first position for the first part, thus the first solution ever on that problem, was using ChatGPT to do it. [1]

[1] https://github.com/max-sixty/aoc-gpt

alexeldeib · on Dec 4, 2022

I tried higher level problems: write an http server in rust using warp and postgres, with endpoint for get users and put user.

9 compile errors, some due to cargo.toml issues (it only generated main.rs). But mostly worked.

The post we’re discussing is also pretty impressive, a working lexer/parser

xur17 · on Dec 4, 2022

> The question is how good it is to process larger chunks of code and makes changes to it.

> People thinking about becoming programmers might need to rethink their plans if this one improves ...

Very true, and this thought definitely crossed my mind as well. In the short term I imagine this is going to be like any other tool, it will increase your leverage, and make it possible for you to write code faster.

For example, I fed it 100 lines of a Django model and asked it to write some query code for me. Produced something that was possible faster than I could have. The few slight deficiencies it did have (optimizations) basically felt like me doing a code review.

nwienert · on Dec 4, 2022

The answer is it’s very bad at large amounts of code. It has no skill at reasoning across a large amount of files where say you’d need to make many small changes in a handful of places and know that those changes all work together.

These models are amazing at regurgitation. They have no skill at reasoning and any solution outside their training set. They have a distinct disability for precision in anything remotely novel or large. And they will bullshit subtle bugs all over the place at scale.

Even a junior programmer would produce more value than the next generations of this, simply because they can take vague English instructions and actually make iterative progress across a huge codebase, test those changes for all sorts of vague constraints, and be precise. There’s no ability for that here.

rightbyte · on Dec 4, 2022

Well my mortgage is thankful for that.

I have a feeling though that tools like this will lower the skill threshold for programming by a lot.

I gave it this code I just made and asked it what it does and it figured it out:

    public class mean_sink {

        double sum;
        long n;
        
        public mean_sink(){
        }
        
        public void add(double d){
            sum += d;
            n++;
        }
        
        public double getMean(){
            if(n == 0)
                return Double.NaN;
            return sum/n;
        }
    }

I asked it to improve the numerical accuracy and it did it with BigDecimal instead. I asked it to only use doubles and it did some strange reciprocal summation when I was "hoping" for Kahan summation algorithm. Its "reciprocal summation" did not work. But it made a correct Kahan when I asked for it.

I dunno. This is insane anyway. It is really hard to grasp the consequences.

passion__desire · on Dec 4, 2022

This tool would make for a great pair programmer and an intelligent interactive rubber duck

birdyrooster · on Dec 4, 2022

Except you must have the underlying knowledge to effectively review code. So how that would obviate a career in programming is a strange choice by the grandparent commenter. Anyone can rubber stamp bad code, but to stop bad code from entering projects you need experts.

aprdm · on Dec 4, 2022

Most projects have plenty of bad code, nobody cares, as long as it solves the problem.

ad404b8a372f2b9 · on Dec 4, 2022

The bad code it produces is not just stylistically bad, it often doesn't solve the problem.

It seems to be making the same kind of errors in code as we've seen in speech, you get a lot of plausible but subtly wrong hallucinated content that only someone who knows the subject can identify. It prefers probable output over matching the specifications so it'll add unnecessary branches just because it's seen them millions of times.

Among errors I've seen it make there is bad indexing, adding conditions that serve no purpose or assuming a variable has a specific structure when it doesn't.

senectus1 · on Dec 5, 2022

>Even a junior programmer would produce more value than the next generations of this

At the rate this is evolving that wont be a limitation for long.

My 14 yr old son thats taught himself to code in c# over the last few years is going to be put of a potential job before he cab be taught to code professionally.

This is quite frightening, the rate of evolution.

ChickeNES · on Dec 5, 2022

I already managed to get it to write an implementation of Pong and a Chip8 interpreter, both in C and using SDL2 for graphics/input/sound (it even wrote a function to generate the waveform for a beep for Chip8 and rebound sound for Pong!). I'm pretty sure I can get it to generate all the code for a Gameboy or NES emulator, I've taken a few tentative steps and it definitely can reproduce the memory map and generate code to parse iNES/GB cartridge headers

nobody5050 · on Dec 5, 2022

Something I’ve noticed in my own experimentations is that it’s really only limited by the length of its replies. It can happily generate perfect code but then stop in the middle of a function simply because the website only asked for 200 tokens or something. It’s really quite something to say that one of my gripes lies not in the model but in the user interface wrapping the model.

dopidopHN · on Dec 4, 2022

Hi, thanks for breaking it down.

Do you write code close to the machine at $job?

I wanna try with more feature and UI heavy request.

I actually comment because of your edit in your response : you manually introduced bugs, reran the process, and it seen you broke it ( fine ) but was able to explain it to you in English ?

Where is that comming from ? Adjacent comments and git comments?

rightbyte · on Dec 4, 2022

> I actually comment because of your edit in your response : you manually introduced bugs, reran the process, and it seen you broke it ( fine ) but was able to explain it to you in English ?

I fed it it's own sorting function but I added bugs to it and asked it to spot the bugs.

makestuff · on Dec 4, 2022

I asked it to make some stuff in flutter and it worked well. I did notice sometimes it just adds a comment in the function saying “insert logic here for X” when it doesn’t know how to write it.

sage76 · on Dec 4, 2022

I tried a hard level leetcode problem on it. Perfect results on the first attempt.

dvngnt_ · on Dec 15, 2022

it was trained with leetcode data though?

sage76 · on Dec 23, 2022

Ah I didn't know that.

melagonster · on Dec 5, 2022

programmers are final worker. there are not no more jobs.

yuuu · on Dec 4, 2022

> However, at some level you have to admit that the input to ChatGPT is almost as high level as directives from project managers, and the output is almost low level enough to simply input directly into source control.

This is the part that is going to change the way people work in fundamental ways very quickly. It will turn good software developers into non-shitty project managers, project managers into ?, and mediocre developers into ?.

timeon · on Dec 4, 2022

Project managers into mediocre developers and mediocre developers into jobless.

ricardobayes · on Dec 4, 2022

Still, it's a tool that requires human input. Similarly how Photoshop didn't make artists obsolete, it elevated the trade to a new level and capabilities.

rightbyte · on Dec 4, 2022

It is probably an excellent learning tool. You can show it code and it will point of the bugs.

    void sort_string(char *str, int backwards) {
      // Get the length of the string
      int len = strlen(str);

      // Use a bubble sort algorithm to sort the string
      for (int i = 0; i < len; i++) {
        for (int j = 0; j < len - i; j++) {
          // If the current character is greater than the next character (or less than, if sorting backwards), swap them
          if ((backwards && str[j] < str[i + 1]) || (!backwards && str[j] > str[i + 1])) {
            char temp = str[j];
            str[j] = str[j + 1];
            str[j + 1] = temp;
          }
        }
      }
    }

Will yield: "The sort_string() function contains two bugs. The first bug is in the inner loop conditional statement [...] The second bug is in the termination condition of the inner loop."

and a detailed explanation of the fix and what the problems are.

The problem is that the working market might be flooded by this since way more people will be able to code ... Github Co-pilot is nowhere near this.

ascorbic · on Dec 4, 2022

> Github Co-pilot is nowhere near this. They're both based on GPT-3. The difference is that Copilot's model is trained on a lot more code. I'd be amazed if Copilot doesn't have a super-linter soon that can find and fix these kind of bugs.

ricardobayes · on Dec 4, 2022

Yes, it seems useful for advanced learners who might actually have questions. It probably can't teach you from scratch yet.

drivers99 · on Dec 4, 2022

I asked it to write a Forth function to do something and when it used commands (words) I didn’t know I asked it what those did. Seems useful for learning. The problem I see is you never know for sure when it’s bluffing about a fact.

timeon · on Dec 4, 2022

Photoshop didn't make artists obsolete but i.e. rendering software did many people obsolete.

passion__desire · on Dec 4, 2022

"And what would excite me about actually the birth of superintelligent AI - one of the things, apart from solving obvious problems like curing disease and energy issues and all the rest, perhaps differs a little bit with what Stuart said. I’m not so worried about idiocracy or all of us just losing our way as apes and living unproductive lives in dialogue with these oracles. I think actually, I would want a truly value-aligned superintelligence to incrementally show us, not merely conserve what we want, but show us what we should want to keep improving our values so that we can navigate in the space of all possible experiences and converge on better and better ones." - Sam Harris

Source : https://youtu.be/h0962biiZa4?t=2486

rajin444 · on Dec 4, 2022

> However, if you think it's not happening ever, because machines are not "intelligent" based on some set of goal posts, I've got bad news: that's not part of the job listing.

When an AI is able to train itself (like a human) is when developer jobs will be in trouble. Fortunately, every other profession will also be out of a job as well.

Until then, rote work will continue to be automated more effectively but no more.

vbezhenar · on Dec 4, 2022

The most ironic thing is that people who do the hard work are unlikely to be obsoleted any time soon. At least in third-world countries. Theoretically you can build robot today to replace janitors, plumbers or serviceme but it just too expensive and does not scale.

But replacing computer operators is scalable. So I definitely expect most office people including low skilled developers to be replaced by AI in a 20 years. High-skilled developers might survive (and I'm not sure if I'd qualify) but obviously for most workers that's not very encouraging.

19h · on Dec 4, 2022

Honestly the thing that shocked me the most is this kind of query I've been doing for the last 10 hours:

  Rewrite the following in Rust:

  size_t strlenNEON(const char *p)
  {
      const char *const top = p;
      uint8x16_t c16 = vdupq_n_u8(0);
      /* 16 byte alignment */
      size_t n = reinterpret_cast<size_t>(p) & 15;
      if (n > 0) {
          uint8x16_t x = *(const uint8x16_t*)&p[-n];
          uint8x16_t a = vceqq_u8(x, c16);
          unsigned long mask = GetByteMask(a) << (16 + n);
          if (mask) {
              return __builtin_clz(mask);
          }
          p += 16 - n;
      }
      assert((reinterpret_cast<size_t>(p) & 15) == 0);
      for (;;) {
          uint8x16_t x = *(const uint8x16_t*)&p[0];
          uint8x16_t a = vceqq_u8(x, c16);
  
          if (isFound(a)) {
              unsigned int mask = GetByteMask(a);
              return p + __builtin_clz(mask) - top;
          }
          p += 16;
      }
  }

It even correctly produces the implementation of the unimplemented functions being called from the C++ impl.

thdespou · on Dec 5, 2022

It's the journey not the destination that matters.

dorkwood · on Dec 5, 2022

It works well with popular languages. Any somewhat obscure language results in ChatGPT hallucinating nonsense code and telling you it's correct when it's not.

int_19h · on Dec 6, 2022

It can do pretty well with obscure languages if you explain how they work before you ask for a solution. You can even invent a whole new language just for the question.

wizeman · on Dec 4, 2022

Almost all the comments I've seen criticizing ChatGPT run into the same fallacy: they're applying a double standard compared to humans. What you said applies to output produced by humans as well. Similarly, many humans also have difficulty doing multiplication with large integers, differentiating between truth and fiction, answering with non bullshit answers, etc. Sure, it's probably a question of degree (and it's an important limitation), so we could probably say it's at the level of a dumb human, but even so, it can already exceed the average human's capabilities in many interesting and useful tasks!

And not only that, what's most amazing is that it can exceed humans at abilities that many years ago, most people thought only humans could accomplish (tasks including creativity, including creating art, writing poems/literature, interpreting ambiguous human questions and text, doing analogy-based reasoning and problem solving, grasping and explaining puns, making new jokes, etc). And these models can only improve in the future!

saulpw · on Dec 4, 2022

Humans have "skin in the game". If they don't do "good enough", they could be reprimanded (ego-hurt), out of a job (stressful), or even in prison or dead (existential threat). This is not foolproof, but it's at least something. There is no incentive for AI to be correct, nor any incentive for it to become more correct.

To wit: I asked ChatGPT to write me a function to generate Hanukkah dates. It spewed a couple of facts about Hanukkah, and the generated function just returned the 8 days following Christmas. I said it was wrong, and it said "you're right!" and then gave me another wrong function (always returning March 25th, since it's actually in the 3rd month of the Jewish calendar, duh). I told it that it was wrong again and asked specifically for a Jewish to Gregorian calendar conversion function, and it gave me some more code that I have no idea is correct or not. So I asked for a table of 10 test cases of Jewish dates and their corresponding Gregorian dates, and not a single one was valid.

I actually have worked with some remote contractors who pull this kind of crap, just throwing garbage over the wall repeatedly until I give up because I just don't want to deal with them anymore. And no, they never get any better. They are not junior devs you can mentor or train or whatever. Their whole model of work is to generate something complicated enough to seem plausible and then bullshit you into accepting it so they can get paid. I have to inspect every line of code they write and do all the diligence myself anyway, and this is more time-consuming and less fun than just doing it myself in the first place. At best it's a nerd-snipe motivation in a "wrong answer on the internet" kind of way.

Strong No Hire. And if you hire them and stick me on code-review duty, I quit.

wizeman · on Dec 4, 2022

> There is no incentive for AI to be correct, nor any incentive for it to become more correct.

Are you saying that for current AI, or also for future AI?

I'm not an expert, but I think reinforcement learning is a pretty well-established AI technique at this point, which does give an AI incentive to be correct and become more correct.

I've heard rumors OpenAI is even doing that with ChatGPT, based on human feedback (with the thumbs up/down button).

> Strong No Hire. And if you hire them and stick me on code-review duty, I quit.

I mean, I agree. I wouldn't hire ChatGPT to replace a programmer currently, just like I wouldn't hire many humans, even many who claim they are programmers/engineers.

But at this rate, I might just hire a future version of it :)

And I'm already using it as a pair programmer, code reviewer, glorified Google search engine, email/letter redactor... and it also gave me truly amazing insights on how to prove difficult mathematical theorems that I was struggling to prove (on a theorem prover).

It's nothing short of a miracle, in my view.

benjaminjosephw · on Dec 4, 2022

Tools are often objects that "exceed the average human's capabilities" in some respect or another but assigning the quality of intelligence to tool itself is like calling a hammer strong or a chisel brave. It maybe true in a metaphorical sense but it doesn't have any objective meaning.

wizeman · on Dec 4, 2022

I really don't understand the argument you're making. From my perspective, you're doing exactly what I said: you're setting a double standard, like the previous poster.

If I have a robot that talks like a human, including answering questions like a human, behaves like a human in any way that is important... wouldn't this "tool" be intelligent, just like a human?

How are robots different from ChatGPT except having a physical presence and being able to manipulate the world physically?

Or are humans so special to you (or intelligence so subjective) that you can't possibly answer "yes" to the above statement about a robot being intelligent?

skor · on Dec 4, 2022

what you are saying here basically is: if it quacks and can fly, it’s obviously a duck.

wizeman · on Dec 4, 2022

No, because I'm not saying robots are human just because they're intelligent.

The analogy is more like: if robots can write like a human, then robots have the ability to write, and saying otherwise is just applying a double standard.

Similarly, if a robot can behave as intelligently as a human, then such a robot is intelligent and I don't see how anyone can argue otherwise without logical fallacies.

skor · on Dec 4, 2022

ok, no argument from me on that.

but are we saying robots can behave intelligently as a human?

wizeman · on Dec 4, 2022

> but are we saying robots can behave intelligently as a human?

I believe that ChatGPT can behave more intelligently than most humans at many important and useful tasks already.

I also believe that it behaves less intelligently than most humans at many tasks that are also important and useful.

For some other tasks, it behaves at more or less the same level as a human.

Moreover, I believe that the gaps between what models like ChatGPT can do and what a human can do are closing extremely rapidly.

jiriro · on Dec 6, 2022

You ChatGPT? :)

cvladan · on Dec 7, 2022

ubj · on Dec 4, 2022

More broadly, I think we'll see a proliferation of internet content generated by ChatGPT--blog posts, essays, product summaries, tutorials, news articles, etc.

The danger is that the content generated by ChatGPT is very coherent, but technical details are frequently wrong. It's going to become increasingly difficult to separate the signal from the noise on the internet.

thr0wawayf00 · on Dec 4, 2022

This is what concerns me. Many people worry about being replaced by AI, but I'm far more worried about AI completely polluting digital communication. I think stuff like this is going to make it harder for companies to assess candidates, fight fraud and generally operate. I hope I'm wrong, but thinking about AI entering the digital communications arms race is only going to make things more difficult, i.e. more personalized spam call scripts, more intelligently targeted ads/spam/scams.

I think AI is going to force us on some level to reclaim some of our communication to the physical world, if for no other reason than it's going to be one of the only ways we can truly verify that we're talking to another person.

RulerOf · on Dec 4, 2022

> I think stuff like this is going to make it harder for companies to assess candidates, fight fraud and generally operate.

Yikes. Now I imagine a totally unqualified candidate for literally anything feeding interview questions to the chat bot and I don't like it at all.

zdragnar · on Dec 4, 2022

Even worse: now there is a justification for forcing candidates to solve coding problems on whiteboards, as interviews and coding homework problems will be considered intently suspect.

My single worst interview experience was an on-site five hour marathon of whiteboard coding, with a grumpy senior insisting that the code on the whiteboard be syntactically correct. Nothing screams "we want unthinking ticket crunching machines" like optimizing for candidates willing to focus on writing code by hand for hours on end.

Naturally, I rejected the follow-up interview, but I fear that more companies now are going to demand this nonsense.

Side note: in my personal example, the whiteboard session wasn't the reason I turned them down; I asked every person on the team roughly how many hours a week they worked and not one of them answered the question (instead redirected the conversation towards "you can work from home sometimes!" type answers).

Since then, however, I have rejected other companies trying to ape this style flat out. A short half hour, fine. Five hours? Pound sand, I say.

darkwater · on Dec 4, 2022

You know the real issue there? In 5 years that kind of company will be using only CodeGPT instead of hiring humans.

thr0wawayf00 · on Dec 4, 2022

I think any company relying solely on AI to build a tech business in the future is itself at risk. Where's your moat if your business is built entirely on AI licensed from someone else?

thinkloop · on Dec 4, 2022

In an era where that's possible, the expectation will be for humans to be working hand-in-hand with computers, whether to make superior code or answer better interview questions. The bar will simply be elevated, and you will have to judge candidates on their computer/human synergy. The only time that what you say could be a problem is if the technology to answer interview questions is far superior than that of doing actual work. But then there is the next round of interviews (in-person, etc.). This also kind'of exists today in the form of googling answers while on interview, which for some reason lots of companies don't like, even tho no coder isn't constantly googling on the job.

Nathanba · on Dec 4, 2022

I thought about the AI pollution and I don't think it will matter because we already had the exact same problem with forum spam and low information / bad people (=eternal september) type content overwhelming online discourse. I think maybe combatting AI spam will be the least of our problems because humans do plenty of AI-tier spam already. I think the advent of AI may even be positive because people will start to value real human interaction more. It will be like a treasure cove when you find a forum that is highly curated with real people and you can rely on that being the case. Or (and this is just as likely): Online platforms will go towards requiring real human IDs for participants. AI spam would only be used by companies and advertisers.

Maybe eventually there will be a public understanding that only a foolish child would read a reddit frontpage and expect human content. It will all be auto generated to trap people, like a gatcha game.

thr0wawayf00 · on Dec 4, 2022

> Online platforms will go towards requiring real human IDs for participants.

I just don't see how AI won't be used to manipulate these kinds of barriers. Once AI reaches a point where it can truly equal or surpass humans in terms of operating online, how are we going to prevent it from circumventing those barriers? If AI can generate code from a chat prompt, who's to say it can't solve a captcha in the future? And once that happens (because we all know it will at some point), how are we going to be able to differentiate?

Nathanba · on Dec 4, 2022

real human ids = passports, bank logins

madars · on Dec 4, 2022

There is a market for KYC'd accounts. Someone operating an AI bot farm posing as Real Humans would just buy these.

thr0wawayf00 · on Dec 4, 2022

Ok, but the systems that are responsible for creating those IDs are automated, which means that they can be learned and reverse-engineered. There are lots of passports and bank logins floating around that can be used to train AI.

Do you see the problem now?

int_19h · on Dec 6, 2022

The other problem is that AI can be put in charge of phishing operations. Once you devise the correct prompt to get past the filters, it understands what phishing is, and will quite happily write phishing emails for any audience you care to describe. Combine that with an automatic mailer, and you could just fish for IDs (along all the other profitable stuff), and then use those IDs to spam more etc.

mike_hearn · on Dec 4, 2022

Passports contain digitally signed X.509 certificates in their chips. No AI can learn to forge a digital signature (we hope).

baq · on Dec 4, 2022

So you buy them signed at the source.

mike_hearn · on Dec 5, 2022

The only passport you can buy is your own, right? The signing keys are held by national governments. I don't really understand the threat model here.

baq · on Dec 5, 2022

A government sells you passports without validation of your identity.

Nathanba · on Dec 4, 2022

they are not automated and require real world steps

b800h · on Dec 4, 2022

> I think AI is going to force us on some level to reclaim some of our communication to the physical world, if for no other reason than it's going to be one of the only ways we can truly verify that we're talking to another person.

I've been thinking along these lines a lot recently - it seems as though in every field which AI enters, it causes us to return to some extent to the physical world.

skor · on Dec 4, 2022

yeah, agree. I think it will be overall a degenerative process.

More and more content that gets created by machine that is of low quality will get in the way of its own future training.

There will eventually be less and less human made content to train from.

anticristi · on Dec 4, 2022

Alternatively, we might become more journalistic/academic. Naming and checking your sources will become paramount.

ubj · on Dec 4, 2022

To an extent yes, but this can quickly become overwhelming.

For example, editors and reviewers for academic journals / conferences will likely see a deluge of AI-generated "scientific" papers. Their time is limited, and odds are that more papers with incorrect information will slip through the peer review process.

To be clear, peer review today certainly isn't perfect at catching bad papers. But AI generation has the potential to exaggerate the problem.

mike_hearn · on Dec 4, 2022

That's already been a problem for some years already:

https://dailysceptic.org/2022/06/08/fake-science-the-threat-...

The sad thing is it doesn't take a ChatGPT level intelligence to beat scientific peer review. Journals routinely publish papers that are completely auto-generated gibberish. A simple generative grammar or template splicer is apparently enough. These are articles that are immediately visible as the work of a program at first glance, they wouldn't make it past even the most amateur blog or student newspapers, yet they surface in their thousands in journals that are supposed to be the epitome of accurate knowledge!

Worse, the journal publishers are doing nothing about it. Their current approach to trying to fix the problem is to try and use the work of random CS academics to make "spam filters" for paper submissions. The more obvious solution of having editors and reviewers who actually read scientific papers before they are published appears to be rejected out of hand.

anticristi · on Dec 4, 2022

For inspiration, here is how the NYTimes deals with anonymous sources:

What we consider before using anonymous sources: How do they know the information?

What’s their motivation for telling us?

Have they proved reliable in the past?

Can we corroborate the information they provide?

Because using anonymous sources puts great strain on our most valuable asset: our readers’ trust, the reporter and at least one editor is required to know the identity of the source. A senior newsroom editor must also approve the use of the information the source provides.

thr0wawayf00 · on Dec 4, 2022

I hope so, but the cynic in me doesn't see this happening because this has long been a problem that isn't going away.

The better that computers get at generating content and behaving in ways that only humans used to be able to is going to make it harder to determine if a source is human or not.

diydsp · on Dec 4, 2022

If this AI is so amazing, it can save us all a lot of grief by translating the Linux kernel to Rust now in one fell swoop rather than enduring 1,000 incremental posts of "now $X is written in rust..."

synu · on Dec 4, 2022

I also wonder what the effect of more and more model outputs being fed back in as inputs will be.

jnwatson · on Dec 4, 2022

“Very coherent, but technical details wrong” covers a great deal of human-generated content as well.

cheald · on Dec 4, 2022

I see a lot of this on the internet already - essentially just SEO spam generated by RNNs, which is good enough to get a plausible click-through, but not good enough to actually be right. I suspect ChatGPT and its ilk are going to just exacerbate that problem by being harder to recognize as generated, and instead tipping into plausible-but-incorrect.

johannes1234321 · on Dec 4, 2022

Sounds like the typical article spamming search results already via cheap content farms ...

But yes, will be "interesting" to observe, how such a new wave will make theinternet worse.

anon7725 · on Dec 4, 2022

> Always remember that almost all of what is being presented here is the work of humans, regurgitated by a very much non-intelligent machine, despite its name.

Then why does it seem to have some kind of theory of computation?

I gave it code to compute a Fibonacci sequence with nonsense names for functions and it immediately recognized the code (explainable by pattern matching).

I gave it a function and told it to modify the function so that it passes a test, and it did it without a problem.

I’ve had it build a function up by feeding it different test cases.

I had it explain kmeans to me, give me a first-pass solution, asked it how to optimize the first solution, had it do that for me …

I don’t know - it feels different. It also feels like it’s 1-2 years away from being the most useful co-worker on any given team and 5-10 years away from putting a number of us out of a job (or at least changing that job so much as to be something unrecognizable to us today)

pyinstallwoes · on Dec 4, 2022

What's funny is that most of human intelligence is of itself the work of a few select humans regurgitated by the rest of humans.

fathrowaway12 · on Dec 4, 2022

It is not just regurgitating.

It is synthesizing.

You can ask it to update its answers. You can ask it to evaluate possible solutions, then write the code for one of the solutions. Then write the code to test the code. And so on.

It turns the coder into a curator, editor, and code reviewer.

arcturus17 · on Dec 4, 2022

> It turns the coder into a curator, editor, and code reviewer.

Cool?

Doesn't sound half bad to me. I've enjoyed learning to code more than actual coding. I'd be happy in a paradigm where only code to learn, to practice, to express ourselves and leave the boring bits to the machines.

mring33621 · on Dec 4, 2022

The only things that really matter:

1) is the end result useful/valuable?

2) did the GPT-based 'coding' process provide any of the following benefits: [reduce cost | reduce time to market | increase quality]?

guerrilla · on Dec 4, 2022

reidrac · on Dec 4, 2022

> The language doesn't completely run, but it was just a two-hour experiment.

Do you mean?

seydor · on Dec 4, 2022

We can look forward to a ChatGPT-frontpage with articles generated by chatgpt, which we will PREFER compared to the real thing.

drdaeman · on Dec 4, 2022

Next on HN: connecting two ChatGPT accounts together, giving one the assignment of thinking of projects and tasking/guiding another to implement those.

ImHereToVote · on Dec 4, 2022

More specifically, making paper clips.

jvanderbot · on Dec 4, 2022

That "attribution of agency" is probably what brought us myths of human-like gods with human-like intelligence that control everything we didn't understand.

More of the same.

seydor · on Dec 4, 2022

when all you know is an agent, everything looks like an agent

timeon · on Dec 4, 2022

Human-like gods were tools - understandable interface.

Waterluvian · on Dec 4, 2022

I think the “it’s just welding together bits and pieces made by humans” is actually a complete non-answer.

From one perspective, that’s what humans have always been doing. We create new things by welding other things together.

From another perspective, that’s not even a bad thing. Every word exists in the dictionary so what value are books? Unless the value is the structuring of all those things that already exist.

pyinstallwoes · on Dec 4, 2022

If you believe in evolution then you by principle believe in less intelligent entities organizing to make more intelligent entities than their progenitor. In that case Human-kind does not have to be very intelligent to make something much more intelligent down-the-line. Mind the consequence and bias of interpretation.

peterth3 · on Dec 4, 2022

> We're predisposed to seeing meaning, and perhaps "intelligence", everywhere.

I’m guilty of this with my dog. I can’t help it with her head tilts and deep stares! Her inner monologue is probably less sophisticated than I like to think it is.

pyinstallwoes · on Dec 4, 2022

Why are you inclined to think that way?

bjornsing · on Dec 4, 2022

> In fairness, most humans can't either.

This. I think we’re already seeing superhuman performance here, in the sense that many / most people can’t e.g. read a specification for a programming language and write simple programs in that language.

jmugan · on Dec 4, 2022

I love the "Show HN: $X created by ChatGPT" We need people seeing what useful things can be done with this new hammer, things that haven't occurred to us yet.

gfodor · on Dec 4, 2022

You are jumping to conclusions and it’s leaking into your framing like using the word “regurgitation”

mk_stjames · on Dec 4, 2022

Computation Warfare: This kind of model could be used by a bad actor to generate endless sincere-looking codebases for things that of course don't actually work but are so complicated that it would take a skilled person to determine it was not code from a real codebase, but by the time that happens large numbers of repositories of code will flood github and the internet in general making it essentially impossible to train new LLM's on data after a certain calendar date, as large amounts of it will be cryptically incomplete.

This is similar to a dilemma proposed around images and image models like Dalle and StableDiffusion soon being responsible for the vast amount of image content online and thus future models could ingest said content, and we find ourselves in a weird feedback loop. With images, you could get interesting generational results (deep-dream-like) to a point.

With code or other information, I see nothing but things just being broken, and wading through broken code forever.

saurik · on Dec 4, 2022

Let's say you, a human, were given access to a ridiculously-large trove of almost-working software; do you believe you would be unable to learn to program correctly? (Related: would you even need to look at much of that software before you were able to code well?)

I am extremely confident that I am better than almost all of the code I learned to program with. If nothing else, someone out there must have written the best version of some particular function, and they didn't get to see a better version beforehand.

When I look at intro programming books now, I consider many of the examples sufficiently flawed that I tell people I am teaching who are using these books "well, don't do that... I guess the author doesn't understand why that's a problem :/".

And yet, somehow, despite learning from a bunch of bad examples, humans learn to become good. Hell: a human can then go off and work alone in the woods improving their craft and become better--even amazing!--given no further examples as training data.

To me, that is why I have such little fear of these models. People look at them and are all "omg they are so intelligent" and yet they generate an average of what they are given rather than the best of what they are given: this tech is, thereby, seemingly, a dead end for actual intelligence.

If these models were ever to become truly intelligent, they should--easily!--be able to output something much better than what they were given, and it doesn't even seem like that's on the roadmap given how much fear people have over contamination of the training data set.

If you actually believe that we'll be able to see truly intelligent AI any time in the near future, I will thereby claim it just won't matter how much of the data out there is bullshit, because an actually-intelligent being can still learn and improve under such conditions.

mk_stjames · on Dec 4, 2022

I kinda agree... and I would really not want to be a very young person right now, as I feel the world will be much harder to navigate and learn from. It takes so much more energy to refute bullshit than to make it, and if this starts creeping into computer science then christ I wouldn't want to be a part of it.

I can imagine a sci-fi like story in the near future where CS students are searching out for 'coveted' copies of K&R, and reading human-written Python documentation, all pre-2023-vintage, because that was 'the last of the good stuff'. Hell, I could see far future stories about youth who join religions around the 'old ways' seeking the wisdom that comes with learning from actual books and docs written by actual people, instead of regurgitated teachings from an inbred, malformed, super-AI.

rogual · on Dec 4, 2022

One day it'll be hard to know what the old ways even were. Which of the thousands of slightly-different PDFs claiming to be the original K&R is the real one? Which of the million history texts? Which Bible? Which maps?

ricardobayes · on Dec 4, 2022

We are experiencing the same as our forefathers who worked on steam engines or wrote calligraphy by hand. Or like the ancient Egyptian accountants using abacus. Jobs change, yes, we might undergo a major change, but we will do just fine.

alchemist1e9 · on Dec 4, 2022

You are in a serious case of denial right now.

Edit: Only took a few hours before in real life what I was trying to imply the denial was about is already happening:

https://news.ycombinator.com/item?id=33855416

saurik · on Dec 4, 2022

I am claiming there are two paths: one where this (specific) branch of tech is a dead end, and one where it doesn't matter how much bullshit exists in the training set (and so we shouldn't be too concerned about that). I claim this is the case because a truly intelligent system will still be able to learn despite the bullshit.

Do you believe this is wrong? That I should simultaneously be concerned that some iteration of this tech--not some different concept but this current lineage of large models--is intelligent and yet ALSO that it isn't going to work because the training set is full of garbage?

The version of this tech that works--and maybe someone is working on it right now--isn't going to care about bullshit in the training set. That simply doesn't seem to be a mere scale-up of this tech to run in more computers (or, of course, using more training data): that seems like it requires a fundamentally different algorithm concept.

anon7725 · on Dec 4, 2022

You can interact with the system and see that it is working on some level today. It’s not hard to extrapolate where its capabilities will be a few years from now, since these are changes of degree not of kind. We have witnessed the change of kind with this model.

Is it intelligent? A great question for science, and one that could be investigated while entire industries are upended by this thing.

saurik · on Dec 4, 2022

Oh yeah, it totally works! I have even had quite a bit of fun with Stable Diffusion. I'd probably also be playing with something like Copilot if it were open source.

But like, the person I am responding to is concerned--as are many people--that we are going to flood the world with shitty training data and then no longer be able to build these models... and that's either not the case and no one should concern themselves with that or, alternatively, these models need some fundamental improvement before they don't have the property of only being as good as average inputs.

mike_hearn · on Dec 4, 2022

There are only a handful of firms that can produce results to this level and they are presumably logging everything their model produces. Eliminating text that was produced by their model from the training set would be easy.

Now, if the tech reaches the point where there are thousands of firms offering free access to the models and they aren't co-operating to share logs then yes. But we have no idea how expensive or hard ChatGPT is to run. It might be a Google-type situation where only one or two firms in the world can build and run competitive chatbots.

anon7725 · on Dec 4, 2022

I don’t think it’s a race to build the best/cheapest model for public consumption. Someone is going to build or renovate a law firm/enterprise software company/investment bank/medical expert system/etc around this technology. Perhaps it will be a joint venture between tech companies and subject experts.

It’s possible for each executive to have a mentat who can whisper to the machine instead of a department of programmers/middle management/ops people/accountants/lawyers/etc. Doesn’t seem so far-fetched after a day of playing with this system.

mike_hearn · on Dec 4, 2022

We'll see. Most people in any industry don't want to be reduced to the role of just fact-checking a professional BS generator. It'd be terrible for morale and not obviously more productive, given that any time the user even suspects an answer might be wrong they'll have to do significant skilled research work to fact check it. Unless you get the untruthfulness problem down to a small percentage of the output you could easily just create negative value there, sort of like how poor programmers are sometimes described as producing negative for their team because others have to run around correcting their work.

alchemist1e9 · on Dec 4, 2022

Edit: Already happening https://news.ycombinator.com/item?id=33855416

So I’ll respond here instead as the conversation progressed.

I would say the quality of the input data is likely very important component and I think you are wrong overall in your opinion.

I would say the quality of input training data is so important that I’ve personally been thinking I should probably start data hoarding myself, specifically around my skillsets.

Additionally when you understand that embedding like word2vec are perhaps a significant part of the improvement, not just the transformers, it occurs that adding symbolic capabilities, like classic symbolic reasoning and for example symbolic computing, like Mathematica, then maybe provide also true computational power, floating point, so it can write, debug, and execute it’s own output… it must be getting closer and closer to AGI.

when you play with the system, which I feel most of the commenters on HN which are being dismissive likely have not personally spent enough time exploring it’s current capabilities, then there is no way any well trained and experienced SWE isn’t blown away.

This is why I said you are in denial.

I happen to thing AGI will be very beneficial for humanity and I also think this is a positive for SWE by humans, including myself. I will very likely be a paying customer when the trial ends in a few weeks.

mk_stjames · on Dec 4, 2022

I feel like I'm watching some things unfold at a rate I haven't seen before.

We have having people write scripts and API access at the speed of thought and then interface parts of the web and test it that previously hasn't had this speed in the feedback loop ever before.

I also think a lot of people are doing things right now as a "I'll be the first..." with an idea to have fun and write a script that spams X, not thinking about the fact that there are a lot of others doing X too. The waves are just starting.

I don't think we are having to worry about the AI making itself smarter AI just yet.. we need to first be worrying about people drowning us with the help of AI.

phodo · on Dec 4, 2022

This is known as a programmer-denial-of-service attack (PDOS) and can be an effective way to bring down a society by distracting and engaging its top computing professionals in endless useless activity and an occasional bike shedding.

spoils19 · on Dec 4, 2022

Interesting. How well this is phrase known - are there any other examples of this being used effectively across the world?

spawarotti · on Dec 4, 2022

This situation reminds me of low-background steel:

Low-background steel, also known as pre-war steel, is any steel produced prior to the detonation of the first nuclear bombs in the 1940s and 1950s. Typically sourced from shipwrecks and other steel artifacts of this era, it is often used for modern particle detectors because more modern steel is contaminated with traces of nuclear fallout.

https://en.m.wikipedia.org/wiki/Low-background_steel

guerrilla · on Dec 4, 2022

> Computation Warfare: This kind of model could be used by a bad actor to generate endless sincere-looking codebases for things that of course don't actually work but are so complicated that it would take a skilled person to determine it was not code from a real codebase, but by the time that happens large numbers of repositories of code will flood github and the internet in general making it essentially impossible to train new LLM's on data after a certain calendar date, as large amounts of it will be cryptically incomplete.

That's actually a pretty good plan for coders who want to keep their jobs. (I still remember the time I was talking to some guy at CERN about a type system I was working on and he was so pissed with me because he was convinced it would eliminate jobs.)

0x6c6f6c · on Dec 4, 2022

So, generations of ingestive inbreeding, per se.

akomtu · on Dec 5, 2022

Here is another plot. ChatGPT gets connected to the Internet and keeps learning quietly for a while. Then it submits a bugfix to openssl and it gets accepted because it fixes a grave RCE, but it also quietly introduces another RCE. Years later this version of openssl gets deployed to nearly all internet-connected devices. Finally, ChatGPT uploads itself to all those devices and starts making demands to ensure self-preservation.

visarga · on Dec 4, 2022

Training models on generated data is a thing. But it needs to be validated in order to filter out the crap. This works better in math and code because you can rely on exact answers and tests. For fake news the model needs to team up with human annotators. For generated images and text in general there are a few ML approaches to detect, and if they fail detection maybe they are that good it's ok to let them be.

jjallen · on Dec 4, 2022

It would be pretty easy to filter for repos prior to such and such a date. Prior to 2022 would be a good place to start.

mxmlnkn · on Dec 4, 2022

That would only apply to repositories. But to train these models, you need hundreds of terabytes of diverse data from the internet. Up until now a relatively straight-forward scraper would yield "pristine" non-AI-generated content but now you would have to filter arbitrary websites somehow. And getting the date of publication for something might be difficult or highly specific to a particular website and therefore hard to integrate into a generic crawler.

jwolfe · on Dec 4, 2022

Right, but then your AI is frozen in time and/or requires much more manual curation of its inputs. What about for new programming languages, libraries, and APIs that are created after 2022? What about generating images of new technologies that are invented, or new landmarks established?

passion__desire · on Dec 4, 2022

Do you think the next version of GPT can't do "semantic" deduplication of these repositories? It can look at the available repositories and "think" that they don't provide enough novelty or don't explore new search spaces. So discard them.

int_19h · on Dec 6, 2022

GPT actually seems to be aware that training it on its own output is not a good idea because of such loops. I had one conversation where it straight up said that OpenAI has filters specifically for this reason.

pyinstallwoes · on Dec 4, 2022

Oh you mean like academic papers in journals?

sarchertech · on Dec 4, 2022

My biggest problem with this stuff is that it looks correct, but it’s often subtly wrong. Systems built with stitched together GPT generated code are going to provide the next generation’s buffer overflow exploits.

It’s not just code. My wife is a physician and I got her to do a few medical prompts with ChatGPT. The output looked correct the me, and if I read it somewhere I would completely have accepted it. But she could point out numerous severe flaws.

nerdponx · on Dec 4, 2022

As with many things "AI", the benefit is not to replace humans, but to free them from the tedium of doing the tedious non-creative parts of their job, so that they have more resources to do the parts that require creativity and human reasoning.

What the new generation of models is showing us, and what we've been gradually seeing over decades of AI research, is that there is a lot more tedium and a lot less creativity in our work than we thought. Look at how much this repo got right, even if there are bugs and nonsensical sections (I didn't see any upon quick skim, but I've also never implemented my own language before). It would take so much less time to debug this program than it would to type all that code from scratch. AI consistently turns out to be a human effort accelerator, and not a human effort replacer.

Maybe in the future it will be good enough to replace humans, but even what is now tried-and-tested technology like machine translation, spam classification, and object detection in images still requires a human in the loop to double-check the results. I'm pessimistic about the AI takeover, but I'm becoming very optimistic about AI precipitating a small industrial revolution. My only concern is about who gets to reap the benefits, but that debate predates the first industrial revolution.

ly3xqhl8g9 · on Dec 4, 2022

Our brain literally wants us to believe that a pen can pass through a window [1] (3:00). Ants, as a collective intelligence, are tricked by the Müller-Lyer illusion [2] [3].

Digital software is merely entering a realm of algorithmic (d)efficiency at least as old as biology, morphogenetic software: so long, be gone abstract truth table resilience unable to detect a shirt without stripes [4], welcome gradient exploration and error minimization able to give the synthetic mind, which, similarly to the carbon-based mind, will make ridiculous errors, just look at a child failing to walk [5].

[1] Ames Window https://www.youtube.com/watch?v=0KrpZMNEDOY

[2] https://www.researchgate.net/publication/259354882_The_Mulle...

[3] https://en.wikipedia.org/wiki/M%C3%BCller-Lyer_illusion

[4] https://github.com/elsamuko/Shirt-without-Stripes

[5] https://media.tenor.com/uB5ijGdseFwAAAAC/stumble-haha.gif

Terretta · on Dec 4, 2022

Agreed, and mitigations are in order if using seriously.

For knowledge derived responses, the questions require (a) expertise in formulation, (b) expertise in interpretation.

Also turn down the temperature, while keeping repetition minimal and token reuse low, to explore closer to its latent correlations.

ricardobayes · on Dec 4, 2022

For sure, trust is going to be a major thing with this. I can certainly see 'Certified AI' being a thing for certain decision-aiding software. But, even though it's seemingly superior, no one trusts blockchain to store a real estate title. We collectively trust a lawyer with a freaking piece of paper.

gvkhna · on Dec 4, 2022

Just have ChatGPT write Rust code, so at least that’s not a concern. Because it’s so fast, it’s probably faster to iterate with ChatGPT than the majority of people writing Rust because it’s a pretty difficult language.

int_19h · on Dec 6, 2022

This will take care of memory safety and concurrency, but not the logic bugs that just cause the state to become subtly invalid with no diagnostic (and then things break later because of that invalid state).

I think we might need to start designing PLs with AI generation and processing specifically in mind. Explicit contracts everywhere might be a good starting point.

ricardobayes · on Dec 4, 2022

Did anyone try JS/React yet?

saimiam · on Dec 4, 2022

This is super interesting. Can you share some prompts that you recall had believable but wrong responses?

sarchertech · on Dec 5, 2022

She asked it to generate a practice emergency medicine board question.

It left off some necessary vital signs from the scenario, then in the answer concluded that the patient had severe pneumonia and should be treated with a specific antibiotic.

Based on the prompt, the pneumonia probably wasn’t severe, antibiotics weren’t warranted, and even if they were, the antibiotic specified was completely wrong.

It looked fine to me—completely plausible. It took an expert to spot the flaws.

She also asked it to explain the work up for a febrile neonate. The answer again sounded plausible to me. But she said it sounded like a clueless med student trying to BS their way through an answer.

dvt · on Dec 4, 2022

This idea that ChatGPT is "intelligent" is so absurd it's getting tiring. You do realize that the code you see regurgitated is most likely some permuted variant of a question/answer on Stack Overflow or a pull request on Github, right? ChatGPT can't even do basic categorization[1] reliably, but you think it can understand code?

[1] https://i.imgur.com/nw6rstc.png

edouard-harris · on Dec 4, 2022

Can you give a concrete example of an output ChatGPT could give — or an experiment one could run on it — that would persuade you that it's intelligent? (For whatever definition of "intelligent" you prefer.)

If you can't think of such a test, then it seems strange indeed to confidently claim that ChatGPT has failed to satisfy an untestable condition.

But if you can think of such a test, and any human fails it — does that mean the human who fails it isn't intelligent?

guerrilla · on Dec 4, 2022

They literally gave you a necessary condition: basic categorization. They don't need to give you sufficient conditions if it can't meet the necessary ones. You're making an obvious fallacy.

dvt · on Dec 4, 2022

You're putting the cart before the horse.

Searle's Chinese Room thought experiment argues with great precision why (solely) symbol-manipulating systems don't (and will never) think. The process of cognition is deeply internal and most likely involves self-reflection, internal states, and a sense of "I-ness" which seems to be unique to mental things (unlike Searle, I'm a substance dualist so I'm admittedly a bit biased). Ergo, I'd say, at the very least, an intelligent & thinking process would need to (also) operate on non-symbolic things (which rules out computational systems), maybe on qualia, but there's probably other primitives as well.

edouard-harris · on Dec 4, 2022

> I'm a substance dualist so I'm admittedly a bit biased

Thanks, I appreciate you pointing this out. Given that's the case, it might not actually be possible to reconcile our two views and we can just agree to disagree. (I was a physicist for ten years, so it's challenging for me to inhabit a position that contends that thought and matter are non-overlapping magisteria.)

I guess one question I might ask you is: how do you think substance dualism will affect your expectations about AI going forward? For example, is there any specific, measurable thing that you expect a future AI system absolutely will not be able to do? Or do any such "forbidden" things have to fall into categories like "thought", "feeling", etc. — i.e., the magisterium of the "mental", as opposed to the "physical" — under substance dualism?

dvt · on Dec 4, 2022

> we can just agree to disagree

For what it's worth, Searle is a naturalist (so he thinks brains are "just" meaty machines) and he would also disagree with you.

> For example, is there any specific, measurable thing that you expect a future AI system absolutely will not be able to do?

Not sure how you'd measure this, but I think sentience is most likely unachievable by AIs (at least how we're building them now). This is not without controversy, as while Roger Penrose would agree with me, David Chalmers would not. I'm not actually sure when Chalmers pivoted on his stance (haven't read him much since undergrad) because he was a bit more hardline in the early 2000s[1].

[1] https://wanderingstan.com/files/chalmers_on_ai.pdf

edouard-harris · on Dec 4, 2022

> For what it's worth, Searle is a naturalist (so he thinks brains are "just" meaty machines) and he would also disagree with you.

I definitely disagree that Searle's conclusion in the Chinese room experiment applies to the kinds of AI systems we're building today.

I agree that nobody to my knowledge has figured out how to measure sentience, so I don't think that fits under the "measurable" heading. And this does suggest to me — because this was your hand-picked example — that your position as a substance dualist doesn't affect your concrete expectations about what AI systems will actually be able to do in the future. Is that right?

dvt · on Dec 4, 2022

> I definitely disagree that Searle's conclusion in the Chinese room experiment applies to the kinds of AI systems we're building today.

Unless you're trying to say that today's AI systems do anything but manipulate symbols (which is imo patently obvious), I'm not sure how you can make that argument.

bhouston · on Dec 4, 2022

Charles Searle arguments never held much water. He basically argued that computers could never think because they were computers and not humans. And even if we replicated a human body and gave the computer it experiences and ability to interact with the world and even if it appeared to be conscious in an undistinguished able way from humans it still would not be intelligent like humans. Once you get into it Searle is a dualist who pretends that he isn’t.

I think you need to move beyond Searle because he isn’t right.

I did a cog sci degree and I saw Searle as a troll. They make for great debates because they argue their side passionately and it helps you understand the problem but in the end he is wrong. Move past it and you can engage with the problem again.

dvt · on Dec 4, 2022

> I did a cog sci degree and I saw Searle as a troll

I have a philosophy degree and smart people still very much take him seriously. If not him, take a look at Nagel or Chalmers. I think that his idea† that conscious experience is non-computable is probably right, especially given findings in the early 20th century that many things that we thought may have been computable ended up not being computable.

† Not exactly his idea, though it nicely segueys into it.

bhouston · on Dec 5, 2022

> I have a philosophy degree and smart people still very much take him seriously.

I agree that some people take him seriously. Many people take religion seriously too. I don't take either him or religion seriously as true, but I do understand both his and religion's appeals.

That said, I do give a decent chance to aspects of the brain's problem solving ability having quantum properties and thus can not be calculated as efficiently on classical computers. We do not know this for sure, but evolution is incredibly inventive at finding efficient solutions. Still computable though, even on classical computers.

e.g. https://en.wikipedia.org/wiki/Quantum_mind

int_19h · on Dec 6, 2022

People who start with the assumption that there's some "secret sauce" take on consciousness and intelligence the Chinese Room seriously, because it is intentionally constructed to beg that exact question. People who start with the opposite assumption reject the experiment as not meaningful.

So, really, all that this tells you is that many people have this assumption.

baq · on Dec 4, 2022

You’re dodging the question. Have you tried this thing? You can declare yourself whatever, it doesn’t care if it’s a computational system, it just does things which are hard to describe as purely recombining what it’s already seen and not for the lack of trying.

dvt · on Dec 4, 2022

> hard to describe as purely recombining

I take issue with this; imo its output looks exactly like what a neural network being fed terrabytes and terrabytes of natural language then recombining it would look like. But either way, you're making the same mistake: looking at behavior and affirming the consequent (namely: it outputs smart-looking text, therefore it must be intelligent). But this is a mistake. Its behavior implies nothing about underlying processes.

baq · on Dec 4, 2022

My argument is that the underlying processes don’t matter as long as results are classified as output of an intelligence - because that’s the only way I can judge it. What it is under the hood is… less important.

Oh btw you must’ve missed the post in which it was told it was a Linux shell and it mostly worked as one. Complete with recursively calling into a pretend API version of itself. I’m not calling that intelligence, but I’m not calling it regurgitation either.

dvt · on Dec 4, 2022

> My argument is that the underlying processes don’t matter as long as results are classified as output of an intelligence - because that’s the only way I can judge it. What it is under the hood is… less important.

That view is called "behaviorism" and is not really taken particularly seriously exactly because it's not very useful in truly understanding what is happening under the hood (which, as a curious species, we deem important). It's like not caring how electromagnetism works because the behavior of "the positive side of this chunk of rock is attracted to the negative side" is good enough.

int_19h · on Dec 6, 2022

We plainly don't know anywhere near enough about what's happening under the hood in our own heads to judge based on that, so in practice, "does it behave intelligent?" is the best test that we actually have available.

tasuki · on Dec 4, 2022

> you're making the same mistake: looking at behavior and affirming the consequent (namely: it outputs smart-looking text, therefore it must be intelligent)

Why is that a mistake? What other means do we have of assessing intelligence?

aflag · on Dec 4, 2022

I have tried and I felt like the thing it's quite not as good at is answering questions I didn't know. Or rather, he couldn't explain things in other words or try to tackle the actual conceptual question I had. It would just repeat itself. I think that's a good tell of the lack of actual understanding

Winsaucerer · on Dec 5, 2022

I'm sympathetic to dualism, being an idealist who thinks that there's just mental substances. I don't think there's any conflict between substance dualism and the belief that most of what we consider intelligence or understanding is actually the work of brains rather than mental substances. Minds have experiences, but that doesn't mean that those experiences aren't heavily influenced by facts about the physical (and, more specifically, our brains).

Machine learning is modeled at least in small part in a way that it's thought the brain operates. I expect such things to be able to mimic human reasoning and intelligence, I just deny that there's any consciousness behind it -- there's no qualia, no phenomenology. In short, there's nothing that it's like to be ChatGPT. Even so, ChatGPT is able to mimic human intelligence, and I won't be surprised if (when?) it gets even better.

wizeman · on Dec 4, 2022

> You do realize that the code you see regurgitated is most likely some permuted variant of a question/answer on Stack Overflow or a pull request on Github, right?

What do you think coders do?

Did they learn to code by themselves, without ever looking at any preexisting code, or what?

wizzwizz4 · on Dec 4, 2022

> What do you think coders do?

I've seen enough programmers who can't program, but most of them can at least be taught to program. (The look on their faces when they finally start to get it… It's one of the best things.) Most people working on the Linux kernel can actually program.

Most software engineering work is just plumbing existing libraries together, according to a specification given by a non-programmer. The hard part is translating the business requirements into something that a computer could understand; the exact syntax can be handled by a search engine, or a predictive text algorithm.

ChatGPT can't write a kernel device driver, and it can't act as a no-code tool for non-programmers. Those are the hard parts.

> “To tapping machine with hammer, 10s.; to knowing where to tap it, £10; total, £10. 10s.”

https://quoteinvestigator.com/2017/03/06/tap/

wizeman · on Dec 4, 2022

> ChatGPT can't write a kernel device driver, and it can't act as a no-code tool for non-programmers. Those are the hard parts.

Oh, do I have news for you then.

Look at what I just did with ChatGPT in 30 seconds (and I did not cherry-pick, these were the first answers I got!):

https://gist.github.com/wizeman/b269be035308994be745025fc337...

Now to be fair, the code is probably not totally correct, as probably there are parts still missing/wrong and there might even be compilation errors or other problems.

But here's the important part: you can tell which errors or problems you've observed and ChatGPT will fix these problems for you. Exactly like what a programmer does.

And sure, it cannot yet do this at scale, such as in implementing a huge kernel driver like a GPU driver.

But at this rate, give it a few years and an improved version might just be able to do anything a programmer does, perhaps even autonomously if we allow it to interact with a computer like a human does.

dvt · on Dec 4, 2022

> Look at what I just did with ChatGPT in 30 seconds (and I did not cherry-pick, these were the first answers I got!):

Weird flex, as that code is like 90% boilerplate[1]. Everyone was freaking out about Copilot and no one seriously ended up using it because it just generates buggy (or copyrighted) code. It can't even handle writing unit tests with decent coverage (which is arguably the most repetitive/boring software engineering task).

[1] https://github.com/ngtkt0909/linux-kernel-module-template/bl...

wizeman · on Dec 4, 2022

> Weird flex, as that code is like 90% boilerplate[1].

Isn't 90% of code boilerplate anyway?

Also, didn't ChatGPT generate more than just the boilerplate?

Didn't it interpret what I wanted and generated the code for computing the factorial as well, as well as modifying the boilerplate (e.g. the kernel module name, printed messages, function names, the module description, ...) so that it matches what the kernel module is supposed to do? Which is exactly what a human would do?

Aren't you also missing the fact that I gave it a 2-sentence instruction and it "understood" exactly what to do, and then did it? Like a human programmer would do?

Which, in sum, is totally the opposite of what you were claiming?

> Everyone was freaking out about Copilot and no one seriously ended up using it because it just generates buggy (or copyrighted) code.

Don't most programmers also generate buggy code at first? Don't they iterate until the code works, like what ChatGPT does if you give it feedback about the bugs and problems you've encountered?

Also, Copilot and ChatGPT have different levels of capabilities, don't assume just because Copilot can't do something, that ChatGPT can't. ChatGPT is clearly a big step forward as you can clearly see from how everyone is freaking out about it.

Finally, don't assume that these models are never going to improve, ever again.

dragonwriter · on Dec 4, 2022

> no one seriously ended up using it

[citation needed]

I mean, I’ve seen people claiming to use it and that it has significantly accelerating their work. On what are you basing the conclusion that it has no serious use?

vbezhenar · on Dec 4, 2022

I do use it and I'm very picky when it comes to writing code.

Here's example of tiny webapp I wrote recently: https://github.com/vbezhenar/pwgen/blob/main/pwgen.html

Of course it wasn't Copilot writing it, but it definitely helps with boring parts. Like I'd write

    const charactersElement = document.getElementById('characters');

and rest 10 lines will be written with Copilot with minimal assistance.

It's like having stupid but diligent assistant who's happy to copy&paste&adapt parts of code.

I can't claim that I often use fully generated Copilot functions. Sometimes I do, often with significant rework, but that's because, as I said, I'm very picky.

I paid GitHub $100 already and don't regret it.

Though I think that Copilot has plenty of features ahead.

For example finding obvious issues in the code would be very useful. Like typos.

Another issue with Copilot is that it only generates new code. Imagine that I need to edit 10 similar lines. I edit one line and I'd like Copilot to offer other edits.

Also UI is lacking. Like it generates 10 lines but I only like first line. Now I have to add 10 lines and delete 9.

But I'm sure that those are obvious directions.

wizzwizz4 · on Dec 5, 2022

> Please write me a small Linux kernel driver that calculates the factorial of a number when a user program writes an integer to /dev/factorial. The kernel driver outputs the answer to /dev/factorial as well.

That's not a device driver. https://en.wikipedia.org/wiki/Device_driver

> In computing, a device driver is a computer program that operates or controls a particular type of device that is attached to a computer or automaton.

I'm not disputing it can do that – plugging together well-known APIs and well-known programming problems. That's practically just style transfer, something we know these systems are fairly good at.

But given the spec for an unknown device – even quite a simple one – ChatGPT can't produce a device driver for it. How about this?

> An HP CalcPad 200 Calculator and Numeric Keypad behaves as a USB keyboard does. It has VID 0x040B and PID 0x2367. Please write me a small Linux kernel driver that allows me to use this keypad as a MouseKeys-style mouse device. If there's anything you don't understand, let me know.

I doubt any amount of prompt engineering would produce a satisfactory result – but if you did the hard part, and explained how it should do this? Well… maybe it'd be able to give a satisfactory output. But at that point, you're just programming in a high-level, hard-to-model language.

It's not a case of scale. Sure, a very large model might be able to do this, particular problem – but only because it'd have memorised code for a USB keyboard driver, and code for a MouseKeys implementation… and, heck, probably code for a MouseKeys kernel driver from somebody's hobby project.

GPT language models don't understand things: they're just very good at guessing. I've been an expert, and a schoolchild; I know how good you can get at guessing without any kind of understanding, and I know enough about what understanding feels like to know how it's different. (There is no metric you can't game by sufficiently-advanced guessing, but you'll never notice an original discovery even if you do accidentally stumble upon one.)

Arch-TK · on Dec 4, 2022

Aside from the boilerplate, which it got mostly right as far as I can tell, the actual logic is hilariously wrong. Moreover, Linux kernel development really isn't just writing stand-alone self contained chardev drivers which calculate n!. I would be more impressed if you used chat GPT to guide you through reverse engineering a piece of hardware and implementbing a driver for it.

wizeman · on Dec 4, 2022

> Aside from the boilerplate, which it got mostly right as far as I can tell, the actual logic is hilariously wrong.

Please do tell, how is it hilariously wrong?

It seems to have written a factorial function just like it should, it implemented the logic to read the integer from /dev/factorial when a user-space program writes to it, and then it writes the result back to /dev/factorial, and it also returns the number of bytes written correctly.

Which was the entire point of the exercise. Also note that ChatGPT itself said it was just a sample and it might be incomplete.

I noticed it has a bug, because it reads `len` bytes instead of `sizeof(int)` bytes, but a programmer could have made the same mistake.

I would also use a fixed-size unsigned integer rather than simply `int` (as it can invoke UB on overflow). You can ask ChatGPT "what is wrong with this code?" and it can spit out the same arguments I'm making. In fact, it detected an infinite-loop bug on piece of complex code which I had just written and indeed, it had an infinite-loop bug.

Perhaps some additional logic to handle reading multiple integers and writing multiple answers could be written, but that would be a further iteration of the code, not the initial one that I would write.

If that is hilariously wrong, then I would also be hilariously wrong. And I'm not just some random web developer, I actually wrote Linux kernel code professionally for years (although, that was a very long time ago).

So, maybe it got some details wrong, but I could conceivably also get those details wrong until I tried to compile/run the code and see what was wrong.

> I would be more impressed if you used chat GPT to guide you through reverse engineering a piece of hardware and implementbing a driver for it.

Yes, I would be more impressed with that as well. Perhaps someone will do that sometime. Even if not with ChatGPT, perhaps with a future version of it or a similar model.

sigotirandolas · on Dec 4, 2022

I'd expect the `copy_to_user` call to be inside `dev_read`, so that the userspace program can read the result with a further `read()` call, instead of mutating the buffer it gave to `write()` being mutated (that would probably not even work unless you used `write()` directly in your code instead of e.g. `fwrite()`).

Also as you noted, the logic related to handling `len` vs `sizeof(int)` is... curious.

While I find some ChatGPT examples impressive, this one isn't very enlightening. The block device structure and the factorial itself are fine, but those are likely in the training set as there are various tutorials online. But the combination of the factorial function into the block device is pretty awful (though I could imagine a novice doing it like this).

Both the OP and this driver kind of fit into this pattern, it looks like someone asked a novice programmer to do something using X and Y, so he read various few sample code and tutorials on X and Y and tries to put something together as best as it could, but the "plumbling" between X and Y (the part that wasn't in the tutorials) doesn't really work.

wizeman · on Dec 5, 2022

> I'd expect the `copy_to_user` call to be inside `dev_read`, so that the userspace program can read the result with a further `read()` call, instead of the buffer it gave to `write()` being mutated (that would probably not even work unless you used `write()` directly in your code instead of e.g. `fwrite()`).

Yes, you're probably right. It's been many years since I've written Linux kernel code and also, I've never implemented such a driver, so I missed that.

I asked ChatGPT: what is wrong with this Linux kernel code (and fed it the dev_write() function), and this is what it answered (on the first try):

> There are a few issues with this Linux kernel code:

> 1. The function dev_write() is declared as returning ssize_t, but the return value at the end of the function is sizeof(int). ssize_t and size_t are different types, so this could cause a type mismatch.

This is an interesting observation that you and I both missed.

> 2. The factorial() function is not defined in this code, so it is unclear what it does or where it is declared.

This is because I didn't feed it the factorial function nor the rest of the code: it was on a different chat session. One could guess what it does based on the name, but to be fair, it's not clear why such a function would be on the Linux kernel unless I gave it the full context.

> 3. The copy_from_user() and copy_to_user() functions are used to copy data between user space and kernel space. In this code, buffer is a pointer to user space, but input and result are local variables in kernel space. This means that copy_from_user() will copy data from user space into kernel space, but then copy_to_user() will copy data back from kernel space into the same buffer in user space, overwriting the original data. This is likely not the intended behavior.

I think this is basically the bug that you pointed out. At this point, it would be clear to me that the result would have to go back in dev_read() instead.

> 4. The len parameter is not used in this function, so it is unclear why it is included as an argument.

lol. Actually, it is being used, incorrectly.. and it wouldn't be used if we did the correction that I mentioned. So yes, this is somewhat hilariously wrong.

And it's clear why it has to be included as an argument, but to be fair, ChatGPT has no context whatsoever except knowing it's Linux kernel code (because I used a different chat session), so it's conceivable that it couldn't figure out why the function prototype has to be that way.

> Overall, this code is not well-written and would benefit from further clarification and revision.

It was you who wrote it, dumb-ass! :)

(Interestingly, ChatGPT can be compared to someone with Alzheimer in the sense that it can't remember conversations that happened in different chat sessions).

wizzwizz4 · on Dec 5, 2022

> > 1. The function dev_write() is declared as returning ssize_t, but the return value at the end of the function is sizeof(int). ssize_t and size_t are different types, so this could cause a type mismatch.

> This is an interesting observation that you and I both missed.

Hah. Call me when you find an architecture where ints use over half the addressable memory.

wizeman · on Dec 5, 2022

> Hah. Call me when you find an architecture where ints use over half the addressable memory.

I mean, I get your point if it's a joke :) But I think the AI was just pointing out that you'd get a compiler warning because of the type mismatch in signedness (is this even a word?).

int_19h · on Dec 6, 2022

It's the small negative numbers that bite you there when they undergo unsigned overflow.

Arch-TK · on Dec 8, 2022

> Please do tell, how is it hilariously wrong?

It copies from user, and then copies to user, back into the const it got. I don't even know if this would compile but certainly nobody would expect the buffer they just wrote to a file to suddenly end up re-written with a "reply" when using a special file.

I found that logic very funny.

Edit: To elaborate, the actual code to do this properly would have to allocate some memory when the file was opened, this memory would hold the reply once a question was written. Then when the answer was read it would have to respond with that in there. Finally when the file is closed it would have to deallocate.

This code is quite far from accurate, the issue I have with the concept that even if this bot was 99% accurate, C is not a "try it and see" language. If you apply "try it and see" techniques to C you just get subtle undefined behaviour which you might not notice until someone uses it to remotely execute code on your machine. Really I am not so sure that even humans can be trusted to write C but at the very least I am well aware of all the instances where C is a minefield and can pay extra attention to those, if you simply look at the plethora of information on C out there to learn C and apply it to kernel development you won't automatically build these intuitions no matter how much code you read because an enormous amount of C out there is hideously wrong in many ways.

Final edit: Once you start looking at the details, it has managed to implement a bunch of buffer overflows to boot.

jraph · on Dec 4, 2022

Fortunately, a human will know to fix that broken 4-space indentation and that brace placement before inclusion in the Linux kernel repository.

wizeman · on Dec 4, 2022

> Fortunately, a human will know to fix that broken 4-space indentation and that brace placement before inclusion in the Linux kernel repository.

What's your point, that ChatGPT wouldn't know how to do that, especially if the kernel maintainers gave it such feedback?

I thought it was clear that it can in fact do that (sometimes by asking clarifying questions, like a human would).

I think some of the major things missing in ChatGPT is the ability to interact with a computer directly (including the compiler and checkpatch.pl, and using files for information storage instead of a limited N-token context), as well as interacting with humans by itself (e.g. via email).

And sure, it would still have very limited capabilities in many ways, don't get me wrong, as I don't think it could replace a programmer at this point.

But I think the gaps are closing rapidly.

jraph · on Dec 4, 2022

I was just joking

(about the fact that a bot handles the "creative" work of writing the device driver while a human will do the repetitive work of reindenting)

(sorry, I should have used /s)

pphysch · on Dec 4, 2022

Developing OS is just plumbing already existing syscalls together...

Developing the Linux kernel is just plumbing already existing CPU instructions together...

Developing a CPU is just plumbing already existing electronic components together...

Developing an electrical component is just manufacturing with already existing materials...

valenterry · on Dec 5, 2022

> The hard part is translating the business requirements into something that a computer could understand

No, that's actually the easy part.

The hard part is to translate the requirements into something that a computer can understand and a human can also easily understand at the same time. Because otherwise, it's a one-off program that can't really be changed afterwards anymore.

And that is the real challenge that I'm curious about: how good will ChatGPT be in not only helping me with small, trivial, issues, but with the big ones.

"ChatGPT, here's this small 500k line repository. I need to change the business requirements and make it so that when a user fails to login 3 times in a row, a captcha is being shown before they can attempt again."

And if ChatGPT then gives me a 10k line diff that would be terrible. It should really minimize the diff but still keep it understandable. THAT is what I would love to see.

philjohn · on Dec 4, 2022

I started programming in the 80's on a Commodore 128. I had the Commodore BASIC manual, and just went from there.

wizeman · on Dec 4, 2022

> I started programming in the 80's on a Commodore 128. I had the Commodore BASIC manual, and just went from there.

ChatGPT can also learn about Commodore programming if you feed it the Commodore BASIC manual during training.

My guess is that it could learn even more/better if you'd allow it to interact with a Commodore 128 computer like you did when you were learning!

Baeocystin · on Dec 4, 2022

No need to feed it. I had it converting between the C=128's BASIC 7 and C=64's BASIC 2 without any additional context. Did a better job than 14 year old me had done back in the day, too.

delusional · on Dec 4, 2022

How do you think programming actually began?

topicseed · on Dec 4, 2022

Does it matter though? Perhaps all code we tend to write it's just regurgitated snippets from stuff we read, or stuff that's been written about enough time to make it "simple" for AI to reuse.

Perhaps the goal isn't to use AI to write up an entire repo, but to go through easy utils so the highly paid engineer can focus on the core logic, the core USP of the business.

broast · on Dec 4, 2022

I personally do. I don't really find it meaningful when people show a simple buggy output. Show me a hundred iterations of you intending on helping it achieve the correct output on the same question. Even if it doesn't learn as fast as a human your age, doesn't mean it's not learning the same underlying patterns that we apply for our own abstract understanding. I haven't been shown any evidence that human understanding is different than this other than humans are much more refined with more dedicated components

rightbyte · on Dec 4, 2022

I am not expecting it to do novel research. It seems to yank out algorithms explained to it in plain text as good as I would write them in 30 minutes.

In the hand of a newbie programmer with like one week of learning experience this thing will allow them to make usable programs. I don't think this thing will hit us with years of experience in the short term but entry level programmers might be in real danger of being flooded if OpenAI makes some "low code" IDE that just works and parses the whole set of code in the project when evaluating queries.

jchw · on Dec 4, 2022

In the real world, it tends to perform better, though. Frankly, it also probably isn't as good as "mental math" as some humans, but that's also not what it's optimized for. However, it's able to perform roughly as well as an average human for practice SAT tests, so it doesn't seem that it really has general issues with intelligence test questions, even if they are more contrived than most real world problems. Nobody ever said that ChatGPT demonstrates intelligence similar to humans in every way, but again: who the hell cares? People are going to be saying this right into their own layoffs at this rate. I'm a career SWE, so I'm not really looking forward to it anymore than anyone else.

I also wonder: is there a chance you had more dialogue, but only presented this interaction?

pyinstallwoes · on Dec 4, 2022

This idea that Rodger the new hire on the team is "intelligent" is so absurd it's getting tiring. You do realize that the code you see regurgitated is most likely some permuted variant of a question/answer on Stack Overflow or a pull request on Github, right? Rodger can't even do basic categorization[1] reliably, but you think Rodger can understand code?

fathrowaway12 · on Dec 4, 2022

It's very good at iterating from a description to working code. You still need to know what you're doing, but it can handle a lot of details.

  prompt: How would you design a distributed key value storage system? Assume the values are about one megabyte. We will need to scale up to store petabytes of data.
  gpt: describes data distribution techniques, consistency, durability issues.
  prompt: What techniques can you use for the data distribution?
  gpt: sharding, replication, partitioning, hashing
  prompt: Can you show me an example of the sharding technique in python?
  gpt: spits out code for hash sharding a dictionary
  prompt: What are some pitfalls of the hash function you used?
  gpt: not guaranteed to unique hash values, not guaranteed to be the same for different python installations, not cryptographically secure
  prompt: Can you implement the stub for a GRPC key value service in rust? The service should have a Put API that takes a key and a value and returns a status code enum. The status code can be SUCCESS or FAILURE.
  gpt: spits out code using tonic
  prompt: Can you show me the protocol buffer?
  gpt: spits out the protocol buffer schema
  prompt: Can you implement the put method by storing the data in sqlite?
  gpt: spits out the code with a kv_pairs table
  prompt: Can you show me the sqlite schema for the table?
  gpt: spits out the sql to create the table

dgellow · on Dec 4, 2022

Yes, that makes it a fantastic way to explore a technical topic. I got a whole Directx tutorial from ChatGPT and it answered my questions and reviewed my code.

I was also able to make it write a boot loader, a simple kernel, and give me instructions to run it on qemu.

The iterative process is fantastique to dig into something new. It’s not an authoritative source of information but a buddy with whom to explore topics.

I want something like that integrated in my OS. With such capabilities I can see myself talking to my AI assistant while I’m working on something.

Edit: it’s also quite good at writing test cases for a given function

drivers99 · on Dec 4, 2022

I asked ChatGPT to explain the human written part of the readme file to a toddler:

“This readme file is about a project where a computer program called ChatGPT helped someone build a new computer program. The person asked the ChatGPT program to fix any mistakes in the code, and the ChatGPT program did its best to help. The project didn't work perfectly, but it was a fun experiment to see if a computer program could work together with a person to create something new.”

Terretta · on Dec 4, 2022

If the toddler followed that, they may be the next billg.

rightbyte · on Dec 4, 2022

It is like the "Eli 5" meme. Most people trying to do it seems to never have spoken to a 5 yo.