I don't know about GPT4, but GPT3.5 I'd bet is pretty traditional and boring. It...

lexandstuff · on April 7, 2023

In this interview [1] with Ilya Sutskever, he indicates that they aren't even close to tapping out of data.

[1] https://www.youtube.com/watch?v=Yf1o0TQzry8&t=656s

jakeinspace · on April 7, 2023

To be fair, if the opposite were true, it might not be wise to admit. Saturating available high quality training data is one of the few ways anyone can see OpenAI slowing down.

tome · on April 7, 2023

Yes, it's a bit strange. I would have thought

1. They would already be using everything they can get 2. They would easily be able to explain what they're not using, without giving away sensitive secrets.

nr2x · on April 7, 2023

That interview is mind blowing stuff.

rcpt · on April 7, 2023

Yeah great interview style. It's non stop content.

fock · on April 7, 2023

I wonder if we saw the same video - or maybe it is just ChatGPT being "great" in the wild? I see one guy asking another guy simple questions and getting weaselwords for an answer.

revelio · on April 7, 2023

The interviewer and his questions is in some way more impressive than the answers, which is weird.

rcpt · on April 7, 2023

Yeah. He's fast and it's just question after question. I didn't realize that was something I wanted to see in interviews.

revelio · on April 7, 2023

Right, that's totally it. I came away thinking the interviewer seemed way sharper than the interviewee which is pretty rare. The sheer throughput and speed of interesting questions was incredible. Too bad that many of the answers were not.

brendamn · on April 7, 2023

> I'd say with GPT4 they probably reached the limit of how big the dataset can be

I’m curious about this too; not just on the dataset size, but also the model size. My hunch is that the rapid improvements of the underlying model by making it bigger/giving it more data will slow, and there’ll be more focus on shrinking the models/other optimisations.

int_19h · on April 7, 2023

I don't think we're anywhere close to the limit of sheer hardware scalability on this. Returns are diminishing, but if GPT-4 (with its 8+ k context window) is any indication, even those diminishing returns are still very worthwhile.

If anything, I wonder if the actual limit that'll be hit first will be the global manufacturing capacity for relevant hardware. Check out the stock price of NVDA since last October.

flangola7 · on April 7, 2023

According to financial reports they are building a $225 million supercomputer for AI. What we can probably expect is the same dataset with even more compute ran on it.

theturtletalks · on April 7, 2023

Is there a limit on how big the context size can be?

PeterisP · on April 7, 2023

There is a soft limit due to the computation required; the currently used model architectures are quadratic with respect to context size, so if you want ten times larger context size, that's going to need a hundred times more effort.

kickette · on April 7, 2023

There’s no theoretical limit