AI's $600B Question

LarsDu88 · 2024-07-03T20:51:30

According to Jensen it takes about 8000 H100s running for 90 days to train a 1.8 Trillion param MoE GPT-4 scale model.

Meta has about 350,000 of these GPUs and a whole bunch of A100s. This means the ability to train 50 GPT-4 scale models every 90 days or 200 such models per year.

This level of overkill suggests to me that the core models will be commoditized to oblivion, making the actual profit margins from AI-centric companies close to 0, especially if Microsoft and Meta keep giving away these models for free.

This is actually terrible for investors, but amazing for builders (ironically).

The real value methinks is actually over the control of proprietary data used for training which is the single most important factor for model output quality. And this is actually as much an issue for copyright lawyers rather than software engineers once the big regulatory hammers start dropping to protect American workers.

Voloskaya · 2024-07-04T03:38:15

> This means the ability to train 50 GPT-4 scale models every 90 days or 200 such models per year.

Not anywhere close to that.

Those 350k GPUs you talk about aren't linked together. They also definitely aren't all H100s.

To train a GPT-4 scale model you need a single cluster, where all the GPUs are tightly linked together. At the scale of 20k+ GPUs, the price you pay in networking to link those GPUs is basically almost the same as the price of those GPUs themselves. It's really hard and expensive to do.

FB has maybe 2 such clusters, not more than that. And I'm somewhat confident one of those cluster is an A100 cluster.

So they can train maybe 6 GPT-4 every 90 days.

LarsDu88 · 2024-07-04T04:57:04

I had to take a second look at this: https://www.datacenterdynamics.com/en/news/meta-to-operate-6...

340,000 H100s 600,000 H100 equivalents (perhaps AMD Instinct cards?) On top of the hundreds of thousands of legacy A100s.

And I'm certain the order for B100s will be big. Very big.

Even the philanthropic org Chan-Zuckerberg institute current rocks 1000 H100s, probably none used for inference.

They are going ALL OUT

onion2k · 2024-07-04T05:57:00

They are going ALL OUT

Just like they did for their metaverse play, and that didn't work out very well.

LarsDu88 · 2024-07-04T06:17:32

I honestly don't think we've seen the end of AR/VR yet. The tech continues to improve year over year. There are rumors the prototype Zuck plans to show at Meta Connect this year are mindblowing

onion2k · 2024-07-04T06:47:52

Better VR tech won't make people buy VR. You could literally offer them a Star Trek holodeck and they still wouldn't buy in. People don't buy it because they don't see the point.

This was even true in Star Trek. People could do literally anything on a holodeck and the writers still had them going to Risa for a holiday.

There is no chance of VR going mainstream until someone solves the fundamental human problem of people preferring to do things in real life.

TeMPOraL · 2024-07-04T08:31:51

> This was even true in Star Trek. People could do literally anything on a holodeck and the writers still had them going to Risa for a holiday.

If anything, that was a failure of imagination on writers' part, somewhat rectified over time and subsequent shows. Even in the core shows (TNG, DS9, VOY), we've seen the holodeck used for recreation, dating, study, simulation, brainstorming, physical training, hand-to-hand combat training, marksmanship training, gaming out scenarios for dangerous missions, extreme sports, engineering, accident investigation, crime scene reconstruction, and a bunch of other things. Still, the show was about boldly going where no one has gone before - not about boldly staying glued to a virtual reality display - so this affected what was or wasn't shown.

Plus, it's not either/or. People went to Risa to have real sex (er, jamaharon) with real people, and lots of (both it and them). This wasn't shown on-screen, just heavily implied, as this is where Roddenberry's vision of liberated humanity clashed with what could be shown on daytime TV. Holo-sex was a thing too, but it was shown to be treated more like pornography today - people do it, don't talk much about it, but if you try to do it too often and/or with facsimile of people you know, everyone gets seriously creeped out.

jordanb · 2024-07-04T13:34:57

In TNG we see Barkley get addicted to the holodeck and use it to play out fantasies with female members of the crew. Through the episode we end up learning that Barkley escaped to the holodeck because he was having problems and not being fulfilled in his real life.

There was a similar episode of DS9 where Nog gets addicted to the holodeck due to war trauma.

The central take of the show is that real life is better for these people in this future communist space utopia and the only reason why you'd go to the holodeck is light entertainment, physical training, or if there's something wrong with your life that needs fixed.

TecoAndJix · 2024-07-04T11:17:36

https://youtu.be/6lobo3c0NFg?si=WqxJiOHyjwl3vD_m

ChainOfFools · 2024-07-04T10:37:15

so much human potential, natural resources, and anxiety wasted on obsessive pursuit of diddling a few special nerve endings, heaped in a mountain of self-serving social pecking order mythology and ritualistic mystery.

mc32 · 2024-07-04T11:25:42

Once we can produce offspring in sci-fi vats, then we can remove the then unnecessary organs from our DNA and not have those worries. We can be just like human ants where the queen is now vats and we just work and maybe think a little.

plasticchris · 2024-07-04T13:56:27

What a brave new world it would be.

eru · 2024-07-04T07:17:23

> There is no chance of VR going mainstream until someone solves the fundamental human problem of people preferring to do things in real life.

I don't think that's much of a problem? People already watch TV and play computers games and read novels, instead of real life.

I agree that VR has _some_ problem, but I don't think it's that people prefer real life.

boloust · 2024-07-04T07:11:35

Totally agreed. It's like the hype around "social media" or "streaming services" or "video games". There's no chance of any of them going mainstream because of the fundamental human problem of people preferring to do things in real life.

duggan · 2024-07-04T10:07:24

> There is no chance of VR going mainstream until someone solves the fundamental human problem of people preferring to do things in real life.

Ready Player One had a pretty good answer to this: dystopia. Once real life is miserable enough, VR's time will have arrived.

bergie · 2024-07-04T10:22:36

That, or another year of lockdown could also do it.

figassis · 2024-07-04T07:41:28

VR requires too much setup. I have a PS4, bought a used PSVR set and realized I needed a camera that I did not have. Realized instead of buying a camera, I could upgrade to a ps5 and buy the new headset that did not require a camera, bc I prefer not to have my living room look like a lab. Then there is the social aspect of it.

You can't interact with people around you the same way you do if you play with, say, a console controller. VR is an all encompassing activity that you have to dedicated time for, instead of having it casually just exist around you. Then we have the cost. Only some people can have it, so it will be a lonely activity most of the time when it could be so much more.

I can afford it, but every time I am in front of a new set, I consider my life with it and say "maybe next time". Finally, I have not really explored them, but I have a feeling the experience is limited by the content that exists.

I dream of a VR experience where suddenly all content I currently enjoy on flat screens will automagically be VRified. But I am pretty sure that will not be the case. Only a very limited collection will be VR native.

But I want it all to be, or almost all, before I go all in.

KronisLV · 2024-07-05T15:57:28

> VR requires too much setup. I have a PS4, bought a used PSVR set and realized I needed a camera that I did not have. Realized instead of buying a camera, I could upgrade to a ps5 and buy the new headset that did not require a camera, bc I prefer not to have my living room look like a lab. Then there is the social aspect of it.

I bought the somewhat dated Quest 2 sometime in the last year, because I could get it for a good price. There's a mobile app I think I used for setup, you can also connect to the PC for Oculus Link, SteamVR, Virtual Desktop or any other number of OpenXR apps or games, but as for the device itself... there was basically nothing aside from logging in and downloading what I want from the store, if I wanted to run things directly on the headset. The controllers and tracking just works, you define the area you want to get warned about getting close to the borders of by just drawing in the room around you.

Actually, the only problems I've had have been in a PCVR use case after I got an Intel Arc - Oculus Link and SteamVR both don't support it natively (an allowlist in the case of the former and support only for NVENC I think in the case of the latter), whereas Virtual Desktop worked with AV1 and Intel QSV out of the box, while also allowing me to launch SteamVR through it.

There are warts and all (especially software like Immersed removing support for physical monitors, what were they thinking), but in general the hardware and everything around it, even hand tracking, are pretty well streamlined, surprisingly so.

> VR is an all encompassing activity that you have to dedicated time for, instead of having it casually just exist around you.

This kind of killed it for me, to be honest. There's more friction than just launching a game on the PC directly (in the case of PCVR: putting the headset on, connecting to the PC, then launching it on the PC, finally accessing it on the headset) in addition to needing to sometimes use the keyboard being especially annoying, since the on screen keyboard is just more annoying to use and having to find your regular keyboard taking a step or two, if you're standing instead of sitting while playing.

That said, VR in general still feels cool, even if it's a bit early.

mapt · 2024-07-04T15:46:22

The appropriate response to VR is that we all get a VR/storage/etc room in addition to the existing paradigms of bedroom, living room, kitchen, etc. At the high end we've grown houses to the point that in order to remain boxes they demanded interior rooms without windows, and so far we have varied between refusing to build these rooms because "natural light" and outright banning these rooms for safety reasons, creating sprawling complicated floorplans instead with lots of surface area per volume.

It would be a bit better suited to a civilization that wasn't undergoing a catastrophic urban housing shortage crisis with demographic & economic effects for upcoming generations that are comparable to a world war or the Black Death. We are building huge exurban houses which nonetheless do not have VR-appropriate rooms, and tiny 1-bedroom apartments, and not much else. https://www.youtube.com/watch?v=4ZxzBcxB7Zc

The question is whether this is a chicken/egg problem that prevents us from launching next-generation VR plays.

pmarreck · 2024-07-04T16:21:17

Vision Pro is already that today, FYI. It’s honestly amazing.

But it’s too expensive and still too heavy on your face.

LarsDu88 · 2024-07-04T13:27:06

You should try the quest3. Virtually no setup

pmarreck · 2024-07-04T19:02:13

I think the key there (the “killer app” as it were) is shared experiences. I love co-op gaming and keep in touch with faraway friends by playing those games while Discording. It would be a game-changer (literally and figuratively) if we could game or watch something in the same AR or VR space, with some kind of persona or avatar representing ourselves, and spatially reflecting our audio/voice.

We’ve joked multiple times that whenever the “co-op Skyrim VR” of gaming comes out, we will never be heard from again lol.

Apple is SO CLOSE to this with its Vision Pro hardware, and yet so far… (no “co-op space” implementation, too expensive, too heavy on face)

Imagine seeing a live soccer match in 3D from incredible camera angles like just above the goals, but your buddy who is 3000 miles away is actually also sitting right next to you in that space, and you can see and hear each other…

jononor · 2024-07-04T10:33:47

> There is no chance of VR going mainstream until someone solves the fundamental human problem of people preferring to do things in real life.

Three counterpoints: Online gaming, social media, smartphones. All of these favor "virtual" over "real life", and have become massively popular over the last decades. Especially among the young, so the trend is likely to continue.

tonynator · 2024-07-04T07:25:24

Think you're off base here and the issue is comfortability. People spend all day on their computers and phones, VR just needs to make some breakthroughs in comfort (maybe built-in fans? Literally include ginger tablets with the headset?) to get people over the initial nausea hump. This plus higher resolution for AR purposes will do a ton.

Now, there may also be a physical laziness factor to overcome, but there are enough people that enjoy moving their bodies to really explode the industry even if all the lazy folks stay 2D.

rlt · 2024-07-04T07:42:43

AR might be a different story, if the tech gets small/good enough.

8organicbits · 2024-07-04T22:15:39

> rumors... prototype... mindblowing

Sounds like more unsubstantiated hype from a company desperate to sell a product that was very expensive to build. I guess we'll see, but I'm not optimistic for them.

jopsen · 2024-07-04T22:31:50

Oh, I'm sure it'll be mind-blowing to see the emperor without clothes again.

dmix · 2024-07-04T05:17:27

> Even the philanthropic org Chan-Zuckerberg institute current rocks 1000 H100s, probably none used for inference.

What do they use them for?

ChainOfFools · 2024-07-04T10:38:36

tax writeoffs

LarsDu88 · 2024-07-04T04:48:12

Ok, I might have misread some rumored ballpark figures. And most of the GPUs will be used for inference rather than training. Still 6 GPT-4's every 90 days is pretty amazing.

Kon-Peki · 2024-07-03T20:58:10

It's like someone thinking that they are SOOOO smart, they are going to get rich selling shovels in the gold rush. So they overpay for the land, they overpay for the factory, they overpay for their sales staff.

And then someone else starts giving away shovels for free.

gwd · 2024-07-04T07:42:32

> And then someone else starts giving away shovels for free.

Ah, I see -- it's more like a "level 2 gold rush".

So a level 1 gold rush is: There's some gold in the ground, nobody knows where it is, so loads of people buy random bits of land for the chance to get rich. Most people lose, a handful of people win big. But the retailers buying shovels at wholesale and selling them at a premium make a safe, tidy profit.

But now that so many people know the maxim, "In a gold rush, sell shovels", there's now a level 2 gold rush: A rush to serve the miners rushing to find the gold. So loads of retailers buy loads and loads of shovels and set up shop in various places, hoping the miners will come. Probably some miners will come, and perhaps those retailers will make a profit; but not nearly as much as they expect, because there's guaranteed to be competition. But the company making the shovels and selling them at a premium makes a tidy profit.

So NVIDIA in this story is the manufacturer selling shovels to retailers; and all the companies building out massive GPU clouds are the retailers rushing to serve miners. NVIDIA is guaranteed to make a healthy profit off the GPU cloud rush as long as they play their cards right (and they've always done a pretty decent job of that in the past); but the vast majority of those rushing to build GPU clouds are going to lose their shirts.

demondemidi · 2024-07-04T16:04:34

And basically one AI company making all the money. Weird symbiosis.

Terr_ · 2024-07-03T21:02:32

> And then someone else starts giving away shovels for free.

And their business model is shovel-fleet logistics and maintenance... :p

woah · 2024-07-03T22:50:32

The platform for shovel fleet logistics startups

ugh123 · 2024-07-04T00:15:48

SaaS (shoveling as a service)

TeMPOraL · 2024-07-03T21:10:46

And/or exploiting the legal infrastructure around intellectual property rights to make sure only hobbyists and geologists can use the shovels without paying through the nose or getting sued into oblivion.

freehorse · 2024-07-04T00:34:43

If your company grows to 700 million monthly active users, then most probably you can make your own AI department and train your own models. I guess people's aspirations are very high in this space, but let's be realistic.

jona-f · 2024-07-04T07:23:17

Their business model is of course tracking all the shovels and then selling the locations of all the gold.

from-nibly · 2024-07-03T23:06:55

It's almost like you can't actually control the demand side.

Willish42 · 2024-07-04T00:46:08

> once the big regulatory hammers start dropping to protect American workers

Have we been living in the same universe the last 10 years? I don't see this ever happening. Related recent news (literally posted yesterday) https://www.axios.com/2024/07/02/chevron-scotus-biden-cyber-...

LarsDu88 · 2024-07-04T04:52:12

I think people wildly underestimate how protectionist people - particularly educated software engineers and PhDs will get once an AI model directly impacts their source of wealth.

Red state blue collar workers got their candidate to pass tariffs. What happens when both blue state white collar workers and red state blue collar workers need to contest with AI. Perhaps not within the next 10 years, but certainly within 20 years!

And if you think 20 years is a long time... 2004 was when Halo 2 came out

sangnoir · 2024-07-04T06:59:05

> I think people wildly underestimate how protectionist people - particularly educated software engineers and PhDs will get once an AI model directly impacts their source of wealth.

I don't know what power you imagine SWEs and PhDs posses, but the last time their employers flexed their power by firing them in droves (despite record profits); the employees sure seemed powerless, and society shrugged it off and/or expressed barely-concealed schadenfreude.

demondemidi · 2024-07-04T16:03:01

They were sued for collusion and the lawyers got a massive payout and the employees got a fraction of lost wages. I was one of them. (Employees not lawyers.)

jjallen · 2024-07-04T07:34:10

Hopefully that time AI will be working for us in our homes, stores and farms so we don't need to work as much and this is ok.

rlt · 2024-07-04T07:45:01

People will still need purpose, which for better or worse is often provided by their job.

dmix · 2024-07-04T05:24:46

It's not going to stop it though even if they try though. You can't stop technical progress like this any more than you can stop piracy.

But agreed, between the unions with political pull and "AI safety" grifters I suspect there could be some level of regulatory risk, particularly for the megacorps in California. I doubt it will be some national thing in the US absent a major political upheaval. Definitely possible in the EU which will probably just be a price passed on to customers or reduced access, but that's nothing new for them.

techostritch · 2024-07-05T14:10:58

I keep seeing people say you can’t stop progress (social, technical, etc.) but has this really been tested? There seems to be a lot of political upheaval at least being threatened on the near future, and depending on the forces that come into power I imagine they may be willing to do a lot to protect that power.

Tucker Carlson at one point said if FSD was going to take away trucking jobs we should stop that with regulation.

hnthrow289570 · 2024-07-04T02:00:54

The only upside is state-level minimum wage increases. The federal minimum wage is still a complete joke at $7.25 an hour.

But there's bigger fish to fry for American politics and worker obsolescence is not really top of mind for anyone.

alberth · 2024-07-04T00:25:15

> making the actual profit margins from AI-centric companies close to 0

The same thinking stopped many legacy tech companies from becoming a “cloud” company ~20 years ago.

Fast forward to today and the margin for cloud compute is still obscene. And they all wish in hindsight they got into the cloud business way sooner than they ultimately did.

jordanb · 2024-07-04T15:00:31

That's not the way I remember the cloud transition at all. My company adopted an internal cloud. VMWare had some great quarters. Openstack got really big for a while and everyone was trying to hire for it. All the hosting companies started cloud offerings.

What ended up happening was Amazon was better at scale and lockin than everyone else. They gave Netflix a sweet deal and used it as a massive advertisement. It ended up being a rock rolling down a hill and all the competitors except ones with very deep pockets and the ability to cross-subsidize from other businesses (MSFT and Google) got crushed.

demondemidi · 2024-07-04T16:07:38

It still blows my mind that Microsoft is the most valuable company in the planet because of the cloud and Balmers long term vision. I thought they would have gone the way of IBM.

fragmede · 2024-07-04T16:09:22

Which, IBM booked about $62 billion in revenue for 2023.

I thought Nvidia recently took that crown recently though.

hollerith · 2024-07-04T16:09:54

Agree. Ballmer seems to have done his job well.

alberth · 2024-07-04T22:09:33

While I agree, he also admits his biggest miss was phone/hardware (which is what catapulted Apple).

https://youtu.be/v9d3wp2sGPI?feature=shared

coredog64 · 2024-07-04T18:26:19

It’s not just the software, it’s the hardware too. Too many companies got good at speeding up VM deployments but ignored theory of constraints and still had a 4-6 month hardware procurement process that gave everyone and their dog veto power.

And then you come to companies that managed to streamline both and ran out of floor space in their data center because they had to hold onto assets for 3-5 years. At one previous employer, the smallest orderable unit of compute was a 44U rack. They eventually filled the primary data center and then it took them 2 years to Tetris their way out of it.

lmm · 2024-07-04T01:09:31

Are the second-tier cloud companies really seeing big margins? Why is it not competed away to zero like airlines?

mbb70 · 2024-07-04T01:14:47

There is essentially zero cost for a user to switch airlines. The cost to switch clouds is astronomical for any decent sized org.

jart · 2024-07-04T06:16:50

Those poor little AI clouds will never keep people reeled in unless they invent something like CUDA.

throwaway2037 · 2024-07-04T04:50:57

I like the sentiment of your post. I mostly agree. If you use OpenShift, doesn't that help to reduce the cost of switching cloud providers?

fragmede · 2024-07-04T18:35:28

You're not switching cloud providers. Amazon's not going to suddenly decide to jack up rates for EC2 instances on you. So the extra complexity just isn't worth it.

There is a hypothetical "but what if we honestly actually really really do", but that's such a waste of engineering time when there are so many other problems to be solved that it's implausible. The only time multi-cloud makes sense is when you have to meet customers where they're at, and have resources in whichever cloud your customers are using. Or if you're running arbitrage between the clouds and are reselling compute.

coredog64 · 2024-07-04T18:28:58

Not really. What happens when you run a cloud on top of your cloud is that you don’t get to use any of their differentiating features and that winds up costing you money. Plus you have to pay for your own control plane when that’s already baked into the cloud provider’s charge model.

throwaway2037 · 2024-07-05T00:43:51

    > Plus you have to pay for your own control plane when that’s already baked into the cloud provider’s charge model.

When you say "control plane" does this mean Kubernetes?

flyingpenguin · 2024-07-04T02:01:18

There will probably be 2 huge winners, and everyone else will fail. Similar to the solar boom.

throwaway2037 · 2024-07-04T04:51:17

Who are the winners in solar?

kristjansson · 2024-07-04T02:42:23

per Zuckerberg[0], ~half of their H100s were for Reels content recommendation:

> I think it was because we were working on Reels. We always want to have enough capacity to build something that we can't quite see on the horizon yet. ... So let's order enough GPUs to do what we need to do on Reels and ranking content and feed. But let's also double that.

So there's an immense capacity inside Meta, but the _whole_ fleet isn't available for LLM training.

[0]: https://www.dwarkeshpatel.com/p/mark-zuckerberg?open=false#§...

matthewdgreen · 2024-07-04T12:06:45

Surely they’re using some of that hardware to overcome Apple’s attempts to deprive them of targeted advertising data.

shoggouth · 2024-07-04T01:03:56

In my opinion Elsevier and others charging for access to academic publications has held back the advancement of humanity to a lower exponential acceleration into the future at a considerable factor. Think of how cancer could have been cured a decade ago if information was allowed to flow freely from the 50's forward - if anyone could have read scientific publications for free. I have no respect for people that want to protect the moat around information that could be used to advance humanity.

robwwilliams · 2024-07-04T02:11:35

Have to disagree. Almost all researchers have essentially unfettered access to all of biomedical literature. Access to papers is therefore a tertiary annoyance wrt progress in science and the cures for cancers.

What IS a huge problem is the almost complete lack of systematically acquired quantitative data on human health (and diseases) for a very large number (1 million subjects) of diverse humans WITH multiple deep-tissue biopsies (yes, essentially impossible) that srr suitable for multiomics at many ages/stages and across many environments. (Note, we can do this using mice.)

Some specific examples/questions to drive this point home: What is the largest study of mRNA expression in humans? ANSWER: The small but very expensive NUH GTEx study (n max of about 1000 Americans). This study acquired postmortem biopsies for just over 50 tissues. And what is the largest study of protein expression in humans across tissues? Oh sorry, this has never been done although we know proteins are the work-horses of life. What about lipids, metabolites, metagenomics, epigenomics? Sorry again, there is no systematically acquired data at all.

What we have instead is a very large cottage-industry of lab-level studies that are structurally incoherent.

Some brag about the massive biomedical data we have, but it is truly a ghost and most real data evaporates with a few years.

Here is my rant on fundamental data design flaws and fundamental data integration flaws in biomedical research:

Herding Cats: The Sociology of Data Integration https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2751652/

mchinen · 2024-07-04T10:25:19

I also think the bottleneck isn't access to the papers today and data access and silos are more important.

But I also think the GP's claim and yours are not incompatible. I wonder how much survivorship bias this has since it only considers those that are able to do research, and not those that would have but ended up doing continuing with another STEM job. We could be asking the counterfactual that I think the GP is implying: would more people have been interested in becoming cancer researchers if publications were open?

We can sort of see the effect because we have scihub now, which basically unlocks journal access for those that are comfortable with it, and I consider it plausibly having a significant effect for the population that have a research background without an academic affiliation. I've met a few biotech startup founders that switched from tech to bio and did self study+scihub outside of the university. The impetus for change I've heard a few times is a loved one got X disease, and I studied it, quit my less impactful tech job to work on bio stuff.

OJFord · 2024-07-04T01:18:17

As much as I'd love open access to academic publications and don't think the current model is great:

> Think of how cancer could have been cured a decade ago if information was allowed to flow freely from the 50's forward

might be a bit fanciful? Unless you're referring to something particular I'm unaware of.

The people best equipped and trained to deliver a cure for cancer (and then some, since it tends not to be particularly field-restricred) do have access.

I think the loss is more likely in engineering (to the publication's science), cheaper methods, more reliably manufacturable versions of lab prototypes, etc.

I doubt there are many people capable of cancer research breakthroughs who don't have access to cancer research, personally.

(And to be clear: I'm not capable of it.)

enjoylife · 2024-07-04T01:28:18

I’ll add that even if the papers we all wanted were more freely accesible, the replication and completeness of their described methods would be another source of slowdown.

robwwilliams · 2024-07-04T02:14:53

Main problem is still just getting good quantitative data and metadata. Most biomedical researchers are motivated to “tell stories”. Few of us care about generating huge mineable data sets.

instagib · 2024-07-04T04:05:24

All of the engineering companies I’ve worked for have not paid for IEEE or any journals. I have to go to the library and maintain membership for IEEE myself then request reimbursement.

The schools I’ve worked with have access to everything I’ve needed. They didn’t advertise it but it’s also free for students.

azinman2 · 2024-07-04T06:15:40

Not to mention there is not singular “cancer” - there are many types and they’re all sufficiently different to make the problem much more challenging.

xhkkffbf · 2024-07-04T15:47:43

1) First, most researchers at universities or other institutions have always had unfettered access thanks to a site license. It would be pretty hard to find a real example of a university researcher who couldn't see something.

2) There may be a few researchers who don't have unfettered access. Perhaps they paid $40 for a copy of a paper. Given the high cost of other parts of research labs, I find it hard to believe that any real possibility of curing cancer was halted because someone had to pay $40.

3) It's possible to imagine the opposite being the case. Perhaps someone had a key insight in a clever paper and decided to distribute it for free out of some info-anarchistic impulses. There it would sit in some FTP directory uncrated, unindexed and uncared for. Perhaps the right eyes would find it. Perhaps they wouldn't. Perhaps the cancer researcher would be able to handle all of the LaTeX and FTP chores without slowing down research. Perhaps they would be distracted by sys admin headaches and never make a crucial follow up discovery.

The copyrighted journal system provides curation and organization. Is it wonderful? Nah. Is it better than some ad hoc collection of FTP directories? Yes!

Your opinion may be that this scenario would never happen. In my opinion, this is more likely than your vision.

jimjimjim · 2024-07-04T02:23:38

99%+ of the people doing scientific work in curing cancer have access to all the relevant medical and scientific journals.

robertlagrant · 2024-07-04T15:58:09

You have to not latch on to causes such as "advance humanity" and then justify making people do work for free. We decided a while ago[0] that making people work for free was a bad thing. There is high demand for curing cancer. Every company that tries it will hire scientists and lab techs and large laboratories, and have subscriptions to journals. Do you think all of those people should work for free, in the cause of advancing humanity?

[0] https://en.wikipedia.org/wiki/Slavery_Abolition_Act_1833

lkrubner · 2024-07-04T15:58:23

But anyone who needs to see these publications can reach them through libraries. One of the reasons why Elsevier can charge so much is that their customers have been institutions.

aleph_minus_one · 2024-07-04T16:15:31

This depends a lot on the country and the library.

eru · 2024-07-04T07:19:48

Eh, most published research papers are wrong anyway.

laweijfmvo · 2024-07-03T20:52:45

A lot of those GPUs are for their 3B users to run inferencing, no?

benreesman · 2024-07-04T00:06:35

It’s been a very long time since I had any inside baseball, but I very much doubt that Hopper gear is in the hot inference path.

The precisions and mantissa/exponent ratios you want for inference are just different to a mixed-precision, fault tolerant, model and data parallel pipeline.

Hopper is for training mega-huge attention decoders: TF32, bfloat16, hot paths to the SRAM end of the cache hierarchy with cache coherency semantics that you can reason about. Parity gear for fault tolerance, it’s just a different game.

LarsDu88 · 2024-07-03T22:43:26

True that, but I think in a very short amount of time, using dedicated general purpose GPUs just for inferencing is going to be mega overkill.

If there's dedicated inferencing silicon (like say the thing created by Groq), all those GPUs will be power sucking liabilities, and then the REAL singularity superintelligence level training can begin.

campers · 2024-07-04T00:52:54

Etched is another dedicated inference hardware company that recently announced their product. It only works for transformer based models, but is ~20x faster than a H100

eru · 2024-07-04T07:15:52

> The real value methinks is actually over the control of proprietary data used for training which is the single most important factor for model output quality.

Maybe. But we've barely scratched the surface of being more economical with data.

I remember back in the old days, there was lots of work on eg dropout and data augmentation etc. We haven't seen too much of that with the like of ChatGPT yet.

I'm also curious to see what the future of multimodal models holds: you can create almost arbitrarily amounts of extra data by pointing a webcam at the world, especially when combined with a robot, or letting your models also play StarCraft or Diplomacy against each other.

weitendorf · 2024-07-04T06:45:52

There are more kinds of AI-centric companies than just foundation models. Making that equivalency is akin to equating internet companies during the dotcom bubble with just websites like pets.com. Now one semi-skilled person in a couple days can make websites that entire teams back then would have taken months to build, but that doesn't mean google.com and thefacebook.com are easily commodotized or bad businesses just because they're websites.

bufferoverflow · 2024-07-04T19:05:41

> this means the ability to train 50 GPT-4 scale models every 90 days or 200 such models per year.

What it actually means is that they are training next gen models that are 50X larger.

And, considering MS and OpenAI are planning to build a $100 billion AI training computer, these 350K GPUs is just a tiny portion of what they are planning.

This isn't an overkill. This is the current plan: throw as much compute as possible and hope intelligence scales with compute.

lkdfjlkdfjlg · 2024-07-04T18:33:09

> but amazing for builders (ironically).

Could you expand on this? Who are "the builders" here? You mean the model developers? I don't see how this situation can be "amazing" for the builders - developers will just get a wage out of their work.

babyshake · 2024-07-04T07:22:15

> once the big regulatory hammers start dropping to protect American workers.

The US Supreme Court seems determined to make sure that big regulatory hammers are not going to be dropping, from what I can tell.

naveen99 · 2024-07-04T01:11:52

Propriety data is not necessary for training intelligence. Wikipedia, pubmed, arxiv, Reddit and github are probably sufficient. And babies don’t even use that.

I agree though that the returns on hardware rapidly diminish.

VirusNewbie · 2024-07-04T03:46:27

Llama isn't even in the same stratosphere as the big models when it comes to coding, logic, and other interesting tasks that I think are commercially viable.

apitman · 2024-07-03T21:17:49

I thought AI is supposed to put all the lawyers out of work.

tracerbulletx · 2024-07-04T00:31:06

Nothing will put lawyers or doctors out of work. They are powerful cartels that can easily protect themselves. Realtors are already irrelevant technologically but they have a huge entrenched social and legal system to make it impractical to compete.

DataDive · 2024-07-04T06:01:47

Weren't lots of realtors recently put out of work in the US, at least?

When NAR settled the price collusion charge? Thus cartel or not, times do change.

bamboozled · 2024-07-04T05:00:37

My friend is a real estate agent, they play a major part in the psychology of buyers and sellers. Selling your dead parents home that you grew up in (for example) isn't something everyone just signs up to some website and does using a credit card without a second thought.

A good real estate agent can guide people through this process while advising them on selling at the right price while avoiding the most stress often during an extremely difficult time in their life, such as going through divorce of breakup. They of course also help keep buyers interested while the seller is making up their mind about the correct offer to take.

I find your comment ignorant in so many ways. Maybe have some respect?

dmix · 2024-07-04T05:29:49

Are you not just explaining "a huge entrenched social system" as OP said?

It takes a long time for cultures to shift and for people to start to trust information systems to entirely replace high touch stuff like that. And at some level there will always be some white glove service on top for special cases.

bamboozled · 2024-07-04T05:47:07

How is hiring a professional to help you sell a property a "huge entrenched social system" sorry ? No one is forced to hire a real estate agent. I bought my house through private sale.

DataDive · 2024-07-04T06:03:38

> No one is forced to hire a real estate agent.

but for long time in the US you were "forced" to hire a real estate agent, if you wanted to get the market price.

Refer to the NAR settlement that pretty much admits to this.

https://www.realestatecommissionlitigation.com/

This is not to say that real estate agents cannot add value to a process; it is just that they were a cartel with anticompetitive practices.

The mandated and fixed 6% on each sale was and is ridiculous, when the median sell price is 400K in the US ... that is 24K commission

wolfendin · 2024-07-04T06:18:51

That really does say something about how unrealistic house prices are nowadays, doesn’t it?

DataDive · 2024-07-05T18:03:19

it doesn't say anything about house prices IMHO,

simply put the cost of selling a home should not be linearly related to the cost of the house,

and especially should not be a fixed constant across the entire country

threeseed · 2024-07-03T22:12:35

Lexis+ AI and Ask Practical Law AI systems produced incorrect information more than 17% of the time, while Westlaw’s AI-Assisted Research hallucinated more than 34% of the time:

https://hai.stanford.edu/news/ai-trial-legal-models-hallucin...

falcor84 · 2024-07-03T22:46:47

Just out of curiosity, what's the human lawyer baseline on that?

qingcharles · 2024-07-05T22:40:57

The failures are different from my experience in this.

Human lawyers fail by not being very zealous and most of them being very average, not having enough time to spend on any filings, and not having sufficient research skills. So really, depth-of-knowledge and talent. They generally won't get things wrong per se, but just won't find a good answer.

AI gets it wrong by just making up whole cases that it wishes existed to match the arguments it came up with, or that you are hinting that you want, perhaps subconsciously. AI just wants to "please you" and creates something to fit. Its depth-of-knowledge is unreal, its "talent" is unreal, but it has to be checked over.

It's the same arguments with AI computer code. I had AI create some amazing functions last night but it kept hallucinating the name of a method call that didn't exist. Luckily with code it's more obvious to spot an error like that because it simply won't compile, and in this case I got luckier than usual, in that the correct function did exist under another name.

csa · 2024-07-04T00:38:07

> Just out of curiosity, what's the human lawyer baseline on that?

Largely depends on how much money the client has.

dbish · 2024-07-04T00:38:40

it's the self-driving car problem. Humans aren't perfect either but people like to ignore that.

Terr_ · 2024-07-04T01:01:02

True, they're similar... But what's also similar is that people make the mistake of focusing on differences in failure rates while glossing over failure modes.

Human imperfections are a family of failure-modes which have a gajillion years of experience in detecting, analyzing, preventing, and repairing. Quirks in ML models... not so much.

A quick thought-experiment to illustrate the difference: Imagine there's a self-driving car that is exactly half as likely to cause death or injury than a human driver. That's a good failure rate. The twist is that its major failure mode is totally alien, where units attempt to inexplicably chase-murder random pedestrians. It would be difficult to get people to accept that tradeoff.

Barrin92 · 2024-07-04T02:15:10

No, people have the correct intuition that human errors at human speeds are very different in nature from human rate errors at machine speeds.

It's one thing if a human makes a wrong financial decision or a wrong driving decision, it's another thing if a model distributed to ten million computers in the world makes that decision five million times in one second before you can notice it's happening.

It's why if your coworker makes a weird noise you ask what's wrong, if the industrial furnace you stand next to makes a weird noise you take a few steps back.

zacmps · 2024-07-04T03:15:24

I'm sure it's no where near good enough yet, but a legal model getting the answer right 83% of the time is still quite impressive imo.

itkovian_ · 2024-07-04T04:51:14

Everything is if scaling keeps making the models better. If it does you don't train 50 gpt4s, you have the best model.

boloust · 2024-07-04T07:14:31

Why are we assuming we're topping out at a GPT-4 scale model?

forgot-im-old · 2024-07-04T02:52:44

What percentage of GPUs are being used for training versus inference?

vasco · 2024-07-03T22:52:08

The infinitely expanding AI-generated metaverse isn't going to render itself, at least in the case of meta I think that might be one of the only pieces missing.

pedalpete · 2024-07-03T22:05:57

I think this is the correct take. My understanding of the article is that huge investments in hardware, mostly to NVIDIA, and spending by major tech companies is currently defining the market, even if we include OpenAI, Anthropic, etc. It is FAANG money they are running on.

I put this as equivalent to investing in Sun Microsystems and Netscape in the late 90s. We knew the internet was going to change the world, and we were right, but we were completely wrong as to how, and where the money would flow.

npalli · 2024-07-03T23:51:03

The better analogy is the massive investment in fiber optic cable in the late 90s. All the companies in that line (Global Crossing, Worldcom etc.) went bust after investing 10s of billions but the capacity was useful (with a >90% drop in price) for future internet services. Around 2000 when the bubble was bursting only 5% of capacity was being used but proved to be useful to get all internet-first companies like the Googles, Amazon, NetFlix's going.

pedalpete · 2024-07-04T01:12:09

I initially was agreeing with you, but I don't see NVIDIA, AWS, Microsoft, etc going to zero (and Worldcom was unraveled by accounting fraud).

Sun Microsystems sold to Oracle for $7B, and Netscape was acquired by AOL for $10B.

fuzztester · 2024-07-04T03:30:40

https://en.m.wikipedia.org/w/index.php?title=Acquisition_of_...

wmf · 2024-07-04T02:36:48

Yeah, Cisco didn't go to zero in 2000 and Nvidia won't go to zero. It will merely go down 90%.

aurareturn · 2024-07-04T08:28:19

When Cisco went bust, their stock price was still 150% higher than before the boom.

So will Nvidia be worth $5 trillion AFTER the AI bust?

cruffle_duffle · 2024-07-04T00:02:28

Good old JDS Uniphase was one of the first individual stocks I bought. I mean it had to go up right? Fiber and dark fiber and the continual threat of the internet collapsing due to load… better buy that stock!

elphinstone · 2024-07-04T01:49:11

Worldcom here. Ah, the Enrons of the internet.

ddrmaxgt37 · 2024-07-04T04:56:20

I wonder how the analogy holds up given computational advances. Will a bunch of H100s be as useful a decade later like fiber ended up being?

torginus · 2024-07-04T07:56:12

I might be wrong, but my understanding is that we're on a decelerating slope of perf/transistor and have been for quite a while - I just looked up the OpenCL benchmark results of the 1080 Ti vs 4090, and the perf/W went up by 2.8x despite going from 16nm to 5nm, with perfect power scaling, we would've seen a more than 10x increase.

novaRom · 2024-07-04T07:03:41

Probably not. There will be better GPUs. It's like we did use all those Kepler K10 and K80 fifteen or so years ago, they were Ok for models with few millions of parameters, then Pascal and Volta arrived ten years ago with massive speed up and larger memory, allowing to train same size models 2-4 times faster, so you simply had to replace all Keplers. Then Turing happened making all P100 and V100 obsolete. Then A100, and now H100. Next L100 or whatever with just more on-board memory will make H100 obsolete quickly.

qingcharles · 2024-07-05T22:44:06

One thing that is missing is that we have massively improved the performance of the algorithms lately to require less compute power, so a H100 will still be performant several years from now. The problem will be that it's going to be using up more power and physical space than an out-performing future version and so will need to be scrapped.

dmix · 2024-07-04T05:44:52

Same applies to the railroads analogy used in the original article.

treis · 2024-07-04T01:43:42

Think the FAANGs are doing part moat defending and part value add. Like AI powered spreadsheet might dethrone Excel so Excel has to be AI powered. MS will probably get some additional revenue from it but I don't think it will be a revolutionary amount.

hnburnsy · 2024-07-04T00:20:50

> I put this as equivalent to investing in Sun Microsystems and Netscape in the late 90s.

Cisco too.

threeseed · 2024-07-03T20:24:01

> Founders and company builders will continue to build in AI—and they will be more likely to succeed, because they will benefit both from lower costs and from learnings accrued during this period of experimentation

Highly debatable.

When we look back during the internet and mobile waves it is overwhelmingly the companies that came in after the hype cycle had died that have been enduring.

malshe · 2024-07-03T20:53:28

There is an old study that supports your point. The abstract reads:

"Several studies have shown that pioneers have long-lived market share advantages and are likely to be market leaders in their product categories. However, that research has potential limitations: the reliance on a few established databases, the exclusion of nonsurvivors, and the use of single-informant self-reports for data collection. The authors of this study use an alternate method, historical analysis, to avoid these limitations. Approximately 500 brands in 50 product categories are analyzed. The results show that almost half of market pioneers fail and their mean market share is much lower than that found in other studies. Also, early market leaders have much greater long-term success and enter an average of 13 years after pioneers."

PDF available here:

https://people.duke.edu/~moorman/Marketing-Strategy-Seminar-...

dmix · 2024-07-04T05:50:45

Yes the "first mover advantage" is mostly just a common myth in business that refuses to die, but if we look at the original statement:

> Founders and company builders will continue to build in AI—and they will be more likely to succeed, because they will benefit both from lower costs and from learnings accrued during this period of experimentation

This still lines up with the 2nd wave benefiting more. The first movers helped established the large scale AI hardware industry, got a bunch of smart kids trained on how to make AI, a bunch of people will fail and learn, etc and this experimentation stage sets the groundwork for OpenAI 2.0.

We could very well just in the Altavista vs Yahoo days of AI and an upstart takes over in 5yrs.

malfist · 2024-07-03T20:29:21

Let's see: Microsoft Windows: wasn't close to the first OS

Microsoft Office: wasn't close to the first office editing suite

Google: Wasn't close to the first search engine

Facebook: Wasn't close to the first social media website

Apple: ~~First "smart phone"~~ but not the first personal computer. Comments reminded me that it wasn't the first smartphone

Netflix: Wasn't close to the first video rental service.

Amazon: Wasn't close to the first web store

None of the big five were first in their dominate categories. They were first to offer some gimmick (i.e., google was fast, netflix was by mail, no late fees), but not first categorically.

Though they certainly did benefit from learnings of those that came before them.

AlexandrB · 2024-07-03T20:31:38

> Apple: First "smart phone" but not the first personal computer

Was it the first smartphone? I would call phones like the Palm Treo and later BlackBerries smartphones. There were even apps, but everything was lot more locked down and a lot more expensive.

irq · 2024-07-03T20:47:52

> I would call phones like the Palm Treo and later BlackBerries smartphones.

It's not just you; at the time these products were available, _everyone_ called them smartphones. Emphatically, Apple did not bring the first smartphone to market, not even close. They were, however, the first to popularize it beyond the field of nerds into the general public.

tim333 · 2024-07-03T23:48:12

I had an Nokia N95 which was basically a smartphone and came out a year before the iPhone. And Wikipedia says

>it became a huge sales success for Nokia ... It managed to outsell rivals such as LG Viewty and iPhone.

However the iPhone got better.

seanmcdirmid · 2024-07-03T20:36:10

First modern smartphone (capacitive touch screen/multi-touch/form factor), but not first smartphone.

dvt · 2024-07-03T20:37:31

> There were even apps, but everything was lot more locked down and a lot more expensive.

And just plain... bad. The entire experience didn't have that "feel" that Apple turned into reality. It's comparable to today's AI landscape—the technology is pretty neat, but using it is a complete slog.

AlexandrB · 2024-07-03T20:43:49

I actually have pretty fond memories of PalmOS PDAs. The hardware was very nice, but they were held back by the resistive touchscreen and dependence on a stylus for input. I never used a Treo but it felt like this was Palm trying to copy BlackBerry by adding a physical keyboard.

Edit: There were also the limitations of that era that held devices back in general. WAP internet[1] was awful, but most mobile services were too slow for much else.

[1] https://en.wikipedia.org/wiki/Wireless_Application_Protocol

nextos · 2024-07-03T20:57:48

Nokias were very open. You had a terminal with apt-get.

The entire device was a regular Linux machine.

endless1234 · 2024-07-03T22:21:29

In general, they were not. You're probably thinking of the very niche and unsuccessful Maemo/MeeGo project - eg Nokia N900 - that were indeed Linux-based. But everything else smartphone-ish from Nokia before Lumia (Windows Phone) were Symbian, which predates Linux and has nothing to do with it.

nextos · 2024-07-03T23:16:28

I am of course referring to Maemo, as per my previous post.

nextos · 2024-07-03T20:56:03

There were Nokias running Maemo ahead of the iPhone. Note these were not Symbian.

The 770 was released in Q4 '05.

They definitely fell within the smartphone category, but oddly the first few iterations lacked GSM radio.

Ekaros · 2024-07-03T21:47:08

I would classify them as tablets. At least what I thought my N810 as.

malfist · 2024-07-03T20:58:15

I'm a complete idiot. I almost bought an HTC fuse too

edanm · 2024-07-04T07:59:26

> i.e., google was fast,

Just to quibble with this - that was not even close to the reason Google got popular. It was because Google was much, much better at finding what you actually wanted. It was just a far better product.

You can debate why this is exactly, Joel Spolsky pointed out many years ago that it was because Google got that what matters to users most isn't "finding all pages related to X" but rather "ranking" those pages, a take I agree with.

olalonde · 2024-07-03T23:57:59

I have a pet peeve with this common piece of wisdom. You can always find a "predecessor" for about anything. The corollary being that there is never a "first". And therefore, stating that "none of the big companies were the first in their categories" is just stating a tautology.

robbiemitchell · 2024-07-03T20:46:37

> some gimmick

"key differentiator" and not necessarily easy to pull off or pay for

jjtheblunt · 2024-07-03T20:30:55

“Pioneers get the arrows, and settlers get the land”?

chrisweekly · 2024-07-04T03:11:20

"The early bird gets the worm -- but the second mouse gets the cheese."

igammarays · 2024-07-04T02:23:16

Maybe because the revenue isn't directly attributable to AI itself, but is realized in the cost savings and productivity improvements in already existing revenue streams? That's where AI has been useful to me. I can't put a number on how much AI has made me exactly, but it has certainly helped all aspects of my bootstrapped startup.

alok-g · 2024-07-04T05:39:33

Any product should bring benefits to both the producer and the consumer.

For the case where a company is using their own AI for their own cost reduction and productivity improvements, they can keep doing that but not offer to another party.

If they offer to another party, and that party is having benefits (like you have said), the price should be such that a part of the consumer benefit is shared with the producer resulting in benefits for the producer.

The real challenge here is because of price wars, i.e., too much competition already with producers willing to take a hit on profitability in anticipation that they will be able to do so later after creating a moat above and beyond competitors. Or they think that it will strenghen their overall bigger offering by adding an otherwise lossy feature.

In a nutshell, even if there's a lot of value for the consumers, it must result in a win-win for a new product to be sustainable in the market.

dmix · 2024-07-04T05:46:22

> but it has certainly helped all aspects of my bootstrapped startup.

Well if there's value to you then how much did you pay for it and would it realistically cover operating cost once VC cash dries up? That's the only question.

ryandrake · 2024-07-03T20:35:13

Others are saying this article is bearish, but then...

> A huge amount of economic value is going to be created by AI. Company builders focused on delivering value to end users will be rewarded handsomely.

Such strong speculative predictions about the future, with no evidence. How can anyone be so certain about this? Do they have some kind of crystal ball? Later in the article they even admit that this is another one of tech's all-too-familiar "Speculative frenzies."

The whole AI thing just continues to baffle me. It's like everyone is in the same trance and simply assuming and chanting over and over that This Will Change Everything, just like previous technology hype cycles were surely going to Change Everything. I mean, we're seeing huge companies' entire product strategies changing overnight because We Must All Believe.

How can anyone speak definitively about what AI will do at this stage of the cycle?

TeMPOraL · 2024-07-03T20:52:52

How can anyone not see just how impactful it's going to be? Or already is? I can't think of a single recent technology that was so widely adopted by tech and non-tech people alike, immediately integrated into day-to-day experience. The rise of mobile phones and e-commerce in the 90s would be the last time I've seen this happen (I'm not counting smartphones, as those are more of an iteration). Or social media, in purely software space.

I've just had GPT-4o write me a full-featured 2048 clone in ~6 hours of casual chat, in between of work, making dinner, and playing with kids; it cost me some $4 in OpenAI bills, and I didn't write a single line of code. I see non-tech people around me using ChatGPT for anything from comparison shopping to recipe adjustments. One person recently said to me that their dietitian is afraid for their career prospects because ChatGPT is already doing this job better than she is. This is a small fraction of cases in my family&friends circle; anyone who hasn't lived under the rock, or wasn't blinded by the memetic equivalent of looking at a nuclear weapon detonation, likely has a lot of similar things to say. And all of that is not will, it's is, right now.

cfeduke · 2024-07-03T22:14:49

Okay I guess I've just had a different experience entirely. Maybe I'm jaded by hallucinations.

The code ChatGPT generates is often bad in ways that are hard to detect. If you are not an experienced software engineer, the defects could be impossible to detect, until you/ChatGPT has gone and exposed all your customers to bad actors, or crash at runtime, or do something terribly incorrect.

As far as other thought work goes, I am not consulting ChatGPT over, say, a dietician or a doctor. The hallucination risk is too high. Producing an answer is the not the same as producing a correct answer.

TeMPOraL · 2024-07-03T22:28:03

My experience actually agrees with you. It's just that the set of use cases that either:

- Are hard (or boring) to do, but easy to evaluate - for me, e.g. writing code, OCR, ideation; or

- Don't require a perfectly correct answer, but more of a starting point or map of the problem space; or

- Are very subjective, or creative, with there being no single correct answer,

is surprisingly large. It covers pretty much everything, but not everything for everyone at the same time.

nvarsj · 2024-07-03T22:47:40

I agree. I've just seen it hallucinate too many things that on the surface seem very plausible but are complete fabrications. Basically my trust is near 0 for anything chatgpt, etc. spits out.

My latest challenge is dealing with people that trust chatgp to be infallible, and just quote the garbage to make themselves look like they know what they are talking about.

gwervc · 2024-07-03T23:10:49

> things that on the surface seem very plausible but are complete fabrications

LLMs are language model, it's crazy people expect them to be correct in anything beyond surface level language.

nvarsj · 2024-07-04T09:09:08

Yeah, I was probably being a bit too harsh in my original comment. I do find them useful, you just have to be wary of the output.

ignoramous · 2024-07-03T22:40:25

> Okay I guess I've just had a different experience entirely.

I've seen both the good and the bad. I really like the good parts. Most recently, Claude Sonnet 3.5 fixed a math error in my code (I prompted it to check for it from a well-written bug report, and it did it fix it ever so perfectly).

These days, it is pretty much second nature for me to pull up a new file & prompt Copilot to complete writing the entire code from my comment trails. I don't think I've seen as much change in my coding behaviour since Borland Turbo C -> NetBeans.

itsoktocry · 2024-07-03T22:59:37

If your procees is asking it to "write me all this code", then you slap it in production, you're going to have a bad time. But there's intermediate ground.

>I am not consulting ChatGPT over, say, a dietician or doctor

Do you know any doctors, by chance? You have way more faith in experts than I do.

kortilla · 2024-07-03T23:37:36

ChatGPT is just statistically associating what it’s observed online. I wouldn’t take dietary advice from the mean output of Reddit with more trust than an expert.

interstice · 2024-07-03T23:46:10

Doctors can be associating what they’ve learned, often with heavy biases from hypochondriacs and not enough time per patient to really consider the options.

I’ve had multiple friends get seriously ill before a doctor took their symptoms seriously, and this is a country with decent healthcare by all accounts.

Human biases are bad too.

abraae · 2024-07-04T04:22:46

> Doctors can be associating what they’ve learned, often with heavy biases from hypochondriacs

So true. And it's hard to question a doctor's advice, because of their aura of authority, whereas it's easy to do further validation of an LLMs diagnosis.

I had to change doctor recently when moving towns. It was only when chancing on a good doctor that I realised how bad my old doctor was - a nice guy but cruising to retirement. And my experience with cardiologists has been the same.

Happy to get medical advice from an LLM though I'd certainly want prescriptions and action plans vetted by a human.

throwaway2037 · 2024-07-04T05:17:35

    > It was only when chancing on a good doctor that I realised how bad my old doctor was

How did you determine the new doctor is "good"?

theshackleford · 2024-07-04T06:19:30

By the time a doctor paid me enough attention to realise something was wrong I had suffered a spinal cord injury whose damage can never be reversed. I’m not falling all over myself to trust chatgpt, but I got practically zero for doctors either. Nobody moved until I threatened to start sueing.

fennecfoxy · 2024-07-05T15:31:44

Will be cool once we have active agents tho. Surely the learning/research process isn't that difficult even for current LLMs/similar architectures. If it can teach itself, or it can collate new (never seen) data for other models then that's the cool part.

aworks · 2024-07-04T16:48:33

I sometimes use ChatGPT to prepare for a doctor's visit so I can have a more intelligent conversation even if I may have more trust overall in my doctor than in AI.

TeMPOraL · 2024-07-04T06:09:25

You realize that "online" doesn't just mean Reddit, but also Wikipedia and arXiv and PubMed and other sources perused by actual experts? ChatGPT read more academic publications in any field than any human.

kortilla · 2024-07-05T15:26:01

Yes, but because ChatGPT doesn’t think, it doesn’t know which arxiv papers are absolute garbage and which ones are legit.

Wikipedia does not have dietary advice. It’s an encyclopedia.

XajniN · 2024-07-04T02:40:08

I’ve seen so many doctors advertising or recommending homeopathic “medicines” or GE-132 [1], that I would be fairly more confident in an LLM + my own verification from reliable sources. I’m no doctor, but I know more than enough to recognize bullshit, so I wouldn’t just recommend this approach to everyone.

[1] https://pubmed.ncbi.nlm.nih.gov/1726409/

ern · 2024-07-04T01:30:33

I recently needed to help a downstream team with a problem with an Android app. I never did mobile app dev before, but I was able to spin up a POC (having not coded in Java for 22 years) and solve the problem with the help of ChatGPT 4.0.

Sure I probably would have been able to do it without ChatGPT, but it was so much easier to have something to bounce ideas off-of. A safety net, if you will.

The hallucination risk was irrelevant: it did hallucinate a little early on. I told it it was a hallucinating, and we moved onto a different way of solving the problem. It was easy enough to verify it was working as expected.

gexla · 2024-07-04T04:10:44

Seems to me this is the equivalent of fast retrieval and piecing together from a huge amount of examples in the data. This might take far more time if you were to do this yourself. That's a plus for the tools. In other words, a massively expensive (for the service provider) auto-complete.

But try to do something much more simple but has much fewer examples (a typical case is something which has bad documentation) in the data, and it falls apart. I even tried to use Perplexity to create a dead simple CLI command, and it hallucinated an answer (looking at the docs, it misused the parameter, and may have picked up on someone who gave an incorrect answer in the data.)

dmix · 2024-07-04T05:55:58

It's already gotten significantly better and faster in a few yrs. Maybe LLMs will hit a wall in the next 5yrs but even if it does it's still extremely useful and there are always other ways to optimize the current technology where this is already a major development for society.

remarkEon · 2024-07-04T07:31:26

>The code ChatGPT generates is often bad in ways that are hard to detect. If you are not an experienced software engineer, the defects could be impossible to detect, until you/ChatGPT has gone and exposed all your customers to bad actors, or crash at runtime, or do something terribly incorrect.

I wonder about this a lot, because there's a future here where a decent amount of software engineering is offloaded to these AIs and we reach a point, in the near future, where no one really knows or understands what's going on. That seems bad. Put another way, suppose that your primary care doctor is really just using MedAI to diagnose and recommend treatment for whatever it is you went in to see him about. Over time, these sorts of shortcuts metastasize and the doctor ends up not really knowing anything about you, or the other patients, or what he's really doing as a doctor ... it's just MedAI (with whatever wrongness rate is tolerable for the insurance adjusters). Again, seems bad. There's a palpable loss of human knowledge here that's enabled by a "tool" that's allegedly going to make us all better off.

disgruntledphd2 · 2024-07-04T08:52:53

The closest analogy here is that we don't have as full-featured autopilots in airplanes as we could, because they reduce safety.

remarkEon · 2024-07-04T17:42:42

Right, good point. Maybe I'm making an argument that some features, or scope of features, should be highly regulated along the same lines.

jakderrida · 2024-07-04T06:52:58

>The code ChatGPT generates is often bad in ways that are hard to detect. If you are not an experienced software engineer, the defects could be impossible to detect

I keep hearing this, but it's incorrect. While I only know R, which is obviously a simple language, I would never type out all my code and go without testing to ensure it does what I intended before using it regularly.

So I can't imagine someone that knows a more complex language just typing out all of it before integrating it into business systems at their work or anything else before testing it.

Why would AI be any different?

Why the hell are AI skeptics acting like getting help from an LLM would involve not testing anything? Of course I test it! Why on earth wouldn't I? Just as I tested code made by freelancers I hired on commission before using the code I bought from them. Do AI skeptics really not test their own code? Are you all insane?

disgruntledphd2 · 2024-07-04T08:54:35

> While I only know R, which is obviously a simple language

Take it from someone who started with R, R is 100% not a simple language. If you can write good R, you're probably a surprisingly good potential SE as R is kinda insane and inconsistent due to 50+ years of history (from S, to R etc).

jakderrida · 2024-07-04T10:01:04

Hmmm.. I'm trying to imagine interviewing for SE and telling them I got wealthy from a crypto market-making algorithm I coded in R during Covid and the interviewer responding with anything but laughter or with silence as they ponder legal ways to question my mental health.

It's an excellent language, I think, for many reasons. One is that you can work with data within hours because even before learning what packages or classes are, you got native objects for data storage, wrangling, and analysis. Even import my Excel data and rapidly learn the native function cheat sheet so fast that I was excited to learn what packages are because I couldn't wait to see what I could do.

That was my experience in like 2010, maybe, and after having C++ and Python go in and out my head during college multiple times. I view R as simple only because I actually felt more helpless to keep learning it than helpless to ever learn coding at all. Worth noting that I was a Stat/Probability tutor with a Finance degree and much Excel experience.

disgruntledphd2 · 2024-07-04T10:18:01

> That was my experience in like 2010, maybe, and after having C++ and Python go in and out my head during college multiple times. I view R as simple only because I actually felt more helpless to keep learning it than helpless to ever learn coding at all. Worth noting that I was a Stat/Probability tutor with a Finance degree and much Excel experience.

Ah yeah, makes sense. That's the happy path for learning R (know enough stats etc to decode the help pages).

That being said, R is an interesting language with lots of similarities to both C based languages and also Lisp (R was originally a scheme intepreter), so it's surprisingly good at lots of things (except string manipulation, it's terrible at that).

CuriouslyC · 2024-07-03T23:12:15

Easy answer. Ask ChatGPT to write testable code, and tests for the code, then just verify the tests. If the tests don't work, have ChatGPT use the test output to rewrite the code until it does.

If you can't have ChatGPT write testable code because of your architecture, you have other problems. People with bad process and bad architecture saying AI is bad because it doesn't work well with their dumpster fire systems, 100% facepalm.

aleph_minus_one · 2024-07-04T16:33:15

> If you can't have ChatGPT write testable code because of your architecture, you have other problems.

There exist lots of reasons why code is hard to test automatically that have nothing to do with the architecture of the code, but with the domain for which the code is written and runs.

sumedh · 2024-07-03T22:32:26

> The code ChatGPT generates is often bad in ways that are hard to detect.

Does it work though, yes it does. There are many human coders who write bad code and life goes.

elicksaur · 2024-07-03T22:43:19

>I can't think of a single recent technology that was so widely adopted by tech and non-tech people alike, immediately integrated into day-to-day experience.

This is not meant to be an offense, but you are in a bubble. The vast, vast majority of people do not use LLMs in their day-to-day life. That’s ok, we’re all in our own bubbles.

You should also post the 2048 clone as proof. Lots people saying they built X in Y minutes with AI. But, when it’s inspected, it’s revealed it very obviously doesn’t work right and needs more development.

williamcotton · 2024-07-03T23:05:51

I hand-wrote perhaps 10-20 lines of this project:

https://github.com/williamcotton/guish

The rest is Claude 3.5 (with a dash of GPT-4o) with a LOT of supervision!

I'd say I'm about 8 hours deep and that this would have taken me at least 30+ hours to get it to the current state of polish.

I used it to make some graphs at work today!

bomewish · 2024-07-04T01:18:42

Quite interesting — but how is it fundamentally more productive than being in VS code in R or python? You don’t get any of the benefits of an IDE here. I often find myself doing very similar workflows but default to either VS Code or the shell. Trying to imagine this truly making workflows faster/easier/more efficient, but can’t figure it.

williamcotton · 2024-07-04T01:26:36

Maybe it isn’t? I am just experimenting with new UX! Maybe it could be integrated into an editor of… the fuuuturrre!

But seriously, do you have any thoughts or suggestions?

TeMPOraL · 2024-07-03T22:53:26

> You should also post the 2048 clone as proof.

I posted it twice already in this thread, but I guess third time's the charm: http://jacek.zlydach.pl/v/2048/ (code: https://git.sr.ht/~temporal/aider-2048).

It's definitely not 100% correct (I just spotted a syntactic issue in HTML, for example), and I bet a lot of people will find some visual issue on their browser/device configuration. I don't care. It works on my desktop, it works on my phone, it's even better than the now-enshittified web and Android versions I used to play. I'm content :).

cout · 2024-07-03T23:17:02

It is too large for my phone display (iphone SE). Do you think chatgpt can fix it?

TeMPOraL · 2024-07-04T08:13:01

Yes. It does so trivially, but in the process it breaks the CSS for larger screens. I couldn't get it to get both to work at the same time in 5 minutes of trying. My modern CSS skills aren't good enough to quickly notice what the problem is, so it's beyond my horizon of caring (but I do encourage hints I could pass on to the AI).

elicksaur · 2024-07-04T13:42:11

>Yes. It does so trivially

>but in the process it breaks the CSS for larger screens.

So, no, it doesn’t fix it trivially. Also isn’t correctly sized on iPhone 11 Safari.

TeMPOraL · 2024-07-04T17:35:23

It does fix it trivially, just in a way that causes regression on larger screens :).

As mentioned above, I don't care. It's sized correctly for the devices I use to play it, and I'm not going to put any more work into this. I mean, even junior web devs get paid stupidly high salaries for doing Responsive Web Design; I ain't gonna work on this for free.

(But I will accept AI-generated patches, or human-generated advice I could paste into the prompt to get a correct solution :P.)

tempusalaria · 2024-07-03T22:37:44

Being able to create 2048 in 6 hours has basically zero economic value.

Can ChatGPT materially and positively impact the code written by big companies? Can it do meaningful work in excel? Can it do meaningful PowerPoint work? Can it give effective advice on management?

Right now we don’t know the answer to those questions. LLM apps can still improve in many ways - better base models, better integration with common enterprise applications, agentic processes, verifiability and so on - so there is definitely hope that there will be significant value created. Companies and people are excited because there’s huge potential. But it is really just potential right now … current systems aren’t creating real enterprise value at this moment in time

TeMPOraL · 2024-07-03T22:43:47

> Can ChatGPT materially and positively impact the code written by big companies? Can it do meaningful work in excel? Can it do meaningful PowerPoint work? Can it give effective advice on management?

> Right now we don’t know the answer to those questions.

I know the answer to the first three. Yes, yes, and yes. I've done them all, including all of them in the past few weeks.

(Which is how I learned that it's much better to ask ChatGPT to use Python evaluation mode and Pandoc and make you a PPTX, than trying to do anything with "Office 365 Copilot" in PowerPoint...)

As for the fourth question - well, ChatGPT can give you better advice than most advice on management/leadership articles, so I presume the answer here is "Yes" too - but I didn't verify it in practice.

> current systems aren’t creating real enterprise value at this moment in time

Yes, they are. They would be creating even more value if not for the copyright and exports uncertainty, which significantly slows enterprise adoption.

WgaqPdNr7PGLGVW · 2024-07-03T22:54:49

> I know the answer to the first three. Yes, yes, and yes.

You say this but from a management perspective at a large enterprise software company I have not seen it.

Some of our developers use copilot and gpt and some don't and it is incredibly difficult to see any performance difference between the groups.

We aren't seeing higher overall levels of productivity.

We aren't seeing the developers who start using copilot/gpt rush ahead of their peers.

We aren't seeing any ability to cut back on developer spend.

We aren't seeing anything positive yet and many developers have been using copilot/gpt for >1 year.

In my opinion we are just regaining some of the economic value we lost when Google Search started degrading 5-10 years ago.

TeMPOraL · 2024-07-03T22:59:58

> We aren't seeing higher overall levels of productivity.

You can't measure productivity for shit, otherwise companies would look entirely differently. Starting from me not having to do my own finances or event planning or hundred other things that are not my job description, not my specialty, and which were done by dedicated staff just a few decades ago, before tech "improved office productivity".

> We aren't seeing the developers who start using copilot/gpt rush ahead of their peers.

That's because individual productivity is usually constrained by team productivity. Devs rushing ahead of their teammates makes the team dysfunctional.

> We aren't seeing any ability to cut back on developer spend.

Devs aren't stupid. They're not going to give you an opportunity if they can avoid it.

> We aren't seeing anything positive yet and many developers have been using copilot/gpt for >1 year.

My belief is that's because you aren't measuring the right things. But then, no one is. This is a problem well-known to be unsolved.

WgaqPdNr7PGLGVW · 2024-07-03T23:17:14

> You can't measure productivity for shit

We can't measure small changes and we aren't great at comparing across orgs.

However, at the director level we can certainly see a 50% or 100% productivity improvement in our teams and with individuals in our teams.

We aren't seeing changes of this magnitude because they don't exist.

WgaqPdNr7PGLGVW · 2024-07-03T23:26:10

There are other potential explanations.

Perhaps developers are now slacking off.

Perhaps we have added more meetings because developers have more free time.

Or perhaps developers were never the bottleneck.

We can see large productivity improvements when we make simple changes like having product managers join the developers daily standup meetings. We can even measure productivity improvements from Slacks/Zooms auto-summary features. Yet gpt/copilot doesn't even register.

Vegenoid · 2024-07-04T00:42:15

> We can even measure productivity improvements from Slacks/Zooms auto-summary features.

While not code generation, this auto-summary is powered by the same tech. I think using it to sift through and surface relevant information, as opposed to generation of new things, will have the biggest impact.

By far the greatest value I get out of LLMs is asking them to help me understand code written by others. I feel like this is an under-appreciated use. How long has this feature been in Copilot? Since February or so? Are people using it? I do not use Copilot.