More

xtreme · 2024-11-18T21:19:08 1731964748

This actually tells you why AI doesn't have to be better than all human experts, just the ones you can afford to get together.

simion314 · 2024-11-19T12:25:31 1732019131

>This actually tells you why AI doesn't have to be better than all human experts, just the ones you can afford to get together.

And I agree, and a script might do a better job on some taks and you will not claim my script has reached singularity, right ?

xtreme · 2024-11-12T23:32:23 1731454343

Chegg wasn't a victim — it was a middleman profiting from locked-up educational content and exploiting students’ needs. ChatGPT didn't "lure" users; it provided a superior, accessible alternative, democratizing learning rather than hiding it behind paywalls. The argument against AI due to resource usage is selectively blind to the inefficiencies of legacy systems like Chegg. Calling this hype is like dismissing the internet as a fad — it’s a failure of imagination. Disruption always displaces incumbents, but clinging to outdated, exploitative models is far worse than embracing a tool that genuinely empowers users.

StableAlkyne · 2024-11-12T23:50:20 1731455420

Chegg felt predatory.

They had no real original content of their own, just worked solutions to homework problems they pulled from textbooks. They were good at SEO and would appear at the top. You clicked on it because it lied to you: showing you part of the content you wanted. Just enough for the search engine preview. That probably boosted them further, wasting more time by others tricked by the same fake results.

To see the rest of the answer, they wanted you to pay money and hope it was what you wanted. Who would subscribe to that other than students desperate for homework answers?

Then ChatGPT comes in without any of the scammy tactics. Sure, it's often wrong, but so are Chegg and Quora.

demaga · 2024-11-12T23:41:42 1731454902

The difference here is that Chegg was actually profitable. It is still unknown if OpenAI will be.

MBCook · 2024-11-12T23:53:23 1731455603

Right. It was an easy to understand example on hand of a business that was DESTROYED.

I’m sure there are plenty of good places that were destroyed. I just don’t have verifiable anecdotes for any others at hand.

xtreme · 2024-11-02T07:46:14 1730533574

200 mm/year gets you roughly 1000 engineers at 200k salary. Is that not enough to make rocm experience equal to cuda?

anon291 · 2024-11-02T14:29:09 1730557749

Considering that each kernel / kernel size is usually custom tuned on NVIDIA, I'd say no. Working in this field at several different companies, there are likely thousands of hand-tuned variations of a simple GEMM kernel. Each one required an engineer to look at specifically, even if they're all variations on a common theme.

As far as I know (and again, I work in the field of AI compilers), we're still a ways off from complete end-to-end generation of highly optimized kernels. If you want it to go fast, you need to write it by hand [1], and then test and validate.

Moreover, chip makers are constantly adding new features (Tensor Cores in NVIDIA for example), so the compiler is always playing catch up and at some point an engineer has to sit down (likely a team of them) and think 'what's the best way to exploit this hardware functionality for software performance?'. Then they have to test and validate that, and then either write a kernel, or attempt to put that know-how into a compiler.

Multiply this times the number of kernels in a typical suite, and... yeah.

And that was my point about herculean effort on modern chips. Assembly language isn't just the old 'Add register 1 and 2 and dump in R3' anymore. It's 'Use this instruction to access memory in this way, so that it's in a compatible format for the next instruction' and 'oh yeah, make sure your memory synchronization primitives are such that the whole thing is coherent'. Good luck!

Even going one step up into a higher-level language, you have to know how the kernel gets compiled to make it worthwhile. Again, it is trivial to write a correct opencl matrix multiply, but that's never going to be the highest performance. You have to know the hardware intimately. This is where having the software co-designed with hardware is very important. Basically, every AI chipmaker of any importance does this, including the startups, like Groq and Cerebras.

[1] A lot of kernels share basic patterns, so its not as hard as it sounds, but definitely requires engineering effort to get the design right.

almostgotcaught · 2024-11-02T16:03:45 1730563425

> Considering that each kernel / kernel size is usually custom tuned on NVIDIA, I'd say no. Working in this field at several different companies, there are likely thousands of hand-tuned variations of a simple GEMM kernel. Each one required an engineer to look at specifically, even if they're all variations on a common theme.

Lol that's absolutely not true. What you're describing is literally impossible for any company that has more than one product family on the market since each product has different scratch sizes, number of vector registers, data types supported/emulated etc.

Outside of trade show demos, kernels are codegened. What is true is there are recurring "themes/patterns" that are handled by engineers for a class of products. Lately this is flash attention...

> Again, it is trivial to write a correct opencl matrix multiply, but that's never going to be the highest performance.

I guess you work at AMD. The reason AMD ships a whole bunch of binary kernels is not because someone tuned/designed each one but because AMD doesn't have a PTX/SASS equivalent. So each kernel has to be compiled at build time for each device (it's also why they can't have LTS support for architectures).

anon291 · 2024-11-02T16:56:39 1730566599

> outside of trade show demos, kernels are condegened. What is true is there are recurring "themes/patterns" that are handled by engineers for a class of products. Lately this is flash attention

I never said they weren't using code generation. I said that each one requires a manual tune. You will set various parameters, determine if the generated code does well enough and then if there's performance to squeeze out, you modify the code generator.

> I guess you work at AMD.

Close but not quite

almostgotcaught · 2024-11-02T16:58:41 1730566721

> that each one requires a manual tune

Ya definitely not - everyone uses grid search or whatever latest BPO tuning strategy.

anon291 · 2024-11-02T17:17:26 1730567846

Oh right. Those require no engineering effort because I said so.

almostgotcaught · 2024-11-02T17:19:38 1730567978

They require one person or team to engineer and then a whole bunch of people to use...? That doesn't resemble in the least what you were describing where each kernel is hand-tuned for each shape and device. But please do continue to insist you're still somehow right

anon291 · 2024-11-02T22:15:36 1730585736

Sigh. I use to be an engineering manager for a kernels team. I think I know what I'm talking about. Yes, each kernel is paid individual attention to even if many are basically the same and require little rework. It's a lot of work. Now I work as an ic in the same field . I don't need to insist I'm right because it's what I do all day

almostgotcaught · 2024-11-03T01:40:43 1730598043

> I think I know what I'm talking about

I've refuted your claims point by point and all you can say is "trust me bro"? Cool but I think you do not in fact know what you're talking about.

lumost · 2024-11-04T22:26:04 1730759164

I work in DB kernels, everything gets hand tuned as there is economic reason for hand tuning it. The expectation in many of these systems is that there are no wasted cycles. You can codegen a decent kernel, but then someone will find a better approach - do you want the slower version of the product, or the faster one?

You can see this in action with matrix math libraries, folks have been hand tuning those for decades at this point.

anon291 · 2024-11-03T04:59:32 1730609972

Your claim is that there are automated methods (which I mentioned in my original post) to manage the complexity. My claim is that it requires a large team of engineers working on it. I'm not really sure what you think you've refuted.

binary132 · 2024-11-02T08:52:10 1730537530

1000 engineers don’t automatically crank out 50x more code than 20 engineers. But GP is just saying there are a lot of subcomponents involved that each need major engineering effort dedicated to them.

I see it less as an engineering problem and more as a market problem. AMD stuff has existed, it’s the market that doesn’t see a point in it, and at this point, even feature parity or CUDA compatibility for that matter won’t make a huge dent. People will just keep using what they know and are recommended.

It’s more amazing to me that NVDA is so intensely inflated by this LLM hype wave. I find it genuinely scary to think about what’s going to happen when 95+% of AI slopware startups fold. Nvidia won’t be the only company financially impacted. Our entire economy runs on fads.

xtreme · 2024-10-22T18:09:38 1729620578

Your conclusion does not follow from your premise. AI moderation should easily catch the worst offenders and the most obvious ones. The examples you gave easily stand out from non-offensive content and are easy to catch with high confidence. So, human moderators will have to look at where AI has low confidence classifying the content. In fact, AI will reduce the likelihood of human moderators ever seeing traumatic content.

xtreme · 2024-10-02T13:55:51 1727877351

So what? Maybe there would have been multiple island nations, maybe they all would have seen other countries and joined hands together. Why should the Dutch get any say in what modern Indonesia should look like?

Same goes for the parent comment about India. The decline of the Mughal empire and the possibility of multiple states arising instead of an unified India doesn't justify the British colonialism and in no way absolves them of their atrocities.

rayiner · 2024-10-02T18:51:13 1727895073

Why get worked up about one foreign empire coming in and replacing another foreign empire?

xtreme · 2024-09-24T01:27:39 1727141259

As someone who loves Typescript, the first chapter of your book deeply resonated with me. Good data representations and data types that encode meaning can eliminate entire classes of bugs by making invalid states unrepresentable, and I wish more languages emphasized and supported these principles.

xtreme · 2024-08-23T16:58:05 1724432285

I agree with what you are saying, but I find it interesting that physics can also predict properties of planets, stars, and galaxies interacting over cosmic distances. At that scale, you can zoom out and reduce the complexity again back to a handful of rules.

mensetmanusman · 2024-08-23T17:00:33 1724432433

This depends on how many digits of accuracy you are looking for. With living things, we care more than with hot rocks, because the entropy gradients are so much higher.

BobaFloutist · 2024-08-23T17:18:56 1724433536

Right, and a big enough crowd of people behaves like a fluid.

mensetmanusman · 2024-08-24T15:14:36 1724512476

Depends on the crowd, assassination attempts induce stampedes, but not always.

xtreme · 2024-04-14T03:05:54 1713063954

It's not doable because your current location needs to be in the center of the screen in north up mode but can be placed lower in regular mode. Also the non square aspect ratio of the phone screen means that you don't get the same field of view in all four directions.

Dylan16807 · 2024-04-14T04:46:17 1713069977

> your current location needs to be in the center of the screen in north up mode

No it doesn't, so you just identified an easy opportunity for improvement.

boneitis · 2024-04-14T11:15:26 1713093326

Thank. You.

xtreme · 2024-03-12T18:24:13 1710267853

Buying handcrafted artisan stuff is a luxury few can afford. I come from a poor family and I was always grateful for mass produced mediocre stuff that we could actually afford.

MSFT_Edging · 2024-03-12T19:28:11 1710271691

Sure but the same optimizations that bring the costs down, bring down the average laborers value.

I'm not saying that things were sunshine and rainbows pre-industrialization, but there's some level of analysis to be done on the durability and value of a handcrafted piece of clothing, the care that goes into maintaining it, the value of a local economy, and the other side where you're forced to buy cheap items that degrade at a far faster rate.

If a town's local businesses are put out by a new walmart's ability to carry low prices, does the town truly come out ahead with those low prices? Or does Walmart simply extract more money from the town than it returns, leaving the town worse off?

MacsHeadroom · 2024-03-13T00:56:05 1710291365

> bring down the average laborers value.

Labor has never been to afford so much luxury as the modern day.

esoterica · 2024-03-13T01:21:08 1710292868

Those optimizations increase the value of labor by greatly increasing the amount of output per unit of labor. Do you understand how much wealthier the modern worker is than the pre Industrial Revolution peasant?

MSFT_Edging · 2024-03-13T13:32:50 1710336770

Pre-industrial societies may not have had flat screen tvs, but they had time. The only thing you can't buy.

esoterica · 2024-03-13T16:07:59 1710346079

They did not actually have time because they mostly died in infancy.

That said if you are satisfied with a pre industrial quality of life no one is stopping you from doing zero hours of work and just becoming homeless.

MSFT_Edging · 2024-03-14T12:38:21 1710419901

It's always interesting to see the default argument of "u dont like it? just be homeless".

Where does that come from? I see it all the time. It's like there's zero ability to step outside the black and white. Either you must be totally onboard with the current system and not question the negative aspects, or you must live without dignity in the streets.

What I'm saying is we really don't need to be working 40+hrs/week in order to enjoy these advancements. People don't need to be made homeless in order for scientists who research these medicines to be paid peanuts.

QuizzicalCarbon · 2024-03-18T14:23:30 1710771810

“They had time” is such an amusing, and inaccurate, assertion. Toil was a very real, everyday thing. The only way to avoid toil was to be on one’s deathbed.

robbbed · 2024-03-15T14:08:46 1710511726

That’s a good point. What’s always been strange to me is why we as a community allow this. We should boycott the Walmarts and Devins of the world. Drive them out of business by voting with our dollars.

dandelionsnow · 2024-03-12T19:24:04 1710271444

Handcrafted artisinal stuff is a luxury because that's the only niche that makes economic sense for it now, given mass production and other recent developments (too lazy to list, sorry).

Consider how you can't really get by in most of America without a car because we designed our cities to require them. It would be a mistake to conclude that, because life is harder in a car-optimized society without a car, society must be better off optimizing for cars.

qqqwerty · 2024-03-12T20:09:13 1710274153

The capitalist system keeps you poor by design. And you feed the system by purchasing the mass produced garbage. Sure, it is nice to afford stuff when poor, but we don't need to live in a society where being poor is common, or where mass produced garbage is the default option for most.

CaptainFever · 2024-03-20T09:08:26 1710925706

> ...but we don't need to live in a society...

Please describe the alternative. "Mass produced garbage" is exactly how we are able to feed the world. It's not some sort of conspiracy, just scarcity.

(Welfare and UBI is still capitalist.)

xtreme · 2024-02-05T15:58:36 1707148716

There are size limits when deploying on serverless or edge infrastructure so developers have to care about that. The providers also typically charge by compute seconds * memory consumed so a larger executable costs real money as well.

fshr · 2024-02-05T16:43:07 1707151387

The compiled option isn’t intended for serverless or edge uses, you just deploy the source files and the platform takes over.

dmattia · 2024-02-05T17:43:27 1707155007

Some serverless use cases work like you say, but Docker-based options such as AWS ECS, Docker-based Lambda functions, or Kubernetes would all commonly make use of compiled options

0x457 · 2024-02-05T22:34:07 1707172447

No, you won't use compiled option in k8s or any other container based deployment.

You would use deno as base layer, your dependencies in another layer, your code is in the last layer.