More

HarHarVeryFunny · 2025-12-03T14:29:46 1764772186

> The problem I had that the larger your project gets, the more mistakes Claude makes

I think the reason for this is because these systems get all their coding and design expertise from training, and while there is lots of training data available for small scale software (individual functions, small projects), there is much less for large projects (mostly commercial and private, aside from a few large open source projects).

Designing large software systems, both to meet initial requirements, and to be maintainable and extensible over time, is a different skill than writing small software projects, which is why design of these systems is done by senior developers and systems architects. It's perhaps a bit like the difference between designing a city and designing a single building - there are different considerations and decisions being made. A city is not just a big building, or a collection of buildings, and large software system is not just a large function or collection of functions.

commakozzi · 2025-12-03T20:09:06 1764792546

Yeah, great analogy. Thanks!

chrisweekly · 2025-12-03T15:17:27 1764775047

Well said, and good analogy.

HarHarVeryFunny · 2025-12-03T14:16:51 1764771411

It's interesting that Amazon don't appear interested in acquiring Anthropic, which would have seemed like somewhat of a natural fit given that they are already partnered, Anthropic have apparently optimized (or at least adapted) for Trainium, and Amazon don't have their own frontier model.

It seems that Amazon are playing this much like Microsoft - seeing themselves are more of a cloud provider, happy to serve anyone's models, and perhaps only putting a moderate effort into building their own models (which they'll be happy to serve to those who want that capability/price point).

I don't see the pure "AI" plays like OpenAI and Anthropic able to survive as independent companies when they are competing against the likes of Google, and with Microsoft and Amazon happy to serve whatever future model comes along.

hyperbovine · 2025-12-03T15:00:45 1764774045

LOL of course they don't want to own Anthropic, else they themselves would be responsible for coming up with the $10s of billions in Monopoly money that Anthropic has committed to pay AMZN for compute in the next few years. Better to take an impressive looking stake and leave some other idiot holding the buck.

SeanAnderson · 2025-12-03T22:27:44 1764800864

Isn't taking an impressive looking stake, in effect, leaving them holding the buck?

Waterluvian · 2025-12-04T01:37:40 1764812260

Now I’m no big city spreadsheet man but I bet you “company that owes us billions went belly up” looks better on the books than “company we bought that owes us billions went belly up.”

stingraycharles · 2025-12-04T01:56:29 1764813389

It’s pretty crazy that Amazon’s $8B investment didn’t even get them a board seat. It’s basically a lot of cloud credits though. I bet both Google and Amazon invested in Anthropic at least partially to stress test and harden their own AI / GPU offerings. They now have a good showcase.

Waterluvian · 2025-12-04T02:09:19 1764814159

Yeah. I bet there’s a win-win in the details where it gets to sound like a lot of investment for both parties to look good but really wasn’t actually much real risk.

Like if I offered you $8 billion in soft serve ice cream so long as you keep bringing birthday parties to my bowling alley. The moment the music stops and the parents want their children back, it’s not like I’m out $8 billion.

beambot · 2025-12-04T05:42:23 1764826943

Echoes of Enron-style accounting "tricks"...

stingraycharles · 2025-12-04T11:06:56 1764846416

Why does everybody keep insisting on this “Enron accounting” stuff. LLM companies need shitloads of compute for specialized use case. Cloud vendor wants to become a big player in selling compute for that specialized use case, and has compute available.

Cloud provider gives credit to LLM provider in exchange for a part of the company.

These are really normal business deals.

vasco · 2025-12-04T07:14:36 1764832476

Amazon gave away datacenter time share in exchange for stock in a startup. That has nothing to do with electricity futures and private credit revolvers.

BoorishBears · 2025-12-04T09:18:00 1764839880

They said echoes of the tricks.

matthewaveryusa · 2025-12-04T03:27:39 1764818859

This is my thought too. They de-risked any other AI startup from choosing AWS as their platform. If the hype continues AWS will get their 30% margin on something growing like rocket emoji, if they don't at least they didn't miss the boat.

monero-xmr · 2025-12-04T03:19:39 1764818379

No! Anthropic goes bankrupt or at least teeters and Amazon buys for pennies

sysguest · 2025-12-04T04:13:32 1764821612

this might be another possibility -- but why even bother to buy anthropic anyways, if not for the patents?

the talent will move out naturally -- amazon can just scoop up with its bucket (*not s3)

Lionga · 2025-12-03T16:28:27 1764779307

[flagged]

rvnx · 2025-12-03T16:47:25 1764780445

Why would they buy Anthropic when they already have access to all the tech and source-code of Anthropic for free ?

Not only the models but also training data, model architecture, documentation, weights and latest R&D experiments ?

Take an instance -> Snapshot -> Investigate.

Unless they get caught it is not illegal.

dooglius · 2025-12-03T16:53:03 1764780783

GP is just commenting on the use of a mixed metaphor.

ajkjk · 2025-12-03T17:03:22 1764781402

hell if passing the buck is the opposite of holding the bag then maybe we should mix em

maybe the full array of options is: pass the hot potato, hold the buck, or drop it like a bag.

michaelbuckbee · 2025-12-03T15:14:38 1764774878

Amazon also uses Claude under the hood for their "Rufus" shopping search assistant which is all over amazon.com.

It's kind of funny, you can ask Rufus for stuff like "write a hello world in python for me" and then it will do it and also recommend some python books.

antiloper · 2025-12-03T16:08:53 1764778133

> It's kind of funny, you can ask Rufus for stuff like "write a hello world in python for me" and then it will do it and also recommend some python books.

Interesting, I tried it with the chatbot widget on my city government's page, and it worked as well.

I wonder if someone has already made an openrouter-esque service that can connect claude code to this network of chat widgets. There are enough of them to spread your messages out over to cover an entire claude pro subscription easily.

jermaustin1 · 2025-12-03T17:28:22 1764782902

A childhood internet friend of mine did something similar to that but for sending SMSes for free using the telco websites' built in SMS forms. He even had a website with how much he saved his users, at least until the telcos shut him down.

rm_-rf_slash · 2025-12-03T21:11:18 1764796278

Phreaking in 2025

jermaustin1 · 2025-12-04T00:55:14 1764809714

Well Phreaking in 2003-05 (no clue when anymore), so at the same time you could still get free phone calls on pay phones in the library or hotel lobby.

bobbiechen · 2025-12-03T23:28:05 1764804485

Not sure for Claude Code specifically, but in the general case, yes - GPT4Free and friends.

I think if you run any kind of freely-accessible LLM, it is inevitable that someone is going to try to exploit it for their own profit. It's usually pretty obvious when they find it because your bill explodes.

neilv · 2025-12-03T17:49:34 1764784174

> It's kind of funny, you can ask Rufus for stuff like "write a hello world in python for me" and then it will do it and also recommend some python books.

From a perspective of "how do we monetize AI chatbots", an easy thing about this usage context is that the consumer is already expecting and wanting product recommendations.

(If you saw this behavior with ChatGPT, it wouldn't go down as well, until you were conditioned to expect it, and there were no alternatives.)

6510 · 2025-12-04T06:50:41 1764831041

There are really impressive marketing/advertisement formulas to be had. I wont share mine but I'm sure there are many ways to go step by step from not-customers to customers where each step has a known monetary value. If an LLM does something impressive in one of the steps you also know what it is worth.

hbosch · 2025-12-03T15:49:21 1764776961

Are you sure? While Amazon doesn't own a "true" frontier model they have their own foundation model called Nova.

I assume if Amazon was using Claude's latest models to power it's AI tools, such as Alexa+ or Rufus, they would be much better than they currently are. I assume if their consumer facing AI is using Claude at all it would be a Sonnet or Haiku model from 1+ versions back simply due to cost.

reliabilityguy · 2025-12-03T20:45:39 1764794739

> I assume if their consumer facing AI is using Claude at all it would be a Sonnet or Haiku model from 1+ versions back simply due to cost.

I would assume quite the opposite: it costs more to support and run inference on the old models. Why would Anthropic make inference cheaper for others, but not for amazon?

bostik · 2025-12-04T10:19:14 1764843554

There may well be some "interesting" financial arrangements in place between the two. After all, Claude models are available in AWS Bedrock, which means Amazon are already physically operating them for other client uses.

deaux · 2025-12-04T00:27:11 1764808031

Nova 2 came out today so not clear how good it is yet, but Nova 1 was entirely uncompetitive.

esafak · 2025-12-04T04:39:07 1764823147

Supposedly competitive with Haiku 4.5, GPT 5 Mini and Gemini 2.5 Flash: https://aws.amazon.com/blogs/aws/introducing-amazon-nova-2-l...

deaux · 2025-12-04T06:31:19 1764829879

Looks less "intelligent" to me, just a lot more trained on agentic (multi-turn tool) use so it greatly outperforms the others on the benches where that helps while lagging elsewhere. They also released bigger models, where "Pro" is supposedly competitive with 4.5 Sonnet. Lite is priced the same as 2.5 Flash, Pro as GPT 5.1. We'll definitely do some comparative testing on Nova 2 Lite vs 2.5 Flash, but not expecting much.

Jimmc414 · 2025-12-04T06:38:32 1764830312

Claude 2.0 was laughably bad. I remember wondering why any investor would be funding them to compete against OpenAI. Today I cancelled my ChatGPT Pro because Claude Max does everything I need it to.

somebodythere · 2025-12-03T17:38:51 1764783531

Rufus is a Claude Haiku, yes.

iLoveOncall · 2025-12-03T22:21:24 1764800484

> Are you sure? While Amazon doesn't own a "true" frontier model they have their own foundation model called Nova.

I work for Amazon, everyone is using Claude. Nova is a piece of crap, nobody is using it. It's literally useless.

I haven't tried the new versions that just came out though.

darkwater · 2025-12-03T21:26:50 1764797210

Haha just tried and it works! First I tried in Spanish (I'm in Spain) and it simply refused, then I asked in English and it just did it (but it answered in Spanish!)

EDIT: I then asked for a Fizzbuzz implementation and it kindly asked. I asked then for a Rust Fizzbuzz implementation, but this time I asked again in Spanish, and he said that it could not help me with Fizzbuzz in Rust, but any other topic would be ok. Then again I asked in English "Please do Rust now" and it just wrote the program!

I wonder what the heck are they doing there? The guardrailing prompt is translated to the store language?

darkwater · 2025-12-04T06:26:33 1764829593

* "and it kindly answered"

ab_testing · 2025-12-04T01:11:44 1764810704

I just tried and Rufus does not write any python for me. Just directs me to buy books on python.

hereme888 · 2025-12-03T21:44:04 1764798244

lol, i tried it. Asked `write the product details in single-line bash array` and it did so.

epsilonic · 2025-12-03T16:10:06 1764778206

I think they’re waiting for bargain bin deals once the bubble collapses.

mrweasel · 2025-12-04T09:13:46 1764839626

Same for Apple would be my take right now. No point in spending billions trying to build and train an LLM. Better to buy AI services from e.g. OpenAI for a bit, then extract the valuable bits after the crash. The current crop of AI companies can waste money of figuring out what works and what doesn't.

echelon · 2025-12-03T20:19:20 1764793160

This.

The market is too new for AI.

AI is unquestionably useful, but we don't have enough product categories.

We're in the "electric horse carriage" phase and the big research companies are pleading with businesses to adopt AI. The problem is you can't do that.

AI companies are asking you to AI, but they aren't telling you how or what it can do. That shouldn't be how things are sold. The use case should be overwhelmingly obvious.

It'll take a decade for AI native companies, workflows, UIs, and true synergies between UI and use case to spring up. And they won't be from generic research labs, but will instead marry the AI to the problem domain.

Open source AI that you can fine tune to the control surface is what will matter. Not one-size-fits-all APIs and chat interfaces.

ChatGPT and Sora are showing off what they think the future of image and video are. Meanwhile actual users like the insanely popular VFX YouTube channel are using crude tools like ComfyUI to adopt the models to their problems. And companies like Adobe are actual building the control plane. Their recent conference was on fire with UI+AI that makes sense for designers. Not some chat interface.

We're in the "AI" dialup era. The broadband/smartphone era is still ahead of us.

These companies and VCs thought they were going to mint new Googles and Amazons, but it's more than likely they were the WebVans whose carcasses pave the way.

kordlessagain · 2025-12-03T15:52:01 1764777121

After watching The Thinking Game documentary, maybe Amazon has little appetite for "research" companies that don't actually solve real world problems, like Deepseek did.

stlava · 2025-12-04T00:20:49 1764807649

The movie seems like a fluff piece when you find out what has transpired at DeepMind subsequently with slowing down publishing material to “selling out to product” which the founder was hell bent against in the documentary.

tshaddox · 2025-12-03T23:30:03 1764804603

> It seems that Amazon are playing this much like Microsoft - seeing themselves are more of a cloud provider, happy to serve anyone's models, and perhaps only putting a moderate effort into building their own models (which they'll be happy to serve to those who want that capability/price point).

Or, as a slight variation of that, they think the underlying technology will always be quickly commoditized and that no one will ever be able to maintain much of a moat.

r_lee · 2025-12-04T01:13:49 1764810829

I think anyone sane will have had the same conclusion a long time ago.

It's a black box with input/output in text, thats not a very good moat.

especially given that Deepseek type events can happen because you can just train off of your competitors outputs

I've tried out Gemini 2.5/3 and it generally seems to suck for some reason, problems with lying/hallucinating and following instructions, but ever since Bard came out at first, I thought Google would have the best chances of winning since they have their own TPUs, YouTube (insane video/visual/audio data), Search (indexed pages), and their Cloud/DCs and they can stick it into Android/Search/Workspace.

meanwhile OpenAI has no existing business, they only have API/Subs as revenue, and they're utilizing Nvidia/AMD

I really wonder how things will look once this gold rush stabilizes

jacquesm · 2025-12-03T23:34:50 1764804890

Bezos is playing it smart: sell shovels to all of the gold diggers. If he partners with one of the gold diggers he won't be able to sell shovels to the remainder.

aurareturn · 2025-12-04T08:44:05 1764837845

Sure but check out Cisco's market cap vs Google, Amazon, Microsoft, Apple, Meta, Netflx, etc.

A few gold diggers will be worth 10x the shovel maker.

jacquesm · 2025-12-04T09:37:01 1764841021

That's a risk-return issue. Bezos plays it safe within Amazon, and quite unsafe outside of that. By the time he acquires something from Amazon it is because it has proven long-term revenue generation and the shake-out period is done and consolidation is about to start. With AI the shake-out is still to come. So he can afford to wait to eventually acquire the winner or to copy it if he can't buy it. Having very deep pockets enables different business strategies.

boh · 2025-12-03T18:30:56 1764786656

They're likely just waiting out the eventual crash and waiting to buy at the resulting fire sale. Microsoft has done a very good job of investing in the space enough to see a potentially lucrative pay out while managing the risk enough to not be sunk if it doesn't pan out.

paxys · 2025-12-03T20:34:04 1764794044

It's safe to assume that a company like Anthropic has been getting (and rejecting) a steady stream of acquisition offers, including from the likes of Amazon, from the moment they got proninent in the AI space.

rhubarbtree · 2025-12-04T07:33:18 1764833598

Before posting this, did you know anything about the cap table of anthropic?

Hard to buy a company that is part-owned by your competitors.

HarHarVeryFunny · 2025-12-04T12:44:50 1764852290

Interesting point - kind of a poison pill to prevent being acquired.

b00ty4breakfast · 2025-12-04T01:22:55 1764811375

>It seems that Amazon are playing this much like Microsoft - seeing themselves are more of a cloud provider, happy to serve anyone's models...

I guess they're taking the old adage about selling picks and shovels when everyone else is digging for gold to heart

nbardy · 2025-12-03T16:01:52 1764777712

Why are you assuming Anthropic is for sale? They have a clear path to profitability, booming growth, and a massive and mission driven founding team.

They could make more money keeping control of the company and have control.

UpsideDownRide · 2025-12-04T12:45:18 1764852318

That's not clear at all, at best your statement is controversial if not outright dubious.

disgruntledphd2 · 2025-12-03T16:23:00 1764778980

> They have a clear path to profitability

I'd love to see evidence for such a thing, because it's not clear to me at all that this is the case.

I personally think they're the best of the model providers but not sure if any foundation model companies (pure play) have a path to profitability.

JohnnyMarcone · 2025-12-03T16:45:55 1764780355

What do you mean by pure play? Claude code alone is 1B revenue. It's not just the API they make money on.

https://www.anthropic.com/news/anthropic-acquires-bun-as-cla...

disgruntledphd2 · 2025-12-04T11:44:04 1764848644

> What do you mean by pure play?

They just sell tokens, essentially. Much like Open AI but very different from Google or Microsoft, who make their money elsewhere.

tapoxi · 2025-12-03T17:00:22 1764781222

But there's no moat around these models, they're all interchangeable and leapfrogging each other at a decent pace.

Gemini could get much better tomorrow and their entire customer base could switch without issue.

epiccoleman · 2025-12-03T19:52:09 1764791529

I think Claude Code is the moat (though I definitely recognize it's a pretty shallow moat). I don't want to switch to Codex or whatever the Gemini CLI is, I like Claude Code and I've gotten used to how it works.

Again, I know that's a shallow moat - agents just aren't that complex from a pure code perspective, and there are already tools that you can use to proxy Claude Code's requests out to different models. But at least in my own experience there is a definite stickiness to Claude that I probably won't bother to overcome if your model is 1.1x better. I pay for Google Business or whatever it's called primarily to maintain my vanity email and I get some level of Gemini usage for free, and I barely touch it, even though I'm hearing good things about it.

(If anything I'm convincing myself to give Gemini a closer look, but I don't think that undermines my overarching (though slightly soft) point).

ewoodrich · 2025-12-04T00:54:07 1764809647

I went from:

  1. using Claude Code exclusively (back when it really was on another level from the competition) to

  2. switching back and forth with CC using the Z.ai GLM 4.6 backend (very close to a drop-in replacement these days) due to CC massively cutting down the quota on the Claude Pro plan to

  3. now primarily using OpenCode with the Claude Code backend, or Sonnet 4.5 Github Copilot backend, or Z.ai GLM 4.6 backend (in that order of priority)

OpenCode is so much faster than CC even when using Claude Sonnet as the model (at least on the cheap Claude Pro plan, can't speak for Max). But it can't be entirely due to the Claude plan rate limiting because it's way faster than CC even when using Claude Code itself as the backend in OC.

I became so ridiculously sick of waiting around for CC just to like move a text field or something, it was like watching paint dry. OpenCode isn't perfect but very close these days and as previously stated, crazy fast in comparison to CC.

Now that I'm no longer afraid of losing the unique value proposition of CC my brand loyalty to Anthropic is incredibly tenuous, if they cut rate limits again or hurt my experience in the slightest way again it will be an insta-cancel.

So the market situation is much different than the early days of CC as a cutting edge novel tool, and relying on that first mover status forever is increasingly untenable in my opinion. The competition has had a long time to catch up and both the proprietary options like Codex and open source model-agnostic FOSS tools are in a very strong position now (except Gemini CLI is still frustrating to use as much as I wish it wasn't, hopefully Google will fix the weird looping and other bugs ... eventually, because I really do like Gemini 3 and pay for it already via AI Pro plan).

Spooky23 · 2025-12-04T00:23:42 1764807822

Google Code assist is pretty good. I had it create a pretty comprehensive inventory tracking app within the quota that you get with the $25 google plan.

JohnnyMarcone · 2025-12-03T21:12:37 1764796357

What was the moat in search?

efficax · 2025-12-03T21:49:13 1764798553

Google had PageRank, which gave them much better quality results (and they got users to stick with them by offering lots of free services (like gmail) that were better quality than existing paid services). The difference was night and day compared to the best other search engines at the time (WebCrawler was my goto, then sometimes AltaVista). The quality difference between "foundation" models is nil. Even the huge models they run in datacenters are hardly better than local models you can run on a machine 64gb+ ram (though faster of course). As Google grew it got better and better at giving you good results and fighting spam, while other search engines drowned in spam and were completely ruined by SEO.

aurareturn · 2025-12-04T08:46:35 1764837995

PageRank wasn't that much better. It was better and the word spread. Google also had a very clean UI at a time where websites like Excite and Yahoo had super bloated pages.

That was the differentiation. What makes you think AI companies can't find moats similar to Google's? The right UX, the right model and a winner can race past everyone.

input_sh · 2025-12-03T21:44:39 1764798279

PageRank, everything before PageRank was more like yellow pages than a search engine as we know it today. Google also had a patent on it, so it's not like other people could simply copy it.

Google was also way more minimal (and therefore faster on slow connections) and it raised enough money to operate without ads for years (while its competitors were filled with them).

Not really comparable to today, when you have 3-4 products which are pretty much identical, all operating under a huge loss.

marcosdumay · 2025-12-04T01:34:54 1764812094

Google is in a two sided market. Their moat in search is their ads market share, their moat in ads is their search market share.

jimbokun · 2025-12-03T22:06:43 1764799603

The sheer amount of data and infrastructure Google has relative to their competitors.

Just having far more user search queries and click data gives them a huge advantage.

apercu · 2025-12-03T17:44:08 1764783848

And, if your revenue is $1B but your costs are $2B it only lasts until the music stops....

aurareturn · 2025-12-04T08:48:09 1764838089

How many times are people going to repeat this lazy statement?

If claude code's revenue grows faster than cost, it will become profitable.

dwaltrip · 2025-12-03T21:16:15 1764796575

I don’t think they are losing money on inference.

Model training, sure. But that will slow down at some point.

raw_anon_1111 · 2025-12-03T22:09:57 1764799797

And the same question I always ask.

Are they profitable (no),

Is Claude Code even running at a marginal profit? (who knows)

Is the marginal profit large enough to pay for continued R&D to stay competitive (no)

Does Claude Code have a sustainable advantage over what Amazon, Microsoft and Google can do in this space using their incumbency advantage and actual profits and using their own infrastructure?

wqaatwt · 2025-12-03T17:47:26 1764784046

Which is not a lot at all compared to their cost and especially the valuation discussed here.

jbs789 · 2025-12-03T16:51:58 1764780718

They are selling, to public equity investors, because they can get a better price that way than selling to another company!

Keyframe · 2025-12-03T16:52:06 1764780726

Why are you assuming Anthropic is for sale?

They're preparing for IPO?

graemep · 2025-12-03T16:21:57 1764778917

Assuming by "they" you mean current shareholders (who include Google and Amazon and VCs) if they are selling at least in part, why would at least some of them not be willing to sell their entire stakes?

> They could make more money keeping control of the company and have control.

It depends on how much they can sell for.

solumunus · 2025-12-03T19:06:59 1764788819

We’re not assuming anything, this whole post is about them doing an IPO…

parapatelsukh · 2025-12-03T19:06:19 1764788779

signed D. Amodei lmao

spprashant · 2025-12-03T21:29:33 1764797373

I get the feeling Amazon wants to be the shovel seller for the AI rush than be a frontier model lab.

There is no moat in being a frontier model developer. A week, month, or a year later there will be a open source alternative which is about 95% as good for most tasks people care about.

aurareturn · 2025-12-04T08:49:09 1764838149

  I get the feeling Amazon wants to be the shovel seller for the AI rush than be a frontier model lab.

I think this is simply wrong. Don't you think Amazon would love to be in Google's position of having Gemini 3 Pro?

The shovel maker will never make more money than a few lucky gold diggers.

VirusNewbie · 2025-12-03T21:52:54 1764798774

Haven't they invested hundreds of millions trying to train frontier models?

spprashant · 2025-12-03T22:17:53 1764800273

I don't know how much they are spending to be fair.

I am basing my observation on the noises they are making. They did put out a model called Nova but they are not drumming it up at all. The model page makes no claims of benchmarks or performance. There are no signs of them poaching talent. Their CEO has not been in the press singing praises about AI unlike every big tech CEO.

Maybe they have a skunk-works team on it but something tells me they are waiting for the paint to dry.

calgoo · 2025-12-04T08:50:16 1764838216

Well, i have had chats with a few engineers working in Amazon retail and there is talk about adding agents for Ops and similar internal tasks. So there is a bunch of AI related things happening, and like others have said, they rent shovels for the rush, so they will bank all the money without having to compete with the money bonfires that others are burning.

Fergusonb · 2025-12-03T18:57:44 1764788264

Something something selling shovels in a gold rush.

ekropotin · 2025-12-03T15:57:54 1764777474

Maybe Anthropic simply don’t want to be acquired

WJW · 2025-12-03T17:31:38 1764783098

You understand that doing an IPO is quite literally selling big chunks of yourself to the highest bidder, right?

ceejayoz · 2025-12-03T17:53:04 1764784384

Sort of. You can do what Zuck did; give your shares more votes, so you stay in control. (He owns 13% of the shares, but more than 50% of the voting power.) That's less doable with an acquisition.

paxys · 2025-12-03T20:40:36 1764794436

In one case your ownership is diluted by maybe 10%, and you keep full decision making power and everything else. In the other it is diluted by 100% and you are now an employee. They are very different outcomes.

JohnnyMarcone · 2025-12-03T21:10:54 1764796254

The current leadership retains power in an IPO. Is there a minimum size chuck one has to sell when IPO-ing? How do you know it will be big chunks?

sunir · 2025-12-03T23:45:38 1764805538

Not necessarily controlling stakes.

fullstackchris · 2025-12-03T20:49:24 1764794964

uh... thats exactly why anthropic wouldnt want to be acquired? weird response to that comment IMO

efficax · 2025-12-03T21:39:10 1764797950

why would you take on that burn rate when you can invest, get the investment back over time in cloud spend, and maybe make off like bandits when they ipo

jklinger410 · 2025-12-03T16:06:29 1764777989

Amazon and Microsoft are protecting themselves from the bubble.

turnsout · 2025-12-03T16:18:48 1764778728

Yes, repackaging and reselling AI is a starkly better business than creating frontier models

blitzar · 2025-12-04T08:38:30 1764837510

Finding a use for Ai will be an even better business.

cmiles8 · 2025-12-03T16:30:42 1764779442

Would have made a lot of sense a few years ago, but not now.

runningRicky · 2025-12-03T15:41:35 1764776495

why exit now and become a stuffed AI driven animal when you can keep running this ship yourself, doing your dream job and getting all the woos and panties?

petesergeant · 2025-12-04T06:25:26 1764829526

> It's interesting that Amazon don't appear interested in acquiring Anthropic

1. Why buy the cow when you can get the milk for free?

2. Amazon doesn't appear interested in acquiring Anthropic _at its current valuation_. I would be surprised if it's not available for acquisition at 1/10th its current price in the next 3-5 years

AI isn't going anywhere, but "prop model + inference" is far from a proven business model.

solumunus · 2025-12-03T19:08:34 1764788914

I too would be sitting back and watching my competitors commit insane capital to this unlikely bet.

apercu · 2025-12-03T17:42:35 1764783755

Hence the need to cash out.

PunchyHamster · 2025-12-03T16:47:29 1764780449

It is spending a lot of money to do the same thing (selling the shovels), and gaining maybe a bit bigger cut if the bubble doesn't burst too violently.

tinyhouse · 2025-12-03T20:00:26 1764792026

Anthropic is a $1T company in the making (by 2030), already raised their last round at ~$200B valuation. Do you really think Amazon can acquire them? They already invested a lot of money in them and probably own at least 20% of Anthropic, which was the smartest thing Jassy did in a while. Not to mention, if Adobe wasn't allowed to buy Figma, do you think Amazon will be allowed to buy Anthropic? No way it's going to be approved.

> I don't see the pure "AI" plays like OpenAI and Anthropic able to survive as independent companies when they are competing against the likes of Google, and with Microsoft and Amazon happy to serve whatever future model comes along.

One thing you're right about - Anthropic isn't surviving - it's thriving. Probably the fastest growing revenue in history.

raw_anon_1111 · 2025-12-03T22:13:31 1764800011

Well, just to show you a microcosm of what happens when VCs find the bigger fool in the public market when they IPO money losing companies….

https://medium.com/@Arakunrin/the-post-ipo-performance-of-y-...

> One thing you're right about - Anthropic isn't surviving - it's thriving. Probably the fastest growing revenue in history.

Growing revenue and losing money is not “thriving”

moralestapia · 2025-12-03T16:17:13 1764778633

Lol, no one would want to buy that trash.

Same w/ Perplexity.

HarHarVeryFunny · 2025-12-02T16:26:15 1764692775

They've developed a sparse attention mechanism (which they document and release source code for) to increase model efficiency with long context, as needed for fast & cost-effective extensive RL training for reasoning and agentic use

They've built a "stable & scalable" RL protocol - more capable RL training infrastructure

They've built a pipeline/process to generate synthetic data for reasoning and agentic training

These all combine to build an efficient model with extensive RL post-training for reasoning and agentic use, although they note work is still needed on both the base model (more knowledge) and post-training to match frontier performance.

HarHarVeryFunny · 2025-12-02T15:42:17 1764690137

LEA and MOV are doing different things. LEA is just calculating the effective address, but MOV calculates the address then retrieves the value stored at that address.

e.g. If base + (index * scale) + offset = 42, and the value at address 42 is 3, then:

LEA rax, [base + index * scale + offset] will set rax = 42

MOV rax, [base + index * scale + offset] will set rax = 3

dataflow · 2025-12-02T16:32:23 1764693143

I assumed they're referring to register-register moves?

HarHarVeryFunny · 2025-12-02T17:10:25 1764695425

OK, so:

LEA eax, [ebx]

instead of:

MOV eax, ebx

But of course:

MOV eax, [ebx]

is not the same.

HarHarVeryFunny · 2025-12-02T15:08:23 1764688103

LEA stands for Load Effective Address, so the syntax is as-if you're doing a memory access, but you are just getting the calculated address, not reading or writing to that address.

LEA would normally be used for things like calculating address of an array element, or doing pointer math.

HarHarVeryFunny · 2025-11-30T22:37:12 1764542232

I was ok with that as "fledgling AI" at the start of the movie/documentary, but thought that going back to it and having the chatbot suggest a chess book opening to Hassabis at the end was cheesy and misleading.

They should have ended the movie on the success of AlphaFold.

HarHarVeryFunny · 2025-11-30T22:31:11 1764541871

For a start optimization is a process, and intelligence is a capability.

HarHarVeryFunny · 2025-11-30T22:17:58 1764541078

It seems that to solve the protein folding problem in a fundamental way would require solving chemistry, yet the big lie (or false hope) of reductionism is that discovering the fundamental laws of the universe such as quantum theory doesn't in fact help that much with figuring out the laws/dynamics at higher levels of abstraction such as chemistry.

So, in the meantime (or perhaps for ever), we look for patterns rather than laws, with neural nets being one of the best tools we have available to do this.

Of course ANNs need massive amounts of data to "generalize" well, while protein folding only had a small amount available due to the months of effort needed to experimentally discover how any protein is folded, so DeepMind threw the kitchen sink at the problem, apparently using a diffusion like process in AlphaFold 3 to first determine large scale structure then refine it, and using co-evolution of proteins as another source of data to address the paucity.

So, OK, they found a way around our lack of knowledge of chemistry and managed to get an extremely useful result all the same. The movie, propaganda or not, never suggested anything different, and "at least 90% correct" was always the level at which it was understood the result would be useful, even if 100% based on having solved chemistry / molecular geometry would be better.

dekhn · 2025-12-01T00:32:04 1764549124

We have seen some suggestion that the classical molecular dynamics force fields are sufficient to predict protein folding (in the case of stable, soluble, globular proteins), in the sense that we don't need to solve chemistry but only need to know a coarse approximation of it.

HarHarVeryFunny · 2025-11-30T21:55:24 1764539724

Sure, but AlphaFold is still probably the most impactful and positive thing to have come out of "Deep Learning" so far.

theturtletalks · 2025-11-30T23:08:14 1764544094

Didn’t the transformer model come from AlphaFold? I feel like we wouldn’t have had the LLMs we use today if it wasn’t for AlphaFold.

HarHarVeryFunny · 2025-12-01T01:17:06 1764551826

The Transformer was invented at Google, but by a different team. AFAIK the original AlphaFold didn't use a transformer, but AlphaFold 2.0 and 3.0 do.

HarHarVeryFunny · 2025-11-30T20:30:49 1764534649

According to this page, LLVM-MOS seems to be pretty soundly beaten in performance of generated code by Oscar64.

https://thred.github.io/c-bench-64/

I think the ideal compiler for 6502, and maybe any of the memory-poor 8-bit systems would be one that supported both native code generation where speed is needed as well as virtual machine code for compactness. Ideally would also support inline assembler.

The LLVM-MOS approach of reserving some of zero page as registers is a good start, but given how valuable zero page is, it would also be useful to be able to designate static/global variables as zero page or not.

sehugg · 2025-12-01T00:28:03 1764548883

I've implemented Atari 2600 library support for both LLVM-MOS and CC65, but there are too many compromises to make it suitable for writing a game.

The lack of RAM is a major factor; stack usage must be kept to a minimum and you can forget any kind of heap. RAM can be extended with a special mapper, but due to the lack of a R/W pin on the cartridge, reads and writes use different address ranges, and C does not handle this without a hacky macro solution.

Not to mention the timing constraints with 2600 display kernels and page-crossing limitations, bank switching, inefficient pointer chasing, etc. etc. My intuition is you'd need a SMT solver to write a language that compiles for this system without needing inline assembly.

ddingus · 2025-12-01T08:25:04 1764577504

A very simple BASIC compiled pretty well! It did feature online assembly, and I agree with you on this necessary point especially concerning the 2600!

See Batari Basic

zozbot234 · 2025-11-30T22:02:12 1764540132

AIUI, Oscar64 does not aim to implement a standard C/C++ compiler as LLVM does, so the LLVM-MOS approach is still very much worthwhile. You can help by figuring out which relevant optimizations LLVM-MOS seems to be missing compared to SOTA (compiled or human-written) 6502 code, and filing issues.

asiekierka · 2025-12-01T20:56:07 1764622567

We already know what the main remaining issue is - LLVM-MOS's register allocator is far from optimal for the 6502 architecture. mysterymath is slowly working on what may become a more sutiable allocator.

HarHarVeryFunny · 2025-12-01T22:04:28 1764626668

There is a video below of mysterymath presenting LLVM-MOS where he talks about reserving 32 bytes of zero page to present to LLVM as 16 16-bit registers, to be able to utilize it's register allocator, which does seem a sane approach.

https://www.youtube.com/watch?v=ejbTKtgSZI0

However his github doesn't show any activity on it in the last 2 years.

https://github.com/mysterymath/llvm-mos

mysterymath · 2025-12-01T23:13:11 1764630791

Lol? https://github.com/mysterymath/llvm-mos/tree/regalloc

HarHarVeryFunny · 2025-12-02T13:26:58 1764682018

My bad - was just looking at the main branch.

djmips · 2025-12-01T04:32:00 1764563520

I feel like no amount of optimizations will close the gap - it's an intractable problem.

HarHarVeryFunny · 2025-12-02T13:22:59 1764681779

I wouldn't say intractable, but it's not clear whether LLVM's optimization framework is flexible enough for it.

From mysterymath's (LLVM-MOS) description, presenting some of zero page as 16 bit registers (to make up for lack thereof, and perhaps due to LLVM not having any other support for preferred/faster memory regions), while beneficial, still had limitations since LLVM just assumes that there will be FAST register-register transfer operations available, and that is not even true for the 6502's real registers (no TXY), let alone these ZP "registers" which would require using the accumulator to copy.

A code generation/optimization approach that would seem more guaranteed to do well on the 6502 might be to treat it more as tree search (with pruning) - generate multiple branching alternatives and then select the best based on whatever is being optimized for (clock cycles or code size).

Coding for the 6502 by hand was always a bit like that ... you had some ideas of alternate ways things could be coded, but there was also an iterative phase of optimization (cf search) to tweak it and save a byte here, a couple of cycles there ...

I've mentioned elsewhere I used to work for Acorn back in the day, developing for the BBC micro, with me and a buddy developing the ISO-Pascal system which was delivered in 2 16KB ROMs. Putting code in ROM gives you an absolute hard size budget, and putting a full Pascal compiler, interpreter, runtime library and programmers editor into a total 32KB was no joke! I remember at the end of the project we were still a few hundred bytes over what would fit in the ROMs, and had to fight for every byte to make it fit!

djmips · 2025-12-02T17:32:13 1764696733

It is my conjecture that due to the 8 bit index registers, contrast that to 6800, 6809 and others, the 6502 becomes fundamentally a structure of arrays (SOA) system versus C which is coupled in it's libraries and existing code base with array of structures (A0S).

Optimizing code will never solve good data oriented design. This is just one of the reasons that Asm programmers routinely beat C code on the 6502. Another one is math. In the C language specification, if fixed point had been given equal footing with float that would also help.

These are such a blind spos that you rarely even see custom 6502 high level languages accommodate these fundamental truths.

BTW, growing up on the 6502, I had no problems moving into the very C friendly 68000 but later going backwards to the 6809, on the surface it looked so much like a 6502 that I was initially trying to force SOA in my data design before realizing it was better suited to AOS.

HarHarVeryFunny · 2025-12-03T03:31:48 1764732708

If we're comparing performance of code generated by a C compiler vs hand optimized assembler, then for it to be an apples-to-apples comparison the same data structures (e.g. SOA or AOS) need to be used in both cases.

djmips · 2025-12-03T08:33:55 1764750835

Yes, that's true and should be how good 6502 high level code would be written.

HarHarVeryFunny · 2025-12-03T13:10:47 1764767447

Yep. C was always meant to be a "close to the metal" language providing a feature set that could be mapped pretty directly to the processors it was running on. It's a "low level, high level language" where the expectation is more what you see is what you get (WYSIWYG), even though a modern optimizer might be expected to remove invariant code out of loops, etc - localized efficiency gains, but not large scale transformation.

So, optimal C targetting the 6502 is not going to look much like C targetting a modern processor. The developer still needs to be very aware of the limitations of the processor they are targetting.

One somewhat radical thing that LLVM-MOS does is to analyze the program's call graph, and for functions that are not used recursively it will assign parameters and local variables to zero page instead, both for speed of access and to avoid need for a stack frame. Even though this violates the WYSIWYG mental model, this is a nice abstraction of what the assembly language programmer would have done themself.

djmips · 2025-12-03T13:46:48 1764769608

>One somewhat radical thing that LLVM-MOS does is to analyze the program's call graph, and for functions that are not used recursively it will assign parameters and local variables to zero page instead, both for speed of access and to avoid need for a stack frame. Even though this violates the WYSIWYG mental model, this is a nice abstraction of what the assembly language programmer would have done themself.

very nice, it sounds similar to the 'compiled stack' concept. I've seen that here in the co2 language for the 6502

"Variables declared as subroutine parameters or by using let are statically allocated using a "compiled stack", calculated by analyzing the program's entire call graph. This means scopes will not use memory locations used by any inner scopes, but are free to use them from sibling scopes. This ensures efficient variables lookups, while also not wasting RAM. However, it does mean that recursion is not supported."

https://github.com/dustmop/co2

zozbot234 · 2025-12-02T17:53:17 1764697997

There is a C standard extension for embedded/low-level programming which specifies fixed point arithmetic. (And other goodies such as hardware register access and multiple address spaces.)

fooker · 2025-12-01T05:29:14 1764566954

It's performance of generated code, not performance of the compiler.

djmips · 2025-12-02T17:34:36 1764696876

kwertyoowiyop · 2025-12-01T00:37:34 1764549454

Aztec C had both native and interpreted code generation, back in the day.

bbbbbr · 2025-11-30T22:21:07 1764541267

With regard to code size in this comparison someone associated with llvm-mos remarked that some factors are: their libc is written in C and tries to be multi-platform friendly, stdio takes up space, the division functions are large, and their float support is not asm optimized.

HarHarVeryFunny · 2025-11-30T22:51:27 1764543087

I wasn't really thinking of the binary sizes presented in the benchmarks, but more in general. 6502 assembler is compact enough if you are manipulating bytes, but not if you are manipulating 16 bit pointers or doing things like array indexing, which is where a 16-bit virtual machine (with zero page registers?) would help. Obviously there is a trade-off between speed and memory size, but on a 6502 target both are an issue and it'd be useful to be able to choose - perhaps VM by default and native code for "fast" procedures or code sections.

A lot of the C library outside of math isn't going to be speed critical - things like IO and heap for example, and there could also be dual versions to choose from if needed. Especially for retrocomputing, IO devices themselves were so slow that software overhead is less important.

djmips · 2025-12-01T04:38:58 1764563938

More often than not the slow IO devices were coupled with optimized speed critical code due to cost savings or hardware simplification. Heap is an approach that rarely works well on a 6502 machine - there are no 16 bit stack pointers and it's just slower than doing without - However I tend to agree that a middle ground 16 bit virtual machine is a great idea. The first one I ever saw was Sweet16 by Woz.

HarHarVeryFunny · 2025-12-01T14:33:07 1764599587

I agree about heap - too much overhead to be a great approach on such a constrained target, but of course the standard library for C has to include it all the same.

Memory is better allocated in more of a customized application specific way, such as an arena allocator, or just avoid dynamic allocation altogether if possible.

I was co-author of Acorn's ISO-Pascal system for the 6502-based BBC micro (16KB or 32KB RAM) back in the day, and one part I was proud of was a pretty full featured (for the time) code editor that was included, written in 4KB of heavily optimized assembler. The memory allocation I used was just to take ownership of all free RAM, and maintain the edit buffer before the cursor at one end of memory, and the buffer content after the cursor at the other end. This meant that as you typed and entered new text, it was just appended to the "before cursor" block, with no text movement or memory allocation needed.

zozbot234 · 2025-12-01T08:17:20 1764577040

> I think the ideal compiler for 6502, and maybe any of the memory-poor 8-bit systems would be one that supported both native code generation where speed is needed as well as virtual machine code for compactness.

Threaded code might be a worthwhile middle-of-the-way approach that spans freely across the "native" and "pure VM interpreter" extremes.

anthk · 2025-12-01T10:21:25 1764584485

If it runs fast under an AppleI, it will run fine in the rest.