More

ripvanwinkle · 2025-05-23T05:50:22 1747979422

shouldn't the comparison be with gpt4o or 4.5 and not 4.1 or o3

ripvanwinkle · 2025-05-02T20:06:34 1746216394

a well written postmortem and it raised my confidence in their product in general

ripvanwinkle · 2025-04-01T19:31:33 1743535893

I think this is the best april fools article I've seen

ripvanwinkle · 2025-01-18T18:41:55 1737225715

Very cool to have his kids closely involved in his work

ripvanwinkle · 2024-10-29T05:53:49 1730181229

Its cyclical, a lot of companies over hired and are trying hard to cut down

aaronbrethorst · 2024-10-29T05:55:13 1730181313

Cyclical processes and appallingness aren’t mutually exclusive. In fact they may well be related.

ripvanwinkle · on May 3, 2024

about time. you also need a clawback provision since it can take a while for flaws to be detected and the execs could be in new jobs by then.

ripvanwinkle · on April 27, 2024

As a newish user of Apple (Macbook and the IPad mini) it was not as big a leap from Android and Windows as I had feared. I still live in Google services including Gboard on the IPad mini and apart from klutzing around with system settings occasionally the mini feels "not terribly different" from the android devices I use. The Macbook is a bigger challenge though.

I only picked the mini because I couldn't get the same performance with that form factor in Android.

I only entertained the mini because I was forced to use a Macbook for work and realized that apart from annoyances with keyboard shortcuts and system settings I could continue to live in a Firefox + Chrome + Edge + Google services ecosystem.

I will now definitely consider Apple hardware if I don't find a good fit in the Android + Windows world

phmqk76 · on April 27, 2024

And yet, in Apple’s preferred world, they suck up 30% of all of the revenue made by developers who develop for their devices. The Mac model may not exist in 10 years if Apple can get rid of it and replace it with a locked down App Store from which they charge rents.

ripvanwinkle · on March 31, 2024

It would be interesting to feed it a formal language specification of some language it hasn't seen and then ask it write code and see how it does.

That could be a test of reasoning and reading comprehension

CuriouslyC · on March 31, 2024

I've been thinking about a benchmark designed this way for a while. It doesn't even need to be code, particularly, it could be basic reasoning problems. The key is that you define a new, random language that has never before been seen (maybe it has statistical similarity to existing languages, maybe not), create a translation key, then ask a question in that language.

ape4 · on March 31, 2024

Reasoning vs being a completion engine (I could make a guess at how well that would work)

CuriouslyC · on March 31, 2024

Reasoning is a form of completion (logical), the problem is that LLMs aren't language agnostic in their learned semantic reasoning.

ripvanwinkle · on March 3, 2024

>> AI is in the solution-space, not the problem-space,

this feels like an oversimplification. Its like saying the internet was in the solution space, companies were still selling things and now they needed to use the internet as well.

I think there's a ton of new scenarios that open up some are already underway like self driving cars and others further down like home robots that function like butlers.

Also companies that leverage AI in the best possible way for a domain will differentiate and that does open up the possibility of disrupting incumbents. It may get to be that to set up a business you pay for Data and the Model until you have critical mass to generate your own data.

ripvanwinkle · on Feb 13, 2024

Tesla ought to have C'en this at the start