Hacker News new | past | comments | ask | show | jobs | submit | ripvanwinkle's comments login

shouldn't the comparison be with gpt4o or 4.5 and not 4.1 or o3

a well written postmortem and it raised my confidence in their product in general


I think this is the best april fools article I've seen


Very cool to have his kids closely involved in his work


Its cyclical, a lot of companies over hired and are trying hard to cut down


Cyclical processes and appallingness aren’t mutually exclusive. In fact they may well be related.


about time. you also need a clawback provision since it can take a while for flaws to be detected and the execs could be in new jobs by then.


As a newish user of Apple (Macbook and the IPad mini) it was not as big a leap from Android and Windows as I had feared. I still live in Google services including Gboard on the IPad mini and apart from klutzing around with system settings occasionally the mini feels "not terribly different" from the android devices I use. The Macbook is a bigger challenge though.

I only picked the mini because I couldn't get the same performance with that form factor in Android.

I only entertained the mini because I was forced to use a Macbook for work and realized that apart from annoyances with keyboard shortcuts and system settings I could continue to live in a Firefox + Chrome + Edge + Google services ecosystem.

I will now definitely consider Apple hardware if I don't find a good fit in the Android + Windows world


And yet, in Apple’s preferred world, they suck up 30% of all of the revenue made by developers who develop for their devices. The Mac model may not exist in 10 years if Apple can get rid of it and replace it with a locked down App Store from which they charge rents.


It would be interesting to feed it a formal language specification of some language it hasn't seen and then ask it write code and see how it does.

That could be a test of reasoning and reading comprehension


I've been thinking about a benchmark designed this way for a while. It doesn't even need to be code, particularly, it could be basic reasoning problems. The key is that you define a new, random language that has never before been seen (maybe it has statistical similarity to existing languages, maybe not), create a translation key, then ask a question in that language.


Reasoning vs being a completion engine (I could make a guess at how well that would work)


Reasoning is a form of completion (logical), the problem is that LLMs aren't language agnostic in their learned semantic reasoning.


>> AI is in the solution-space, not the problem-space,

this feels like an oversimplification. Its like saying the internet was in the solution space, companies were still selling things and now they needed to use the internet as well.

I think there's a ton of new scenarios that open up some are already underway like self driving cars and others further down like home robots that function like butlers.

Also companies that leverage AI in the best possible way for a domain will differentiate and that does open up the possibility of disrupting incumbents. It may get to be that to set up a business you pay for Data and the Model until you have critical mass to generate your own data.


Tesla ought to have C'en this at the start


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: