More

tunesmith · 2025-09-25T19:35:29 1758828929

Data point: I run a site where users submit a record. There was a request months ago to allow users to edit the record after submitting. I put it off because while it's an established pattern it touches a lot of things and I found it annoying busy work and thus low priority. So then gpt5-codex came out and allowed me to use it in codex cli with my existing member account. I asked it to support edit for that feature all the way through the backend with a pleasing UI that fit my theme. It one-shotted it in about ten minutes. I asked for one UI adjustment that I decided I liked better, another five minutes, and I reviewed and released it to prod within an hour. So, you know, months versus an hour.

paulryanrogers · 2025-09-25T22:55:56 1758840956

Is the hour really comparable to months spent not working on it?

rpdillon · 2025-09-25T23:10:06 1758841806

He's referring to the reality that AI helps you pick up and finish tasks that you otherwise would have put off. I see this all day every day with my side projects as well as security and customer escalations that come into my team. It's not that Giant Project X was done six times as fast. It's more like we were able to do six small but consequential bug fixes and security updates while we continued to push on the large feature.

jama211 · 2025-09-26T19:33:31 1758915211

“If you make a machine that can wash the dishes in an hour, is that more productive than not doing the dishes for months?” - Yes! That’s the definition of productivity!! The dishes are getting done now and they weren’t before! lol

tunesmith · 2025-08-19T22:52:43 1755643963

Too bad it's misogynistic. I'm not sure you already knew that. If I were rude enough to call you a name, I wonder what term I could use that would work either way!

bezier-curve · 2025-08-19T22:58:17 1755644297

As a gay man, I disagree. I think it's a word that describes someone who doesn't consider others without realizing it.

gausswho · 2025-08-19T23:13:14 1755645194

You could try scumbag, but you might not want to know what it originally meant.

tunesmith · 2025-08-11T03:32:15 1754883135

For those who don't know, OpenAI Codex CLI will now work with your ChatGPT plus or pro account. They barely announced it but it's on their github page. You don't have to use an api key.

tunesmith · 2025-08-08T18:26:56 1754677616

I felt like it was getting somewhere and then it pivoted to the stupid graph thing, which I can't seem to escape. Anyway, I think it'll be really interesting to see how this settles out over the next few weeks, and how that'll contrast to what the 24-hour response has been.

My own very naive and underinformed sense: OpenAI doesn't have other revenue paths to fall back on like Google does. The GPT5 strategy really makes sense to me if I look at this as a market share strategy. They want to scale out like crazy, in a way that is affordable to them. If it's that cheap, then they must have put a ton of work in to some scaling effort that the other vendors just don't care about as much, whether due to loss-leader economics or VC funding. It really makes me wonder if OpenAI is sitting on something much better that also just happens to be much, much more expensive.

Overall, I'm weirdly impressed because if that was really their move here, it's a slight evidence point that shows that somewhere down in their guts, they do really seem to care about their original mission. For people other than power users, this might actually be a big step forward.

theahura · 2025-08-08T22:52:11 1754693531

I agree that they don't have other revenue paths and think that's a big issue for them. I disagree that this means they care about their original mission though; I think they're struggling to replicate their original insane step function model improvements and other players have caught up.

If you liked the general analysis of OpenAI and the AI space, you may appreciate https://open.substack.com/pub/theahura/p/tech-things-gemini-... Or https://open.substack.com/pub/theahura/p/tech-things-gpt-pro...

Which focus much more on macro analysis and less on memes

tunesmith · 2025-08-07T18:21:32 1754590892

The 2h 15m is the length of tasks the model can complete with 50% probability. So longer is better in that sense. Or at least, "more advanced" and potentially "more dangerous".

tunesmith · 2025-08-01T22:05:39 1754085939

What's the thing you should replace Jenkins with at scale?

flakes · 2025-08-01T23:30:44 1754091044

Im a firm believer that there will never be a perfect general purpose job scheduler. The priority for how jobs are scheduled is always deeply coupled to your business needs. General purpose schedulers always end up as a jack of all trades but master of none. With a custom built scheduler you get that control, but do have to re-invent the wheel for a lot of features. Jenkins, Argo, Airflow, Cron, etc, all have their own pros and cons.

tunesmith · 2025-08-01T19:14:59 1754075699

They should invent a programming language that only compiles if the why is still true.

9rx · 2025-08-01T20:07:02 1754078822

They have, but they're beyond grasp of most developers.

Tests were invented to express the "why" for the normal guy. They don't strictly prevent compilation, but a proper workflow will see them halt your process in the same way, offering the same outcome.

Granted, there are a lot of horribly written tests out there that don't tell you "why" — or, well, anything. As always, people will find a way to abuse anything you put in front of them. But when used well...

tunesmith · 2025-08-01T22:03:53 1754085833

With a test, it might link up some functionality with "why" and pass, but then what happens if a business requirement just isn't a requirement anymore? The test will still pass. I'm thinking of something sillier, like a language that forces you to justify why for your code, and then regularly quizzes you if the business reasoning is still true. If anything changes, it rips out the code and breaks your site. :) So then you have to go in to fix it.

I'd also love it if this were applied to politics and laws.

9rx · 2025-08-02T16:09:39 1754150979

It wouldn't be too hard to add such logic to your tests. If it proves useful, someone will no doubt turn it into a language feature.

tunesmith · 2025-08-01T02:12:53 1754014373

Not the authorities. The sources.

tunesmith · 2025-07-30T23:57:11 1753919831

Yeah this seems like the specification/implementation problem. One can perfectly implement a bad spec, but coming up with the perfect spec is a human problem.

a_cardboard_box · 2025-08-03T14:34:16 1754231656

It's possible you could at least mitigate the problem by checking that what you've proven isn't trivial. If you slightly change a mathematical statement, it frequently becomes either trivially true or trivially false. So if you accidentally proved the wrong thing, there's a good chance that your proof can be shortened to a point that it becomes obviously wrong. For example, if you accidentally put "there exists" instead of "for all" in Fermat's last theorem, the proof is 1^3 + 1^3 != 1^3. That is obviously too short to prove FLT - it would have fit in Fermat's margin.

cubefox · 2025-08-01T09:35:52 1754040952

It's interesting that this rather fundamental problem doesn't seem to have an established name and Wikipedia article. At least I couldn't find one. Perhaps the problem is too obvious.

tunesmith · 2025-07-30T20:31:54 1753907514

It's just that they take turns. The top rivals in darts, or pool, or any other turn-taking game is the same way. They have to react to what their rival left them.

lordnacho · 2025-07-30T21:46:46 1753912006

> there are people on both teams who really are tackling those guys.

This is the critical distinction.

Peyton isn't deciding the defense when Brady is on offense. Brady isn't reacting to whatever Peyton left on the pool table or the dart board.