More

8n4vidtmkvmk · 2025-06-15T07:10:44 1749971444

That sounds crappy. Hobby flying is already stupid expensive.

8n4vidtmkvmk · 2025-06-15T07:04:57 1749971097

My work has been adding more and more AI review bots. It's been like 0 for 10 for the feedback the AI has given me. Just wasting my time. I see where it's coming from, it's not utter nonsense, but it just doesn't understand the nuance or why something is logically correct.

That said, there have been some reports where the AIs have predicted what later became outages when they were ignored.

So... I don't know. Is it worth wading through 10 bad reviews of 1 good one prevents a bad bug? Maybe. I do hope the ratio gets better though

8n4vidtmkvmk · 2025-06-15T06:56:55 1749970615

I would not want to read that every morning. Just show me the calendar and weather. The graphical representations are much faster to digest.

8n4vidtmkvmk · 2025-06-10T16:18:39 1749572319

Are we sure more time butt in office equates to more productivity?

FabHK · 2025-06-11T03:58:02 1749614282

> Are we sure more time butt in office equates to more productivity?

Typically more output, but less productivity (= output/time).

1propionyl · 2025-06-10T16:59:13 1749574753

Yes, specifically when it comes to open-ended research or development, collocation is non-negotiable. There are greater than linear benefits in creativity of approach, agility in adapting to new intermediate discoveries, etc that you get by putting a number of talented people who get along in the same space who form a community of practice.

Remote work and flattening communication down to what digital media (Slack, Zoom, etc) afford strangle the beneficial network effects.

throwaway0123_5 · 2025-06-10T17:51:15 1749577875

I think they were talking about total time spent working rather than remote vs. in-person. I've seen more than a few studies over the years showing that going from 40 to 35 or 30 hours/wk has minimal or positive impacts on productivity. Idk if that would apply to all work environments though, and I don't recall any of the studies being about research productivity specifically.

hdjrudni · 2025-06-11T01:53:16 1749606796

> I think they were talking about total time spent working rather than remote vs. in-person.

I was, yes. I should have omitted the "in office" part but I was referencing the "work more hours in America than France"

distortionfield · 2025-06-10T18:43:12 1749580992

You’re being downvoted but you’re right. The number of people who act like a web cam reproduces the in person experience perfectly, for good and bad, is hilarious to me.

alienbaby · 2025-06-10T20:45:06 1749588306

I think the mistake people make is believing that one approach is best for all. Diffferent people work most effectively in different ways.

jama211 · 2025-06-14T19:13:10 1749928390

Well said. If you make me commute to an office I’m far far far less productive, simple as that.

meta_ai_x · 2025-06-10T16:51:20 1749574280

Yes, especially in cutting edge research areas where other high functioning people with high energy isarelso there.

You can write your in-house CRUD app in your basement or your office and it doesn't matter.

The vast majority of HN crowd and general social/mainstream media don't make the difference between these two scenarios

adventured · 2025-06-10T20:18:07 1749586687

$89,000 GDP per capita vs $46,000 rather proves the point about productivity per butt. US office workers are extraordinarily productive in terms of what their work generates (thanks to numerous well understood things like the outsized US scaling abilities). Measuring beyond that is very difficult due to the variance of every business.

ath92 · 2025-06-11T03:29:43 1749612583

Weird take. Norway has about the same gdp per capita as the USA with stricter regulations than France. Ireland’s GDP per capita is higher than that of the USA, with less bureaucracy than France but more than the US. Not to mention that all of these are before adjusting for PPP. Almost as if GDP per capita is not a good measurement of productivity.

FabHK · 2025-06-11T04:03:48 1749614628

Many wrinkles here.

First, one should probably look at GNP (or even GNI) rather than GDP to reduce the distortionary impact of foreign direct investment, company headquarters for tax reasons, etc.

Next, need to distinguish between market rate and PPP, as you highlight.

Lastly, these are all measures of output (per capita), while productivity is output per input, in this context output per hour worked. There the differences are less pronounced.

HPsquared · 2025-06-11T15:39:33 1749656373

Monaco is the most productive country in the world in nominal GDP per capita. A very industrious place, it seems!

cataphract · 2025-06-10T21:35:10 1749591310

A part of that figure is an artifact of how strong the dollar is though.

palata · 2025-06-10T22:33:03 1749594783

> $89,000 GDP per capita vs $46,000 rather proves the point about productivity per butt.

So if I work 24h/day in a farm in Afghanistan, I should earn more than software developers in the Silicon Valley (because I'm pretty sure that they sleep)? Is that how you say GDP works?

77pt77 · 2025-06-11T23:02:06 1749682926

Yes, and Louisiana has a GDP per capita on par of higher than France and is a shithole compared to the worst areas of Europe, let alone France.

But I wouldn't expect someone like you to know, understand or even acknowledge it.

numpad0 · 2025-06-10T19:08:09 1749582489

I think maybe we should completely switch to admitting this. Every extra second you sit in the (home)office adds to productivity, just not necessarily converting into market values, that can be inflated with hype. Also longer hours is not necessarily safe or sustainable.

We only wish more time != more productivity because it's inconvenient in multiple ways if it were. We imagine a multiplier in there to balance the equation, such factor that can completely negate production, using mere anecdotal experiences as proofs.

Maybe that's not scientific, maybe time spent very closely match productivity, and maybe production as well as productivity need external, artificial regulations.

mschild · 2025-06-10T20:52:35 1749588755

> Every extra second you sit in the (home)office adds to productivity

I'm not sure I believe that. I think at some point the additional hours worked will ultimately decrease the output/unit of time and at some point that you'll reach a peak whereafter every hour worked extra will lead to an overall productivity loss.

Its also something that I think is extremely hard to consistently measure, especially for your typical office worker.

maigret · 2025-06-11T11:17:33 1749640653

Here you go https://cs.stanford.edu/people/eroberts/cs181/projects/crunc...

8n4vidtmkvmk · 2025-06-08T05:26:53 1749360413

Plex does something very similar to marquee to display an actors name when it's too long to fit under their profile pic. Seems like a good use.

layer8 · 2025-06-08T13:28:14 1749389294

Music players, including car radios and portable CD and MiniDisc players, did that around 25 years ago. It's sort-of a standard UI pattern for variable-length text in a fixed-size display.

8n4vidtmkvmk · 2025-05-24T07:00:46 1748070046

Yep. 17 year old me working alongside a 70 year old dude working the same job as me... I knew that's not what I wanted for my life.

That said, I think I've still wafted through life on tracks. I just concluded that FAANG was the next track after uni so I made it happen. Not sure I'm happy any more though. Maybe I need to reinvent myself.

8n4vidtmkvmk · 2025-05-22T04:26:16 1747887976

How does grammarly exist then? Must be some secret sauce in there.

8n4vidtmkvmk · 2025-05-22T04:25:06 1747887906

That's not been my experience so far. LLMs are good at mimicking existing good, it doesn't usually bring in new things when not asked. Sometimes I have to go out of my way to point to other bits of code in the project to copy from because it hasn't ingested enough of the codebase.

That said, a negative prompt like we have in stable diffusion would still be very cool.

Incipient · 2025-05-22T05:59:42 1747893582

I'm in the camp of 'no good for existing'. I try to get ~1000 line files refactored to use different libraries, design paradigms, etc and it usually outputs garbage - pulling db logic into the UI, grabbing unrelated api/function calls, to entirely just corrupting the output.

I'm sure there is a way to correctly use this tool, so I'm feeling like I'm "just holding it wrong".

fragmede · 2025-05-22T06:23:38 1747895018

Which LLM are you using? what LLM tool are you using? What's your tech stack that you're generating code for? Without sharing anything you can't, what prompts are you using?

Incipient · 2025-05-22T08:03:32 1747901012

Was more of a general comment - I'm surprised there is significant variation between any of the frontier models?

However, vscode with various python frameworks/libraries; dash, fastapi, pandas, etc. Typically passing the 4-5 relevant files in as context.

Developing via docker so I haven't found a nice way for agents to work.

fragmede · 2025-05-22T20:49:20 1747946960

> I'm surprised there is significant variation between any of the frontier models?

This comment of mine is a bit dated, but even the same model can have significant variation if you change the prompt by just a few words.

https://news.ycombinator.com/item?id=42506554

danielbln · 2025-05-22T08:10:18 1747901418

I would suggest using an agentic system like Cline, so that the LLM can wander through the codebase by itself and do research and build a "mental model" and then set up an implementation plan. The you iterate in that and hand it off for implementation. This flow works significantly better than what you're describing.

otabdeveloper4 · 2025-05-22T10:03:13 1747908193

> LLM can wander through the codebase by itself and do research and build a "mental model"

It can't really do that due to context length limitations.

exe34 · 2025-05-22T11:25:29 1747913129

It doesn't need the entire codebase, it just needs the call map, the function signatures, etc. It doesn't have to include everything in a call - but having access to all of it means it can pick what seems relevant.

danielbln · 2025-05-22T12:37:20 1747917440

Yes, that's exactly right. The LLM gets a rough overview over the project (as you said, including function signatures and such) and will then decide what to open and use to complete/implement the objective.

otabdeveloper4 · 2025-05-23T10:12:59 1747995179

In a real project the call map and function signatures are millions of tokens themselves.

exe34 · 2025-05-23T14:43:35 1748011415

For sufficiently large values of real.

otabdeveloper4 · 2025-05-23T16:08:33 1748016513

Anything less is not a "project", it's a "file".

exe34 · 2025-05-23T17:39:25 1748021965

That's right, there is no true Scotsman!

otabdeveloper4 · 2025-05-25T20:29:12 1748204952

Incorrect attempt as fallacy baiting.

If your repo map fits into 1000 tokens then your repo is small enough that you can just concatenate all the files together and feed the result as one prompt to the LLM.

No, current LLM technology does not allow to process actual (i.e. large) repos.

simonw · 2025-05-25T22:05:34 1748210734

Where's your cutoff for "large"?

johnisgood · 2025-05-22T11:47:06 1747914426

1k LOC is perfectly fine, I did not experience issues with Claude with most (not all) projects around ~1k LOC.

otabdeveloper4 · 2025-05-23T10:15:44 1747995344

Actual projects where you'd want some LLM help start with millions of lines of code, not thousands.

With 1k lines of code you don't need an LLM, the entire source code can fit in one intern's head.

johnisgood · 2025-05-23T12:08:01 1748002081

The OP mentioned having LLM issues with 1k LOC, so I suppose he would have problems with millions. :D

simonw · 2025-05-23T19:10:41 1748027441

Have you tried Claude Code yet?

Even with it's 200,000 token limit it's still really impressive at diving through large codebases using find and grep.

lukan · 2025-05-22T10:52:07 1747911127

I guess people are talking about different kinds of projects here in terms of project size.

jacob019 · 2025-05-22T12:32:22 1747917142

I've refactored some files over 6000 loc. It was necessary to do it iteratively with smaller patches. "Do not attempt to modify more than one function per iteration" It would just gloss over stuff. I would tell it repeatedly: I noticed you missed something, can you find it? I kept doing that until it couldn't find anything. Then I had to manually review and ask for more edits. Also lots of style guidelines and scope limit instructions. In the end it worked fine and saved me hours of really boring work.

landl0rd · 2025-05-27T20:44:37 1748378677

I'll back this up. I feel constantly gaslit by people who claim they get good output.

I was hacking on a new project and wanted to see if LLMs could write some of it. So I picked an LLM friendly language (python). I picked an LLM friendly DB setup (sqlalchemy and postgres). I used typing everywhere. I pre-made the DB tables and pydantic schema. I used an LLM-friendly framework (fastapi). I wrote a few example repositories and routes.

I then told it to implement a really simple repository and routes (users stuff) from a design doc that gave strict requirements. I got back a steaming pile of shit. It was utterly broken. It ignored my requirements. It fucked with my DB tables. It fucked with (and broke) my pydantic. It mixed db access into routes which is against the repository pattern. Etc.

I tried several of the best models from claude, oai, xai, and google. I tried giving it different prompts. I tried pruning unnecessary context. I tried their web interfaces and I tried cursor and windsurf and cline and aider. This was a pretty basic task I expect an intern could handle. It couldn't.

Every LLM enthusiast I've since talked to just gives me the run-around on tooling and prompting and whatever. "Well maybe if you used this eighteenth IDE/extension." "Well maybe if you used this other prompt hack." "Well maybe if you'd used a different design pattern."

The fuck?? Can vendors not produce a coherent set of usage guidelines? If this is so why isn't there a set of known best practices? Why can't I ever replicate this? Why don't people publish public logs of their interactions to prove it can do this beyond a "make a bouncing ball web game" or basic to-do list app?

simonw · 2025-05-28T05:49:59 1748411399

> Why don't people publish public logs of their interactions to prove it can do this beyond a "make a bouncing ball web game" or basic to-do list app?

It's possible I've published more of those than anyone else. I share links to Gists with transcripts of how I use the models all the time.

You can browse a lot of my collection here: https://simonwillison.net/search/?q=Gist&sort=date

Look for links that's at things like "transcript".

8n4vidtmkvmk · 2025-05-18T22:43:18 1747608198

Kysley does a good job of this. I haven't found it annoying.

Well... I think MySQL is a 2nd class citizen so I had to write my own schema gen but that only burned a few hours. Now it's great

riwsky · 2025-05-19T01:43:04 1747618984

On the other hand: https://kysely.dev/docs/recipes/excessively-deep-types

codr7 · 2025-05-19T19:24:29 1747682669

Is it though, or are you currently enjoying the same trip?

I mean, take a step back, is it worth the effort?

hdjrudni · 2025-05-20T04:20:34 1747714834

Kysley is worth the effort, yes.

Prisma and Drizzle...those gave me a bit too much heck. Kysley is close enough to SQL while offering some benefits, typings being one of them, but also query builders are often helpful when I need to run subtle variations of the same query, e.g. depending on the user's permissions or to add search filters.

8n4vidtmkvmk · 2025-05-17T07:25:51 1747466751