It's super intelligent but it can't be bothered to run tests unless specifically...

simonw · 2026-02-13T17:42:33 1771004553

Personally I prefer my agents not to run random commands on my machine without me telling them to first.

Imagine you just cloned some random project from GitHub and fired up Claude Code in that folder, but it turned out to be malicious and running 'npm test' stole all your files.

suddenlybananas · 2026-02-14T08:56:23 1771059383

If it's super intelligent, surely it could glance at tests before running them and figure whether it was malicious or not.

simonw · 2026-02-14T13:44:44 1771076684

Tests have dependencies. Crawling all of those dependencies to check for malicious code could require inspecting millions of lines of code, if you could even obtain the code.

It's also beginning to sound like needing to solve the halting problem.

suddenlybananas · 2026-02-14T17:11:26 1771089086

Come on man. You're being unserious here.

simonw · 2026-02-14T17:32:03 1771090323

I'm really not. You're the one arguing about a "super intelligent" strawman.

suddenlybananas · 2026-02-15T13:18:47 1771161527

Look, I know you have a lot invested in this project but I don't see why you think it is somehow unreasonable to expect an AI agent to run tests in a repository. You don't need super intelligence for that.

simonw · 2026-02-15T15:15:38 1771168538

Of course I went agents to run tests in a repository - I do that all the time.

I don't want the agent to run tests in a new repository until I've given it the go-ahead to do that.