Agents __SHOULD NOT__ verify their own code. They know they wrote it, and they a...

iagooar · 2025-09-23T21:54:42 1758664482

Codex uses this principle - /review runs in a subthread, does not see previous context, only git diff. This is what I am using. Or I open Cursor to review code written by GPT-5 using Sonnet.

daxfohl · 2025-09-23T21:58:04 1758664684

Do you have examples of this working, or any best practices on how to orchestrate it efficiently? It sounds like the right thing to do, but it doesn't seem like the tech is quite to the point where this could work in practice yet, unless I missed it. I imagine multiple agents would churn through too many tokens and have a hard time coming to a consensus.

CuriouslyC · 2025-09-24T00:21:43 1758673303

I've been doing this with Gemini 2.5 for about 6 months now. It works quite well, it doesn't catch big architectural 100% but it's very good at line/module level logic issues and anti-patterns.