Ah, yes! The universal and uncheatable LLM! Surely nothing can go wrong.

NitpickLawyer · 2025-09-18T09:36:10 1758188170

Perfect is the enemy of good. Current LLM systems + "traditional tools" for scanning can get you pretty far into detecting the low hanging fruit. Hell, I bet even a semantic search with small embedding models could give you a good insight into "what's in the release notes matches what's in the code". Simply flag it for being delayed a few hours, till a human can view it. Or run additional checks.

progx · 2025-09-18T08:46:05 1758185165

I can't wait to read about your solution.

orphea · 2025-09-18T10:00:13 1758189613

You don't need to be a chef to tell that the soup is too salty.

progx · 2025-09-18T08:45:22 1758185122

As i wrote "not perfect". But better than anything else or nothing.

robertlagrant · 2025-09-18T08:58:46 1758185926

The Politician's Syllogism[0] is instructive.

[0] https://en.wikipedia.org/wiki/Politician's_syllogism

progx · 2025-09-18T09:06:09 1758186369

OK, we are here now on reddit or facebook?

I thought we discuss here problems and possible solutions.

My fault.

rpdillon · 2025-09-18T13:58:42 1758203922

I'm not sure why everyone is so hostile. Your idea has merit, along the lines of a heuristic that you trigger a human review as a follow-up. I'd be surprised if this isn't exactly the direction things go, although I don't think the tools will be given for free, but rather made part of the platform itself, or perhaps as an add-on service.

robertlagrant · 2025-09-18T10:30:42 1758191442

I don't think "we should use AI to solve this" is a solution proposal.