I do this with our AI PR review checks. We have AI review every PR and commits t...

stickfigure · 2026-05-10T05:10:49 1778389849

About 30% of our AI PR review checks are flawed at a fundamental level, based on poor comprehension of the whole. If I told the AI "fix everything and keep going until it's green" I would be terrified of the result.

I disbelieve this works in anything other than a toy codebase (or an incredibly fine-grained microservice).

The 70% is amazing! But a 30% failure rate requires intense supervision.