The governance gap nobody is filling
Anthropics' internal debate about pausing self-improving AI development is a governance question at a civilisational scale. Your agency's version of the same question is more modest but structurally identical: who has the authority to say no, and at what point in the process?
Most teams have not answered this. They have adopted AI coding tools because the productivity gains are obvious, and they have assumed that existing code review processes will catch what needs catching. That assumption is under pressure.
Code review was designed for human-paced output. When a pull request contains 800 lines written in 40 minutes, the review is not happening with the same depth it would for 800 lines written over two days. The reviewer knows less about the reasoning behind each choice, because there was no visible reasoning process to observe. The model does not leave notes about what it considered and rejected.
A few things actually help here. First, shrink the review unit. Smaller PRs reviewed more often beat large PRs reviewed quickly. This requires slowing down the commit cadence, which feels counterintuitive when the whole point was to go faster. Do it anyway. Second, make the engineer who prompted the output explain the approach out loud before the review. Not the code, the approach. If they cannot, the code should not ship. Third, build explicit decision logs for anything architectural. Not a document nobody reads, a short record of what was considered, what was chosen, and why. AI can help write those too, but a human needs to confirm they are accurate.