How many times does review actually catch bugs? Why should review wotk better for AI generated code than human written? For both you need some rigorious, i.e. non-AI, systems ti catch bugs: tests and static code analysers. If you have too many bugs, up your testing. Use the tests to safely refactor provlematic code. Review is at best a way to align developers and train newbees, not Q.