Why would you assume "a bad computer program with a dozen eyes is likely to be better than a bag of meat with only two"?
I'm not up on state of the art on computer image/object recognition but the experience I have from about 10 years ago leads me to believe that there are still challenges to be solved, especially when it comes to recognizing movements and intentions. As a driver, some of the cues I rely on include turning indicators, wheel positions, other driver/pedestrian/cyclist eye contact as well as sounds.
I'm not saying that the challenge of coming up with software that allows a car to autonomously drive itself better than a human isn't possible. I just challenge the assertion that a computer with multiple cameras is likely superior to a human.