"1) While the FLOPs per megawatt ratio is going to change over time, the limiting factor right now isn't compute per se, it's the ability to get power the computers. "
Citation please. Obviously, billionaires want free power and are making a point of this, but so what? Furthermore, while watts is a measure of power, it is NOT a measure of inference.
"But the fact that memory bandwidth is a problem doesn't make FLOPS any less obsolete."
FLOPS are not obsolete, nor have FLOPS ever been the sole measure of computing capability. It is one measure, and it's a chosen measure here by people like you for the sole reason of arguing against it. The topic is megawatts, the dumbest possible way of expressing inferencing capability.
"What matters for AI inference is being able to do 50 TFLOPS across tens of gigs of RAM (or hundreds?)."
Which is why FLOPs are not absolute, despite your ignorant claim.
"But we don't have a good unit-of-measure for that. But we do have a perfectly good unit-of-measure of today's limiting factor: gigawatts."
Sure we do, even if you don't, and gigawatts is NOT a measure of today's limiting power, just like horsepower is not a measure of the amount of freeways we have.