This is no different from scraping images (DALL-E), code (GitHub co-pilot), text (ChatGPT), facebook (Cambridge Analytica), news, etc. This is all content that requires licensing and protections to access. A lot of this has been happening for some time, but with the recent leaps in capabilities and productizing, it is now front and centre.
This will definitely backfire. Also, not good timing for google with the recent attention for moving to bing.
Honestly, why do we still require a massive indexer like google? Just like HTML5 standard killed flash, we need a metadata standard for search and create some type hierarchical distributed search across the internet.