Comment Re:You can literally just download the whole site (Score 4, Informative) 40
My thoughts exactly! I have a few (very old) copies of Wikipedia hanging around somewhere. I should go torrent a fresh copy. Way back when, I used to keep a text-only copy on my phone (Kiwix, which appears to still be a thing) for when I didn't have data. I bet I still have that SD card somewhere. I think it was about 10GB uncompressed back then.
I guess it goes to show how stupid and greedy these AI companies are. I'm sure that a lot of the primary training data for most models *is* Wikipedia. So letting all these AI bots go nuts hitting the public servers over and over again for slightly updated content is just plain lazy. Grabbing diffs from a mirror every month and updating a local copy isn't even hard, or maybe just spend an infinitesimal amount of that VC money on a Wikipedia API subscription. Sheesh.