Comment Re:JPL??? (Score 1) 21
Personal opinion: As others noted, JPL has expertise in handling incredibly large amounts of data, generally. JPL also has expertise in web-crawling (https://github.com/nasa-jpl-memex) and in open source file parsers. Specifically, the Apache Tika project (https://tika.apache.org/) was co-founded by Chris Mattmann, a JPL'er.