Comment Use the Archive's crawler (Score 2, Informative) 29
How about using Heritrix, the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler?
In a five year period we can get one superb programming language. Only we can't control when the five year period will begin.