Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Map the Internet... In One Day?

Posted by jamie on Fri Nov 14, 2003 03:09 PM
from the graph-paper dept.
rjbrown99 writes "There have been numerous stories over the past few years on Bill Cheswick's Internet Mapping Project. The Lumeta folks even created a company out of it. Well, now there is a competitor. A single guy with a single computer is working to accomplish the same feat - within ONE DAY and using open-source tools to do it. The new project is called Opte and can be found at www.opte.org." He's made some progress and is looking for volunteers.
This discussion has been archived. No new comments can be posted.
Map the Internet... In One Day? | Log In/Create an Account | Top | 263 comments (Spill at 50!) | Index Only | Search Discussion
Display Options Threshold:
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • This server will die ! (Score:5, Informative)

    by Anonymous Coward on Friday November 14 2003, @03:10PM (#7476252)
    Who
    This project was started by me (Barrett Lyon) as a response to a conversation with my colleagues at Network Presence. Over a lunch we were discussing William Cheswick and Hal Burch's Internet Mapping Project. I was not very impressed with the results of their project, they produce beautiful maps but they don't seem to be very useful nor do they release their code freely. Their mapping also takes nearly six months to generate a single map. My comment was that, "I can write a program that can map the entire net in a single day." The comment was met with some hostility. Thus, this project was born.

    What
    The goal of this project is to use a single computer and single Internet connection to map the location of every single class C network on the Internet. It is obvious that the Internet is not routed as a bunch of class-c networks, but it is easy to see that by treating the Internet IP space as a bunch of class C networks, it will be possible to make a detailed map of the entire Internet. The global Internet address space currently offers 32 bits worth of unique host addresses, or a theoretical maximum of 2^32=4,294,967,296 hosts. In reality, the address space has been allocated in fairly large contiguous blocks, which renders strictly optimal utilization difficult. The smallest block that is logically routed via BGP or allocated by ARIN is a class C network (CIDR /24.)
    At the rate of 194 traceroutes per-second it is possible to scan the entire theoretical 2^24 space within a single day. Thus about 16,777,216 class C networks could be processed by a single computer in a single day. Yet, there are huge portions of network blocks that are no longer used, many network blocks fall into the RFC 1918 standard and other blocks that are reserved by ARIN.

    According to ARIN there are about 47 class A networks in the reserved status (search ARIN for OrgName "Internet Assigned Numbers Authority".) Doing the math results in a reduction of 3,080,192 class C blocks to be removed from the scan list, leaving us with a theoretical list of 13,697,024 blocks.

    Applying some additional thought large portions of the 13.7 Million blocks may route to the same place. By testing about 20 routes at random within a class B and comparing the results, it is possible to see if there are multiple routes worth investigating or if the entire thing goes to the same place. By applying that logic it increases the speed of the scanning.

    After some testing and beta code I proved that with enough bandwidth it is possible to scan the entier Internet with a single computer. The 1/5th of the Internet map only took about 2 hours to create, yet it generated nearly 200k/sec of traffic and put my machine at a load of 60+ while scanning. If you apply the math, the entire internet would take about 10 hours to scan and another hour or two for the visual map output.

    I found a lot of value in the project, so after the proof of concept was completed I continued to program. I turned the entire system into a distributed client/server model. The clients request a chunk of random IP space from the server and when it is completed the IP space is registered with the server. This is done until all of the IP space has been scanned. I'm also working on a stats system so I can monitor the productivity of the different scanning nodes and users involved in the project.

    By taking a more distributed approach the data will look more like the real Internet. It will show more of the backup routes, more of the smaller links in different countries, etc. When the first version of the code is done I should have about 5 to 10 different scanning nodes running on the Internet. If you would like to donate a computer and some bandwdith to this project, please contact me. I can give credit where credit is due!

    When
    The first scanning tests began in late October 2003 and I wish to have the project generate a new map every week.

    Where
    Currently the project is hosted in San Francisco on a multi-homed fiber ba
  • Here ya' go... (Score:3, Funny)

    by swordboy (472941) on Friday November 14 2003, @03:10PM (#7476255)
    (Last Journal: Monday December 08 2003, @09:32PM)
    Several maps of the internet right here [google.com]
  • He needs more bandwidth (Score:5, Funny)

    by defMan (175410) on Friday November 14 2003, @03:12PM (#7476268)
    I am in serious need of more bandwidth and hardware power. If anyone has a Co-Located system on a nice network to donate to this project for a few months, I would be very happy!

    Slashdotting was never easier!
  • by bcolflesh (710514) on Friday November 14 2003, @03:12PM (#7476272)
    (http://www.redstream.org/)
    Go past the burnt-out Cray and then right at the Commodore64 Contiki server - you'll see my drive lights.
  • Are we overlooking something? by bigjnsa500 (Score:1) Friday November 14 2003, @03:13PM
  • Things look good so far... by sk3tch (Score:2) Friday November 14 2003, @03:13PM
  • It has to be asked.... by SpaceCadetTrav (Score:1) Friday November 14 2003, @03:15PM
  • Ok heres my part... (Score:5, Funny)

    by GoofyBoy (44399) on Friday November 14 2003, @03:15PM (#7476307)
    (Last Journal: Monday October 11 2004, @09:43PM)

    IP Address: 127.0.0.1
    Computer: The one from Microsoft with the Start button in the bottom left hand corner.
    Location: my bedroom.
  • His Map Is Wrong (Score:5, Funny)

    by DoctorMabuse (456736) * on Friday November 14 2003, @03:15PM (#7476308)
    (http://www.advantura.com/)
    SCO IPs are in the Mordor address space.
  • Great! by phaetonic (Score:1) Friday November 14 2003, @03:16PM
    • Re:Great! by gregfortune (Score:3) Friday November 14 2003, @03:44PM
    • Re:Great! by Orien (Score:1) Friday November 14 2003, @03:57PM
      • Re:Great! by The Original Atrox (Score:1) Friday November 14 2003, @07:00PM
    • 1 reply beneath your current threshold.
  • Been there, done that (Score:4, Informative)

    by Fux the Penguin (724045) on Friday November 14 2003, @03:17PM (#7476322)
    (Last Journal: Wednesday July 28 2004, @08:06PM)
    Okay, yes, I fully admit that it's cool to map the internet in one day. Regardless...I think I hear about some internet every other day.

    There's John Quarterman [mids.org] who's been doing it for years, and then the CAIDA [caida.org] visualization tools, and Cybergeography [cybergeography.org] and the Internet weather report [internetweather.com] and damn maps [mapblast.com] and more maps [mapquest.com].

    Note to everyone: please stop mapping the internet.
  • He should try this: by Anonymous Coward (Score:1) Friday November 14 2003, @03:17PM
  • Slashdotted ! (Score:5, Funny)

    by Goody (23843) on Friday November 14 2003, @03:17PM (#7476327)
    (http://technocrat.net/ | Last Journal: Monday March 29 2004, @12:53PM)
    Well, there's one less server to map...

  • Perhaps he should aim for something lower... by SageMadHatter (Score:1) Friday November 14 2003, @03:17PM
  • What can is the purpose of it ? by zymano (Score:1) Friday November 14 2003, @03:18PM
    • 1 reply beneath your current threshold.
  • by burgburgburg (574866) <splisken06@email.cUMLAUTom minus punct> on Friday November 14 2003, @03:18PM (#7476340)
    half a day with a broken computer, dial-up access and a guy with no hands.

    Top that!

  • slashdotted by nate1138 (Score:1) Friday November 14 2003, @03:20PM
  • Distributed effort? by kamukwam (Score:1) Friday November 14 2003, @03:21PM
  • zerg by Lord Omlette (Score:1) Friday November 14 2003, @03:22PM
  • And one day... by Lugor (Score:2) Friday November 14 2003, @03:24PM
  • Creepy (Score:5, Interesting)

    by Seanasy (21730) on Friday November 14 2003, @03:25PM (#7476403)

    When I first saw the image on the right [opte.org] it looked like a human brain. It would be creepy if the Internet had a sort of fractal self-similarity to our physiology.

    • Mod Parent Up by handy_vandal (Score:2) Friday November 14 2003, @03:30PM
    • Re:Creepy by Anonymous Coward (Score:2) Friday November 14 2003, @03:46PM
    • Re:Creepy by zangdesign (Score:3) Friday November 14 2003, @03:54PM
      • Re:Creepy by The Cydonian (Score:2) Saturday November 15 2003, @02:05AM
    • Gaaassspp! by RandomHavoc (Score:1) Friday November 14 2003, @04:24PM
    • Re:Creepy by Zork the Almighty (Score:2) Friday November 14 2003, @11:45PM
    • My thoughts exactly. by The Cydonian (Score:2) Saturday November 15 2003, @02:02AM
      • 1 reply beneath your current threshold.
    • 1 reply beneath your current threshold.
  • hostip.info by Space cowboy (Score:2) Friday November 14 2003, @03:25PM
    • Re:hostip.info by sulli (Score:2) Friday November 14 2003, @03:39PM
    • Re:hostip.info by harmonica (Score:2) Friday November 14 2003, @03:52PM
      • Re:hostip.info by Space cowboy (Score:2) Friday November 14 2003, @04:04PM
        • Re:hostip.info by Bromrrrrr (Score:1) Friday November 14 2003, @04:36PM
          • Re:hostip.info by Space cowboy (Score:2) Friday November 14 2003, @05:31PM
  • You're Welcome by Stalemate (Score:2) Friday November 14 2003, @03:25PM
  • Fantastic. by ToadSprocket (Score:2) Friday November 14 2003, @03:25PM
  • It's too easy... (Score:5, Funny)

    by Fux the Penguin (724045) on Friday November 14 2003, @03:30PM (#7476444)
    (Last Journal: Wednesday July 28 2004, @08:06PM)
    A single guy with a single computer...

    He's mapping the Internet. Why am I not surprised he's single?
  • rsync (Score:3, Interesting)

    Why can't somebody just rsync the Google search cluster? Wouldn't it have the same results this guy is looking for?
    • 1 reply beneath your current threshold.
  • Eh? by dstillz (Score:1) Friday November 14 2003, @03:32PM
  • Cool, best DDOS ever by sammyo (Score:1) Friday November 14 2003, @03:32PM
  • 1 + Volunteers = 1 by Infonaut (Score:2) Friday November 14 2003, @03:32PM
  • Bugs Bunny by bigjnsa500 (Score:2) Friday November 14 2003, @03:34PM
  • Patent by Mamoth (Score:1) Friday November 14 2003, @03:38PM
  • the form by Feyr (Score:2) Friday November 14 2003, @03:39PM
    • Re:the form by chemicallyreliant (Score:1) Friday November 14 2003, @05:58PM
      • Re:the form by Feyr (Score:2) Friday November 14 2003, @06:42PM
    • 1 reply beneath your current threshold.
  • It will already be outdated by ToKsUri (Score:1) Friday November 14 2003, @03:44PM
  • Forget mapping, the whole net just visited by lb746 (Score:1) Friday November 14 2003, @03:44PM
  • by hburch (98908) on Friday November 14 2003, @03:47PM (#7476590)
    As a side comment, now I understand why my connection got so slow.

    [Internet Mapping Project's] mapping also takes nearly six months to generate a single map. My comment was that, "I can write a program that can map the entire net in a single day."

    The Internet Mapping Project maps the Internet in under two hours (105 minutes for this morning's run). I'm not certain where the six months came from. The rate limitation is the packet rate limit we set (500 packet per second).

    Map layout time is not included in that time, but that is not done on a daily basis. A map layout take about six hours, as I recall. It only took a couple weeks to produce all the layouts necessary for a movie of the Internet [lumeta.com] from Aug 1998 to Jan 2001 based on the daily runs.

    CAIDA [caida.org] also creates daily maps of the Internet as part of their Skitter project. Their schedule varies between measurement points. In addition, other projects, such as the Mercator project and the RocketFuel projects, also map or did map the Internet.

    Each project has slightly different goals. Skitter focuses on paths to major web and DNS servers. Mercator attempted to discover networks with limited pre-knowledge. RocketFuel wants a very accurate map of a particular ISP. The Internet Mapping Project is focused on the router connectivity within and between public backbones.
  • He's not gonna get very far... by dacarr (Score:2) Friday November 14 2003, @03:50PM
  • I have considered something similar (Score:5, Interesting)

    by fv (95460) * <fyodor@insecure.org> on Friday November 14 2003, @03:53PM (#7476640)
    (http://insecure.org/)
    As the author of the free Nmap [insecure.org] ("Network Mapper") tool, I have also considered creating a map of the entire Internet. I would have focused on end hosts (where they are, what operating systems and services they run, trending, etc.) instead of routing. Rather than try this from a single high-bandwidth machine (as with Opte), I was going to take a distributed approach. I would release a P2P-like application that users could run and each scan small sections of network space to be contributed to the global database. The app would be called Nmapster :). I also liked to think about it as a "caching service", so that you don't have to spend the time rescanning the Microsoft network if someone else has done so in the last N hours.

    Then I came to my senses and decided to work on more practical and less controversial projects such as Nmap Version Detection [insecure.org]. But the subversive in me still hasn't given up entirely on Nmapster :).

    -Fyodor

  • Brilliant! (Score:3, Funny)

    by Quixadhal (45024) on Friday November 14 2003, @03:56PM (#7476655)
    (Last Journal: Friday May 02 2003, @12:35PM)
    You want to map the internet?

    1 Setup a site saying you want to map the internet.
    2 Get posted on slashdot.
    3 Parse the referer logs.
    4 ???
    5 Profit!
  • Increased Network Traffic if used by too many by ViolentGreen (Score:1) Friday November 14 2003, @03:57PM
    • 1 reply beneath your current threshold.
  • Mirror by markclong (Score:1) Friday November 14 2003, @03:58PM
  • Wonderful Idea by bacon-kidney-pie (Score:1) Friday November 14 2003, @03:59PM
  • Why not doing it the distributed way? by TheRagingTowel (Score:1) Friday November 14 2003, @04:02PM
  • bad for business (Score:3, Interesting)

    by glassesmonkey (684291) * on Friday November 14 2003, @04:03PM (#7476712)
    (http://slashdot.org/ | Last Journal: Tuesday June 24 2003, @04:25PM)
    Maybe people already take this into consideration, but won't this impact webhosting? Won't people try to get their webpage/company closer to the main trunk / center of map? When you look for a hosting service (basically an IP address) right now most people don't consider where in the map the host is.

    I mean with this tool, I would look up where my new IP would land me and try to find a host closer to the main backbones. Is this already done now by most people?

    (on another subject the maps remind me of the species origin stuff)
  • More links by modder (Score:1) Friday November 14 2003, @04:04PM
  • wow by Anonymous Coward (Score:1) Friday November 14 2003, @04:17PM
  • Be afraid by daves (Score:2) Friday November 14 2003, @04:21PM
  • Hierarchy (Score:3, Insightful)

    by sploxx (622853) on Friday November 14 2003, @04:23PM (#7476880)
    Has anyone noticed that nearly all of the maps have a more or less tree-shaped structure?
    This means concentration of power. So, the real, failure-tolerant internet is gone, at least it seems to be.
  • Hosts out in the middle of nowhere? by placeclicker (Score:1) Friday November 14 2003, @04:33PM
  • I'll pitch in! by beckett (Score:2) Friday November 14 2003, @04:34PM
  • limited by the speed of light? by Elminst (Score:2) Friday November 14 2003, @04:42PM
  • Internet Topology by Tacoguy (Score:2) Friday November 14 2003, @04:59PM
    • 1 reply beneath your current threshold.
  • Is there a map of sever locations on a real map? by DoorFrame (Score:2) Friday November 14 2003, @06:01PM
  • AS mapping would be more useful by cpghost (Score:2) Friday November 14 2003, @06:09PM
  • Let the mountain come to Mohamed (Score:3, Informative)

    by Porag_Spliffing (66509) on Friday November 14 2003, @06:21PM (#7477808)
    (http://slashdot.org/)
    I see his trick already. Post on /. that you plan to map the entire net and then wait till the entire net maps its way to you.

    P.S.

    Is there such a thing as trecart ?
  • It could make a good musical... by jejones (Score:2) Friday November 14 2003, @06:39PM
  • The Internet according to Garp (Score:3, Insightful)

    by Alomex (148003) on Friday November 14 2003, @06:42PM (#7477921)
    (http://slashdot.org/)
    Notice that he maps the paths from his computer to the rest of the world. That is not the same as a map of the entire Internet.

    To illustrate, if I map routes from, say Chicago, I'm likely to miss the direct connection between Seattle and San Francisco, as there is no traffic I could generate that would take that path.

    • 1 reply beneath your current threshold.
  • Too late by duffbeer703 (Score:2) Friday November 14 2003, @10:52PM
  • One Day? With a single computer? by Pan T. Hose (Score:1) Friday November 14 2003, @11:58PM
  • Re:Lets face it (Score:3, Informative)

    by nate nice (672391) on Friday November 14 2003, @03:16PM (#7476313)
    (Last Journal: Wednesday October 20 2004, @01:41AM)
    "You can't make money with computers anymore because some jackass is always trying to give away the same thing you're doing."

    Don't feel too bad, the government here (USA) is on your side mainly. I would disagree with you as there is always good money to made here but you have to be creative. The idea is to push each other further to create new ideas and technologies where you can make money.
    [ Parent ]
  • Re:Slashdotted in 3... 2... 1... by gid (Score:1) Friday November 14 2003, @03:26PM
  • Re:Do the math by kamukwam (Score:1) Friday November 14 2003, @03:32PM
    • Re:Do the math by lacheur (Score:1) Friday November 14 2003, @03:45PM
    • 1 reply beneath your current threshold.
  • Re:Lets face it by Anonymous Coward (Score:1) Friday November 14 2003, @03:35PM
    • 1 reply beneath your current threshold.
  • Ummm.. he can do it in parallel by Stone316 (Score:1) Friday November 14 2003, @03:46PM
    • 1 reply beneath your current threshold.
  • Dude. (Score:5, Funny)

    by FreeLinux (555387) on Friday November 14 2003, @03:46PM (#7476586)
    Please explain how one pings a web page. Is this a feature of AOL?

    Web pages are NOT internet hosts.

    Web servers are relatively few compared with other types of hosts on the internet.

    The World Wide Web is NOT the internet.

    The World Wide Web is NOT the internet.

    The World Wide Web is NOT the internet.

    The World Wide Web is NOT the internet.
    [ Parent ]
    • Re:Dude. by Just Some Guy (Score:2) Friday November 14 2003, @03:57PM
      • 1 reply beneath your current threshold.
    • Re:Dude. by freeweed (Score:3) Friday November 14 2003, @04:02PM
      • Re:Dude. by moonbender (Score:2) Friday November 14 2003, @05:18PM
    • Re:Dude. by cpghost (Score:1) Friday November 14 2003, @06:13PM
    • Re:Dude. by mcrbids (Score:2) Friday November 14 2003, @06:44PM
    • Compared to what? by 2fargone (Score:1) Saturday November 15 2003, @01:09AM
    • Re:Dude. by transient (Score:2) Saturday November 15 2003, @05:22AM
    • Re:Dude. by heXXXen (Score:1) Friday November 14 2003, @04:05PM
      • Re:Dude. by symbolic (Score:2) Friday November 14 2003, @04:46PM
        • Re:Dude. by symbolic (Score:2) Saturday November 15 2003, @01:13AM
          • 1 reply beneath your current threshold.
        • 1 reply beneath your current threshold.
    • 2 replies beneath your current threshold.
  • Re:Do the math by stratjakt (Score:1) Friday November 14 2003, @03:48PM
  • Re:Lets face it by Stone316 (Score:2) Friday November 14 2003, @03:52PM
  • Re:Well, if the site takes too long to load... by garwain (Score:1) Friday November 14 2003, @04:06PM
  • 22 replies beneath your current threshold.