87M Hosts on the Internet? 69
NTT writes "The Telcordia Internet Sizer provides daily updates on the size of the Internet. The Telcordia solution to quantifying Internet growth statistics is based on an internally developed unique sampling method. In this approach, over 150,000 randomly generated IP addresses are sampled on a daily basis and checked for their existence. Check out the other stats they have here"
Re:Only 87 Million? (Score:1)
Re:Accept it! There is no "internet backbone". (Score:1)
Just like anus: big, brown, and hairy. Cut it and you just eventually end up with more anuses. Each operates in its own little anus world not really worrying what the other anus is doing. Ejaculation (eventually) reroute around dead anuses. Only in the old DARPANET days could one logically speak of a central anus.
Love,
Bongo
Inventor of Anal Penetration
Re:Named hosts only... (Score:1)
Re:The net's backbone (Score:1)
I have to disagree with that. The bandwidth is always being added. The bandwidth of the internet may lag behind what is needed by a bit; but it is maintaining a steady pace with bandwidth growth. New pipes are added every day.
If there is an overhaul, it may need to be at major peering points. Not so much upgrades as much as new ones.
Accuracy? (Score:2)
What about many computers with the same IP, or many IPs pointing to the same computer?
---
Re:Named hosts only... (Score:3)
Also, this test doesn't really consider network address translated addresses with public DNS entries. For example, suppose I have an address for www.mydomain.com with my own authoratative domain server. The address is, say, 172.16.1.1 and anyone can connect to it. However, I actually have my firewall round robin the requests for that address to my web farm of 10 machines, 192.168.1.1-10, none of which are in external DNS. The survey would only catch one address, which actually has *no* machine directly associated with it. DNS is a reasonable measure of the size of the internet, but it is hardly an authoritative one.
This isn't even counting DMZ machines (those external to firewalls) that are connected to the internet "directly", but don't have a DNS entry. Why would you want a machine like that? Well, how about IP addresses on routers? Would you want those in DNS? How about intrusion detection servers, which monitor incoming traffic for attempted break ins. You really want to make yourself publicly known, making it easier for script kiddies to find you?
A better test would be an aggragate test of DNS reverse resolution, ping & traceroute. I'm sure that there are many machines out there that are open to some of these but not all three.
Two or three I haven't visited. (Score:1)
How acurate are surveys? (Score:1)
However, a recent consumer association (I think) survey reported that 40% of UK households now have internet access. That would make about 25m and then you need to include the number of people that have access at work etc..
Whilst the method they use may return a acurate(ish) report on the number of hosts on the internet, I can't see how they have extrapolated the number of users.
In a recent survey, 45% of those surveyed admited that they lied in surveys.
Tomorrow's stats (Score:1)
Immediately following posting on slashdot.org of the statistic of
approximately 87 million hosts being connected to the internet, the
statistics increased jumped to 186 trillion hosts. "How in the hell is
this possible," one spokesman was quoted as saying, "There aren't that
many people on this whole damn planet, and Hell! There can't be _THAT_
many addresses under IPv4!"
Logs indicate connections above and beyond the standard 255.255.255.255
range, showing such IP addresses as 1.4m.3l337.b147ch and 666.666.666.666.
Federal officers have been subsequently summoned to investigate whether or
not this is actualy a function of a new Distributed Denial of Service
[DDoS] such as the one that struck Yahoo! and other major sites recently.
This phenomenon is being classed as a new variant of well-known Trinoo and
TFN, labelled curiously "Slashdot Effect".
Re: true error source (Score:2)
They presume reverse DNS implies IP address usage. This is not correct, of course. There are many machines that don't reverse lookup. Also, there are many IP addresses that reverse lookup and aren't there. The most glaring data is to look at Lucent in their enterprise list [netsizer.com] Apparently, Lucent has 48 machines for each employee. Lucent will successfully reverse DNS every IP that they are asked about, into something like h135-1-1-1.outland.lucent.com. Splitrock.net apparently has a similar scheme, although the naming method is a little more opaque.
When your estimate is 87 million, of which 8.3 million of your count are highly suspect, it's not the 3 per cent sampling error that you should be concerned about.
Script kiddies rejoice (Score:1)
Re:Imagine the possiblities (Score:1)
*Ping* (Score:1)
The Ping heard 'round the world!
Re:What about WAP enabled devices (Score:2)
Since my freephone ISP (lineone)are stopping access soon i'll have to start paying for access again soon
Re:Named hosts only... (Score:1)
Re:Only 87 Million? (Score:1)
Besides, if you assume it doubles every one and a half years, I'd say you've got maybe 8 more years. Not 20.
Re:Does it matter? (Score:1)
I'm guessing you're not a network administrator.
It works because the infrastructure can support all the traffic that's currently on it. If your infrastructure is build to support 10 billion hosts, and your survey reveals you have 10 million active hosts, you can relax.
On the other hand, if it reveals you have 900 million hosts, and you only had 500 millon two weeks ago, you're in trouble and you need to get some new hardware, fast.
So aside from a general curiosity as to how many people out there want to download my mp3's, there's a legitimate reason for the 'net community to be interested and even concerned by the size and growth rate of the Internet.
I dunno how accurate they can be... (Score:1)
---
click a button, feed a hungry person!
Re:Only 87 Million? (Score:1)
I did a "measure-the-internet" script a long time ago.
I generated random IP addresses then tested to see if there was a webserver running; From what I remember ~25% of machines had a server running..
Steve
---
Does it matter? (Score:3)
But then, I've always been cynical like that.
Thoughts?
Re:Named hosts only... (Score:1)
I doubt that machines using IPs reserved for local networks (machines that therefore never can be reached directly from the Internet) really should be counted as "hosts on the internet"... (this is the case with masqueraded machines, etc)
--
Imagine the possiblities (Score:2)
Assuming that each host were to have 10 megs of original, non duplicate pr0n online (yes I know thats a very very low estimate), with 87 million hosts out there, that would mean that there 830 terrabytes of luscious luscious pr0n out for your downloading pleasure! Excuse me while I check out the newsgroups...
"The most fortunate of persons is he who has the most means to satisfy his vagaries."
Re:Only 87 Million? (Score:1)
We'd better. If we start using IPv6 now, we'll put them out of business!
"OK, only 26 billion more IP addresses to go..."
netclock.class (Score:1)
Allocated vs Reserved blocks? (Score:2)
Pretty silly assumptions (Score:2)
With all the mostly unused but allocated Class As and Class Bs that were given out long before we ever knew how popular the net was going to be, firewalls, masquerading, dynamic IPs and God knows what else, how good can sampling be on this network?
--
Re:plus minus/ ??? (Score:1)
Err...If I remember correctly from my stats class the general rule for margin of error is actually 1/sqrt(n). Common sense says that as the sample size increases, the margin of error should decrease, so sqrt(n) doesn't seem right.
(IIRC, this is because we are sampling from binomial distribution (either an IP exists or it doesn't), where the margin of error in the normal approximation is given by z_star*sqrt(p_hat*(1-p_hat))/sqrt(n). Using z_star~2 for 95% confidence and p_hat=.5 in the worst case, this reduces to 1/sqrt(n)).
Anyway, a sample size of 150,000 is incredibly good, and I think margin of error will be so small that it's not worth calculating (yes I'm lazy). So a better statistical question is whether the IP addresses tested were a random sample of all possible IP addresses? (For example, I know that some addresses are reserved and may not be used, so it would be a mistake to sample such addresses.)
Comments/corrections are appreciated.
Wil
ipchains DENY (Score:2)
This is a good point. If they do a scan at an IP address and none of the priviledged ports are responding (accepting connections or indicating that they're closed), the best you can do is assume that there's no computer there. Right?
How will this have skewed the results?
hrmmm... (Score:2)
Re:hrmmm... (Score:1)
Re:plus minus/ ??? (Score:2)
1/sqrt(n) gives you the margin of error as a percentage, to figure out the number of IP addresses which this accounts for, the calculation is 1/sqrt(n) * n which becomes sqrt(n) which is what he was talking about.
Anyway, a sample size of 150,000 is incredibly good, and I think margin of error will be so small that it's not worth calculating
It's 0.258%
More importantly, (Score:1)
Only 87 Million? (Score:3)
Re:The net's backbone (Score:1)
The net's backbone (Score:1)
Re:Only 87 Million? (Score:1)
Yhcrana
How the hell can it grow? (Score:4)
How many are serious hosts (Score:1)
Re:The net's backbone (Score:1)
Yhcrana
Filtering PINGs ? (Score:1)
that filter out ICMP completely (because there are many evil attacks that use ICMP
(besides on not counting them
Samba Information HQ
87 at last! (Score:2)
sig:
Re:This approach is flawed... (Score:2)
Doing a full service scan of them might reveal something, but that would be dangerous considering the number of people who take a dim view of being probed.
Guess the size... (Score:1)
The difficulty I think is that I have no concept of size above about a megabyte. It just loses all meaning, and becomes purely "big". The same applies to the count of the hosts on the net, or the number of people reading this. It means nothing more than a number to me...
Re:How many are serious hosts (Score:1)
All's you need is an IP address and a TCP/IP stack (oh, and since they probably check for the existence of hosts using ICMP messages, you might want to have your host reply to ping requests in order to become part of this statistic).
Re:How the hell can it grow? (Score:1)
- Joe
Re:plus minus/ ??? (Score:1)
It's something to be borne in mind when you see polls on TV. Frequently the sample is so small that any lead one party has is lost in statistical noise. Say in a poll of 400 people, you have a statistical error of 20 or in other words, 1 in 20 or 5%. Thus if for example, in suchg a small sampled poll, Bush leads Gore by 8%, with a 5% error on each candidate's poopularity or 10% overall, it's statistically insignificant and doesn't show a thing.
Rich
Re:Does it matter? (Score:1)
Re:Pretty silly assumptions (Score:1)
Re:Accuracy? (Score:1)
Think about for a second. If they take a random sample it doesn't matter if IPs are not evenly distributed. Simple statistics. Now say Do'h.
Moore's Law? (Score:1)
What about WAP enabled devices (Score:2)
Remember that nokia expect there to be a worldwide market for 500 million wap enabled phones!? that'll eat into the ip space.
Re:plus minus/ ??? (Score:1)
Rich
Re:What about WAP enabled devices (Score:1)
Because of the way they work...
Phone -> Gateway -> WapsiteThis does make some sense, since a random probe (you know, you get them all the time on any machine that's been connected for longer than 10 minutes) could upset this little (generally buggy) device. An nmap scan is enough to crash some of the early 7110's (which btw have had a lot of stack problems so don't be suprised about different ping replies between different firmware versions). Even with the always-on GPRS, it's likely some kind of cut-down and reshaped DHCP will be used, since even though the phones can stay on-line all the time a lot of people don't leave them switched on all the time... :) silly really, I have about 3 weeks uptime on my Motorola Timeport :)
The plural of 'anus' (Score:1)
Re:Imagine the possiblities (Score:1)
+/- ??? (Score:2)
Just curious, 'cause I couldn't find an exact number that they published on the site...
Hi Hi Hi Hi (Score:2)
to be contiunued...
long live IP address surveys.
That is besides all poor souls writing lame messages on Slashdot from a MASQed machine.
Re: ID'ing people (Score:1)
Named hosts only... (Score:3)
Re:I host sites on a single PC & I own 2 static IP (Score:1)
You don't know the first thing about high-traffic web serving, do you?
Re:Only 87 Million? (Score:1)
Umm, false. From here: [telcordia.com]
The Telcordia solution to quantifying Internet growth statistics is based on an internally developed unique sampling method. In this approach, over 150,000 randomly generated IP addresses are sampled on a daily basis and checked for their existence.
I haven't checked http://www.argreenhouse.com/netsizer [argreenhouse.com] ('cause it seems to be taking just forever to load), but they appear to be generating random quads and seeing if they exist, and running statistics against their findings over time.
Re: I am going to sue your thieving ass (Score:1)
Re:How many are serious hosts (Score:1)
I always look at these kinds of stats with extreme prejudice. Just exactly how do they determine if a host is alive, where it is, how many IPs are on the same physical machine, etc? "Ping" is a VERY bad way to locate devices. It has the lowest possible priority (read: NONE) and in fact is ignored/filtered at most sites.
As for location... With the increasing use of non-IP transports for IP (read: ATM and SONET), it's even more impossible to guess where something is. Looking at the traceroute to one of my machines, one might think it's in CLT -- that's just the last place it was IP along the ATM path to the IDSL router 200miles away. (Heck, ALL of Interpath's dialup -- three entire states -- looks like the machines are at RDU.)
downtime? (Score:2)
Re: I am going to sue your thieving ass (Score:1)
But AOL is a major corporation; surely they wouldn't do anything wrong like that!
- Joe
Re:How the hell can it grow? (Score:2)
Is growth a good thing? (Score:1)
Corporations patenting obvious ideas left and right to try to gain some control over the network. So-called "intellectual property concerns" become dominant features in internet policy making. Commercialism seems more dominant than community. What happened to it all? Is there any hope for a populist revival to restore more of the old community feel?
This approach is flawed... (Score:2)
My firewall hides (NATs) blocks of hundreds of IPs, and no 'check' will confirm their existence or non-existence.
This is nothing new (Score:2)
"-iR This option tells Nmap to generate its own hosts to scan by simply picking random numbers
If you are ever really bored, try
nmap -sS -iR -p 80
to find some web servers to look at."
The only difference is that most normal people aren't bored enough to keep going after the 500th or so 403 Forbidden error.
And 'mid this tumult Kubla heard from far
Ancestral voices prophesying war!