87M Hosts on the Internet?
NTT writes "The Telcordia Internet Sizer provides daily updates on the size of the Internet. The Telcordia solution to quantifying Internet growth statistics is based on an internally developed unique sampling method. In this approach, over 150,000 randomly generated IP addresses are sampled on a daily basis and checked for their existence. Check out the other stats they have here"
not sure, but if you wrote a script or a piece of software that could, I'm sure you could find some sucker to buy it so they could claim to have mapped IP addresses
The voices in my head don't like you
In my own world, twisted as it may be, has a keen fascination with the word "starfish". I have come to discover that starfish's look like anuses. So let us do an experiment. I'm going to replace everytime you say starfish with the word anus. Let's take a look at the insuing hilarity, shall we?:
Just like anus: big, brown, and hairy. Cut it and you just eventually end up with more anuses. Each operates in its own little anus world not really worrying what the other anus is doing. Ejaculation (eventually) reroute around dead anuses. Only in the old DARPANET days could one logically speak of a central anus.
Love,
Bongo
Inventor of Anal Penetration
Correct. Especially if you're on a shoddy dialup like me, and become a part of the 'net only to disconnect, constantly, over and over again! :)
I would be willing to bet that we will need a major upgrade of the backbone in the next 3-7 years.
I have to disagree with that. The bandwidth is always being added. The bandwidth of the internet may lag behind what is needed by a bit; but it is maintaining a steady pace with bandwidth growth. New pipes are added every day.
If there is an overhaul, it may need to be at major peering points. Not so much upgrades as much as new ones.
How do they account for IPs not evenly distributed? Say some countries have used more of their IPs than others.
What about many computers with the same IP, or many IPs pointing to the same computer?
---
There are many instances where internal hosts (that is, those behind a firewall) have real registered address space addresses. RFC 1918 addresses are nice, but not even close to every company uses them, even for their internal network.
Also, this test doesn't really consider network address translated addresses with public DNS entries. For example, suppose I have an address for www.mydomain.com with my own authoratative domain server. The address is, say, 172.16.1.1 and anyone can connect to it. However, I actually have my firewall round robin the requests for that address to my web farm of 10 machines, 192.168.1.1-10, none of which are in external DNS. The survey would only catch one address, which actually has *no* machine directly associated with it. DNS is a reasonable measure of the size of the internet, but it is hardly an authoritative one.
This isn't even counting DMZ machines (those external to firewalls) that are connected to the internet "directly", but don't have a DNS entry. Why would you want a machine like that? Well, how about IP addresses on routers? Would you want those in DNS? How about intrusion detection servers, which monitor incoming traffic for attempted break ins. You really want to make yourself publicly known, making it easier for script kiddies to find you?
A better test would be an aggragate test of DNS reverse resolution, ping & traceroute. I'm sure that there are many machines out there that are open to some of these but not all three.
My UID is the product of 2 primes.
The most common sites I can think of that are open all of the time without firewalls are porn sites, therefore...
"..don't you eat that yellow snow."
According to this netsizer site there are 18.2m users of the internet in the UK (click the java world map, then go to europe).
However, a recent consumer association (I think) survey reported that 40% of UK households now have internet access. That would make about 25m and then you need to include the number of people that have access at work etc..
Whilst the method they use may return a acurate(ish) report on the number of hosts on the internet, I can't see how they have extrapolated the number of users.
In a recent survey, 45% of those surveyed admited that they lied in surveys.
Appearing tomorrow:
Immediately following posting on slashdot.org of the statistic of
approximately 87 million hosts being connected to the internet, the
statistics increased jumped to 186 trillion hosts. "How in the hell is
this possible," one spokesman was quoted as saying, "There aren't that
many people on this whole damn planet, and Hell! There can't be _THAT_
many addresses under IPv4!"
Logs indicate connections above and beyond the standard 255.255.255.255
range, showing such IP addresses as 1.4m.3l337.b147ch and 666.666.666.666.
Federal officers have been subsequently summoned to investigate whether or
not this is actualy a function of a new Distributed Denial of Service
[DDoS] such as the one that struck Yahoo! and other major sites recently.
This phenomenon is being classed as a new variant of well-known Trinoo and
TFN, labelled curiously "Slashdot Effect".
They presume reverse DNS implies IP address usage. This is not correct, of course. There are many machines that don't reverse lookup. Also, there are many IP addresses that reverse lookup and aren't there. The most glaring data is to look at Lucent in their enterprise list Apparently, Lucent has 48 machines for each employee. Lucent will successfully reverse DNS every IP that they are asked about, into something like h135-1-1-1.outland.lucent.com. Splitrock.net apparently has a similar scheme, although the naming method is a little more opaque.
When your estimate is 87 million, of which 8.3 million of your count are highly suspect, it's not the 3 per cent sampling error that you should be concerned about.
87 million hosts with which to unleash DDoS'es upon our beloved ethical enemies. (heyhey! doubleclick.com =)
-Billco, Fnarg.com
Now THAT is a whole lotta pink and brown cylinders and hemispheres!!
"..don't you eat that yellow snow."
The Ping heard 'round the world!
Because you can't, you won't, and you don't stop...
Mine just dials my usual selection of ISPs. As it happens one of them is a fixed IP dialup (demon internet since the have mobile phone access numbers for very fast conencting) and what I was pinging was definitely my phone since the ping time was about 900ms from a cable modem and when i hung the phone up the responses stopped.
:( Does anyone else know any particularly Wap friendly 0800 isps in the uk?
Since my freephone ISP (lineone)are stopping access soon i'll have to start paying for access again soon
should we count google's 6000 clustered linux servers? They don't have public ip addresses, and I'd say we should.
ok then your [sic] infringing on my copyright! Could you as [sic] me next time before STEALING my comments for your own?
Sure, as long as you can talk the people who aren't using all of their class A's and class B's into loaning you a couple addresses.
Besides, if you assume it doubles every one and a half years, I'd say you've got maybe 8 more years. Not 20.
I'm always curious as to why people are interested in the size of the internet. As long as it works, and people think it's running nicely, does it really matter?
I'm guessing you're not a network administrator.
It works because the infrastructure can support all the traffic that's currently on it. If your infrastructure is build to support 10 billion hosts, and your survey reveals you have 10 million active hosts, you can relax.
On the other hand, if it reveals you have 900 million hosts, and you only had 500 millon two weeks ago, you're in trouble and you need to get some new hardware, fast.
So aside from a general curiosity as to how many people out there want to download my mp3's, there's a legitimate reason for the 'net community to be interested and even concerned by the size and growth rate of the Internet.
Verio isn't listed at all under ISPs with the most hosts, and yet Verio's PR says it's the world's largest business webhosting provider? Or am I missing something, here?
---
click a button, feed a hungry person!
Get off my launchpad!
I did a "measure-the-internet" script a long time ago.
I generated random IP addresses then tested to see if there was a webserver running; From what I remember ~25% of machines had a server running..
Steve
---
I'm always curious as to why people are interested in the size of the internet. As long as it works, and people think it's running nicely, does it really matter? I can't see any competitors to the internet for various institutions to be battling down, so I'm assuming that these reports are issued as nothing more than a cheap way to get some hits on reporter's website and to raise their meagre profile a little.
;-)
But then, I've always been cynical like that.
Thoughts?
This only measures how many hosts are listed within DNS, not the total number of machines on the internet. It doesn't measure IPs used by dialups, machines behind firewalls, IP masquaraded machines, etc. In other words, there are more than 87 million computers on the internet, quite a few more I would guess. In fact, I would say that the exact number is almost impossible to figure out.
I doubt that machines using IPs reserved for local networks (machines that therefore never can be reached directly from the Internet) really should be counted as "hosts on the internet"... (this is the case with masqueraded machines, etc)
--
Ner lbh sebz gur HFN? Gura lbh'ir whfg ivbyngrq gur QZPN!
Now that this survey has shown us how deep the penetration of the Internet is, let's figure out the stats for the real meaning of the net: PORNO!!!!!
Assuming that each host were to have 10 megs of original, non duplicate pr0n online (yes I know thats a very very low estimate), with 87 million hosts out there, that would mean that there 830 terrabytes of luscious luscious pr0n out for your downloading pleasure! Excuse me while I check out the newsgroups...
"The most fortunate of persons is he who has the most means to satisfy his vagaries."
"The most fortunate of persons is he who has the most means to satisfy his vagaries."
- Marquis De Sade
We'd better. If we start using IPv6 now, we'll put them out of business!
"OK, only 26 billion more IP addresses to go..."
icqqm [ICQ:11952102]
Especially interesting is how hosts are added 87 per minute, every minute. Someone needs to teach netsizer about significant digits...
ok then your [sic] infringing on my copyright! Could you as [sic] me next time before STEALING my comments for your own?
I wonder if they take into account which netblocks have been issued by regional registries like ARIN
With all the mostly unused but allocated Class As and Class Bs that were given out long before we ever knew how popular the net was going to be, firewalls, masquerading, dynamic IPs and God knows what else, how good can sampling be on this network?
--
Care about electronic freedom? Consider donating to the EFF!
Err...If I remember correctly from my stats class the general rule for margin of error is actually 1/sqrt(n). Common sense says that as the sample size increases, the margin of error should decrease, so sqrt(n) doesn't seem right.
(IIRC, this is because we are sampling from binomial distribution (either an IP exists or it doesn't), where the margin of error in the normal approximation is given by z_star*sqrt(p_hat*(1-p_hat))/sqrt(n). Using z_star~2 for 95% confidence and p_hat=.5 in the worst case, this reduces to 1/sqrt(n)).
Anyway, a sample size of 150,000 is incredibly good, and I think margin of error will be so small that it's not worth calculating (yes I'm lazy). So a better statistical question is whether the IP addresses tested were a random sample of all possible IP addresses? (For example, I know that some addresses are reserved and may not be used, so it would be a mistake to sample such addresses.)
Comments/corrections are appreciated.
Wil
This is a good point. If they do a scan at an IP address and none of the priviledged ports are responding (accepting connections or indicating that they're closed), the best you can do is assume that there's no computer there. Right?
How will this have skewed the results?
Fire and Meat. Yummy.
Does this measure all IP's, or just addresses in DNS or what? If its all IPs then that means theres only 87000000 out of a possible 256^4=4294967296 which means we are only using 2% of the possible address space. So why all the noise about IPv6?
If I remember correctly from my stats class the general rule for margin of error is actually 1/sqrt(n). Common sense says that as the sample size increases, the margin of error should decrease, so sqrt(n) doesn't seem right.
1/sqrt(n) gives you the margin of error as a percentage, to figure out the number of IP addresses which this accounts for, the calculation is 1/sqrt(n) * n which becomes sqrt(n) which is what he was talking about.
Anyway, a sample size of 150,000 is incredibly good, and I think margin of error will be so small that it's not worth calculating
It's 0.258%
20% of them are at FSU.
Hell, that means we can put off IPv6 for another 20 years or so, right? :)
DrLunch.com The site that tells you what's for lunch!
I thought "...the backbone is going to collapse any day now..." FUD was behind us now.
With how fast the net's expanding will it's backbone be able to handle the traffic in the coming years?
My sister says she has the internet on her iMac, which has only got a 6 gig HD, so how the hell could it be growing if she's never upgraded?
Acting stupid isn't much fun when there's someone around who knows better
How many are serious hosts and not someone running a server off of their pc?
What about those ignorant Website Administrators ;) and dont't know what consequences it will have ? ;)
that filter out ICMP completely (because there are many evil attacks that use ICMP
(besides on not counting them
Samba Information HQ
sig:
sig:
See the "..for smart people" banners Wired runs here? Look elsewhere guys.
I have a site with a 7 IP block ... only the gateway IP address is accessible by pinging, etc.
Doing a full service scan of them might reveal something, but that would be dangerous considering the number of people who take a dim view of being probed.
- Michael T. Babcock (Yes, I blog)
I've whiled away many a dreary evening by trying to make a guess at the amount of data available on the Web. Once upon a time I thought there were no more than maybe a couple of terabytes out there. NOw I know that this is wrong by several orders of magnitude. I'm starting to think that we're on the order of tens of petabytes here- maybe more. The *useful* information is of course a tiny fraction of that.
The difficulty I think is that I have no concept of size above about a megabyte. It just loses all meaning, and becomes purely "big". The same applies to the count of the hosts on the net, or the number of people reading this. It means nothing more than a number to me...
Said it couldn't last, said it wouldn't last... This is the last stand against tomorrow's world.
It's something to be borne in mind when you see polls on TV. Frequently the sample is so small that any lead one party has is lost in statistical noise. Say in a poll of 400 people, you have a statistical error of 20 or in other words, 1 in 20 or 5%. Thus if for example, in suchg a small sampled poll, Bush leads Gore by 8%, with a 5% error on each candidate's poopularity or 10% overall, it's statistically insignificant and doesn't show a thing.
Rich
Interestingly, the number of hosts seems to be doubling every 18 months. Cooincidence?
These behave very strangely. The TCP/IP stack on my nokia 7110 does respond to pings (although i'd hazard a guess that not all models do. It does produce very odd results in nmap - which kicks up dozens of errors about unrecognised responses.
Remember that nokia expect there to be a worldwide market for 500 million wap enabled phones!? that'll eat into the ip space.
Rich
Anybody know what the margin for error with this thing is? I mean, with the millions of possibilities for IP addresses, how accurate can a "random sampling" of 150,000 IP addresses?
Just curious, 'cause I couldn't find an exact number that they published on the site...
ipchains -P input DENY
to be contiunued...
long live IP address surveys.
That is besides all poor souls writing lame messages on Slashdot from a MASQed machine.
Baker's Law: Misery no longer loves company. Nowadays it insists on it
http://www.sigsegv.cx/
Maybe this would be a cool way to store fingerprints for ATM terminals or something. Has anyone ever considered ASCII?
Acting stupid isn't much fun when there's someone around who knows better
This only measures how many hosts are listed within DNS, not the total number of machines on the internet. It doesn't measure IPs used by dialups, machines behind firewalls, IP masquaraded machines, etc. In other words, there are more than 87 million computers on the internet, quite a few more I would guess. In fact, I would say that the exact number is almost impossible to figure out.
You don't know the first thing about high-traffic web serving, do you?
Now I know why she was complaining earlier when she tried to connect and it wouldn't. First I am going to sue you for the theft of the internet. Then I am going to sue AOL for trying to sell a product they no longer owned.
Acting stupid isn't much fun when there's someone around who knows better
what if a machine wasn't up when it was checked... could there be a lot of machines that just arn't connected all the time?
But AOL is a major corporation; surely they wouldn't do anything wrong like that!
- Joe
-Joe
I have to wonder if all of this hype about the ever-growing size of the internet is just blind optimism. Growth doesn't seem to have brought very much of any good. For every Slashdot or Linux kernel there's a thousand new pr0n or warez sites, a thousand badly designed web pages...
Corporations patenting obvious ideas left and right to try to gain some control over the network. So-called "intellectual property concerns" become dominant features in internet policy making. Commercialism seems more dominant than community. What happened to it all? Is there any hope for a populist revival to restore more of the old community feel?
Visit the
150,000 randomly generated IP addresses are sampled on a daily basis and checked for their existence
My firewall hides (NATs) blocks of hundreds of IPs, and no 'check' will confirm their existence or non-existence.
M$: "We're #2!"
From the nmap man page:
:). It will never end. This can be useful for statistical sampling of the Internet to estimate various things.
"-iR This option tells Nmap to generate its own hosts to scan by simply picking random numbers
If you are ever really bored, try
nmap -sS -iR -p 80
to find some web servers to look at."
The only difference is that most normal people aren't bored enough to keep going after the 500th or so 403 Forbidden error.
And 'mid this tumult Kubla heard from far
Ancestral voices prophesying war!