Metcalfe's Law Refutation Explained
sdpinpdx writes "According to this article in the July 2006 IEEE Spectrum Metcalfe's Law (that the value of a network is n^2) is wrong (it's probably only n log(n)). The authors speculate this had something to do with the .com bubble, and that their more conservative model might help alleviate the next one. The article includes an interesting quote from Metcalfe: 'The original point of my law (a 35mm slide circa 1980, way before George Gilder named it...) was to establish the existence of a cost-value crossover point--critical mass--before which networks don't pay. The trick is to get past that point, to establish critical mass.'" This would seem to be an update to a story we ran more than a year ago.
Hrm. The editors can't recognize that an article is a dupe of one submitted two days ago, but they can recognize that an article is related to one posted a year ago? Weird. :P
If you want to know:
http://en.wikipedia.org/wiki/Metcalfe's_Law/
It has been said that 63% of all statistics are made up
So what if I spend 10 minutes devising some silly and arbitrary, yet very simple formula to place value on something as subjective as the value of a network. But, I'll do one for patent lawsuits...though this formula will give the relative ignorance of the original patent, measured in PES-Bs (patent examiner smoke-breaks.) I think I will square it by LEMIs (Large Entity Monetary Incentives) and divide that by the total number of patents submitted in that CY (Calendar Year.)
So we get: (PES-B ^ LEMI) / CY Patent total.
Can I get that formula named after me??
Why go fast when you can go anywhere? O|||||||O
Look, just tell me how many pipes I need and when to buy a truck. Anything else is beyond our comprehension. Plus, does this even compensate for the poker chips and horses? I don't think so.
Developers: We can use your help.
I don't think this is as much a "refutation" as it is three people's assertion countering some other guy's assertion. Since the one guy is richer than the other three put together, I'd say the burden of proof is on them.
What I'm listening to now on Pandora...
That law treats a network as if its only value is its interconnectedness. Especially while some nodes send more info than they receive, some nodes are more valuable, and some connections are more valuable. Unless the actual information transmitted has no value to the network.
Which is what I've gathered from Metcalfe's InfoWorld columns since then.
--
make install -not war
Depending upon the uniqueness of each node.
Having two different Legal dictionaries offers less definitions than having both a Legal and a Medical dictionary.
Two bricklayers or two Carpenters may build a house slower than one carpenter and one bricklayer.
And a car wouldn't get very far if all it's wheels spun clockwise.
Back when computers were more specific purpose (This one is for Payroll, this one for Budget, this one for Customer tracking, this one for the actual Job...) linking them together had amazing potential, but now when an entire operation could be run off one machine (Quickbooks, Photoshop, Coreldraw, Web Browser, Fax server were all together one one machine I know of, and all critical for the business) there's not that much data that needs to move over a network to run the business.
Wikipedia, for example, would still be very useful even if it had zero links to external sites, because in itself it encompasses so much. Amazon does not need to offer links to other retailers, because they sell near everything.
The thing we seem to know from things like process control, is that it takes a finite amount overhead to manage any group, and a very finite amount of resources to bring an outsider into a group. This is Brookes; Law, that says bringing more people onto a late project will only make it later. We see this action around us right now.
What I find most fascinating is how easily people will allow themselves to be deluded by a model, even though the reality is all around them. If we look at something like graph theory we see certain features. For instance, no one has an extremely large number of close friends. Most of us have what can be considered concentric circles of people we know, each group out is usually bigger, but more loosely connected. Communicating with the outer circles are very inefficient. Business are arranged the same way.I think what confused people is that the internet, like the telephone, made geographic distances less important, so it is easier to keep up communications with someone across the world, but that does not mean that the person's ability to relate has been increased.
Additionally, not everyone, or everything, can competently complete all tasks, and not all processes can be factored to take advantage of all resources. At some point one is paying for overhead that does not deliver any added efficiency. I think this is what we are seeing in many international corporations. The corporation supports non-productive real estate, managers, IT, which forces the productive parts of the company to work harder and be less responsive to market forces.
I would say that that a network initially has a n^2 benefit, but quickly transitions to nlog(n). This is not so. If anything cause the dot com crash, it was not understanding that at some point the overhead begins to be the dominant factor, and efficiency is lost.
"She's a scientist and a lesbian. She's not going to let it slide." Orphan Black
Either I don't get it or this is virtual dick-waving. For what I'd say, it's not size that matters, it's how you use it.
It's not the number of connected hosts that tell you about the value or quality of a network, or how much can be accomplished with it. You can network the biggest LAN in the world and have it play Quake all day, I'd put my money on the 5 computers calculating some more primes back in the basement.
The value of a network lies in what it connects. Not in its size.
We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
According to the article (and common sense, because Metcalfe is not a short-sighted fool), Metcalfe acknowledges that his original reason for stating his "law" was simply to illustrate that even though small networks might not be interesting, once a certain size was reached, they would become compelling. For this, the distinction between n^2 and n.log(n) is pretty irrelevant -- the significant feature is that both are superlinear (as the article notes). Metcalfe was absolutely correct.
This is not to say he was unique in recognizing this, or that it'd be surprising for someone invested in selling networks to claim they'll become important. The point is he was not attempting to carefully quantify the scaling effects of networking. Rather, he had an instinct that said networks will be big when they get big. The quickest back-of-the-envelope estimate of the scaling law says n.(n-1)~n^2, so he used that for his talk.
When networks started to catch on, someone (the name is in the article but I'm too lazy to go back and look it up) grabbed ahold of this tidbit and named it Metcalfe's Law. Doing anything quantitative with this is ridiculous. It's obvious to everyone involved, Metcalfe included, that his "law" was just the simplest superlinear curve, not some carefully constructed value function. Even the new estimate -- n.log(n) -- is on pretty crude footing. I'm sure you can find a good analysis that gives this result, but there is so much ambiguity in what the value function should actually measure that it's hard to know you're doing the right thing.
Basically, Metcalfe was right. Networks grow in value faster than they grow in node size. His "Law" may be wrong, but it was just a heuristic to begin with. Anyone basing a business model on the details of that law deserved to have their bubble burst.
(Aside from the fact that 2^n and n^2 are both absurd in any kind of network I can think of), n log(n) has the advantage over all the other models mentioned in that it correctly gives a zero value for a network of one, which is not a network at all and obviously adds no value. Or if you want the combined value of the network and the networked, maybe it would be n + n log(n).
So... what are the units of the result? Dollars? Web 2.0 beta credits?
It is possible to refute this hypothesis, given that "value" is so vaguely defined? Stated as the original "law" states it, I'm not sure that value is something you can state in dollars and cents. If it isn't, I'm not sure what it means.
Different aspects of the network which have "value" scale different ways.
For example while the Internet has probably grown considerably since 1999, I don't really use more web sites regularly, or buy from more vendors than I used to. Yes there is Froogle and so on, but I'd bet most consumers after trying this sort of thing settle on a small set of vendors who have good but not necessarily optimal and have good enough service. I tend to go right to New Egg for example, rather than searching Froogle.
It seems to me that the maximum number of providers (speaking in a broad sense) that a consumer is apt interact with, multiplied by the number of consumers, is a rough measure of the Internet's economic potential. Naturally this leaves aside B2B, but to be meaningful you'd have to disaggregate this sector. Businesses dealing in commodities are likely to expand in an N^2 fashion, but value for relationship oriented business is probably O(n).
On the other hand... The Internet's disruptive potential probably is O(n^2). That is I may not connect to some random node out there, but I could and if there was a reason to I would. It creates the potential for "killer apps" that shift consumer behavior in unexpected ways.
Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
The article is a moderately interesting and loooooong account of how to tweak a growth law based on observations that the previous law does not explain history well enough. For those among us who have fortunes depending on this (let's see a raise of hands, please), it may be very important. But for the rest of us it really doesn't matter much. My concern is that /. may take Nlog(N) seconds longer to load (where N=number of posts), not whether CmdrTaco will make a bazillion instead of a gazillion next year.
Can I get that formula named after me??
Only if you can get some analyst like the George Gilder mentioned in TFA to popularize it. Remember, this is the guy who says Bernie Ebbers i sa nice guy and is a favorite editior fo Forbes (of SCO quality 'journalism') fame.
...which everyone seems to be forgetting, was for Metcalfe to raise the value of 3COM stock. If networks weren't seen as "valuable," then people wouldn't buy networking equipment...
There exists no way of exchanging information without making judgments. --Bene Gesserit Axiom
...it never even realizes that the boat existed.
That is: the authors' analysis is fundamentally flawed in a couple of different respects.
(1) They don't even attempt to establish an actual metric for the value of a network. Without that, any counterarguments to any previous assertions regarding network value that one might make are basically so much handwaving. (One can of course make the same objection to Metcalfe's Law, but saying "My hand-wavy claim is better than yours!" isn't much of an argument.)
On a related note, any discussion of network value that doesn't even take into account the semantics of the network (i.e., what the edges represent) is even more useless.
(2) Let's say that we will approximate the value of a network as being proportional to the number of edges (links); we'll ignore, for the moment, the possibility that edge weights might differ.
_In practice_, the number of individuals that one can be meaningfully connected to in a network--i.e., the number of incident edges to a node--will be limited. _This_ is the real problem with Metcalfe's Law.
For example, suppose that the edges represent friendships (as identified by the individuals, i.e., A is connected to B if A and B each agree that they are friends.) The number of friends that I can have is limited: I can't even _meet_ six billion people for one minute each (even assuming that I could remember them as individuals) before my personal clock runs out. This same argument applies (although the limitations will be different) to financial transactions, telephone calls, IM buddies, and so forth. Sure, if all 6 billion people were signed into Your Favorite IM Client and I could open a group chat with them, they could all read my words...but that's not a meaningful relationship, it's a broadcast--and it has really nothing to do with any network topology.
I hate to burst your bubble, and I don't disagree with your statements.
However the network that he is talking about is not analogous to a LAN. Think of it more as a social network, like the six degrees of separation thing. Who you know matters very little until your network of friends gets to a certain critical size.
What he was trying to say with algorithm is that you can measure how valuable a social network, not computer network, is.
42 - So long and thanks for all the fish.
I mean, with all the Nazi stuff going around these days. It'd be really useful to call a spade a spade!
"Speaking the Truth in times of universal deceit is a revolutionary act." -- George Orwell
"Laws" my ass. These buggers don't even have proper beards.
For example, does the community that forms around an open source project have n log(n) value where n is the number of members in that community?
You don't get it.
The point is not that 10 granny smith apples tied together with string form a more or less valuable network than 5 crays. The comparison is between different networks of the same thing but of different sizes. And it is about the value of the network itself, not what the networked things accomplish. There is an underlying assumption that the metric under consideration values the networking; it is irrelevant that one can always think of a task for which networking doesn't improve performance.
If the node cost, x, is $100 and there are 100 users, n, then the cost for the network is $10,000.
If the single user business value, v, of the network is $10 for one user then the ROI for different valuation methods is:
Linear: vn = $1,000 -- no business case, don't even think about it
Metcalf's Law: (n(n-1)=2)v = 49,500 -- winner
Metcalf's Law as misused by dot-bombers: N^2 * V = 100,000 -- "Proves" selling frozen mud on the net is a winner
As restated by the authors: n long (n) * v = 2000 -- no business case, but better than a flat linear
There really are two problems here. The scaling formula and setting the business value. If you set the business value for a single connection greater than the cost of the network then it is a no brainer, but back when Metcalf as pushing networking that was a hard case to make and given how many people use /. at work that may still be the case.
Here's what Metcalfe had to say about this same paper from these same academics when they first circulated it in '05:
2
http://www.networkworld.com/community/?q=node/635
It kind of annoys me: that author throws together some "rules of thumb" sentences and "it seems plausible to assume", and thinks it makes something worthwhile to publish? I don't know if Metcalfs Law is correct or wrong, and I don't care as long as my internet bandwidth doubles every year, but I am a little bit shocked by this display of extremely low standards. Perhaps the author wanted to prove his own point, though: clearly not all additions to the internet are very valuable...
I got a chance to review this article before publication and in my commentary on the draft version I point out not only that Metcalfe's law is wrong, but that often any positive law is wrong, because in many cases, particularly mailing lists, the value of the network eventually starts dropping as the size increases, due to noise and excess signal.
That's why may people prefer smaller mailing lists to larger communities, and in fact some topics simply can't be handled properly in large groups, even with moderators.
Has it been over a year since you last donated to the Electronic Frontier Foundation
At least you should differentiate between personal communication networks and information networks. If I want to talk to David D. Johnson on IM, the metrics are simple - either he's on the network or he isn't. On the other hand, if I wanted sports results it wouldn't matter if 90% of all news sites got knocked off the Internet - the remainding 10% would do. Networks are also self-specializing, you get on a network because your friends/coworkers/contacts are there. You have also issues like multi-networking (MSN and Yahoo and ICQ instead of DSL or cable), non-uniform network access (e.g. phoning cheaper to phones on your own network, but you can reach any phone), if the network creates cliques (e.g. MMORPG and guilds) or plain interconnects (e.g. phone networks), lock-in like the work put into a myspace profile and so on.
This in one assertion about another assertion where I can guarantee that the answer is far more complicated than either, or that the actual assertion is only applicable in theoretical textbook situations. Most that operate such a network aren't really that interested in quanitfying how important - they're way too busy trying to find out how to grow instead. The rest fo the time, you need to look at much more of a business case to evaluate how much it's worth - the network size is just one factor.
Live today, because you never know what tomorrow brings
The number of lines to connect 3 dots = 3*(3-1)/2 = 3
The number of lines to connect 4 dots = 4*(4-1)/2 = 6
The number of lines to connect 5 dots = 5*(5-1)/2 = 10
The number of lines to connect n dots = n*(n-1)/2 = Hey, let's just use an approximation this time.
The analogy works in social networks too. If your social network consists only of people of the same group (like, say, you're in some sect that only allows contact with others from the sect), your group may be huge but it still offers little insight or input, or "value". If your network consists of people with a very narrow field of expertise, the value is quite limited too.
If your network consists of few people who are a source of tremenduous insight, every single one of them, your network is small but its value is incredible.
We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
I sort of see the key insight of Briscoe, Odlyzko, and Tilly that, if you are going to pull a function out of your ass, it makes more sense if the differential of the function flattens out rather than slopes linearly upwards forever, because there is ultimately a decreasing value of each connection as the number of connections increases.
So they were correct to pull a log function of of their ass, but they could have just as easily pulled out n*ln(n) or some other base. They made no attempt to "calibrate" the model.
A good insigt is this quotation:
"Iif Metcalfe's Law were true, it would create overwhelming incentives for all networks relying on the same technology to merge, or at least to interconnect. These incentives would make isolated networks hard to explain. Consider two networks, each with n members. By Metcalfe's Law, each one's value is on the order of n 2, so the total value of both of these separate networks is roughly 2n 2. But suppose these two networks merge. Then we will effectively have a single network with 2n members, which, by Metcalfe's Law, will be worth (2n)2 or 4n 2--twice as much as the combined value of the two separate networks.
Surely it would require a singularly obtuse management, to say nothing of stunningly inefficient financial markets, to fail to seize this obvious opportunity to double total network value by simply combining the two."
Inflating these "synergies" was exactly what led to the Bombing Off of the Bubble.
Original BOT paper:
http://spectrum.ieee.org.nyud.net:8080/print/4109
Give a man a fish and you have fed him for today. Teach a man to fish, and he'll say "WHERE'S MY FISH, YOU IDIOT?"
there is no difference between n(n-1)/2 and n^2 you fool. the only difference is a constant since n(n-1) ~ n^2. since metcalfe didn't include a constant of proportionality in his theory this makes no difference, the point is about asymptotics.
there is a difference in your numerical example only because it is misconceived.
my password really is 'stinkypants'
Um. That's not how Big-O notation works. O(n(n+1)/2) is the same as O(n^2). Constant terms don't matter. So your n log n might as well be 10n log n. Or n ln n. Or whatever. You can't plug in your n into the function and expect a useful number out of it. Big-O growth terms describe how quickly it grows, not what a particular value is.
The point Metcalfe was making is that after a certain point, the cost of the network grows linearly but the value grows more than linearly (he suggested O(n^2), this paper says O(n log n), same difference). These functions all start under linear growth, cross it, and continue to grow faster. So until you meet that crossing point, there is no business case. But your functions tell you nothing about where that crossing point is - that's real-world data.
Otherwise we'll see articles about "Metcalfe's Law Disproven! It actually grows at 2n^2 - 4n + C" or whatever hyperaccurate nonsense.
I wish I could remember the name so as to give proper credit, but someone pointed out that Metcalfe may have severely understated matters.
The option value of a network depends on how many groups can form using it. Every time a thousand specialized message boards like "people who audit for security in CUPS on Solaris" form, the network becomes more valuable. The number of possible groups is easy to calculate. A group can be represented as a bitmap with as many bits as there are endpoints, each bit representing whether the node is a member or not.
The number of possible groups, each of which adds some sliver of value, is then 2^N.
I don't believe that line of argument(does the value of the Internet really double when one more person joins AOL?) but it does point to interesting lines of thought.
The definition of "value" is a huge problem too, but others have covered it well in this discussion.
Yes, actually that was the whole point of the article. They assumed the values followed a particular distribution (in this case, zipf, one of the most fun to say distributions...) and then derived the "law" based on that, so that's where the nlogn thing comes from.
Now their assumed distribution might be a little naive, and certainly seems to have been chosen because it results in a relatively clean derived formula, but that's academic. This is a refinement of the orginal observation. A higher-order approximation if you will.
Can you be Even More Awesome?!
And a car wouldn't get very far if all it's wheels spun clockwise.
It's called NASCAR, and you're right: the race tends to tend exactly where it began.
Back in my day, it was really hard to come up with a "Law." You had "hypotheses" (otherwise known as wild-ass guesses), "theories" ("well we tested it and it *looks* right") and after those theories have been tested over and over again without failing ONCE... you called it a "Law." It is a testament of today's liberalization that laws are passed without any supporting evidence for the hypothesis. This apparently now applies to science as well as politics.