Wikipedia Mining Algorithm Reveals the Most Influential People In History
KentuckyFC writes: 'In 1978, the American researcher Michael Hart published The 100: A Ranking of the Most Influential Persons in History, a book that became an international best seller. Since then, various others have published similar lists. But all suffer the same drawback: they are subjective list ultimately influenced by numerous cultural factors. Now data scientists have come up with a way to extract an objective list of the 100 most influential people in history using the network of links between biographical articles on Wikipedia and how they vary between 24 different language editions, including English, Chinese, Russian Arabic and so on. The researchers assume that people who are highly ranked in different language editions are influential across both language cultures and that the more appearances they make in different language editions, the more influential they are. But the actual ranking is done by PageRank-like algorithms that consider a biographical article important if it is pointed to by other important articles.
The resulting lists of the most influential men and women might surprise. The top PageRanked individual is Carl Linnaeus, the 18th century Swedish botanist who developed the modern naming scheme for plants and animals, followed by Jesus. The top PageRanked women are: Elizabeth II followed by Mary (mother of Jesus). For comparison, just under half of the top 100 most influential also appear in Hart's 1978 book. But this is just the beginning. By counting the individuals from one culture that influence other cultures, the team is able to work out which cultures have dominated others. And by looking only at people born before certain dates, they can see how the influence of different cultures has waxed and waned throughout 35 centuries of recorded history.'
The resulting lists of the most influential men and women might surprise. The top PageRanked individual is Carl Linnaeus, the 18th century Swedish botanist who developed the modern naming scheme for plants and animals, followed by Jesus. The top PageRanked women are: Elizabeth II followed by Mary (mother of Jesus). For comparison, just under half of the top 100 most influential also appear in Hart's 1978 book. But this is just the beginning. By counting the individuals from one culture that influence other cultures, the team is able to work out which cultures have dominated others. And by looking only at people born before certain dates, they can see how the influence of different cultures has waxed and waned throughout 35 centuries of recorded history.'
subjectively titled ...
Given those of us the world calls "nerds" seemingly have a weakness for championing the lesser-known, and given that nerd-driven edits are a disproportionately large percentage of Wikipedia edits... it's not surprising someone like Linnaeus has the top spot.
Really, the biggest surprise isn't that Linnaeus outranks Jesus - it's that Jesus managed to outrank Joss Whedon.
#DeleteChrome
An interesting study, but nothing about the rankings has anything to do with measuring being 'influential'.
What do you expect of a man that practically nobody had heard of until centuries after his death?
--- Most topics have many sides worth arguing, allow me to take one opposite you.
It's weird that Jesus made the list but Batman didn't. In fact, he was the only fictional character mentioned.
You want to know why Carl Linnaeus is on top of that list? Every Wikipedia article about an Animal or a Plant has an infobox, containing their binomial name. And the person who got to name the animal or plant is linked in said infobox. Since Mr. Linnaeus basically created the binomial nomenclature, he named thousands upon thousands of species. Thus, he is linked from thousands upon thousands of articles about all kinds of animals and plants. Here's a random example. Notice the "L." at the bottom of the infobox. So, basically, Mr. Linnaeus is being Google.. ahem, Wikipedia-bombed.
that he invented the classification system for organisms.
And there are a _LOT_ of stub articles for the Lesser Spotted Garden Slimy Thing, that link to 'biological classification' and hence Carls page. (can you tell I can't spell his second name?
Correct me if I am wrong, but even from the summary I get a strong suspicion this "research" is heavily flawed. I mean, the only way for "Carl Linnaeus" would be on the top spot would be if you blindly applied a sort of page-rank algorithm forgetting to only include non-standardized parts of pages. A significant percentage of Wikipedia pages on all languages are about the various species of plant or animal life, all of which have a stub which contains the link to "Scientific classification" perhaps also to "Binomial name", both of which feature Linnaeus prominently.
It reminds me a spider my boss had built to get a few thousands of pages to construct a word frequency list, and I had to point out that it needed some work, since words like "print", "home" etc were not in the top-5 most common words of the English language.
Violence is the last refuge of the incompetent. Polar Scope Align for iOS
Want to guess how I know that you didn't RTFA? The main article even has a photo of him!
On the overall ranking, Hitler is ranked #5 after Carl Linnaeus, Jesus, Aristotle, and Napoleon
For the 2DRank (places emphasis on outgoing links as well as incoming) he's #1
Michael Jackson, and Hitler?
What utter bullshit!
This is like mining Facebook to decide who the best rock band ever was! Think there's any bias?
My vote goes to Gutenberg. You want to talk about inflection points in human knowledge? Gutenberg, and then Tim Berners-Lee.
Jeff
there are 3 kinds of people:
* those who can count
* those who can't
Actually the only piece of actual evidence of the existence of Christ as a real person is an entry in the histories written by the Jewish historian Josephus. Those histories are not originals of course, in fact they are all copies made by the Catholics. There are no shortage of changes made by those copiers including additions and changes from the original text that have been detected by comparing various copies that were altered in different ways. The only thing that makes people who know what they are talking about say Jesus probably lived vs probably did not live is the subjective opinion of a few scholars (mostly theist scholars) who studied the passages in question and subjectively think they seem like the style of Josephus.
Given the thinking of the day it would be the most natural thing in the world for a Monk to "correct" a "mistake" in a history that failed to record the trial and death of Jesus and to patch it up with the details from their bible.
if you're not a Christian. If you don't believe he was the son of god then he was just preacher whose particular sect took off. The Roman Emperor that convertered to Christianity after being 'saved' is the real power behind Christianity...
Hi! I make Firefox Plug-ins. Check 'em out @ https://addons.mozilla.org/en-US/firefox/addon/youtube-mp3-podcaster/
I know who Michael Jackson is. I can't say he influences my actions very much. I know his name. I don't actually know anything about him. I know who Napoleon is. I can't say that he influences my actions much either. Etc etc etc..... By their methodology, I just name dropped these two guys. Big whoop. Doesn't make them influential.
I would have to agree. I think that Linnaeus has gamed the system a bit. Every (or at least most) Wikipedia articles about a plant or animal species would have a link to back to Linnaeus or his nomenclature system. While he was certainly a notable scientist, he was in no way as influential as most of the others on the list. Perhaps I should change my name to "Citation Needed" so I would be the most influential person in history (according to this methodology).
He gamed the system more than that... Every Wikipedia article about a species contains a link to whoever named that species. And Linnaeus named a lot of species, something close to 10,000! He had a good head start on everyone else seeing as he came up with the naming system. He especially named pretty much all of the species that have the most "mindshare", the same ones that now have long and highly ranked Wikipedia articles.