Slashdot Mirror


Wikipedia Adds No Follow to Links

netbuzz writes "In an attempt to thwart spammers and search-engine optimization mischief, Wikipedia has begun tagging all external links on its site "nofollow", which renders those links invisible to search engines. Whether this is a good thing, a bad thing, or simply unavoidable has become a matter of much debate." This topic has come up before and the community voted to remove nofollow back in 2005. This new round of nofollow comes as a directive from Wikia President, Jimbo Wales.

37 of 264 comments (clear)

  1. Wikipedia and Internet-Topology by P(0)(!P(k)+P(k+1)) · · Score: 4, Insightful

    From TFA:

    Although the no-follow move is certainly understandable from a spam-fighting perspective, it turns Wikipedia into something of a black hole on the Net. It sucks up vast quantities of link energy but never releases any.

    The situation is a classic tragedy of the commons: does the interest of malificent spammers outweigh Wikipedia's rôle as a semantic mediator between alien but related nodes?

    Should Wikipedia transition to leaf from cut-point, it may have significant and unforeseen effects on internet-topology.

    1. Re:Wikipedia and Internet-Topology by spun · · Score: 4, Insightful

      Read the wiki article you link to. The tragedy of the commons only applies to unmanaged resources. Wikipedia is a communally managed resource, so the analogy is less than apt. Your speculation regarding the impact of a no-follow wiki on the rest of the Internet is interesting, though.

      I bring up the point about the Tragedy of the Commons because the parable has been used as an excuse to privatize communally managed resources, when such resources do not fall prey to the Tragedy. Reasoning such as yours could be used to justify the 'privatization' of wikipedia, turning it into an experts-only publication where the public has no input. This would be as bad a misapplication of the lessons of the Tragedy parable as it is when governments and industry collude to privatize such things as water cooperatives, which are public but managed resources and not vulnerable to the Tragedy at all.

      --
      - None can love freedom heartily, but good men; the rest love not freedom, but license. -- John Milton
    2. Re:Wikipedia and Internet-Topology by rossifer · · Score: 4, Insightful
      Should Wikipedia transition to leaf from cut-point, it may have significant and unforeseen effects on internet-topology.
      Wikipedia will remain a node-cluster in the larger web. The only difference is that for Google ranking, they no longer contribute to the ranking of outside websites. This will not stop people from putting relevant external links on Wikipedia pages, it just reduces the benefit to the linked site.

      In my experience as a forum webmaster, there is simply no other choice. Any place where the unverified public can put up links, spammers will put up links to their crap, which do more than just use your resources for their ends. If Google notices that your site seems to have become a spammer link-farm, you're entire site will very likely be removed from Google, with all of the bad mojo that entails. So, any page where the unverified public can put up links, those links must be "nofollow", or else...

      Personally, I'm astonished that Wikipedia hasn't done this from the beginning.

      Ross
    3. Re:Wikipedia and Internet-Topology by David+Gerard · · Score: 5, Interesting
      "If somebody were really intent on "overgrazing" wikipedia, automated troll-bots would have no difficulty spewing crap all over it faster than the community could work to revert it. I'll be honest, I'm surprised I haven't seen more if it already."


      You will be utterly unsurprised to know this happens already ...

      In general, any obvious objection to the idea of a wiki encyclopedia already happens and is already dealt with day to day. We have a ridiculous array of spambots and vandalbots already attacking Wikipedia and trying to turn it to their use, never mind our work trying to write an encyclopedia. So we have an EQUALLY ridiculous array of antivandalbots to deal with these things as needed. Our immune system is quite frightening to contemplate at times ...
      --
      http://rocknerd.co.uk
    4. Re:Wikipedia and Internet-Topology by spun · · Score: 5, Insightful

      Why don't you read the original essay? The tragedy of the commons happened when a common ground was abused because no effective method of management was in place to ensure the integrity of the common land. That method of management could be a single stakeholder, or a communal system of management. The original essay was very clear in regards to the fact that there is more than one way to effectively manage a resource.

      Now, we could argue all day as to whether the system of management wikipedia has in place is effective or not, but we cannot argue that it has such a system. Imagine, would there be a tragedy of the commons if everyone felt free to simply kill all the cos of the offenders? If there weas, it would certainly be a different tragedy. That is akin to the management system of wikipedia. No overgrazing because any one person can nuke every single cow on the planet, and any other person can resurect every dead cow on the planet.

      An experts only publication would not be a bad idea. Why don't you start one up and tell me when you get say 1/1,000 the number of articles wikipedia has, or 1/10,000 the readers. But don't do it to wikipedia, start your own. Wikipedia already has a system that works well enough. Sorry if you don't like it, but in this free market of ideas, enough people find it useful, as is, to make it one of the most popular sites on the Internet.

      --
      - None can love freedom heartily, but good men; the rest love not freedom, but license. -- John Milton
    5. Re:Wikipedia and Internet-Topology by David+Gerard · · Score: 4, Interesting
      "Personally, I'm astonished that Wikipedia hasn't done this from the beginning."


      All the Wikipedias other than English have had this in place already. It's just that the flood of spammers has been so bad on English Wikipedia we've finally had to put it on there too.

      --
      http://rocknerd.co.uk
    6. Re:Wikipedia and Internet-Topology by Lazerf4rt · · Score: 5, Insightful
      ...outweigh Wikipedia's rôle as a semantic mediator between alien but related nodes?

      False premise. Wikipedia is not a "semantic mediator between alien but related nodes". Wikipedia is just a free encyclopedia.

      The only reason why an external link should be placed in Wikipedia is because that external link is already significant in some way. Wikipedia does not exist to make those external links any more significant than they already are. It seems to me that is the essential point of the Wikipedia policy, Wikipedia is not a soapbox.

      So, since there is no such "tragedy of the commons", Wikipedia is free to tag their links "nofollow" if they want to. If it raises Wikipedia's search results over the external links in Google, good for them. That's the way it should be. These bloggers who nitpick about Google PageRanks 24/7 strike me as a bunch of whiners, frankly.

    7. Re:Wikipedia and Internet-Topology by danpsmith · · Score: 5, Insightful
      Oh the horror. Just imagine if Wikipedia was written only by people who knew what they were talking about. Terrors like that keep me up at night.

      I'm sick and tired of this particular beef with wikipedia. Just because you can't quote wikipedia in your thesis for your doctorate doesn't mean its useless. If you want reliable source material look elsewhere, if you want an exorbitant quantity of information, Wikipedia has that. It's the quick and dirty resource for people who might just need to know a few things about a subject without having to fact check and such. That's what it should be treated as. The fact that non-experts are allowed to edit entries is what made it grow to be the resource it is today.

      If some of the information is inaccurate, so what? It's not like heart surgeons are looking up how to conduct an operation on Wikipedia. People need to stop beating on its potential for inaccuracy and instead see it as what it is, a great resource for learning about topics or at least a starting point given no other resources. The Internet as a whole tends to have a large amount of inaccurate information, but that doesn't make the Internet useless. The quantity of information largely and fully outweighs the risk of inaccuracy. Everything has inaccuracies anyway, and Wikipedia's usefulness makes any mistakes it has well worth the benefit of having it versus not having it. It's a mighty powerful resource, and I'm tired of hearing it bashed just because some random vandal could and sometimes does screw up a few entries (even though they are usually fixed in a pretty timely manner). It's an online resource, take it for what it is and quit bitching about how one entry out of 10,000 is inaccurate, and just be thankful you have the 10,000 entries. Or better yet, just don't use it if you find it offensive.

      --
      Judges and senates have been bought for gold; Esteem and love were never to be sold.
    8. Re:Wikipedia and Internet-Topology by David+Gerard · · Score: 4, Informative

      What Jimbo did was remove his previous objection to nofollow, rather than dictate its presence. Slight distinction.

      --
      http://rocknerd.co.uk
    9. Re:Wikipedia and Internet-Topology by hotdiggitydawg · · Score: 3, Insightful

      These bloggers who nitpick about Google PageRanks 24/7 strike me as a bunch of whiners, frankly. Amen, brother. Want to boost your PageRank? How about getting some decent f***ing content for a change...
    10. Re:Wikipedia and Internet-Topology by DragonWriter · · Score: 4, Insightful
      There appears to be a neverending conveyorbelt of excuses for Wikipedia's acute failure to be authoritative or reliable.


      A encyclopedia will, even if written by experts, rarely be either authoritative or reliable. It will be at best a rough, selective summary, and usually one which misses much of what is current. An encyclopedia is, at best, a good starting point.

      Let me make it clear - if Wikipedia is not reliable, but full of half-truths and errors, then its not simply useless, it's potentially dangerous.


      Only if misused. The average library is full of half-truths and errors, and yet no (sane person) says libraries are dangerous. If you are using information you cannot directly evaluate from Wikipedia for any important use, you should be checking the sources cited (and discarding the information if it isn't cited), and evaluating the credibility of those sources and consulting them more fully.

      An information resource that is unreliable as history IS PROPAGANDA.


      No, its not. It's only "propaganda" if its all written to advance the interests of the same faction. Otherwise, it might contain propaganda (and Wikipedia no doubt in some cases does.) But as a whole it is not a work of propaganda.

      You have no idea whether 1 out of 10,000 is inaccurate or 1 in 100 or 1 in 2. You have no idea whether the article has just been vandalized or whether key information is missing. I'm willing to bet that you don't expect surgeons make the same excuses that their work is generally acccurate apart from occasional slips which kill patients, or your college textbooks to have key formulae wrong after someone at the printers decided to improve an equation for the good of Mankind. It's the special pleading for Wikipedia that amazes me.


      A free tertiary reference source is neither a surgeon nor a college textbook. Applying the standards applicable to either of those is inappropriate. Also, its not a ham-and-cheese sandwich, so you shouldn't eat it and expect it to taste like one. It's not inappropriate "special pleading" to suggest that things which are unalike in kind from other things should not be evaluated by the standards applicable to the other, unlike, things.

      Wikipedia is, in practice, useful to me for things I care about (even though I have found, and corrected, errors.) I therefore think it has value, in many cases unique value for which no comparable resource of would offer a suitable, reasonable substitute.

      Is it perfect? No. Are there other tools which are better for some uses? Certainly. Is it as inappropriate as any encyclopedia as an ultimate source? Certainly.

      Is it valuable, and in some cases uniquely so? Yes, I'd say so.
    11. Re:Wikipedia and Internet-Topology by Rakshasa+Taisab · · Score: 3, Insightful

      I don't know if it has been discussed, but why can't you just set nofollow on new links, and let them ferment for a few weeks before removing it?

      --
      - These characters were randomly selected.
  2. Neither good nor bad. It's immaterial. by XorNand · · Score: 5, Insightful

    "nofollow" only exists because Larry Page and Sergey Brin had a (at the time) brillant idea of ranking webpages according to how many sites linked back to it... and now that method of determining relevance is broken. Prior to this innovation, most search engines relied upon META tags... which also eventually broke. Google is where it is today because they recognized that the web had evolved past META tags (and other techniques of self-describing content).

    My point is that the Internet as a whole souldn't be tripping over ourselves because Google's invention too is now obsolete. The "nofollow" attribute is just an ugly hack created to accommodate the frequently-gamed PageRank algorithm. We should instead find new ways to determine relevance. Hey, if your idea is good enough, you might even find yourself a billionaire someday too. Who knows, maybe the next wave will also wash away all those god-forsaken AdSense landing pages and domain squatters (oh please, oh please, oh please...).

    --
    Entrepreneur : (noun), French for "unemployed"
    1. Re:Neither good nor bad. It's immaterial. by RAMMS+EIN · · Score: 4, Insightful

      ``Google is where it is today because they recognized that the web had evolved past META tags (and other techniques of self-describing content).''

      More like meta tags never worked. Much better to judge the content of a page by...looking at the content. Only a fraction of pages included meta tags, anyway.

      --
      Please correct me if I got my facts wrong.
    2. Re:Neither good nor bad. It's immaterial. by Dan+Farina · · Score: 4, Interesting

      Actually this sort of flow model was well documented in IR, AI, and mathematic research for a period long before Google. While credit should be delivered for implementing this scheme in a world of already-entrenched search engines, it falls into the category of age-old computer science. This same scheme is also used to compute the final likelihood of states in Markov models -- a technique at least 30 or 40 years old.

      In a nutshell: the eigenvalues of the adjacency matrix.

    3. Re:Neither good nor bad. It's immaterial. by David+Gerard · · Score: 4, Funny

      My Wikipedia user page makes me the number one hit on "David Gerard" (with and without quotes) because I use it as the link when responding to blog quotes about Wikipedia (in my role as volunteer press contact). I finally beat the Dutch painter!

      --
      http://rocknerd.co.uk
  3. Re:Search Strategy by bluelip · · Score: 5, Informative

    RTFA - This only affects external links.

    Your method of searching wikipedia through google is safe.

    --

    Yep, I never spell check.
    More incorrect spellings can be found he
  4. Jimbo...who are the founders? by dreddnott · · Score: 4, Insightful

    While I don't necessarily disagree with the reinstatement of Wikipedia's nofollow policy, I do have to say one thing: Jimbo Wales is a tool.

    Yesterday, after reading and noting glaring inconsistencies in the Wikipedia articles and talk pages for Wikipedia, Larry Sanger, and Jimbo Wales, as well as Jimbo Wales' user page, I have lost a bit of respect for Wikipedia and a lot more for one of its cofounders. I can't believe he's trying to manipulate his encyclopedia project this way!

    --
    I may make you feel, but I can't make you think.
    1. Re:Jimbo...who are the founders? by AlexMax2742 · · Score: 4, Informative

      You are a fool if you think that the stupidity stops there: When Wikipedia gives sysop priviliages to batshit insane people like this guy, and he somehow managed to keep said privilages for as long as he did (the only reason he lost said priviliages is because he picked a fight with another abusive admin), you know that there is something fundamentally wrong with Wikipedia.

      Now if only someone can unprotect this article...

      --
      I'm the guy with the unpopular opinion
  5. pointless by nuzak · · Score: 4, Insightful

    Nofollow doesn't work if you just put the URL directly in the text, and google will treat them more or less as links (to the site at least, though possibly not the path).

    The way to fix this is with stable versions -- you don't let search engines see unstable versions at all. But having looked at the craptastic mediawiki codebase, I can sympathize with them not wanting to bother with adding such a major feature.

    --
    Done with slashdot, done with nerds, getting a life.
  6. Better for Google, not Wikipedia by fyoder · · Score: 3, Insightful

    This won't solve the problem, since humans may still follow the links, so it's still worthwhile for spammers to have links in Wikipedia. Even if it doesn't up their pagerank, Wikipedia can still serve them as a spam delivery system.

    However, it helps Google by not uping spammer's page rank. And less noise in the search results is good for the users of Google.

    --
    Loose lips lose spit.
  7. Idea for a New Search Engine with Unique Ranking? by MBraynard · · Score: 4, Funny

    How about creating a new Google-style Ranking system that only ranks sites based on the number of no-follow links heading towards them?

  8. Not invisible by truthsearch · · Score: 3, Interesting

    which renders those links invisible to search engines.

    Uh, not really. The big search engines choose to not follow those links.

    Using nofollow reduces the incentive for spammers, but in this case it will hurt search engines. Google wants to provide the most worthy links at the top of search results. Being linked from wikipedia is supposed to denote reliable sources or very relevant information. Therefore Google is slightly more accurate for having those links to follow in wikipedia. The nofollow will make search engines slightly less useful.

  9. Re:Search Strategy by shawb · · Score: 3, Informative

    I don't think this would really affect your search strategy. Wikipedia gets a high score on pagerank because so many site link to it. What spammers etc. have done is then alter existing Wikipedia articles to add links to their own sites. Since Wikipedia has a high pagerank, any links out from Wikipedia will be higher rated than from many other websites. Altering Wikipedia pages in this way allows spammers, spoofers, phishers, etc to get their pages ranked higher on Google. These alterations were probably done in the links section on the bottom, so wouldn't be directly followed by people visiting Wikipedia. Making the link too visible would also make it more prone to reversion by a benevolent Wikipedia user.

    I agree... when I want to look something up on Wikipedia I usually just do a Google search to find it if my initial search term doesn't come up with what I want. Chances are that it is a simple misspelling, as topics I am going to look up on Wikipedia are probably topics that I am not entirely familiar with. Google will then make suggestions based on it's vast knowledge (probably based on a dictionary created from crawling various web sites combined with data from what people followed from google after actually doing a search.

    --
    I'll never make that mistake again, reading the experts' opinions. - Feynman
  10. Wikia is not Wikipedia - please correct story! by David+Gerard · · Score: 4, Informative
    Jimbo is the president of Wikia and the founder of Wikipedia. These are separate and distinct roles.


    Speaking as a Wikipedia press volunteer, it's a goddamn nightmare keeping them separate in press perception. Because Jimbo is Mr Wikipedia, so even though Wikia is COMPLETELY UNASSOCIATED with Wikipedia, they keep conflating the two.

    I ask that Slashdot not perpetuate this. Jimbo asked this as the founder of Wikipedia and the Final Authority on English Wikipedia, and Brion (the technical lead and Final Authority on MediaWiki) switched it on.

    May I say also that we've been watching the spamming shitbags^W^WSEO experts bitch and whine about it, and it's deeply reassured us this was absolutely the right decision. We would ask Google to penalise links from Wikipedia, except the SEO experts^W^Wspamming shitbags would just try to fuck up each other's ranking by spamming their competitors.

    To the spammers: I commend to you the wisdom of Saint Bill Hicks: "If you're a marketer, just kill yourself. Seriously."

    --
    http://rocknerd.co.uk
    1. Re:Wikia is not Wikipedia - please correct story! by David+Gerard · · Score: 5, Insightful
      We're not 'reliable' and we don't claim to be. This is important: we don't save the reader the trouble of having to think when reading.


      Most of the complaints that 'Wikipedia isn't reliable' appear to be complaints that we haven't saved them the trouble of thinking. I have to say: too bad. It's useful or it wouldn't be a top 10 site. But it's just written by people. Keep your wits about you as you would reading any website. We work to keep it useful, but if you see something that strikes you as odd, check the references and check the history and check the talk page.

      Wikipedia does not save the reader from having to think.

      --
      http://rocknerd.co.uk
  11. Call this version 1.0 by victim · · Score: 4, Interesting

    This should be considered a step in an evolving policy. The next step should be that old links, ones that have survived many edits and time as well as links added or edited by known and trusted editors should omit the no-follow tag. Then wikipedia can continue to serve as an interpreter of the WWW.

  12. I doubt it'll stop wiki spamming by jesterzog · · Score: 4, Interesting

    I don't think this will do much to stop Wikipedia link spamming for several reasons:

    • Many spam links on Wikipedia aren't commercially motivated spam, but just people who've naively put external links in articles without properly understanding or caring about the editing policy. They're not thinking so much about search engines as about pointing people to their website (or their favourite website) because they think it's more important than it probably is. If it's a relatively obscure article, it might stay there for months or longer before someone goes through and reviews the links.

    • Wikipedia is only one of the websites that publishes Wikipedia content. There are lots of other sources that clone it, precisely as they're allowed to under the licence, and re-publish it. They usually add advertising to the content, or use it to lure people to some other form of revenue. These sites are easy to find by picking a phrase from Wikipedia and keying it in to a search engine like Google, and I doubt they'll add the nofollow attribute to their reproductions of the content.

      Wikipedia is probably treated as a more important source of links by search engines, but whatever's published on Wikipedia will be re-published in many other places within the weeks that it takes for the new content to be crawled and to propagate. And links on any Wikipedia articles will propagate too, of course.

    • Even if you ignore search engines, having external links from a well written Wikipedia article that gets referenced and read a lot is probably going to generate at least some traffic to a website. Wikipedia articles are often a good place to find good external sources, probably because they get audited and the crappy ones get removed from time to time. This is exactly what provides motivation for spammers to try and get their links added, though.

    Good on them for trying something, but I don't think it'll stop spammers very much.

  13. Let the search engines do this themselves by Bananenrepublik · · Score: 3, Insightful

    If this is of benefit to the search engine operators, then it should be simple enough for the search engine operators to follow or not follow external links from wikipedia, with or without NOFOLLOW. Wikipedia has a high enough profile that search engines already treat it differently from Average John's Incredibly Boring Blog, and they will know if it is of benefit for them to follow those links, without wikipedia putting some policy in place.

  14. Re:Idea for a New Search Engine with Unique Rankin by interiot · · Score: 4, Interesting

    Yahoo Mindset lets you search for sites that are more commercial, or more informational. Sites with the most nofollow incoming links may fit into the "more commercial" group. (by the way, does anybody know how Mindset actually works?)

  15. Overlooking the reason for this change by Distan · · Score: 5, Insightful

    I think the article noted that the last time this came up for vote by the community, the community voted it down. I think it also notes that this is something that Jimbo Wales dictated, and not something that went through the normal community approval process.

    Why?

    Why would Wales simply dictate this change be made?

    Because Wikipedia is a source of high-quality links. Editors have increasingly been making sure to put high-quality references in articles, mainly as links to other web sites. A single Wikipedia article can often contain links to the best websites related to that subject.

    So ask yourself why would Wales want to make those links private, and no longer harvested by Google.

    Is it that hard to figure out?

    If you still don't know, then ask yourself what business Wales has announced that he wants to pursue with his new for profit company, Wikia?

    Search Engines.

    In the words of Paul Harvey, now you know the REST of the story.

    1. Re:Overlooking the reason for this change by Bluephonic · · Score: 4, Insightful

      Sorry, that doesn't make sense. As other people have mentioned, nofollow is not a magic incantation that search engine crawlers have no choice but to obey. Google can do whatever it wants with any link (they could choose to completely ignore the nofollow attribute when it's on wikipedia pages, for example).

  16. Re:Solves the Wrong Problem by Short+Circuit · · Score: 3, Informative
    The reason for the wikispam is an SEO contest...a kind of contest where one wins by having the highest search engine rankings at the end of the contest. The contest mentioned specifically points to Wikipedia as a resource.

    So, to recap:
    • The spammers are participating in a contest.
    • To win the contest, you must have the highest rank in search engines.
    • Adding nofollow to links removes Wikipedia's value as a tool in raising one's pagerank, which removes it's primary value to the wikispammers participating in the contest.
  17. Could be a tax issue for Wikipedia by Animats · · Score: 3, Interesting

    Wales' behavior may be an issue for Wikipedia. If the same person is involved with a profit-making venture and a nonprofit in the same area, the tax status of the nonprofit becomes questionable. When a US nonprofit files their tax return, they have to list any officers or directors involved with profit-making ventures in the same field.

    The IRS is concerned because if you have a nonprofit and a for-profit organization under the same management, it's often possible to structure things so that the for-profit corporation shows a phony tax loss.

  18. Could they not do it smarter? by mi · · Score: 3, Interesting

    For example, auto-add the "nofollow" only to the links added in recent edits (for some definition of recent). Once a particular link was part of the page long enough (and survived other people's edits), it can be followed by the search engines...

    I, for one, contributed a number of wild-life pictures to Wikipedia, but am also selling them in my own shop. I don't think, it is unfair for me to expect links to my shop from the contributed images to be followed...

    --
    In Soviet Washington the swamp drains you.
    1. Re:Could they not do it smarter? by TheSpoom · · Score: 3, Insightful

      You might not, but I'd imagine several editors would. You released the images under the GFDL when you uploaded them to Wikipedia. That means they can effectively do whatever they want with them, including removing the advertisement in the caption. Not to be blunt, but if you don't like it, don't upload the images.

      --
      It's better to vote for what you want and not get it than to vote for what you don't want and get it.
      - E. Debs
  19. Re:uncommonly tragic? by PopeRatzo · · Score: 3, Insightful

    Maybe the problem is in the "metrics" themselves. Perhaps we should stop assigning a quality value to something because it has lots of links. It's no more valid than the notion that a piece of music is better than another because it has been played lots of times on the radio, or even because more people buy it.

    Which is why Wikipedia is a pretty good way to add value to the vast information of the Internet, using the collective judgment (and a little bit of enlightened meritocracy thrown in for good measure).

    --
    You are welcome on my lawn.