Slashdot Mirror


Google Cans Comment Spam

fthiess writes "Comment spam is in many ways even more annoying than regular email spam, since you generally have to do more than just hit the delete button to get rid of it. Its defining characteristic is that spammers abuse websites where the public can add content (blogs, wikis, forums, and even top referrer lists) to increase their own ranking in search engines. It seems, however, that the days of content spam are numbered: today Google announced that, in partnership with MSN Search and Yahoo!, that they have implemented a way to block content spam." (More below.)

"Briefly, you just change your blogging/wiki/forum/etc. software so that any hyperlinks in publicly-contributed text have a new rel=nofollow attribute added to any anchor tags. Google, MSN, and Yahoo! will now no longer index any such links, so the motive for content spamming disappears. Especially hopeful is the fact that a slew of makers of blogging software, including Six Apart, have announced they are supporting the new attribute."

21 of 434 comments (clear)

  1. It's one way... by freitasm · · Score: 4, Interesting

    It certainly will help filtering some of the spam sites out of Google rank and so on, but the links will still be there in blog comments, bulletin boards, etc. The Googlebot will not follow the links, but human readers won't see the NOFOLLOW tag - and they'll click. It means that moderators still have manual work to do.

  2. Re:A gift to Microsoft by ch3 · · Score: 4, Interesting

    Well, since MSN Search seems to apply the same policy as Google it would do them no good either.

  3. Is the result valid HTML/XHTML? by maxwell+demon · · Score: 5, Interesting

    Does HTML/XHTML allow "rel" attributes on links? And if so, is "nofollow" an allowed value for that tag?

    --
    The Tao of math: The numbers you can count are not the real numbers.
    1. Re:Is the result valid HTML/XHTML? by joeljkp · · Score: 2, Interesting

      Yes, this appears to be valid. I can't find the part of the actual spec, but w3schools' XHTML reference lists it as an acceptible attribute to <a>.

      "rel" is short for "relationship" - it can contain values like "previous", "next", "contents", "index", etc.

      --
      WeRelate.org - wiki-based genealogy
  4. Re:A gift to Microsoft by It+doesn't+come+easy · · Score: 1, Interesting

    Worse...how about if a competitor hacks a site and adds the tag to their competition's links...that would let the evil competitor up their Google page rank in a reverse sort of way...

    --
    The NSA: The only part of the US government that actually listens.
  5. Now if only... by deltwalrus · · Score: 4, Interesting

    Slashdot could implement something like this, it would make article comments meaningful again.

    --
    --- "When I think back on all the crap I learned in high school, it's a wonder I can think at all..."
    1. Re:Now if only... by PetiePooo · · Score: 5, Interesting

      Slashdot could implement something like this, it would make article comments meaningful again.

      They could even selectively add or omit it based on the comment's moderation. Include the nofollow tag by default, but if a comment with a link in it is moderated highly, remove the tag so search engines can use it. Sounds like the best of both worlds..

  6. Band aid by eddy · · Score: 4, Interesting

    I'm not really into blogging so I don't know how big of a problem this is. I get some spam in my guestbook, which I promptly remove. The spam iteself is what's really irritaing, not the potential "elevating" of the spamvertised site in search-engines, where I've never personally run across one that I can remember.

    Am I correct in assuming that these sites pops up and down relatively often? Maybe it'd be possible to use temporal component to the rating. Say if the link points to a site which was just registered two days ago, it's given a very very low weight, and then you ramp up as time goes by. As spam gets deleted from blogs and guestbooks, time would work against these spammers. Or? I dunno.

    --
    Belief is the currency of delusion.
    1. Re:Band aid by eddy · · Score: 2, Interesting

      the incentive to spam goes away.

      For a sane rational person maybe, but that's not how spammers work.

      A quick example: For the last four years I've been getting spam to the account "stef" at my domain.

      There is no account "stef" on my domain, and there haven't been for at least four years (previous owner I guess). So mail is rejected at RCPT TO.

      ... still they keep coming, year after year after year. Tell me that's rational.

      Not every opportunity will disappear with this new link-attribute, and so there's still reason for spammers to spam. After all, it's really no effort on their part.

      --
      Belief is the currency of delusion.
  7. It's a pagerank question, not indexing by Underholdning · · Score: 2, Interesting

    Google, MSN, and Yahoo! will now no longer index any such links
    Not quite. What happens is, that the link wont add anything to the site in question. As you probably all know, most search engines rank pages by incoming links - it's not just google. By adding this tag, the incoming link wont count.
    I think this is a great idea. It will probably break the w3c compliance, but hey - anything to piss off a spammer.

  8. Opportunity for Firefox (plugin) by Schweg · · Score: 4, Interesting
    Why not modify Firefox (or provide a plugin) that allows such links to be grayed out or otherwise marked specially?

    Actually, are there any plugins already in existence that modify the appearance of a link based on a regexp match?

    1. Re:Opportunity for Firefox (plugin) by Catiline · · Score: 3, Interesting
      Actually, you probably want to use
      a[rel~="nofollow"]
      instead of just an equals. There are other defined values for the rel attribute and you don't want to have your CSS miss this just because it contains more than just the "nofollow" tag.
  9. rel="nofollow" on Slashdot based on karma? by Anonymous+Cowherd+X · · Score: 2, Interesting

    Will this be implemented on Slashdot as well? Perhaps those with karma lower than neutral would get a rel="nofollow" tag added to the URLs they post?

  10. Google vs. God... by PHPgawd · · Score: 1, Interesting
    it seems, however, that the days of content spam are numbered: today Google announced that...
    ..that they are infinitely wise, all-knowing, and all-powerful. They HAD to announce this since this is the ONLY way they are going to end search engine spam. What, do they think the spammers are stupid are and going to roll over and play dead? Oh that's right, that's what they have been doing with email spam. A few sendmail filters and poof, no more spammers!

    I'd LOVE to hear somebody explain exactely why they are not theoretically screwed here. As near as I can tell, no matter what they do, people are still going to be able to make the "miserable failure" trick work, and if they can do that, all of the spammers can/will end the usefullness of search engines as we know it.

  11. Re:Only one problem. by miu · · Score: 2, Interesting
    There is something of an incentive here though, spammers seeking to elevate their pagerank will focus on sites that give them the best return. When large ISPs shut down open relays the spammers went after small ISPs, because of the increased attention from spammers the small ISPs shut down their open relays.

    So if the big blogs use the attribute then spammers will go after the slow to upgrade folks, in self defense most of them will upgrade eventually.

    Really even for a custom designed visitor book or blog it is not that hard to add the attribute to every hyperlink in user comments. Most such programs already do mangling and vetting of submitted html.

    --

    [Set Cain on fire and steal his lute.]
  12. Re:Good by godlikenerddotcom · · Score: 2, Interesting

    There is plugin for WordPress. However, the problem is that now even legitimate comment links won't have an effect, which is going to skew Google's results to favor only story links. I'm not sure we appreciate the full ramifications of this quite yet.

  13. Re:A gift to Microsoft by Anonymous Coward · · Score: 1, Interesting

    If your competitor can 'hack' your website then you have more to worry about then them adding 'nofollow' attributes to it, so i don't see why this would be an issue

  14. Much better solution - spamassassin by EJB · · Score: 2, Interesting
    Why doesn't Google use spamassassin or another spam filtering tool? One based on Bayesian analysis - to determine if a page is just spam, and give it a lower score.
    This can be done whether it is linked in a blog or not, and will improve the overall quality of the search database.

    - Erwin

  15. No, no, they should BLOCK all BLOGS by default. by doublem · · Score: 1, Interesting

    You're right. This "solution" will do about as much as against Comment SPAM as recent US Federal Legislation has done for e-mail SPAM.

    The BLOGS that collect this kind of SPAM in their comments are being run by people who either don't know how to, or don't care enough to update to the most recent version of, for example, MovableType. This "solution" requires action on the part of the people hosting the BLOG, something that I can guarantee will not happen with the idiots who can't even be bothered to take the existing rudimentary steps necessary to limit BLOG SPAM.

    Anything that requires action on the part of the people administering the BLOG will fail to make an impact, plain and simple. How many MovableType BLOGS are out there with literally pages upon pages of SPAM comments? How many of them are EV ER edited, moderated or subjected to a little house cleaning?

    No, a far better solution would be to NOT index ANY BLOGS unless the bloger take action, such as adding something a SPAMMer can't inject with comment SPAM, like a specific Metatag in the document Head.

    I hear a bunch of people wine about how this will end up restricting BLOGs from coming up in search results. Fine, I'm OK with that. If the people administering the BLOG can't be bothered to take action to reduce BLOG SPAM, then the site is unlikely to contain anything I'd be interested in anyway.

    --
    "Live Free or Die." Don't like it? Then keep out of the USA
  16. Re:This is open to severe abuse by jaydonnell · · Score: 2, Interesting

    and if you trade links you now have to check that people aren't using nofollow links

  17. Incredibly Off Topic by SirTalon42 · · Score: 2, Interesting

    "(an example here)"

    Wow! Thanks for that link! That site is awesome! It's amazing what you can accomplish using pure CSS magic!

    Too bad IE still doesn't support all of CSS1 even :-(. So much stuff you can't do without making it look broken to IE users (though I guess you could check the user agent string via PHP and modify the page based on that...

    Again, thanks for the link! Everyone whos into webdesign should be forced to read that site before they start ruining the Internet.