Slashdot Mirror


Google Cans Comment Spam

fthiess writes "Comment spam is in many ways even more annoying than regular email spam, since you generally have to do more than just hit the delete button to get rid of it. Its defining characteristic is that spammers abuse websites where the public can add content (blogs, wikis, forums, and even top referrer lists) to increase their own ranking in search engines. It seems, however, that the days of content spam are numbered: today Google announced that, in partnership with MSN Search and Yahoo!, that they have implemented a way to block content spam." (More below.)

"Briefly, you just change your blogging/wiki/forum/etc. software so that any hyperlinks in publicly-contributed text have a new rel=nofollow attribute added to any anchor tags. Google, MSN, and Yahoo! will now no longer index any such links, so the motive for content spamming disappears. Especially hopeful is the fact that a slew of makers of blogging software, including Six Apart, have announced they are supporting the new attribute."

16 of 434 comments (clear)

  1. Re:It's one way... by the_pooh_experience · · Score: 4, Informative

    It is not a solution meant to change the content on a website (that would be tantamount to censorship). It only changes how the search engines handle the links (note: the supporters/developers of such a standard are search engine companies).

    The best question raised in this post is if such a tag is standards acceptable.

  2. Re:Band aid by arthurs_sidekick · · Score: 2, Informative

    The point is that the motivation to spam blogs rests on the assumption that posting links to one's site on blogs elevates the Google (Yahoo, MSN Search) rank for the sites linked to. Once that assumption is invalidated, the incentive to spam goes away. It should actually help quite a bit.

    --
    "Oh, I hope he doesn't give us halyatchkies," said Heinrich.
  3. Yes, if you use quotes by Anonymous Coward · · Score: 2, Informative
    Yes, the attribute rel is allowed in anchor elements in HTML. The value "nofollow" is not on the list of recognized types, but that's not so important since the value of rel can be anything.

    It's an interesting idea, but it's probably a matter of short order before MS starts to use this to cut out non-MS sites.

  4. Wikipedia by wikinerd · · Score: 4, Informative

    Wikipedia already implemented this feature. See here.

  5. Re:Is the result valid HTML/XHTML? by darkpurpleblob · · Score: 5, Informative

    Yes and yes.

    From the W3C:

    Links in HTML documents - The A element:
    rel = link-types [CI]
    This attribute describes the relationship from the current document to the anchor specified by the href attribute. The value of this attribute is a space-separated list of link types.
    Basic HTML data types - Link types:
    Authors may use the following recognized link types, listed here with their conventional interpretations. In the DTD, %LinkTypes refers to a space-separated list of link types. White space characters are not permitted within link types.

    These link types are case-insensitive, i.e., "Alternate" has the same meaning as "alternate".

    User agents, search engines, etc. may interpret these link types in a variety of ways. For example, user agents may provide access to linked documents through a navigation bar.

    ...

    Authors may wish to define additional link types not described in this specification. If they do so, they should use a profile to cite the conventions used to define the link types. Please see the profile attribute of the HEAD element for more details.
  6. Re:Is the result valid HTML/XHTML? by leoboiko · · Score: 2, Informative

    Too lazy to search, huh? Ok, I'll give up moderation and search for you :)

    And the answer, of course, is yes. "rel" attribute, valid for "a" and "link" element types. Take a look at the source of any Wordpress weblog and you'll see it being used for many things already.

    The caveat is that you should define a profile about the valid keywords you'll be using in "rel"; I don't know if Google is using a profile, but it's not mandatory.

    --
    Prescriptive grammar:linguistics :: alchemy:chemistry. Stop being a nazi and learn some science.
  7. Re:Opportunity for Firefox (plugin) by Anonymous Coward · · Score: 5, Informative

    Why not modify Firefox (or provide a plugin) that allows such links to be grayed out or otherwise marked specially?

    Actually, are there any plugins already in existence that modify the appearance of a link based on a regexp match?

    Let me introduce you to the wondeful world of userContent.css.

    Something like this should work:

    a[rel="nofollow"] {
    text-decoration: line-through ! important;
    border-bottom: dotted thin gray ! important;
    color: gray ! important
    }
  8. This solves only 1/2 of the problem by PeeAitchPee · · Score: 3, Informative

    While this will prevent spammers from bumping up their sites' Page Rank (probably their primary motivation for comment spam anyway), it doesn't prevent their bots from spamming targeted blogs etc. in the first place. That is still best handled by the blog software providers.

    For example, WordPress has a variety of different plugins for handling comment spam. The best one I've seen renders a series of characters graphically (a la TicketBastard) which the user (a human, of course) has to type into a text field on the comment form before their comment is accepted. Blogs implementing this type of mechanism typically have spam coming from bots drop down to zero.

  9. Re:Useful links by BohemianCoast · · Score: 5, Informative

    Links in the main body of the blog post will be fine. Blogs of course, have high page rank because bloggers comment on each other's blogs. This tag may have a side effect of generally reducing the page rank of blogs.

    As for useful links in comments; if they're really good sites, people are bound to blog about them more generally. And my poor blog gets few enough hits that it will be no problem for me to manually edit genuine comments to remove nofollow tags.

  10. Re:Band aid by Threni · · Score: 2, Informative

    > still they keep coming, year after year after year. Tell me that's rational.

    It's rational because removing the account from their spam lists won't make them any less profitable, but removing even 1 live address might, so all things being equal (no limit on the number of addresses being spammed, not much cost difference in a list of email addresses which is this rather than that big) you might as well leave them in.

  11. Use CSS by JamesHenstridge · · Score: 3, Informative

    It is pretty easy to make rel="nofollow" visible to normal users too in modern web browsers using CSS. You could use something like this:

    a[rel="nofollow"]:before {
    content: url(an-image-representing-nofollow-links.png);
    }

    That will display the given image before any links marked as nofollow.

    1. Re:Use CSS by Anonymous Coward · · Score: 1, Informative

      No but it does work in Firefox.

  12. Re:It's one way... by HTH+NE1 · · Score: 2, Informative
    but human readers won't see the NOFOLLOW tag - and they'll click.

    They will if they put
    a[rel=nofollow]:after { content: " [NOFOLLOW]"; }
    in their client-side stylesheet, or the blog owner puts it in the site's stylesheet. I do similar things to put "[PDF]" after PDF links and "[reg]" after nytimes.com links.
    --
    Oh, say does that Star-Spangled Banner entwine / The myrtle of Venus with Bacchus's vine?
  13. Re:It's one way... by HTH+NE1 · · Score: 2, Informative
    Or even
    a[rel]:after { content:" [rel=" attr(rel) "]" }
    if you're generally curious for what/if people use the rel attribute on anchor tags.

    There's lots of power you can exert over the appearance of web pages through your client-side stylesheet.

    If only there were a way to restrict a set a rules to particular sites, or that you could trust sites to put ID attributes on their BODY tags to uniquely identify their pages to the world, even just the domain name (substituting some other character for the dots).
    --
    Oh, say does that Star-Spangled Banner entwine / The myrtle of Venus with Bacchus's vine?
  14. Largest company in the world? What the hell? by Anonymous Coward · · Score: 1, Informative

    You clearly have no idea what you're talking about. Microsoft is not even a large company. What do they have, 10,000 employees or something? IBM has well over 300,000. IBM takes in way more revenue than Microsoft, too. There are companies that employ more people than medium-sized governments. Microsoft is not one of them.

  15. Re:No, no, they should BLOCK all BLOGS by default. by ReelOddeeo · · Score: 2, Informative

    This new tag will not restrict blogs from comming up in search results. It only restricts the Spamvertised sites from search results, not the blog with the spam links.

    --

    Those who would give up liberty in exchange for security and DRM should switch to Microsoft Palladium!