Slashdot Mirror


Spam Rapidly Increasing In Weblog Comments

dsurber writes "BBC News has a nice article discussing 'flyblogging', the phenomenon of spammers leaving advertising-related posts on personal weblogs. The writer comments: 'None of the other blogs I contribute to or run has been affected yet, but I can only assume it is a matter of time before the spammers move in, as they did first with UseNet and then with e-mail. It depresses me to think that any open medium can be so easily undermined by people with no scruples, no sense of responsibility and no idea of the damage they are doing.'" It seems a little surreal that people are having to develop anti-spam weblog tools.

28 of 387 comments (clear)

  1. Google? by The+Tyro · · Score: 2, Interesting

    How much truth is there to the statement that increased links equal increased google rank?

    This article implies that all these postings are an effort to stack the google rankings, in order to place spam sites near the top. I'm not a google wizard... is this actually a usable loophole in google's ranking system?

    --
    Even if a man chops off your hand with a sword, you still have two nice, sharp bones to stick in his eyes.
    1. Re:Google? by devphil · · Score: 3, Interesting


      How much truth is there to the statement that 2 + 2 = 4? A lot. Why? Because that's how it's defined to work.

      How much truth is there to the statement that increased links equal increased google rank?

      Uh, that's how Google documents it. That's how all of Google's employees define it. That's how everybody's experience pans out. Maybe they're all just making shit up with nobody ever calling them on it, but I'd argue for "that's actually how it works" myself. Try going to Google and clicking "About".

      is this actually a usable loophole in google's ranking system?

      Only if the log owners let the spam sit there long enough to be googled. If they do that, then my guess would be quite possibly yes.

      Maybe compile a list of such spammers, then a list of the advertised sites. I'd like a checkbox on my google searches that says, "Ignore results on sites whose page rank is mostly due to asshole tactics."

      --
      You cannot apply a technological solution to a sociological problem. (Edwards' Law)
    2. Re:Google? by Lagged2Death · · Score: 5, Interesting

      My hobbyist project was picked up by Google after a while, but it wasn't until I retroactively changed my comment signature here on Slashdot and on Kuro5hin (thereby creating many links to my project page) that it went to the top of the search results. It wasn't my intent to subvert Google in any way - I was quite surprised by the dramatic result.

      There have been some less-than-scrupulous advertising companies in the business of that publishing dummy machine-generated web pages to exploit this trick. The dummy pages were typically filled with repitions of some nonsense paragraph, with self-links (to other dummy pages) and client-sponsored links interspersed here and there. The idea was that the self-linking would make the site look like a large, legit site to Google, which would mark it as relatively well-trusted and influential. Then Google would dutifully note the client-sponsored links and rank them highly. I believe Google has worked on ways to stop this; I don't know how successful they've been, or if the dummy-site makers are still around.

    3. Re:Google? by Blimey85 · · Score: 2, Interesting

      Nearly useless? I thought it was just me but for the past couple of months I've been hating Google more and more. It used to be so easy to find what I wanted but I try a simple search for a name brand item and the manufacturer is the last page listing to come up it seems. What really irks me is how many listings show up for other search engines... or search services as they call themselves. It's a shame that Google has not done anything to solve this problem. Surely they could change things a bit to completely drop these so-called search services since they offer no real information and are just basically advertisements.

      --
      How is it that one careless match can start a forest fire, but it takes a whole box to start a campfire?
  2. Arms race in the making by jmerelo · · Score: 2, Interesting

    The arms race has just started: spambots becoming increasingly more sophisticated, and bloggers having to go to greater lengths to avoid spam.
    The root of the problem might be in the impact a weblog link has on google ranking. Spammers have taken note, and they're acting on it.

  3. It is natural by Zanek · · Score: 1, Interesting

    This is as natural as evolution. The spammers remind me of roaches. You can try to engineer
    them out of existance but life always finds a way.
    I dont hate them, but I do find this phenomenon curiously interesting with its parallels to life

    --


    Help pay for my wedding! Go to my kickass website
  4. I have a quick and dirty solution. by Ignorant+Aardvark · · Score: 5, Interesting

    Use the same type of human verification system that Yahoo uses when signing up for an e-mail account. If you can't type in the mangled letters in the image, then your post to the weblog is ignored. This would only be required for anonymous postings - if you're logged in, presumably you've already passed the human verification test upon account creation, so you don't have to go through the hassle each time you want to post.

    1. Re:I have a quick and dirty solution. by GeorgeH · · Score: 4, Interesting

      That's called a CAPTCHA, and James Seng wrote a Moveable Type plugin to do this with MT. CAPTCHA stands for Completely Automated Public Turing Test to Tell Computers and Humans Apart, you can read more in this story

      --
      Why can't I moderate something "Wrong" or at least "Grossly Misinformed"?
    2. Re:I have a quick and dirty solution. by madstork2000 · · Score: 2, Interesting

      I have a human verification engine that I use to "protect" web logs, email, and other public, but sensitive pages (like contact pages on my website). My image mangling uses base images that each have three sets of letters Red, Blue and Black, then forms random words out of 5-8 images. So to verify you have to enter either the RED, BLUE or BLACK word. If you entered it correctly you are allowed to post, send mail, look at the page, etc. There are no accounts to create, profiles, etc. I set it up to work as a module in the content management system I worte (which can accept POST NUKE plugins). Anyway, in the limited testing I have done thus far it seems to work, but since none of my sites have had a huge SPAMBOT problem, it is hard to tell. Though I am pretty confident they cannot read and understand the form, because as of now one of my bigger problems is getting people to properly read and recognize what to do. -MS2k

  5. Solution to the problem by Anonymous Coward · · Score: 3, Interesting

    1) Only allow people with verified accounts to post.
    2) With every post, display the advertising policy (buying an ad on the site is $5000)
    3) Make sure they confirm that if their message is an ad, they agree to pay the $5000
    4) Host their ad for them, and collect your money. Small claims is helpful here.

  6. I have already seen this with my blog by chrisgeleven · · Score: 2, Interesting

    It is a huge pain in the butt, especially considering that I have not found an easy way to mass delete comments with Movable Type yet...so I have to go to each comment individually and delete them.

    This past week alone I cleaned out about 20 spam comments.

    1. Re:I have already seen this with my blog by Alan · · Score: 2, Interesting

      Just go into the database (assuming you're running off of mysql) and run something like "delete from mt-comments where author = 'spammer@whatever'" or "where website = 'http://blahpornspam.com'". Run a rebuild afterwards and all spam will be gone. Make sure you run a select on the author = or url = to make sure you know what you're deleting. Note: the actual fields aren't correct I'm sure, run 'desc ' to get them first.

      I've seen the requests for a mass delete of comments in the support forums for MT as well, you're not the only one.

  7. I've seen far worse from spammers. by Rahga · · Score: 3, Interesting

    I've got a website.

    Last year, I closed my hotmail account and two spammed-to-heck e-mail accounts. To keep old friends and family from getting shafted, I had an autoreply attatched to those addresses, announcing that those addressess were closed and that I could be reached through the contact form on my website, prior to sending those e-mails to /dev/null .

    To date, through this manual entry, effort-draining contact form, I have had at least 20 offers to increase my manly-ness, 10 offers to find the love of my life, and 5 death threats from annoyed spammers. Only one charitable organization had a problem with my auto-reply, because a spammer was using their e-mail address to send junk to me over and over again.

  8. Legislation by Schmucky+The+Cat · · Score: 3, Interesting
    This isn't that new but it's becoming a nuisance because spammers now have automated tools.

    It's taken eight years since email spam became an issue for signifigant legislation to pass.

    We need an easily amendable federal law that simply says unwanted, unsolicited, uncompensated advertising is simply illegal.

    Usenet, fax, email, public chat, blogs, RPC messenger, any forum that allows public input for free has become a spammer magnet. They don't own it, get them out.

    We need a law that says this, as a statement that to live under our social contract you can't be an annoying louse.

  9. To hell in a handbasket, i tell you! by Dwarfgoat · · Score: 2, Interesting

    This is why I had to shut down the guestbooks on several of my sites. It didn't help when I changed the input form, then used a new URL for the posting page, THEN deleted any connection to the CGI script whatsoever. It was only after deleting the script from my webspace that it stopped.

    My hosting company was unsympathetic to my pleas for help. Needless to say, I now host elsewhere. I mean, sheesh...my mother reads that that thing. The last thing I want to think about is her and my dad...and viagr^H^H^H

    *shudder*

    --
    That? That was a pigeon.
  10. How about... by revmoo · · Score: 1, Interesting

    Why couldn't we just create a new html tag, something like , then blog coders could simply set the comments sections of their sites up with the tag.

    Then, google would still spider the page, but any spam would fail to be indexed.

    Of course, blogs aren't the only application for such a thing, any time you take user input to be posted online you could surround it with a tag to aleviate any spam possibilities.

    --
    I would expect such blatant racism on Fark, but on Slashdot? Mods please ban this asshole.
  11. Personal Guestbooks have been targeted also... by Diplo · · Score: 2, Interesting
    This relatively new phenomena isn't just confined to blogs - it's been happening to personal guestbooks for a while. I discussed this recently on the Mozzaline forums and it's apparent I'm not the only one to suffer from this automated spam. A brief summary of what I said :
    Recently I've had 3 enteries in my guestbook that are blatant adverts for rather-dodgy commercial websites. I've deleted them, but wondered if anyone has had similair problems? One was an advert for 'bingo cards' and another for one of those dodgy 'casino' types. Now, what's interesting is that I log the user_agent that was used for all enteries, and all of these adverts stated the user_agent as Snoopy 0.95. If you follow the link you'll see that Snoopy is, infact, a PHP class that emulates a web browser.

    Obviously someone has been using it to automate the task of spidering the web and looking for guestbooks and then filling them in with this blatant spam. What suprises me, though, is that I custom wrote my own guestbook, so I'm a little suprised that what appears to be an automated process can work out how to fill in all the fields correctly. I guess my field names are fairly common, but it still managed to work out which was the 'sign' page and fill in the form, including checking radio boxes etc.

    I have feeling that the reasoning behind this spam is that it automatically creates a link from my website to the spammer's website (since I have a field for guests to fill in their own website). My guess is that this is a way to generate lots of links back to the spammers' site and increase their Google page ranking. It just amazes me the lengths these people will go!
  12. The article misses the point by HealYourChurchWebSit · · Score: 4, Interesting


    The BBC article misses the point, as does a similar article in Wired. Seems the editors are more focused on name-dropping and doomsdaying than on focusing on some recent solutions. For example:

    Point is ... perhaps we'd all be better service if said articles spent less time on the hype and a bit more investigation on some of the solutions ... whether they succeed or fail ... as both are educational.

    Just so long as no one attempts to use a rather evil solution I discovered here on /... ... that would be wrong ...

    --
    --- have you healed your church website?
  13. Re:I've Noticed by brianosaurus · · Score: 4, Interesting

    I think the ads in the blogs are going for better Google PageRank scores, rather than for direct exposure. Most blogs don't get a whole lot of traffic, mostly just family and friends, if even that much. Only a very small percentage of that audience will click, and they surely won't fall for it more than once.

    But google reads a lots of blogs. If a spammer gets their link onto a whole lot of blogs, Google PageRank would see hundreds or thousands of links to their site and bump up its rank. They exploit everyone's blog in order to improve their score on searches.

    That's the theory anyway. Whether or not it works is another story.

    --
    blog
  14. SPAM will kill the open nature of the internet by jhendow · · Score: 2, Interesting

    Or at least it'll be forced to evolve into something more restrictive. When only adventuresome geeks were using the net, it was like we were the earliest settlers in a vast ancient forest. I remember getting maybe two or three messages a month and being elated at each. It was like meeting a fellow pioneer and being mutually pleased at having anyone else to talk to. Eventually the web was born and even my mom got an email account (ZOINKS!). And then the first annoying ads starting showing up in my inbox. And now... well, we already know what happened.
    Seems like there won't be any real solution to filtering spam and the internet will have to go from being a wide-open crosslinked universe to a collection of private nodes/networks. Commercial interests supported the explosive growth of the internet/web, and a lot of us got neato jobs in the process. But now that same commercialism (and human greed/stupidity) have clearcut that beautiful old forest and built up sleazy strip malls.

    I know I'm at risk of sounding like one of those "I was here before it sucked" types. Lamenting the loss of the good old days won't bring 'em back.

    So, what do we do? The idea of charging a token fee for email delivery, which could be rejected by the recepient (thus resulting in a charge for spam, but not for mail we really want) is a good idea. But it might already be too late for that kind of solution. Make spam illegal? Sounds like yet another unwinnable "war-on-a-concept".
    Many usenet groups already require approval for membership, and even that doesn't guarantee that new accounts won't become a source of spam.

    I predict that more and more organizations and individuals will simply build fences around their cyber-outposts, only allowing recognized friends past the gate. At my house we NEVER answer the phone unless the caller ID displays a name we recognize. Ditto for email. Ditto for newgroups as well. I guess my mom was right... I don't talk to strangers any more.

  15. something similer happened on my BBS by night_flyer · · Score: 3, Interesting

    but it was a little different, the messages that were already there were replied to, but they had "empty" response, unless you looked reallu close one "character" in the reply to message now had a link attached to it.

    I dont remember where it was linking to but I think it was a seach index or something similer.

    were they trying to boost the ranking on search engines by having these so called links in place?

    --


    Thanks to file sharing, I purchase more CDs
    Thanks to the RIAA, I buy them used...
  16. The real issue is trust management. by androse · · Score: 2, Interesting

    Just like spam on other media (email, usenet, web forums, etc), you can apply quick and dirty fixes :

    • IP # based black lists
    • URL based black lists
    • CAPTCHA (images and/or audio) authentication
    • keyword filtering
    • bayesian/statistical filtering
    • etc...

    But the real issue is always the same : trust management. You want to be able to grant as much trust as possible to trustworthy (non-spamming) strangers, while revoking all trust to others.

    So why do we always want to build trust management systems on top of other systems, and not design a stand-alone one, that can be used by a wide range of media (email, usenet, blogs, etc) ?

    Note: identifying "personas" does not mean identifying "real people", so there are no privacy issues in such a system.

  17. Re:This was happening to my guestbook too by Another+AC · · Score: 3, Interesting

    We run DreamBook, a free guestbook service with about a million members, and recently the guestbook spam started getting to the point we had no choice but to do something about it. We think the way they get the list of our user's URLs is just through a google search (which has the added benefit of returning the most trafficed books where their spam will potentially be the most widely viewed).

    Originally the spam was just huge lists of porn sites, from a few specific spammers. To fight that, we kludgingly added some specific urls we wouldn't allow in any post.

    They figured that out, and we started getting more from all sorts of different people. So we started adding various heuristics that were kind of lame to block posts (no domains with a - in them for example).

    They figured that out, and started to post all sorts of random spam, unrelated to porn, usually with just links to some other dreambook url. We were kind of puzzled about those, because when you went to their dreambook, it was blank. Viewing the source though, they'd added hidden links to their sites at that book. So it seemed they were spamming to get higher google results. Super.

    So then we added system-wide a check for the same IP posting to multiple books a lot within a certain amount of time. That worked really well for a few months, but recently they've started using I guess a whole slew of proxies! So finally we now look for any URLs in their posts instead of IPs (they vary the messages they post so there's nothing else you can really look for) and filter on that.

    So far it's working okay (but now with some false positives) but it's only a matter of time until they work around that as well.

    Bastards!

  18. Re:I've Noticed by Saeger · · Score: 2, Interesting
    The solution is simple: change human nature

    Well, we are going to have to change human nature eventually, if we want to survive alongside exponentially advancing technology where any random psychopath will be able to "press The Red Button" with exponentially decreasing effort.

    I think humans are basically good when resources are abundant and life is good, but when resources are scarce (artificial or not), then the "selfish gene" goes into overdrive and people get desperate. But there's also that rare minority who have their selfish gene stuck in high gear even though they're already living like [spam]kings, because, hey, more power and more money secures *MY* genes even further, right? Screw the commons. I only care about ME and MY family and MY tribe.

    --

    --
    Power to the Peaceful
  19. Re:Here's My Solution by jparp · · Score: 2, Interesting

    Interesting idea!

    How about, a "spam" button beneath every comment, accessible to rigestered users. The message then gets put in the spam pile, to be deleted after a certain amnount of time.

    Also, if the editor notices a registered user labels non-spam as spam, he could ravoke that users use of the spam button.

    If it still gets out of hand, it would have at least been an interesting experement.

  20. Re:Here's My Solution by JWSmythe · · Score: 2, Interesting


    I wrote something like that into a messaging system that I wrote once..

    If you go to voyeurweb.com (warning, porn site), and select any set of pictures, at the bottom there's a link where users can post their comments.

    Anyone can write there, and frequently enough they write really rude comments. The people contributing the pictures don't like it, the people posting nice comments don't like it, so I added in a button, that simply keeps a record of how many people have clicked the alert button. The text of it is:

    "Alert! Click Here to let the VW Ops know if this is a rude message."

    The idea is simple enough, it remembers (SQL DB, of course) how many unique complaints were made about a particular message, and the message monitors get that list, sorted by the number of complaints. The users are pretty good about complaining, and are more than happy to click the button.

    It's fairly free of abuse, because messages that have more complaints from more users are the bad ones. Of course, there are people who complain about perfectly normal messages, but that's why we have people actually reviewing the messages before they're removed.

    There's a whole lot more to it than just the alert button,

    To me, it's very wierd, it's an adult site, and you'd think that most people are just there to look at the pictures, but there are a significant number of people posting messages there, and they are just about as fanatical about it as /. posters are about /. . :)

    The system as a whole works very well. We have 3,363,465 messages in the system (I purge old messages every few months), 5 alerts that haven't been read, and 43 IP's or networks that have been blocked. They have the power to prevent any size network from posting in the future, if the abuses have been bad enough. Most of the abuse and filtering features have grown with the messaging system over the years. When I originally wrote it, it didn't have or need any of it. It's fairly complete now, I haven't done any significant changes in years.

    --
    Serious? Seriousness is well above my pay grade.
  21. privacy, openness, spamfree by epine · · Score: 3, Interesting


    Wake up and smell the bacon, people. The techno-utopianism of Wired when it was boosting the dotcom era into orbit has proven itself a poor match with human nature on all fronts.

    The benificient fathers of the internet made two horrendous design decisions concerning the final destination of a global internetwork: excessively strong anonimity and a near zero cost for dumping pollution into public media.

    Privacy, openness, spam-free: pick any two.

    For anyone who looked into ECC yesterday, you might have noticed that RSA has ideal properties for preventing some of this mess: expensive to sign a certificate, cheap to verify, and the ratio becomes worse as you scale up.

    If every spam artifact was signed with an anonymous RSA cert (anyone could make as many of these as they wish), as soon as one spam is confirmed, every other post signed by the known-spam cert could be instantly revoked.

    This would force the spammers to create a new anonymous cert for every spam instance. Yet with RSA certs, the computational cost to generate a cert is vastly greater than the cost to verify the cert.

    As an added step, the cert could require the IP address of both endpoints to be embedded inside (the server would reflect back the IP source address it sees, and then ask for an anonymous cert to be generated at a desired RSA key size).

    We won't have to damage anonymity very much to vastly increase the cost of dumping pollution.

    In this respect, weblogs would be a good place to start. This is a relatively new technology that could be retrofitted at one percent of the cost of a global e-mail infrastructure upgrade. It really doesn't matter if you inconvience a few bloggers working out the kinks, these people have not much useful to do in any case.

  22. Re:I've Noticed by 24-bit+Voxel · · Score: 2, Interesting

    Do you know of any scams from the centuries you listed? (I am not confronting, I am just curious.) Vox