Comment Spams Straining Servers Running MT

Wow by Anonymous Coward · 2004-12-18 09:09 · Score: 3, Funny

It's a good thing Slashdot doesn't have this problem.

Re:Wow by Anonymous Coward · 2004-12-18 13:08 · Score: 0

I dont get it?
Re:Wow by Anonymous Coward · 2004-12-18 14:01 · Score: 0

Search for each word on Google. The links point to the first hit for each word individually.

I have a plan by Anonymous Coward · 2004-12-18 09:10 · Score: 0

I'm going to start a comment spam deletion/marking service. I'll charge bloggers 1/10th of a cent per comment checked (1000 comments for a dollar), and hire people in some foreign country, like India or China, paying them 1/20th of a cent per comment read. For every proven mistake they make, I will fine them 10 cents, and credit 5 cents to the blogger. Sound workable?

Re:I have a plan by tomjen · 2004-12-18 09:17 · Score: 1

That is all good and well, but if i remember correctly some company offered such a service for your email. Dont you think they will do this if they can make money that way?

--
Freedom or George Bush
Re:I have a plan by the-banker · 2004-12-18 09:31 · Score: 3, Interesting

No this doesn't sound workable, since a person operating at 99.5% accuracy would not make any money.

For example, they check 2,000 e-mails to earn a dollar, so they check 200 to earn 10 cents. If they make one mistake in that 200, then their entire payment for the 200 goes away.

Besides, you are throwing a human resource at a technology problem and when the technology is fixed, *poof* your business is gone.

In the case of MT the problem isn't the amount of spam, its the way in which static pages are rebuilt when they don't need to be, and mostly manifests itself in shared user environments (per the article). Your service wouldn't help this, because the problem isn't in the spam being displayed its the generation of the pages with the spam on it, which would have to be completed before your spam auditors could ever even see the copy.

Not to mention all the problems around fulfillment. So they see spam, what do they do? Send an e-mail? Do you think people would give your little spam army access to delete comments on the spot? Or do you plan on using some sort of live filtering to further slow down a bottle necked process?

Some things, like voting, should have human intervention and control. Others like this aren't as suited to the task.
Re:I have a plan by AndroidCat · 2004-12-18 10:59 · Score: 1

1. Get paid to comment spam for customers.
2. Get paid for removing your own comments after a delay to get spam hits.
3. ???
4. Profit!

--
One line blog. I hear that they're called Twitters now.
Re:I have a plan by kmmatthews · 2004-12-18 11:57 · Score: 3, Funny

3. Ruin your business plan by posting it to slashdot.
:)

--
feh. stuff.

So it's dead? by miyako · 2004-12-18 09:10 · Score: 1, Funny

So...Netcraft confirms it, blogging is dead?

--
Famous Last Words: "hmm...wikipedia says it's edible"

Re:So it's dead? by macshit · 2004-12-18 10:06 · Score: 1

So...Netcraft confirms it, blogging is dead?

Hold on -- blogging was once alive?

Whoa.

--
We live, as we dream -- alone....
Re:So it's dead? by Anonymous Coward · 2004-12-18 18:17 · Score: 0

You got modded Flamebait. Maybe the Sensitive Moderator that read your post also blogs.

Easy Solution by goodgoing · 2004-12-18 09:12 · Score: 1

Why don't bloggers just disable HTML in comment posts, the spammers are looking for Google PR aren't they?

Re:Easy Solution by Anonymous Coward · 2004-12-18 09:15 · Score: 2, Interesting

Or make an in-between page for every URL linked. So, someone leaves a link, it gets made into http://www.example.com/linkout.php?linkid=23890 (or whatever), then linkout.php just SHOWS the link (not a redirect) with a noindex,nofollow tag (for Google) and robots.txt entry. No PR, yet a user can still click. Another alternative would be to be use javascript since Googlebot doesn't seem to parse it yet.
Re:Easy Solution by Eric+Giguere · 2004-12-18 09:41 · Score: 1

Hmm... semi off-topic, but it would be neat if search engines like Google could be trained to ignore negative score Slashdot comments. On systems where there's built-in feedback, that would be one way to combat the spam, just train the search engine crawlers to ignore comments with poor scores.
Eric
See your HTTP headers
Re:Easy Solution by Rob+Carr · 2004-12-18 09:46 · Score: 1

"On systems where there's built-in feedback, that would be one way to combat the spam, just train the search engine crawlers to ignore comments with poor scores." 1. Google should punish URLs with negative feedback! 2. Or Google should ignore URLs in comments. Dang, I'm still shaking - Steelers 33, Giants 30. Great game.

--
This sig seemed like a good idea at the time....
Re:Easy Solution by tepples · 2004-12-18 09:47 · Score: 3, Informative

it would be neat if search engines like Google could be trained to ignore negative score Slashdot comments

Given that the static page is written at a Score:1 threshold, and that Google obeys Slashdot's suggestion in robots.txt not to index the dynamic pages, this is already the case.
Re:Easy Solution by blowdart · 2004-12-18 10:06 · Score: 1

Well referral spam has been going on for ages (I list mine, but don't link to the urls) and people still publish web logs.
Ease of use is going to win every time.
Re:Easy Solution by AmericaHater · 2004-12-18 11:26 · Score: 1

Can you expand on this with a code example? do you mean the link is show as text with no anchor tag or what? If you use a robots file wouldnt you have to confine all links to one page in its own directory?
Its an interesting sounding idea that I might hack up.
Re:Easy Solution by Anonymous Coward · 2004-12-18 11:48 · Score: 0

I'm thinking something like tinyurl. To be more clear on the steps:

1. Comment writer writes a comment and includes a link, say to www.google.com.
2. Blog software parses comment, stores the link in a link database with a unique ID, then replaces the link in the comment to your outgoing linkpage (with the link id #)

Visit my favorite search engine [a href="http://www.google.com/"]Google[/a]!

becomes

Visit my favorite search engine [a href="/outgoing.php?linkid=1"]Google[/a]!

3. The outgoing link page would be pretty simple. It would look at the link ID #, pull it from the database and present it to the user. This page would contain a meta tag for Google telling it not to follow it or index it. I'd probably also add the page to the robots.txt so they don't even waste their time. I wouldn't use a header redirect (301 / 302), but you might be able to get away with a meta header refresh redirect. I'm not sure how Google handles those.

[html]
[head]
[meta name="robots" content="noindex,nofollow"]
[/head]
[body]
[a href="http://www.google.com/"]http://www.google.co m/[/a]
[/body]

Not just comment spam by cybrthng · 2004-12-18 09:12 · Score: 3, Interesting

But DoS attacks as well. Running several political blogs I often get "freeped"

The best solution for me:

1. User email address verification
2. server generated images to verify real user for registration
3. Regular cookie expiration after x amount of time
4. host filtering (referr filtering usually gets ride of "freepers" unless they open a new window

However - nothing beats good moderators, quality users and sticking to your nich. Don't go pissing people off tossing your blog around the world yourself and not expect to get anything in return.

It's a jungle out there :)

Re:Not just comment spam by doormat · 2004-12-18 09:20 · Score: 3, Informative

Some context: This is a "freeper". They have also been known to use militant mob-style tactics to bother/silence those who dont agree with them, as parent has dealt with. Kinda ironic ya know... they are freepers yet they work hard to silence those who dont agree with them.

--
The Doormat

If you're not outraged, then you're not paying attention.
Re:Not just comment spam by LiquidCoooled · 2004-12-18 09:21 · Score: 3, Informative

sage advice :)

The worst part of being a slashdot member is watching people devistate and ruin a server because of childish acts of vandalism.

Take for instance whenever slash points towards wikipedia, within minutes the page will be modified to some trolls' agenda.
Having to wade through the crapflood of comments on blogs and forums after slash has been there is almost embarassing sometimes.
The servers can generally cope with a slashdotting and work perfectly just hours or days after the initial hit, however the trolls handywork can end up staying for longer.

--
liqbase :: faster than paper
Re:Not just comment spam by kv9 · 2004-12-18 09:28 · Score: 0

Harry Fuecks has some ideas also.

--
Stop Computers/Cars Analogies on S
Re:Not just comment spam by nkh · 2004-12-18 09:31 · Score: 1

2. server generated images to verify real user for registration

I don't know if something like that have already been done but there was a paper on neural networks used to crack captchas. It was very efficient on basic text (even with a medium amount of distortion) and showed that intelligent spam bots could be written in the future (not that I want to scare you though ;)
Re:Not just comment spam by eschipul · 2004-12-18 09:35 · Score: 1

IANAB (I am not a blogger) but it seems to be that track back is at least a partial solution. Perhaps assumed negative on the automatic track back post until it is activated by the author. http://www.movabletype.org/trackback/beginners/
Re:Not just comment spam by LordNimon · 2004-12-18 09:46 · Score: 1

3. Regular cookie expiration after x amount of time
I really hate it when web sites do that. Does anyone know of a Mozilla plug-in or something that will let me edit the expiration date of any cookie, preferably when the cookie is being set?

--
And the men who hold high places must be the ones who start
To mold a new reality... closer to the heart
Re:Not just comment spam by tepples · 2004-12-18 09:49 · Score: 1

server generated images to verify real user for registration

Use a visual CAPTCHA and completely disrespect readers with impaired vision.
Re:Not just comment spam by Anonymous Coward · 2004-12-18 09:50 · Score: 0

Not really that ironic, considering Slashdot still has "Your Rights Online" moderated by michael and timothy. Militant mob-style tactics sure are popular on the internet.

(Oh, sorry, that should be "internets.")
Re:Not just comment spam by Firethorn · 2004-12-18 09:55 · Score: 1

Yeah, but the amount of power it takes to decode them at least limits the amount of posts it allows.

The question becomes one of spam. Whether it's in your email box, or the comments of your blog, it's the same.

You want it to be easy to filter out the spam and still make it easy for legitimate readers to make comments.

Looking at the slashdot system, a mail-verified registration system seems to be mostly sufficient.

On my blog the spambot was putting porn weblinks into the webfield, and a generic 'dude that's cool' or 'I want to know more' type of text in the comment field.

However, mine was easy, it was all coming from one subnet, so I blocked that.

--
I don't read AC A human right
Re:Not just comment spam by Anonymous Coward · 2004-12-18 10:03 · Score: 0

Post information on the Internet and completely disrespect readers with impaired Internet connectivity.
Re:Not just comment spam by tepples · 2004-12-18 10:25 · Score: 2, Insightful

Correcting lack of access to text on the Internet is easy: just buy a PC with a screen reader and an account with an ISP. Correcting lack of access to distorted images of text on the Internet, on the other hand, is non-trivial: if the CAPTCHAs are easy enough for blind people's OCR, then they're easy enough for spammers' OCR. If you must use a CAPTCHA, then make it something other than an image. Ask yourself: what questions can a blind person answer that a spambot can't?
Re:Not just comment spam by Anonymous Coward · 2004-12-18 11:00 · Score: 0

Why don't we limit the entire world such that nothing is inaccessable to the lowest common denominator human? Time to get rid of everything that can't be fully utilized by a blind, deaf, dumb, anosmic, quadrapalegic, retard with no no limbs.
Re:Not just comment spam by Anonymous Coward · 2004-12-18 11:03 · Score: 0

You insensitive clod!!!

I am a blind, deaf, dumb, anosmic, quadrapalegic, retard with no limbs!
Re:Not just comment spam by cybrthng · 2004-12-18 11:57 · Score: 1

yeah, i hate it too - but it works. Keeps those "one timers" who come in just to hammer the board with crap and then get re-prompted for a login they most likely forgot before and have to go through a registration process again and usually just give up..

Ofcourse you could also just regenerate the cookies bsaed upon post scoring - for example if people get modded up lengthen cookie time and such because there is some trust being given.

Give a reward for participation of sorts
Re:Not just comment spam by LWATCDR · 2004-12-18 14:19 · Score: 0

frankly it is typical behavor of "Liberals" Often those that scream that they are the most open minded are not. Of course the exterm right does the same the only real difference is that they do not tend to claim Open mindedness.

--
See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
Re:Not just comment spam by minus_273 · 2004-12-18 14:49 · Score: 1

hmm how is this any differnt from slashdot? freeping is just another name for the slashdot effect.

--
The war with islam is a war on the beast
The war on terror is a war for peace
Re:Not just comment spam by cybrthng · 2004-12-18 15:24 · Score: 1

Open minded doesn't mean they necessarily agree with you - just that they see things differently then you.

Put a 100 liberals in the room and no one will speak ot one another.. stick 100 righties in a room and you have an organized gang..

just the way it seems to work. "No 2 thinkers, think alike" :)
Re:Not just comment spam by ceejayoz · 2004-12-18 15:37 · Score: 1

As I understand it, "freeping" a site means intentionally manipulating something like a poll so it swings in your political favor. For example, sites on both sides were encouraging their users to "freep" the CNN/MSNBC/etc. polls after the Presidential debates this year.

The Slashdot effect is more mindless.
Re:Not just comment spam by polyiguana · 2004-12-18 16:44 · Score: 1

Ask yourself: what questions can a blind person answer that a spambot can't?

Audio questions. Livejournal (blech) uses that for registration.
Re:Not just comment spam by LWATCDR · 2004-12-18 17:12 · Score: 1

I have no problem with people disagreeing with me. I have problem with closed mindedness. Sorry they have just as closed minded view on the world as any right winger. I suggest you consider pretending to be pro life, or pro the dropping the bomb on japan to end WWII, or say that Bush is not that horrible and see if you are not attacked by that liberal gang that forms.

--
See my blog http://ilovecookes.blogspot.com/ for light hearted technical information.
Re:Not just comment spam by tepples · 2004-12-18 17:23 · Score: 1

Audio questions? Then you shut out the deafblind, who use a Braille terminal. But I'm guessing you consider them collateral damage in the war on spammers.
Re:Not just comment spam by ZorinLynx · 2004-12-18 18:20 · Score: 1

Just provide an address where people can email you if the distorted text thing is unusable for them. Very few users can't use that system, and you can just create accounts of those who can't manually.
Re:Not just comment spam by Anonymous Coward · 2004-12-18 22:57 · Score: 0

Looking at the slashdot system, a mail-verified registration system seems to be mostly sufficient.

You must be new here...
Re:Not just comment spam by Anonymous Coward · 2004-12-18 23:38 · Score: 0

The Slashdot effect is more mindless.
Hah. Tell that to Alan Ralsky!

Old news. by 1_interest_1 · 2004-12-18 09:12 · Score: 3, Insightful

This has been going on for quite awhile now, and still no official fixes from SixApart?

Shame on them.

Re:Old news. by Anonymous Coward · 2004-12-18 14:06 · Score: 0

They should rename it to Four Apart. ;)

Netcraft confirms ex-MT users love WordPress by IO+ERROR · 2004-12-18 09:14 · Score: 4, Informative

There are many reasons to use WordPress instead of Movable Type.

First and foremost, it's free (speech and beer) and distributed under the GPL.

Second, the actual developers of the software actually participate in the support forums, so if you do have a question, it's likely to be answered very fast by someone intimately familiar with the software.

Third, it's a lot less susceptible to comment spam, especially after applying a few plugins and hacks. I've never received a single one, and that's not for lack of spammers trying.

Fourth, it's very easy to customize the look and feel of the site without knowing any PHP. HTML and CSS is about all you need to know. Knowing PHP helps a lot if you want to really customize it, but it isn't a requirement.

Finally, they've already included a Movable Type import utility, so those of you who are sick of MT for this and many other reasons can move over with little hassle.

Signed,
A very happy WordPress user and occasional contributor.

--
How am I supposed to fit a pithy, relevant quote into 120 characters?

Re:Netcraft confirms ex-MT users love WordPress by Xofer+D · 2004-12-18 10:33 · Score: 2, Interesting

The down side to WordPress is that it's really very immature code. Not only does it handle UTF-8 characters poorly, but even casual usage turns up a number of bugs in various different parts. This suggests to me that the developers fixed it in one section but didn't fix it in other parts of the code - not exactly thorough. I ran into all this stuff inside my first three hours of usage.
Of course, all of this is fixable, and just calls for more people to jump in and get involved. I learned a bit of PHP and hacked myself a fix for the UTF-8 issues I was having, inside five hours of my first wordpress installation (note that's two hours after I found the problem and figured out how to replicate it reliably). I also installed and improved upon some of the comment spammer blacklist plugins, which ended up working very well. Prior to fiddling with wordpress, I had no PHP experience at all. I am not a programming god, either.
The developers are also responsive to suggestions - I posted a bug about some of the UTF issues I could not solve, and it was resolved for me. Thanks, matt!
I think that it's important to manage expectations when advocating software, which is why I want to make it clear the wordpress does not yet seem rock-solid stable. However, I think that with enough eyeballs (Hi, everyone!), it will definitely become the secure, flexible platform that most of everyone wants.
Spammers need not apply.

--
The Signal/Noise ratio can be improved in two ways. Remaining silent is the OTHER way.
Re:Netcraft confirms ex-MT users love WordPress by gad_zuki! · 2004-12-18 12:04 · Score: 1

Wordpress has its pros, but the support forum is a ghost town. Maybe when more people migrate over to it this will change, but I think only a small percentage of my questions even had some kind of reply. The wiki is out-dated and full of tips that dont even apply to the current version.

The current version is buggy (password reset, no way to link to user's profile, etc), but runs well enough and now that MT costs money I'm sure there will be more WP users out there soon. Then again, blogger is great for technophobes and experts alike and offers free hosting.

I use only two plug-ins because other plugins break my site. The problem is "this plug in breaks this plug in or this plug in will break the site if that plug in is installed, etc" Plug-in compatibility really needs to be addressed before we start singing the praises of "plugins for everything!!!" As a dabbler in PHP I really didnt like the lack of commenting in the code either.

That being said, its a good project, has an excellent installer, and I hear the current beta addresses some of these issues. But I'd wait for the next version (1.3?) before trying to sell it to a current MT user. I doubt the cries of "its GPL" is going to convert anyone but the already converted.
Re:Netcraft confirms ex-MT users love WordPress by Anonymous Coward · 2004-12-18 12:06 · Score: 0

Amen to Wordpress. Using that and the AuthImage plugin, which requires commenters to type in a code displayed on an image (like Wil uses, although his is with MT) has been a GODSEND.
Re:Netcraft confirms ex-MT users love WordPress by Anonymous Coward · 2004-12-18 12:29 · Score: 0

I get a significant amount of comment spam on WordPress, so it's not better.
Re:Netcraft confirms ex-MT users love WordPress by saxmatt · 2004-12-18 12:33 · Score: 1

The support forum has hundreds of posts per day, if that's what you mean by ghost town.

The login problem I think you're referring to was fixed in the latest release, if not please file a bug and we'll get to it ASAP.
Re:Netcraft confirms ex-MT users love WordPress by saxmatt · 2004-12-18 12:37 · Score: 1

Strange that you mention UTF-8 issues, was this from before 1.0? Anyway I guess if you filed a bug and we fixed it then there's nothing to worry about now. :)
Re:Netcraft confirms ex-MT users love WordPress by Oliver+Aaltonen · 2004-12-18 12:50 · Score: 1

From the WordPress Dev Blog: Fight Spam
Re:Netcraft confirms ex-MT users love WordPress by spike2131 · 2004-12-18 13:57 · Score: 1

I use wordpress. Its nice but comment spam is a real problem. Or at least it was. I had the same online poker guy spaming me 5 times a day until I changed the name of the php file that comments get submitted to. That seem to have done the trick, at least as far as automated spamming.

--
SpyDock: Scientific Python in a Docker container
Re:Netcraft confirms ex-MT users love WordPress by generationxyu · 2004-12-18 15:09 · Score: 1

Second here. I got sick of MT when I tried to upgrade from 2.6 to 3.01, and while I was at it, switch from Berkeley DB files to MySQL. The upgrade alone took me 6 hours or so (over a number of days), I posted questions on the forums and go no answers. This is for a *paid* product. The BerkDB->MySQL switch simply did not work. They have a script that supposedly does the conversion, but it doesn't work with all versions of BerkDB files, even though MT pretty much does.
I posted this problem to the forums and got a response within about 15 minutes telling me to upgrade to 3.01. I told them I already did. A week later, I hadn't gotten any response, and started looking for alternative CMSs. I had heard good things about Greymatter, but it turned out that it wasn't the style I was looking for at all -- GM is more for journals and such, not blogs. But within 15 minutes of asking about GM on the GM forums, the lead developer told me, no, this probably isn't what you're looking for, your feature wishlist sounds a lot like WordPress.
20 minutes after hearing of WordPress, I had it installed and my MT bloggings imported (therefore importing all my authors). Another hour to mess with the stylesheet and I haven't messed with it since (except to update it, and add a captcha for comments).

--
I mod down pyramid schemes in sigs.
Re:Netcraft confirms ex-MT users love WordPress by generationxyu · 2004-12-18 15:14 · Score: 1

If you install a captcha such as AuthImage (the only one I've gotten to work), the comment spam really drops down to null. You kind of have to hack it in there -- I do wish plugin support was better -- but it does work quite well.

--
I mod down pyramid schemes in sigs.
Re:Netcraft confirms ex-MT users love WordPress by jseng · 2004-12-18 17:24 · Score: 1

Beyond Wordpress, perhaps you should also consider looking at Drupal which is also GPL?
Specifically, my Drupal4Bloggers project is to recreate all the features that MT users are used to in Drupal. Comment spams and the constant need for rebuild after spam attacks is the main reason that drive me away from MT and to start the project. It is pretty stable now and has all the anti-comment spams features I build for MT plugins (e.g. the popular captcha plugin mt-scode). It also has moderation features Wordpress users are used to. Work is underway to build anti-Trackback spams.
Re:Netcraft confirms ex-MT users love WordPress by jseng · 2004-12-18 17:43 · Score: 1

Beyond Wordpress, perhaps you should also consider looking at Drupal which is also GPL?

Specifically, my Drupal4Bloggers project is to recreate all the features that MT users are used to in Drupal. Comment spams and the constant need for rebuild after spam attacks is the main reason that drive me away from MT and to start the project.

It is pretty stable now and has all the anti-comment spams features I build for MT plugins (e.g. the popular captcha plugin mt-scode). It also has moderation features Wordpress users are used to. Work is underway to build anti-Trackback spams.

comment spams made me switch by SethJohnson · 2004-12-18 09:15 · Score: 2, Informative

I had to ditch Moveable Type explicitly due to comment spam. The real problem with it was that there was no way to delete more than one at a time. The web app only displays the last five comments and then you have to go digging through every article to find the other spams. Real pain in the ass. I switched to Wordpress, which is also beseiged by comment spam from Online Poker outfits. In Wordpress, however, you can mass-edit with all comments listed with checkboxes to delete whichever are spams.

In Moveable Type and Wordpress, you can pretty much eliminate the script-driven spambots by renaming the comment cgi handler and then editing all other files that reference it. I didn't think of this till after I swtiched to Wordpress, though.

--
$5 / month hosted VPS on linux = awesome!

Re:comment spams made me switch by happyemoticon · 2004-12-18 09:25 · Score: 1

That looks a lot more robost than MT (mind you I'm still using 2.65). When this whole comments thing started getting out of hand, I actually edited every damn post since last year to be comments-closed.

Maybe I'll switch too. I was planning to do a redesign during the break. Does it have pretty versatile templating?
Re:comment spams made me switch by eggboard · 2004-12-18 09:43 · Score: 1

MT 3.x has a Comments page that lets you review 20, 50, etc., comments at a time, select them all to delete, etc.

Much improve and appreciated. I also turn on comment moderation and this fixed the problems I had with comment spam.

--
Freelance tech journalist for the Economist, MIT Technology Review, Macworld, and others
Re:comment spams made me switch by Echo5ive · 2004-12-18 10:13 · Score: 1

Sorry, but renaming mt-comments.cgi to something else takes a spammer all of two seconds to bypass. They just sniff for the text field names in the comment form, and find out the name of the comment handler that way.

I'm a user at TextDrive, and a bunch of users and admins there have a mailing list where we are VERY aggressive in defeating spam. mod_security is great for blocking based on the contents of a POST payload ("contains texas holdem? Sorry, you get an Error 412.") and mod_dosevasive, which is great for hindering a mini-DDoS of comment spam.

Every addition to the block lists is peer-reviewed by the members of the mailing list, to make sure that we only catch spam, and not innocent comments.

We've pretty much put a stop to comment and referral spam on TextDrive thanks to this effort.

--
Leveling up builds character.
Re:comment spams made me switch by Sethb · 2004-12-18 10:23 · Score: 2, Informative

I just implemented their TypeKey service on my MT blog when it came out. I used to get comment spam nearly daily, but in the five months since I turned on TypeKey I haven't had a single instance of it. I don't know why more blogs aren't using it, since it is free, and it works quite well for me...

--
When in danger or in doubt, run in circles, scream and shout. --Robert A. Heinlein
Re:comment spams made me switch by jacobito · 2004-12-18 10:35 · Score: 2, Interesting

Perhaps this was added in version 3.x, but you certainly can delete more than one comment at a time in Movable Type, and there is no need to "dig through" each post to find the latest comments, whatever the number. I believe that the comments page displays 20 comments at a time by default. It's unfortunate, though, that Six Apart pissed everyone off by licensing 3.x as they did, or more people would be taking advantage of 3.x's small but worthwhile improvements.

I agree with other posters that renaming the comment CGI handler is ineffective. It's ineffective because enough people have tried that technique that it has become worthwhile for spammers to work around it. Other potential solutions will probably end up with similar results. Want to stop spammers by forcing comment previews? Then the spammers will preview their comments. Want to stop spammers by throttling x number of comments per hour? Then you'll end up with exactly x number of comments, fewer legitimate comments, and you'll still have spam. Want to stop spammers by forcing a login from a central authentication server? Spammers will register their own accounts on that central authentication server, too. Etc.

I'm sorry to say that spam cannot be prevented, only mitigated. The best you can hope for is not having to manually delete every single comment you receive, as automated solutions weed out some (hopefully) high percentage of them. Meanwhile, any solution short of refusing comments altogether will eventually be defeated to some extent by spammers, assuming that enough people use that solution to make it worth the spammers' time and effort to defeat. One consequence of this is that switching from one popular blogging platform to another popular blogging platform is not going to save you from spam in the long run.
Re:comment spams made me switch by jacobito · 2004-12-18 10:59 · Score: 1

Blog spammers are starting by pursuing the low-hanging fruit. As more and more weblogs switch to central authentication systems like TypeKey, I expect that spammers will find it worthwhile to figure out how to spam using TypeKey accounts. If I'm wrong in thinking this, I still haven't heard a good reason from Six Apart or anyone else why that would be the case. I would be happy to be wrong about this, though.
Re:comment spams made me switch by rscrawford · 2004-12-18 11:02 · Score: 1

Renaming the comment cgi handler worked for a little while until the spambot authors figured out a way around it. I've now added a hidden text field to the comment form, and the comment cgi handler will not accept the comment unless it includes that hidden form element. It's a temporary solution until the spammers figure it out and bypass that too, but for now it seems to work okay. I haven't gotten hit since I implemented it a couple of weeks ago -- before that I was getting a dozen comments from online poker sites every few minutes (none of them got posted, because I have comment moderation turned on by default -- still, playing whack-a-mole with the comments was really annoying).

I also have wp-blacklist installed, and that works great, though it seems to have issues with some of the earlier versions of WP.

--
-- The reason it's called the right wing? Irony.
Re:comment spams made me switch by Anonymous Coward · 2004-12-18 11:17 · Score: 0

"Sorry, but renaming mt-comments.cgi to something else takes a spammer all of two seconds to bypass."

This is not necessarily true. Not all of the spammers will bother rescanning the source of the page. I have done this on my blog and I have yet to have anymore comment spam since I did so. It's also been over a month now.
Re:comment spams made me switch by Echo5ive · 2004-12-18 12:16 · Score: 1

Oh, just you wait. The serious spammers only target blogs with a good (6+) PageRank in Google. Then renaming mt-comments.cgi doesn't change anything at all.

--
Leveling up builds character.
Re:comment spams made me switch by Anonymous Coward · 2004-12-18 12:39 · Score: 0

TypeKey doesn't protect at all against Trackback spam.
Re:comment spams made me switch by jesser · 2004-12-18 13:41 · Score: 1

TypeKey will be to protect blogs by banning abusive IPs. Before TypeKey, each IP address a spammer acquired allowed him to spam every MT blog. With TypeKey, each IP address will probably get banned before it spams a hundred blogs.

--
The shareholder is always right.
Re:comment spams made me switch by jacobito · 2004-12-18 14:51 · Score: 1

A) Is there even any mechanism to report problem users to the TypeKey admins?
B) If there is, is it wise for the TypeKeys to ban first and investigate later?
C) Why would banning by IP addresses by effective at the TypeKey level when it's already been proven to be ineffective at the individual weblog level? Spammers certainly don't restrict themselves to a single IP. I thought it was already well-established among those who are fighting comment spam that banning by IP address is pointless.
Re:comment spams made me switch by (negative+video) · 2004-12-18 15:19 · Score: 1

Why would banning by IP addresses by effective at the TypeKey level when it's already been proven to be ineffective at the individual weblog level?
Consider M blogs and a spammer with N zombies in his botnet. Banning IP addresses on individual blogs means that M×N spams will get through at a minimum. Centralized banning would only let N spams get through. If M and N are large, that's easily hundreds of millions of spams prevented.
Re:comment spams made me switch by tapin · 2004-12-19 12:59 · Score: 1

Perhaps it defeats the purpose of a web-driven administration tool, but the times when I've had to purge spam comments I've simply done it through the database.
A few "delete from mt_comment where...", and one rebuild later (back in the web admin tool) it was all done. Very little fuss.
Of course, this talk of alternatives has me interested anyway...

Uhh.. it's not that difficult. by Anonymous Coward · 2004-12-18 09:15 · Score: 0

Just disable URL's in comments, and in user information.

Disabling comments is just silly.

Re:Uhh.. it's not that difficult. by jonhuang · 2004-12-18 14:37 · Score: 1

I did that. They don't care; they spam anyway. I auto-blacklist anything that looks like a link. I still get hundreds of *misconfigured* and ineffective spam.

A simplistic solution by happyemoticon · 2004-12-18 09:17 · Score: 3, Interesting

If your case is like mine, where mt is stored in a directory just off of your public web site, do this: use a .htaccess to put a password on your whole MT directory. They can't access comments.cgi (assuming it's just a bot doing the spamming), they can't post comments. I don't really like the idea of people touching my CGIs anyway. Make sure your robots.txt excludes the MT directory as well.

That is, assuming you don't give a damn about people's comments.

Re:A simplistic solution by Anonymous Coward · 2004-12-18 09:41 · Score: 0

That is, assuming you don't give a damn about people's comments.

I'll make a naive comment but: If you don't give a damn about people's comment, why would potential readers care about what you write in your blog?
Re:A simplistic solution by happyemoticon · 2004-12-18 10:04 · Score: 1

For the most part, the only people who read it are a few close friends and my girlfriend. I mostly use it as a design testbed and a place to rant.

However, there's nothing preventing you from giving your password out to some of your friends, or even putting it on the webpage itself. In a gif, better yet. The scripts that run these things aren't that smart, and clearly the 1000 odd posts on my website weren't done by a human. I'm not important enough..
Re:A simplistic solution by GeorgeH · 2004-12-18 10:38 · Score: 1

That is, assuming you don't give a damn about people's comments.

Who posts comments on websites anyway? It's not like anyone reads them.

--
Why can't I moderate something "Wrong" or at least "Grossly Misinformed"?
Re:A simplistic solution by (negative+video) · 2004-12-18 15:28 · Score: 1

+1 Troll
Re:A simplistic solution by scotty · 2004-12-18 18:50 · Score: 1

Nor can anyone search anything on your blog through MT's search CGI script. D'oh.

Now then... by commieboyredux · 2004-12-18 09:17 · Score: 1

How long until we have content/poster filtering for blogs like we have for e-mail? If someone got coding right now, they might make a pretty penny off of this...

Re:Now then... by jacen_sunstrider · 2004-12-18 09:39 · Score: 2, Informative

Already done! And they're for wordpress! My favorite is Blacklist, and it works pretty well, long as I update the definitions every once and awhile.
Re:Now then... by jacksonj04 · 2004-12-19 00:46 · Score: 1

I'm writing a Bayesian filter for comments based on a peer review system (like /.)

--
How many people can read hex if only you and dead people can read hex?

Why your Moveable Type blog must die by __aajqwr7439 · 2004-12-18 09:20 · Score: 2, Funny

You are all pretentious twats

Every last one of you. You're all latte-sipping, iMac-using, suburban-living tertiary-industry-working WASPs who offer absolutely no new insights on anything whatsoever apart from maybe one specialist field if we're lucky.

Quite an enjoyable rant.

xox,
Dead Nancy

Re:Why your Moveable Type blog must die by Anonymous Coward · 2004-12-18 09:34 · Score: 0

Roger that. One of the highlights of kuroshin. Gotta love it.
Re:Why your Moveable Type blog must die by happyemoticon · 2004-12-18 09:37 · Score: 2, Funny

I live in the urbs, I drink cappuccinos, and I work for an academic research unit. My computer is not an iMac, but a PC with XP and Slackware. I'm a euromutt of catholic derivation, and I have pretty broad interests.

But that's pretty damn funny, I'll admit. They forgot, though, that they're all writing dark fantasy novels which will never be published.

There are far too many weblog addicts out there who are excessively vain, and are under some kind of bizarre pretense that they matter, and they seem to exist solely by jacking each other off. Hrmph. But you have to admit, MT users are a little less likely to be whiny baby-bats than, say, livejournal users.
Re:Why your Moveable Type blog must die by Anonymous Coward · 2004-12-18 13:11 · Score: 0

You shouldn't make generalizations about a site that you've never visited. That's what they do on kuro5hin.
Re:Why your Moveable Type blog must die by SQL+Error · 2004-12-18 13:19 · Score: 1

But you have to admit, MT users are a little less likely to be whiny baby-bats than, say, livejournal users.

Or Kuro5hin readers...
Re:Why your Moveable Type blog must die by Anonymous Coward · 2004-12-18 15:06 · Score: 0

Yes, some weblogs are bad. Some webloggers are annoying. 'Blogosphere' is a ridiculous word. How insightful. How funny. *yawn*
Re:Why your Moveable Type blog must die by Anonymous Coward · 2004-12-19 01:54 · Score: 0

But you have to admit, MT users are a little less likely to be whiny baby-bats than, say, livejournal users.

Don't knock those whiny livejournal baby-bats. Fucked up teen goth chicks are the best lays on the planet!

Nucleus CMS by einolu · 2004-12-18 09:24 · Score: 1

besides WP, Nucleus is also a good blogging tool, easy to use and its secure. I use this and WP, both are nice. Also I was getting a lot of comment spam using WP, but I turned off letting other sites know when I update and the online casion spam stopped.

Re:Nucleus CMS by jacobito · 2004-12-18 10:54 · Score: 1

but I turned off letting other sites know when I update and the online casion spam stopped.

I've seen this observation mentioned once before, and I'd like to see this explored further. It seems that spammers are harvesting URLs from sites like weblogs.com and blo.gs. I don't doubt that their finding blogs via Google searches, though, so turning off update notifications is probably a temporary solution at best.
Re:Nucleus CMS by glyneth · 2004-12-20 02:07 · Score: 1

I have a blog that doesn't ping anything when it gets updated. It started to get comment spam, so I just turned on an .htaccess file, and put the login and password right in the login popup window.

No more comment spams, though it takes a second for people who want to read/click-through to do it.

challenge the user by lseltzer · 2004-12-18 09:24 · Score: 4, Informative

We had a similar problem on our ziffdavis.com blogs (like my security blog) and we think we have solved it with with one of those graphic field challenges to the user (enter the value in the nearby graphic).

Re:challenge the user by jacobito · 2004-12-18 10:45 · Score: 2, Insightful

Captchas are currently great for weeding out automated spammers; unfortunately, they're also great at weeding out people who cannot see. This unnecessarily renders your site inaccessible to a portion of your audience. From a geekier perspective, this sort of assumption-laden web design runs completely contrary to the accessible, device-independent spirit of the original WWW.

Of course, since the blog you linked doesn't even work at all as I write this, maybe you're not concerned with accessibility for anyone!

http://blog.ziffdavis.com/seltzer GET /seltzer HTTP/1.1 HTTP/1.x 200 OK Server: Microsoft-IIS/5.0 Date: Sat, 18 Dec 2004 22:39:46 GMT X-Powered-By: ASP.NET X-AspNet-Version: 1.1.4322 Transfer-Encoding: chunked Cache-Control: private Content-Type: img/jpeg; charset=utf-8
Re:challenge the user by Anonymous Coward · 2004-12-18 10:55 · Score: 0

WFM, using linux right?
Re:challenge the user by lseltzer · 2004-12-18 12:33 · Score: 1

Works fine for me, sorry if you have a problem.

I have seen this sort of challenge with an audio option for the sight-impaired. I'll see if that's an option for us.

In the meantime, if my choice were between having the spam and this accessibility problem, I'll put up with the accessibility problem for now and look for a solution to it. The spam was intolerable and the only thing blind users are denied is the ability to post.
Re:challenge the user by jesser · 2004-12-18 13:35 · Score: 1

WFM using Firefox 1.0 on Windows and using web-sniffer.net. I have no idea why you're getting an img/jpeg content type for http://blog.ziffdavis.com/seltzer.

--
The shareholder is always right.
Re:challenge the user by jacobito · 2004-12-18 14:58 · Score: 1

My apologies for jumping on the temporary issue with your web site, which was occurring for me on Firefox 1.0 for Windows and Mac OS X, but which righted itself shortly after I made the comment.

DCC for comments? by Anonymous Coward · 2004-12-18 09:25 · Score: 0

How about something like the Distributed Checksum Clearinghouses for comments? Comments shouldn't generally be exact duplicates, and DCC is good at catching email duplicates which are often spam. It uses some fuziness factors so some alterations will still be caught.

easy fix by GirTheRobot · 2004-12-18 09:26 · Score: 0

To submit a comment on a blog, you must type in a series of letters and numbers for a non-machine-readable image (like when you forget your password here on Slashdot). This will at least prevent automated blog spam. ...I don't know why this solution isn't deployed already.

Netcraft confirms... by Anonymous Coward · 2004-12-18 09:35 · Score: 0

Moveable Type is DYING.

DotComments by meehawl · 2004-12-18 09:38 · Score: 1

Call me untrendy, but I still like dotcomments.

--

Da Blog

SixApart is partly to blame by ShatteredDream · 2004-12-18 09:41 · Score: 1

They hired Jay Allen, creator of MovableType blacklist, as project manager, but MT BL is not part of the standard distribution. It's not a standard feature, nor is there anything designed in house that provides the same functionality if God-forbid Jay Allen won't let them bundle it as a standard feature. The worst part is that it is having major problems working with MT 3.121, the latest release.

Personally I think MT needs to just scrap the entire comment system and start over again. They need to implement a MT BL like system comprehensively, they need to ban ips tied to spam bots and they need to collect the information about the spammers so that MT users can try legal challenges.

Spam bots should be not only a civil offense, but a crime to use. The way that they are used against blogs is basically on par with defacing a website and often the stuff they push is illegal for minors to view. This is why we need something like the Child Online Protection Act. With something like that we could get spammers on criminal offenses for using spam bots indiscriminately.

--
Click here or a puppy gets stomped!

Re:SixApart is partly to blame by gad_zuki! · 2004-12-18 11:18 · Score: 1

>This is why we need something like the Child Online Protection Act.

This is exactly why we DON'T need "won't someone think of the children" legislation. You're going to put up with massive censorship because of some blog spam that can be easily fixed with typekey, blacklists, etc? For some useless blog comments we're going to censor the web? Wow. Amazing, how Americans can even suggest such a thing. So much for the land of the free, eh?

Like all mediums, parents should be making sure their children are using it in a way they approve of. Lazy parents and religious nuts (and now the spam ridden) should try to understand this simple concept. The job to keep whatever content you dont like from your children is YOURS, not the state's job.

You don't need to protect "children" (whatever that means nowadays, like a 17 yearold has no idea what sex is). What you need to do is start your own kid-safe internet or TLD or run some censorship softwarre on YOUR machine, not on the global web. Think client side solutions and leave the rest of us alone. Thanks.
Re:SixApart is partly to blame by sakusha · 2004-12-18 11:26 · Score: 1

You're a little behind the curve. MT hired Jay Allen specifically so he could integrate his antispam tools into the standard MT distribution. He's only worked there a short time, do you seriously expect quality software to appear overnight?
Re:SixApart is partly to blame by bareshiyth · 2004-12-18 19:33 · Score: 1

Jay Allen's Mt Blacklist is a big plus. I have cut the spam by about 90 to 95%, finally, as I get most the big guys (like the infamous puke "Bob", and "sexpics" guys blacklisted. New ones, with new domains, of course, keep appearing, but one can delete ALL their posts at once, getting around the awful "one at a time" deletion method of MT, once you list them or the significant (and common) par of their URL.

But, I really don't understand how those of you who talk about "free speech" and "freedom of the internet" and bewail "censorship" think massive peddling of the most incredible sorts of porn (like animal and feces and grandma-incest type sex) spam or opportunities to buy fake and killer drugs and penis enlargement are (1) worthwhile and of any socially redeemable value, or worth more than the blogosphere they are destroying (MT will not be the only victim)! You are pretty lame if you think that is worth defending ... it's about like defending murder as a "free expression" or right to "pursuit of personal happiness". Get real, the net will eventually die if you always protect the ones who flood it with hate and puke and feces of their mindless and worthless souls.

I ask again, when will some hacker or virus writer really be as noble and high-minded as they claim to be, and wipe out the operations of such spammers? Kill terrorists, not bloggers!

--
Alcaide's Cafe,
Re:SixApart is partly to blame by fooljay · 2004-12-19 15:27 · Score: 1

Well, You're partially right anyhow. I was hired by Six Apart only slightly more than a month ago and anything that I have my hand on will take a little time to simmer in development.

However, they did NOT hire me "specifically to integrate MT-Blacklist". Just because they hire me does not give them the right to my software.

They hired me because I am a damn good Product manager (not project manager mind you) who also happens to have more hands-on experience with Movable Type than just about anyone else in the world not already working at Six Apart.

Now, certainly, my experience fighting comment spam will help us put up the best defense possible, but my job here isn't over when comment spam is finished on the MT platform.

Or if it is, no one has told me so. :-)

Definitely by casuist99 · 2004-12-18 09:43 · Score: 1

I've been using MT for 2 years now, and the comment spam is actually making a significant bump in the traffic to my server (I doubt anyone else actually reads my stuff...). I had looked at Wordpress a while back and didn't think it was quite "on par" with Movable Type, but MT has done it's best to alienate even myself.

I share my MT installation with my brother. Not surprisingly, we like having our own weblogs. MT now charges for something that simple.

The fact that Wordpress is released under the GPL and is actively developed gives me some further impetus to make the switch.

Thanks for the links - should be useful as I change over from MT over Christmas break.

Re:Definitely by IO+ERROR · 2004-12-18 10:26 · Score: 1

You can take a look at my blog to get some idea of what is available, but be aware that I run nightly builds (don't try this at home, kids!) so a few things you see might not be available. And the Google search box at my site definitely is not part of WordPress, and might never be; I developed that bit myself. I can't imagine anything you can do with MT that you can't with WP.

--
How am I supposed to fit a pithy, relevant quote into 120 characters?

Can someone fill me in? by bigberk · 2004-12-18 09:46 · Score: 1

I am entirely unfamiliar with the issue of spam as it pertains to blogs. Are spammers placing ads (as in, posting their URLs) to random peoples' blogs? Or is the problem that they are just polluting the comment list with random garbage?

If the issue is posting of URLs, then it should be a simple matter of the blog site checking any URLs against SURBL, a spam URL blocklist.

What am I missing here? When did this become such a huge issue?

Re:Can someone fill me in? by crayz · 2004-12-18 10:04 · Score: 3, Informative

A few problems, as a Wordpress user and as someone who's run into problems w/ other people's MT blogs:
- spam bots attack WP and MT through various means, one of the most common being to simply POST to the mt-comments.cgi or wp-comments-post.php URLs on peoples sites
- the bots mainly post huge amounts of links to stupid websites, like viagra or poker strategy. the goal is to get a higher google ranking by having links from many different sites
- the biggest problem for WP users is that you get flooded with literally hundreds of comments per day. if you have good filtering you'll at worst just have to sit around and delete some manually
- the biggest problem for MT users(or that MT users cause) is that because of the poor design of MT, the comments script takes up a huge amount of CPU time. apparently it actually goes through the process of rebuilding the static post pages even when comments are moderated or auto-deleted. now imagine you have 500 posts and they all get hit at the same time - it's something close to a forkbomb on the server

The best solution to all of this is to find a way to prevent the stuff from ever getting posted. Once it's submitted you're going to have to analyze it in some way and decide if its SPAM or its good. There are some simple solutions like renaming the comment post scripts, and some more complicated ones like using a verification number or requiring users to register. In any case, it's a very major problem for almost anyone with a blog.
Re:Can someone fill me in? by 68kmac · 2004-12-18 10:06 · Score: 2, Informative

Yes, they post comments which are basically just a list of URLs with lost of links to their sites. The theory being that this will increase their page rank. Luckily, MT already has a blacklist to filter those out but it has to be updated constantly.

The funny thing is that we (another weblog system, but suffering from the same problem) are seeing a lot of spam posts recently where they put the link text into the href attribute and the actual URL as the link text. Not sure what they're trying to accomplish with that - maybe it's just more proof that spammers are actually stupid ...
Re:Can someone fill me in? by Anonymous Coward · 2004-12-18 10:52 · Score: 0

Well, an efficient and simple solution in SURBL (see parent)... there is already a well maintained, automatically collected list of spammed URLs. I'm sure spammers send the same addresses via email as they post on web sites.
Re:Can someone fill me in? by cowsandmilk · 2004-12-18 12:46 · Score: 1

except they have totally different purposes, emails don't increase the page rank of a site, blog spam (in theory) does.

--
http://sladm.org Saint Louis Area Dance Marathon The Best One Night Stand of Your Life
Re:Can someone fill me in? by jesser · 2004-12-18 13:45 · Score: 1

How would requiring users to register help? Spammers can register more easily than legitimate users can.

--
The shareholder is always right.
Re:Can someone fill me in? by crayz · 2004-12-18 14:04 · Score: 1

Registering is a good way of filtering people. You can force them to do things like get approved first, or get them to provide a valid e-mail address and recieve an e-mail and click on a link within it, etc. Also makes it easier to then kill anyone who's spamming not just by IP or URL but by username

All this prevents the simplistic SPAM bots from just POSTing to your cgi scripts and forces them to jump through hoops
Re:Can someone fill me in? by Anonymous Coward · 2004-12-18 17:42 · Score: 1, Informative

>>In any case, it's a very major problem for almost anyone with a blog.>>

Not true.

I use Livejournal. Virtually everyone I know who has a blog is also on Livejournal.

None of us are having any of these MT/WP problems.

I AM concerned about the weird shit going on with PHPBB right now, where messageboards are being spammed with hundreds and hundreds of bogus user accounts that lead back to spam sites pusing porn or viagra or the like.

Speculation is they are signing up for these accounts so they can put their website URL in their user profile. Then they get Google to index the board's user list, and bingo, they suddenly have what looks like a legit site linking to their porn or viagra or whatever. Of course it's not the site doing the linking.

These accounts are never activated so these people are never posting actual messages. The issues are the bullshit account clogging the user lists and helping the linking. The biggest hassle is that there's no easy way to purge PHPBB users.

Obligatory OSS Advocacy by RAMMS+EIN · 2004-12-18 09:50 · Score: 0, Troll

Bla bla bla bugs yada yada proprietary yatta yatta use open source!

There, HAND.

--
Please correct me if I got my facts wrong.

yep by crayz · 2004-12-18 09:54 · Score: 1

I work for a web host and we've had this issue. 744 on mt-comments.cgi. Sorry guys.

NoIndex HTML Tag by beebware · 2004-12-18 09:54 · Score: 3, Insightful

At the start of this year (Jan 2004), I actually proposed a possible solution to avoid this sort of thing. Basically, Google et al starts recognising:

/

And then bloggers can put the comments section of their sites inside the HTML "no index" markup and hence if they are hit by comment spam, Google and the other search engines ignore that content.

Reusable Proofs of Work by yerdaddie · 2004-12-18 10:03 · Score: 4, Interesting

I myself run an MT blog and have been contemplating moving to wordpress to dodge the spam bullet, however temporarily.

It occured to me thought that what would really fix this is to push the load onto the spammers by building a Reusable Proofs of Work (RPOW) system.

For those who are unfamiliar, RPOW is a proposal to stop mail spam by asking the sender to do a little "work" that would make sending a lot emails computationally too expensive.

As I'm in the last throws of my PhD I'll have to delay on this one, but maybe the lazy web can help out on this one, so the same thing doesn't happen to wordpress or whatever blogging monocultures exist.

Re:Reusable Proofs of Work by saxmatt · 2004-12-18 10:15 · Score: 2, Informative

That's what the WordPress plugin Spam Stopgap Extreme does.
Re:Reusable Proofs of Work by generic-man · 2004-12-18 11:10 · Score: 1

Good idea. I've found that security by obscurity (by avoiding popular software like MovableType) is an excellent deterrent.

It's not a cure nor a viable long-term philosophy, but it's a deterrent. That's all you need to deter 99.999% of the robot scripts that troll MT comments.

In other news, I've heard that simply renaming mt-comments.cgi is an excellent solution. No sarcasm here: security by obscurity really works as a deterrent.

--
For more information, click here.
Re:Reusable Proofs of Work by Newton+IV · 2004-12-18 15:09 · Score: 1

So if I am a small internet company that sends 10000 LEGIDIMATE notification emails per day to its happy users, I will have to buy many server just to "do a little work"?
I am sure mail.yahoo.com can afford this, but I, as a guy running a site with 40000 users, cannot.

These smart measures from MIT will lock out small guys from making useful websites with several thousands users, and will leave internet to yahoo and msn.
Re:Reusable Proofs of Work by faedle · 2004-12-18 15:19 · Score: 1

What are you using to service 40,000 users, a 486DX?

The reality is, if you are indeed servicing a large userbase who is sending a legitimate volume of E-mail, it will be computationally trivial. As computationally trivial as doing a complicated DNS lookup or a simple MD5 rehash.

Additionally, some of the proposals I've seen allow an end-user to purchase "stamps" directly, offsetting the computational costs of an ISP (this is done by running a Java client on the machine).

Lastly, all of the proposals I've seen allow for a "whitelist wormhole" that would allow a user to whitelist mailing lists that they participate in, removing the requirement that they bear "postage". In fact, one of the proposals even suggested that mail labelled as "Priority: Bulk" be allowed a free ride, because that would, finally, allow a tag that could be easily filtered against (if it contains the tag, it's spam: and if the address isn't whitelisted, it gets sent to /dev/null at the MTA), and would allow intelligent MTAs to decide how much effort to put in to delivery, allowing "stamped" E-mail to get in the front of the line.

In short, stop whining. If you have 40,000 users, and you can't afford to spend a few cycles of CPU time, you have a shitty business model, outdated computers, and/or are probably a spamhaus.
Re:Reusable Proofs of Work by Newton+IV · 2004-12-18 15:32 · Score: 1

you have a shitty business model, outdated computers, and/or are probably a spamhaus.
I challenge you to come up with an idea for a site that will get 40000 subscriptions in 4 months. This is not taught in course 6... Also, a website like Wikipedia does not have the greatest business model, but still needs to send emails. And it is a useful site.

The reality is, if you are indeed servicing a large userbase who is sending a legitimate volume of E-mail, it will be computationally trivial. As computationally trivial as doing a complicated DNS lookup or a simple MD5 rehash

Do you realize, that if it is trivial for me, it will be also trivial for spammers?
Re:Reusable Proofs of Work by faedle · 2004-12-18 15:42 · Score: 1

No, it won't be trivial for spammers.

The whole spammer business model requires the transmission of literally tens of millions of E-mails, essentially at once. That's the whole point of "Proof of Work" systems is that they are computationally cheap for a few, but increasingly expensive as the volume goes up. Much like cracking crypto: if you have the right key, it's trivial, but if you don't, it is quite a challenge.

Wikipedia needs to send E-mails? Oddly enough, I've been a very active participant on Wikipedia, and I think the only E-mail I may have ever recieved from them was one verifying my E-mail address when I signed up. In a "Proof of Work" concept-system, I could provide (as part of my registration) a MD5 hash "stamp" that would allow their system to E-mail me without doing the work. Nearly all "proof of work" systems I've seen discussed have this ability (if not obviously stated, it's inherent in the design).

Again. Stop whining. This will not affect any legitimate E-mailer, and again might return some sanity to the E-mail system. It has the power to balance the scales, so the legit bulk e-mailers (like mailing lists, etc.) and those who want "spam free" inboxes can peacefully co-exist.
Re:Reusable Proofs of Work by Newton+IV · 2004-12-18 15:52 · Score: 1

I wrote "like wikipedia", so do not take it too literally. There are many websites that are very useful, but do not generate tons of money. Look at Xanga.com for example. According to Alexa ranking, it's the 26th largest site on the internet, and it is by far the largest bloging site. They send email notifications when their users message each other through the site. Their userbase is 16 years old on average, which means two things:

a) They make money, but not a LOT of money
b) These kids will never submit any MD5 hash stamps, unless ALL email clients comply and make it very easy for them , which will never happen.

I think you overestimate the intelligence of an average internet user, which I know from experience. Stamps, MD5 hashes will not do.
Re:Reusable Proofs of Work by stinkbomb · 2004-12-18 16:14 · Score: 2, Funny

As I'm in the last throws of my PhD...

What's the saving throw vs. dissertation committee?
Re:Reusable Proofs of Work by faedle · 2004-12-18 16:31 · Score: 1

a. Again, and you have not demonstrated anything to the contrary, many of these sites are running on powerful enough machines that the slightly additional computational load will not affect them dramatically.
b. Most 16-year-olds are waaay more computer literate than you are giving them credit for. To paraphrase George Carlin: "if they can program their #(%*# VCR, they can bloody well learn how to use.." a tool that says "copy this line and paste it into your registration". If that's even required: the reality of it is, it woudn't be, because the Java code would already be on the page. "This webpage will download a small Java program that will generate for us a valid token that will permit us to send you E-Mail. This should only take a few seconds". Guess what, dorkhat? Not only will this be able to generate the "stamp" for the validation E-Mail, but will effectively eliminate automated signups, too, because it will also be impossible to bulk-signup user accounts because of the same process.

I did overestimate the intelligence of one particular Internet user, however.
Re:Reusable Proofs of Work by Newton+IV · 2004-12-18 17:16 · Score: 1

I did overestimate the intelligence of one particular Internet user, however.
Your own?
Re:Reusable Proofs of Work by faedle · 2004-12-18 17:32 · Score: 1

Apparently, for trying to convince an idiot that a well-thought out system might work.

Wasted breath on a fool, apparently.
Re:Reusable Proofs of Work by Anonymous Coward · 2004-12-18 22:49 · Score: 0

Advice:

Please send your idea to marketing, your work is obviously done.
Re:Reusable Proofs of Work by Anonymous Coward · 2004-12-20 02:52 · Score: 0

So if I am a small internet company that sends 10000 LEGIDIMATE notification emails per day to its happy users, I will have to buy many server just to "do a little work"?

You do realise that he was suggesting using this for comments on blogs, not email, don't you? Quit it with the stupid kneejerk flames.

Nobody reads blogs anyway. by Anonymous Coward · 2004-12-18 10:07 · Score: 0

The blog authors doubtless believe that the whole world is beating a path to their little diary but the fact is they're talking only to themselves.

Nobody cares what some zit-faced teenaged virgin thinks about anything, and nobody is going to waste their time reading those thoughts on some angst-ridden, semi-literate webpage.

Hell, they don't have any worthwhile experiences to share, and precious-little -- if any -- knowledge about anything not pertaining to pr0n sites.

This is not a tragedy in any way.

Re: flamebait my ass by jgaynor · 2004-12-18 10:11 · Score: 1

The link above was funny as hell and explained the MT load issue in far more plain language than the original article! Somebody waste some points and get that back up out of the negatives . . .

But isn't that the kind of area you would want? by SuperKendall · 2004-12-18 10:13 · Score: 1

It might help, but I would rather have Google be searching the comments as well as the main post! Even if comment spam is a problem, you don't want to loose all the other comments that might have value.

Perhaps Google could recognize a Moveable Type site and just ignore comments from them.

--
"There is more worth loving than we have strength to love." - Brian Jay Stanley

Re:But isn't that the kind of area you would want? by Anonymous Coward · 2004-12-18 11:23 · Score: 0

Rather than noindex, nofollow might make more sense. Thus comments are indexed, but links aren't counted for PR.

mt-blacklist by stripmarkup · 2004-12-18 10:22 · Score: 1

I tried renaming the comments script and it worked for a while, but spammers are smart enough to work around that. Lately I had been getting spam even a few minutes after renaming the script.

I installed mt-blaclist, which pretty much solved the problem for me. It allows you to search by regular expression and massively de-spam and blacklist the urls they point to. All subsequent comments containing those urls or other known spam expressions get trashed automatically.

--
See charts for twitter trends on Trendistic

Because spambots don't care by stripmarkup · 2004-12-18 10:25 · Score: 1

I disabled html in comment posts a long time ago. Spammers don't care, their spambots keep spamming blindly. Statistically, they will find lots of sites that allow html.

--
See charts for twitter trends on Trendistic

Re:Because spambots don't care by Anonymous Coward · 2004-12-18 22:53 · Score: 0

Exactly. If you disable HTML posting, they'll still spam you - you'll just see <a href="http://www.somespamsite.com">Free Porn!</a> instead of a link. Which is, to me, even more annoying.

that's not the usage in this context by Trepidity · 2004-12-18 10:30 · Score: 1

While it started from FreeRepublic users, the verb "to freep" now can refer to hordes of people from any political blog, whether right- or left-leaning. The two most common sources of freepers are FreeRepublic itself (right-wing) and DailyKos (left-wing).

--
10 PRINT CHR$(205.5+RND(1)); : GOTO 10

Hey I here there's already some software for this by Trepidity · 2004-12-18 10:32 · Score: 1

"Blog" software predates the existence of a separate category of "blog software", and most of the older stuff works better. SlashCode, I hear, has been known to run several high-traffic sites. There is also Scoop, which was developed for kuro5hin.org, and used at a few other places (like dailykos.org). Both are also much more full-featured than your average "blog software", especially in that they include threaded comments.

--
10 PRINT CHR$(205.5+RND(1)); : GOTO 10

It's tough on us serving from home by Biggerveggies · 2004-12-18 10:33 · Score: 1

I've used Wordpress ever since it branched off from b2. Unfortunately, its success has made it a good target for comment spam. The available plugins, such as Farook's WPBlacklist , work really well. However, the amount of incoming spam attempts is sort of like a DDOS attack on us little guys who have servers running on their home cable lines. It just disapointing that we have to put up with this.

multiple blogs by Skeezix · 2004-12-18 10:39 · Score: 1

Do they support multiple blogs with a single installation yet? That was the big reason I didn't move to Wordpress a while back...

--
Celebrate the finer things in life

Re:multiple blogs by IO+ERROR · 2004-12-18 10:48 · Score: 2, Informative

Multiple blogs are partially supported in 1.2, and 1.3 will have much better support for this type of installation (e.g. web hosting, etc.)

--
How am I supposed to fit a pithy, relevant quote into 120 characters?
Re:multiple blogs by jacobito · 2004-12-18 10:50 · Score: 1

This was also a showstopper for me; I passed on Textpattern for the same reason.

(As an aside, solid multiple blog and multiple user support is one of Movable Type's best features, and it irks me that so many MT plugin developers write their code under the assumption that every MT installation only has a single user.)

Authentication Images by Joystickit · 2004-12-18 10:42 · Score: 1

The solution is to impliment authentication images, much like paypal or the like use when you register. It generates some odd-looking image with a few characters and digits in it, and you as the user have to type it in.

There is a system like this for wordpress called wp-authimage that works quite well. You do have to know a bit of php and it requires GD on your websever, but neither of those things are super-difficult. I used it on a blog I run with some friends and it works quite well. Our comment spam went from 100+ per day with MT to 0 with wordpress and this system.

Netcraft? by Chuck+Bucket · 2004-12-18 11:06 · Score: 1

Netcraft comfirms it; Movable Type is dying!

Sorry, had to plug that one. I run Drupal for my CMS, and lately I've been getting some 'free poker' spams in my comments. I've installed the Spam module and am holding my breath. Do modules like that work in MT?

Time for me to go check my friends MT sites...

CB

--
free ipod and free gmail!

Re:Netcraft? by mike3k · 2004-12-18 15:17 · Score: 1

Almost every Drupal site has been hit with those spams. See the discussion at drupal.org. They seem to look for random low numbered nodes and add the comment to them. Using the spam module, closing comments on all but the most recent nodes, and adding a deny for their user agent eliminated all of them on both of my Drupal sites.

Cheap and cheerful spam blocking for MT 2.6 by ianmacd · 2004-12-18 11:11 · Score: 1

Here's a patch to prevent comment spam for those of you left out in the cold when Movable Type abandoned MT 2.6.

--
Ian Macdonald, Linux sysadmin & Ruby hacker

Re:Hey I here there's already some software for th by DrSkwid · 2004-12-18 11:29 · Score: 1

When I was a lad we had the crazy stuff called newsgroups.

You could post to them, they we're threaded, they had an RFC protocol called NNTP and all sorts of programs understood them. Some of them were even moderated.

I wonder what happended to them?

--
There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter

What about a good blog friendly host? by Sam+Jackson · 2004-12-18 12:30 · Score: 1

I remember hearing the horror stories about WP users doing a fresh install and right after getting flooded with all sorts of comment spam. But I've been using it for quite awhile and I've never got one bit of spam in my comments. I am running various spam plugins and I assume they are working like a charm.

Also I'm not sure if this has anything to with it, but my site is hosted on a blog friendly host. BlogOmania supports all types of CMS, and they have a very firendly and reliable support staff.

--
--- hows it taste mother f$#@er!!!

Give this man a prize by SethJohnson · 2004-12-18 12:41 · Score: 1

Here's the deal. Everyone rolls their own solution like rscrawford has. Some people embed their own hidden fields, which is a great idea. Some people code javascript on the client that forces a pause of 20 seconds before the value of a hidden field is embedded.

Obfuscation can really make the work of the spambot writers more expensive than it's worth. Then they'll move elsewhere.

--
$5 / month hosted VPS on linux = awesome!

CAPTCHA - Politically Incorrect, but effective by diggory · 2004-12-18 12:45 · Score: 2, Interesting

I run WordPress and used to get hit by many casino/cialis spams. I found that I get no comment spam after using a WP hack (http://www.gudlyf.com/index.php?p=376) called AuthImage, which is a CAPCHA (basic Turing test based on character recog.) I strongly recommend it, and would be grateful to any OSS vigilante who could port it to a proper WP plug-in.

Re:CAPTCHA - Politically Incorrect, but effective by dourk · 2004-12-18 21:01 · Score: 1

Look into Spam Karma.

--
Wake up.

Poetic Justice? by Anonymous Coward · 2004-12-18 12:47 · Score: 0

Seems like poetic justice to me, seeing as how the vast majority of weblogs are mere "WWW spam" in the first place. And they seriously fuck up google results.

Better colours by Anonymous Coward · 2004-12-18 12:54 · Score: 0

http://shit.slashdot.org/article.pl?sid=04/12/18/1 827225

Not Just Me! by suwain_2 · 2004-12-18 14:21 · Score: 1

I'm kind of excited, kind of disappointed. I run a blog with ten different posters running MT. We've been getting slammed with comment spam lately. I just assumed it was in relation to Google starting to move my site up a bit in the ranks. Apparently not. :(

At first, most of the spam was from obviously-fictitious domains. I earned myself weeks of absolute lack of spam by throwing this into /lib/MT/App/Comments.pm -- I started mine a few lines after line 150 in my case:

# If an e-mail address is given... make
# it resolve to an IP
# Added by suwain_2
require MT::Blog;
my $blog = MT::Blog->load($entry->blog_id);

if($q->param('email')) {
my $email = $q->param('email');
my @email = split(/@/, $email);
unless(gethostbyname($email[1])) {
return $app->handle_error($app->translate("You pathetic loser. Your e-mail address doesn't resolve to a domain."));
}
}

I don't track how many people are being turned away by this; I still find myself cleaning up spam on a regular basis, but at least at first, I completely stopped spam. I now get a fair deal with 'real' domains that I just clean up by hand.

I also whipped up a little PHP utility that shows the 50 most recent comments; clicking on a field will show all results that match that field. I can easily find people posting under a particular address, or from a certain IP, and delete them. It's pretty crappy, but if people are interested, I'll post it? (Another script I have goes through and auto-rebuilds all the blogs.)

Hope this helps someone?

--
________________________________________________
suwain_2 :: quality slashdot p

Re:Not Just Me! by cyways · 2004-12-19 03:40 · Score: 1

unless(gethostbyname($email[1]))

Many people have a legitimate @domain.name address that doesn't resolve to an IP address. In fact, resolution only works if someone assigns an "A" record to the domain itself.

You really should be checking for an MX record. For instance, I use a PHP function in form processing to validate email addresses by running a "host -t mx domain.name" command.

Frankly, though, I doubt any of this would matter much if our experience with spam is any indication. Nearly all spam comes with forged From addresses that resolve to a legitimate domain; it's what's called a "Joe-Job" in the spam world.

Tyranny of the Minority by Esion+Modnar · 2004-12-18 15:04 · Score: 1

(Not minorities. Don't even start.)

Just takes a few assholes to ruin a public resource. They're like the people who steal and/or vandalize phonebooks in the public phone booths.

Bring punch to the party, and somebody will want to piss in it.

--

They say the first thing to go is your penis. Well, it's either that or your brain. I forget which...

Proper Apache configuration helps too... by bschoate · 2004-12-18 15:23 · Score: 1

If you're doing shared hosting and you allow your users to run CGIs-- regardless of what CGI it is-- you should have reasonable limits in place that keep child processes in check. Apache has had such directives for doing this for some time, one of them being RLimitNPROC. This directive allows you to limit the number of subprocesses that Apache will run concurrently.

You can even specify subprocess limits on a per-virtual host basis. With Apache 2, you can even limit based on directory. Using RLimitMEM is also a good idea.

Yes, MT's comment system can use some improvement. We're working on that. But these servers are getting hammered; in effect a denial-of-service style attack.

Even a "Hello, world" type script can be hit hard enough to bring down a server, assuming there are no process limits in place. Invoking a modern interpreter to execute a CGI script is no small feat. Perl, Python, Ruby, and even PHP (when run as a CGI as many shared hosting companies do for security reasons) consume enormous amounts of resources at startup regardless of the size or complexity of the script they are summoned to execute.

So, sure, code can be added to MT to recognize and adapt to a flood of comments coming in, but by the time the CGI runs, it's already chewing up CPU and memory. In my opinion, a better defense for these flood-style attacks is for Apache itself (or third-party in-memory Apache modules) to handle such situations.

mod_security, mod_dosevasive and others are excellent defensive tools for any public Apache server admin to use.

I'd love to know what others have done to configure Apache to prevent denial-of-service attacks.

Re:Authentication Images and Fuck Accessibility by Anonymous Coward · 2004-12-18 18:29 · Score: 0

Thanks very much for making the net harder to use for blind people. As a blind net user I congratulate you on building up yet more barriers to the fluid and accessible use of the net.

At least some of those sites that use auth images are now also using sound samples, which is somewhat better (if not perfect).

Not good for visually impaired users by Anonymous Coward · 2004-12-18 19:03 · Score: 0

As has been discussed in other threads, the use of auth images poses a huge roadblock to visually impaired/blind users. Some other technique that requires a user to interact should be developed... something like a random word/math problem. The answer could still be displayed on the page as an image, for the cognitively impaired... but it doesn't rely on just being able to see the graphic (of course that leaves out those who are both blind and cognitively impaired).

There must be some better method of determining that there is a real person submitting the form, that doesn't penalize those who may have sensory impairments.

not such a big deal by SethJohnson · 2004-12-18 19:41 · Score: 1

Hey, those people can still read your blog. They just can't post comments to it. In the context of all the other shit they're prevented from doing because of blindness, it's not such a big deal.

--
$5 / month hosted VPS on linux = awesome!

The problems are bigger than they say by mcgroarty · 2004-12-19 00:38 · Score: 1

The problems are somewhat bigger than they mention. MT performs some very heavy database activity to even get to the point of finding that comments have been disabled completely. Even without triggering the page rebuilds, several hundred requests coming in will grind the server to a halt. The problem is compounded if you're running a flat database backend like sqlite, which does huge memory allocations and can launch you into a swapfest.

Given that instances of mt-comments.cgi are expensive even when they net no change to your database or your blog pages, server load is unbearable when there are a large number of concurrent instances. Now, there is a problem with either apache or mt-comments.cgi that makes mod_throttle's per-IP connection limiter fail. The current popular comment vandal's script opens a connection, sends a GET request with the post instructions as CGI arguments, then closes immediately. mt-comments.cgi continues running even though the connection has been dropped, and it doesn't count against concurrent connections from that IP. I don't know if mt-comments is ignoring a sigpipe from apache, or if apache is failing to send it. Either way, the cgi keeps running even though it's not being counted anymore.

My solution was to add code to the head of each instance of mt-comments.cgi. It sleeps for a second, then checks for an unreasonable number of mt-comments.cgi running. If too many instances exist, it dies without getting to the expensive database access. Until Six Apart release a new MT, this may be helpful to you. Add this to the "eval" section:

sleep 1;
$numrun = `ps ax |grep [m]t-comments |wc -l`;
if ($numrun > 3 )
{
die "Too many mt-comments running";
}

Caveat: I'm not a perl programmer. Somebody else can write this more elegantly.

Re:The problems are bigger than they say by sparkane · 2004-12-19 04:11 · Score: 1

That solution probably won't work for Windows users. I'm interested in hearing what you think of this idea. Normally there are locks put on database records. What if there were also "locks" put on .cgi files or entire directories? When a user calls mt-comment.cgi (or whatever), the process first adds a "lock" record to a table (or a file or whatever) for the .cgi or parent directory, and subsequent calls to that .cgi check the lock table/file for a call from the same IP (or name your criterion). If that IP/criterion is found, the call dies. At the end of the original .cgi process, the lock is removed and additional calls can be made. Or, the lock could be scheduled to be removed at a later time, say after 2 minutes, or even 1 minute (whatever won't bug the users too much).

Obviously the lock check has to be as inexpensive as possible. If the locks are stored in a database table, though, it would only require a single query. And of course, the criterion for locking is crucial; if a spammer fakes a different IP for each comment-spam, the IP criterion won't work. And also of course, the lock garbage-collection (if locks are scheduled to be removed after an amount of time) has to be fast.

Wow. by Anonymous Coward · 2004-12-19 09:53 · Score: 1, Interesting

This is what they said to us about all of the problems MT causes our servers:

"We have MT running at a number of hosting companies with a variety of
configurations without an issue. "

Sure but what is this?

site1
Top Process %CPU 99.9 /usr/bin/perl -w mt.cgi
Top Process %CPU 12.0 [analog ]

site2
Top Process %CPU 99.9 /usr/bin/perl -w mt.cgi
Top Process %CPU 99.8 /usr/bin/perl -w mt.cgi

Long live PHP-only blogs.

don't forget Sourceforge by gmhowell · 2004-12-20 07:47 · Score: 1

Don't forget that sourceforge, owned by the same company as Slashdot, hosted floodmt for a time. Way to go guys!

(And yes, I'm one of the bloggers mentioned on the floodmt pages)

--
Jesus was all right but his disciples were thick and ordinary. -John Lennon

Slashdot Mirror

Comment Spams Straining Servers Running MT

186 comments