Microsoft Bots Effectively DDoSing Perl CPAN Testers
at_slashdot writes "The Perl CPAN Testers have been suffering issues accessing their sites, databases and mirrors. According to a posting on the CPAN Testers' blog, the CPAN Testers' server has been being aggressively scanned by '20-30 bots every few seconds' in what they call 'a dedicated denial of service attack'; these bots 'completely ignore the rules specified in robots.txt.'"
From the Heise story linked above: "The bots were identified by their IP addresses, including 65.55.207.x, 65.55.107.x and 65.55.106.x, as coming from Microsoft."
Lazy, feckless, inconsiderate crooks.
Looking at another Robots.txt file seems to return what I expect.
Let no rock remain unthrown when it shows Microsoft is in the wrong - even if they aren't
I have mod points and I am not afraid to use them
You don't need the "www" prefix to figure out that requests for port 80 are http, port 21 are ftp, 443 are https, 25 are smtp, and 110 is pop3.
For those wanting to try this at home and work around their providers' traffic blocking: You also don't need a the power consumption of a repurposed box for that when you can use port forwarding on a router. It'll even let you use one of your boxes on your home lan as a public-facing web/ftp/mail/whatever server (and you can set them up to listen to alternate ports, like 8080 for http, and 2525 for running your own private mail server). Throw in a redirect to your external ip from a known web page, and you're in business. You can even run a proxy that way.
Shit happens. People misconfigure things. Even professionals. Someone noticed, complained, and someone else said they'd investigate and get resolved. Wow. Yawn.
Instead we have Slashtroglodytes screaming about conspiracies by MSFT.
No, it doesn't. The rfc the poster quoted was about naming machines in general, NOT specifically about naming web servers. The title was "Choosing a name for your computer".
The pertinent part says"
In other words, don't name your machine "slashdot" and expect it to work all the time.
And:
And, as I point out, it's only a suggestion, now rendered obsolete by 20 years of practice:
Route your traffic to the right server based on the port requested. "cat /etc/services" for the list. No need for subdomains.
All they need to know is example.com.
Q: What's your domain name?
A: example.com
Q: So what's the name of your ftp server?
A: example.com.
Q: What's the smtp mail server?
A: example.com.
Q: What's your pop3 server?
A: example.com.
Q: So they're all on one machine?
A: No. We use magic pixie dust.