Is IRC All Bad?
An anonymous reader writes "IRC is often portrayed by the media as a haven for illegal activity. The author of IRC Hacks set out to find whether or not this was true. His conclusions are quite alarming, suggesting that 99.9% of IRC usage is illegal although he backs up IRC by saying that it is also used for lots of constructive purposes and is used by open source software developers." Update: 01/21 05:17 GMT by P : The author claimed it was merely 99.9% of traffic "to the top 60 channels" that is illegal, not 99.9% of all IRC traffic.
Ah, IRC, where men are men, women are men, and 14-year old girls are FBI agents.
Yeah, I buy 99%, although the last time I logged on it was for help with my Slackware box.
If nothing else, IRC has given us bash.org.
If my answers frighten you, stop asking scary questions.
99.9% is an entirely sensationalized number. It means nothing. If you actually read through, he's claiming 99.9% of the top 60 public channels on IRC are largely illegal behavior. That's not 99.9% of IRC. The warez related channels are large, and there are many people who use IRC just for that. But there are many people who actually use IRC for the purpose it was intended, to chat.
I'm an oper on efnet, so I'm well aware of the fact illegal activity goes on on IRC. Depending on the illegal activity, we can and do take action. We regularly remove drone runners, hacked bots (drones or XDCC), spammers, and other malicious users. Do we actively pursue copyright infringers? Not generally. Besides the fact there's simply too many of them, they're generally not harming our network or each other so they're a low priority.
Me? I use IRC for chat primarily, and most people I know do the same.
His conclusions are quite alarming, suggesting that 99.9% of IRC usage is illegal
From TFA: Based on those keywords being monitored, 99.9% of IRC traffic to the top 60 channels is "illegal".
Clearly, (all) IRC usage != IRC traffic to the top 60 channels.
Duh?
Most people use IM now, so there's less need for the casual user to read the following:
captnitro: hey whats goin on
ice8229: no fuck that
captnitro: what?
peebles: your mother is a whore, you know it
ice8229: i'm not going to buy a goddamn program just to rip
ice8229: anybody know of an open one?
fisher0: i kno cuz i fuckerd her d00d
captnitro: what the hell is going on here?
adbot: MP3Z MOVIEZ WAREZ BAGELZ go to 62.182.100.10
binaryman: 1000100011110101
captnitro: huh?
binaryman: 1001111010111110
sharky: get out n00b
fisher0: i am not a virgin i so fskced her! in the ears
pornking: anybody want to cyber?
10yearold: yes
Clearly the domain of kings.
He goes searching for warez (using four keywords related to popular software) and when he finds it he declares 99.9% of IRC usage is illegal? What about the linux support, gaming forums, etc... and there have to still be people that use IRC for plain old chat. I think these numbers are a bit misleading.
Binaries (if you mean warez packaged in nicely formatted rars or isos) don't get sent over IRC. They use DCC.
This just in over the wires, everywhere is reporting that planet Earth is a haven for illegal activities. Without exception, in every town of every province of every country, earthlings are violating (where applicable)local, state and federal laws. In conclusion, people cannot be trusted and Martial Law must be declared!
O' Big Brother, where art thou?
Warez channels tend to be huge, so no surprise it'd be easy for them to make the top 60.. But what if there are 10 times as many legit channels, but they only average half as many users per channel. Now we've suddenly gone from 99% to 20%...
On that same note you need to figure in how many users are being counted more than once by being in more than one warez channel. Perhaps it's different on other networks, but people that come into channels I op in that have warez channels in their whois list tend to have a dozen or more warez channels listed, while those in only legit channels usually have 3 or less.
Shoot Pixels, Not People!
Nerd: Derogatory term typically directed at anybody with a lower Slashdot ID than you.
From the article, all he found is that most of the time Microsoft, Norton, Symantec, and Jasc are mentioned in 60 particular IRC channels, it's in relation to illegally downloading their products.
He didn't look at the vast majority of IRC channels, and of those he did, he didn't consider the vast majority of the traffic within them -- just those four words. Additionally, he failed to observe any distinction between engaging in an illegal activity and simply mentioning it.
This is a bit like visiting the 60 largest train stations, measuring how many times the word "score" is used in relation to illegal activity, and concluding that 99.9% of the world's public transport users are drug trafficking.
I should buy some cement.
This entire post is like flamebait for some of us.
I've been an oper on DALnet for six years now, and I currently lead up their coding team, so allow me to shed some light on this - assuming this makes me qualified.
The top 60 channels. Who goes to huge channels to chat? Ever tried talking in a channel with 20 active users? Try 800 active users. Nobody goes to large channels to chat, its pointless to even try. The folks that join these channels join looking for something specific, or to offer something. They find what they are looking for, and move on.
On DALnet, we've taken agressive action against warez, child porn, and drones. Drones are unfortunately the only item that I can speak on authoritively - we reject about 300 drones per second on any given server on our network. This is done through pattern matching in their registration. Drones is a serious problem on any network. A while back (five years or so), dianora of efnet did some drone hunting, and concluded that around 60% of "users" on irc were accually drones - hacked end-user computers. Drones are a far worse problem than people realize.
A few years ago, DALnet was seriously DDOS'd - we went from the top network in the world (around 140,000) to next to nothing. Our servers sometimes got hit with DDOS attacks in the range of 60 Gigabits per second. We shut down major providers, rendered entire datacenters useless, and obviously lost servers quickly. We've since changed our routing methods to rely heavily on anycast, and changed a lot of other things.
In my mind, DALnet is one of the networks that accually has one of the lowest noise ratios around. Quakenet, the current leader in usercount, raises questions with me. Their usercount rose very fast, and I wonder about their userbase. I personally know only -one- person who uses quakenet. You mention DALnet, Undernet or EFnet and people identify much more readily. Even more people use small IRC networks with 50-500 users.
99.9% for illegal purposes - bullshit. If you go to irc only to look for warez, then I think you are in the minority. I'd put illegal purposes around 5% at best. And that means real, live people at the keyboard, looking for illegal material.
.
Yes, this is an extreme example of how NOT to conduct a study. He started by chosing the 60 most popular channels - by definition they were not typical. There are 50,000 channels on undernet alone with an average of about 3 users each. Then he chose 4 keywords that are likely to be used much more for warez than legitimate conversation. The results would have been very different if the channels and keywords had been chosen randomly. Of course, if he had chosen a small number of keywords randomly, the results would probably have been 0.00% illegal traffic since the vast majority of the words used on IRC don't name products that are pirated, so the approach of examining the relative rates of legal and illegal use of particular keywords is itself flawed even if your choice of keywords isn't. Relative frequency of many different keywords in some cases could give some clues though there are statistical problems with this. "ROFLMAO" is more likely to be found in legitimate messages whereas "systemworks" is more likely to be found in piracy or SPAM (though it can occur in many legitimate contexts as well). A bayesian filter that looked at ALL keywords could have been used to separate the legal from illegal traffic after extensive training and used to extend the study over more messages and channels than could be done by hand.
And of course, his statistics (or even much better ones) won't tell you if, for example, 37% of the bots offering downloads are run by BSA, RIAA, and MPAA so they can collect IP addresses of pirates and 87% of the download requests are dummy requests they generate to make it look like everyone is doing it (to make it look like it is safe to download so they can entrap people as well as inflating statistics they can trot out later).