Google's Research on Malware Distribution
GSGKT writes "Google's Anti-Malware Team has made available some of their research data on malware distribution mechanisms while the research paper[PDF] is under peer review. Among their conclusions are that the majority of malware distribution sites are hosted in China, and that 1.3% of Google searches return at least one link to a malicious site. The lead author, Niels Provos, wrote, 'It has been over a year and a half since we started to identify web pages that infect vulnerable hosts via drive-by downloads, i.e. web pages that attempt to exploit their visitors by installing and running malware automatically. During that time we have investigated billions of URLs and found more than three million unique URLs on over 180,000 web sites automatically installing malware. During the course of our research, we have investigated not only the prevalence of drive-by downloads but also how users are being exposed to malware and how it is being distributed.'"
Three million out of billions is not bad, assuming randomness (only, say 1 in 1000 chance of using a bad URL), but it is a lot worse than 180k out of billions.
However not all URLs are used equally. Bad URLs linked to some popular pron site, for instance, will get hit a lot more than Joe Sixpack's facebook site.
Engineering is the art of compromise.
H1 = Very important
H2 = Pretty important
H3 = Important
H4 = Less important, but still important
H5 = Less important
H6 = Even less important
JavaScript (yes) = Punish website
JavaScript (no) = Reward website
JavaScript OnLoad = Double punish website
HTML/XHTML compliance = Reward website
HTML/XHTML not compliance = Punish website
RSS feed = Reward website
Hyperlinks that contain more than 10 words = Punish website
Hyperlink directly after hyperlink = Punish website
ActiveX = Punish website
RealAudio = Punish website
QuickTime = Punish website
Ogg Vorbis = Reward website
FLAC = Reward website
Well its in Google's best interest to fight this, as Malware has the potential to affect their business.
Really, as much as I am not a MS basher, malware is almost entirely Microsoft's fault. If they had paid attention back in the day to security, we wouldn't have the steaming swamp of malware we have now.
The only serious way to fight malware is to reduce the potential infection hosts.
fighting this is just like fighting any sort of sickness or plague. If you have enough immunized hosts, they the issue won't be as bad.
x86
Get thee glass eyes, and, like a scurvy politician, seem to see things thou dost not.--King Lear
The problem with that is the number of sites that happen to host malware without meaning to. Too often the malware comes through advertising services or sneak through in user generated content that would be fine if not for a browser vulnerability. Google does a lot as it is, outright blocking the sites goes too far (unless that's the only thing that the site is made for, which is rare and would probably mean that the site is ranked low in the first place).