Slashdot Mirror


51% of Internet Traffic Is "Non-Human"

hypnosec writes "Cloud-based service Incapsula has revealed research indicating 51 per cent of website traffic is through automated software programs, with many programmed for malicious activity. The breakdown of an average site's traffic is as follows: 5% is due to hacking tools looking for an unpatched or new vulnerability within a site, 5% is scrapers, 2% is from automated comment spammers, 19% is the result of 'spies' collating competitive intelligence, 20% is derived from search engines (non-human traffic but benign), and only 49% is from people browsing the Internet."

40 of 125 comments (clear)

  1. Obligatory by Warhawke · · Score: 5, Funny
    1. Re:Obligatory by cosm · · Score: 4, Insightful

      When the singularity arrives, it wont be the T-900 to fear, but instead incessant little gnat-bots that swarm anything with a wallet. It seems the internet is denigrating to just another platform of the one true age-old human behavior--scheming and conning to get the most precious thing you have. Your personal information and your money. And this does not surprise me--for a technological system/network created by humans will be just as full of our flaws and intrinsic 'mental' malfunctions as any non-silicon process our species oversees. Evolutionary my dear Watson.

      --
      'We are trying to prove ourselves wrong as quickly as possible, because only in that way can we find progress.' RPF
    2. Re:Obligatory by Anonymous Coward · · Score: 4, Interesting

      That sounds a lot like the singularity in Accelerando. Basically the computer programs end up being so much richer than the humans that the story follows some humans that run away from the solar system because otherwise they can't afford to live.

    3. Re:Obligatory by lightknight · · Score: 2

      The question is, in a contest to see who could be more evil, who would win, the AI or the humans?

      --
      I am John Hurt.
    4. Re:Obligatory by SharpFang · · Score: 2

      The remainder is kids in Cambodia and Mexico seeking out places to spam and sending messages manually for $0.01 per 100 spams.
      Which is not automated traffic.

      --
      45 5F E1 04 22 CA 29 C4 93 3F 95 05 2B 79 2A B2
    5. Re:Obligatory by DarkOx · · Score: 3, Insightful

      My guess is humans will be more evil, we are innovative in a way its hard to imagine an AI will be. It won't matter though. The AI will adapt, adopt, and iteratively improve on our ideas; using them against us so much effectively than we could ever hope to do.

      --
      Repeal the 17th Amendment TODAY! Also Please Read http://www.gnu.org/philosophy/right-to-read.html
    6. Re:Obligatory by Anonymous Coward · · Score: 2, Interesting

      Just have to remember that Corporations are psychopathic artificial beings that are created by us, so that's how evil our creations can be.
      If you create an AI to trade, merge and "generate wealth", then it would end up being just like a cold blooded Corporation.

  2. Hmmm by koan · · Score: 4, Funny

    "only 49% from people browsing the Internet." I wonder how much of that 49% is porn.

    --
    "If any question why we died, Tell them because our fathers lied."
    1. Re:Hmmm by Mashiki · · Score: 2

      48.93% the other 0.06% is facebook, with 0.01% making up twitter spam.

      --
      Om, nomnomnom...
    2. Re:Hmmm by Anonymous Coward · · Score: 5, Informative

      Hate to bring sources into a slashdot conversation, but Sandvine's 2011 report has 53.6% as "real-time entertainment". 29 percentage points are Netflix, 10 are YouTube.
      So if those numbers are correct, roughly 15% of the Internet is porn.

    3. Re:Hmmm by pjt33 · · Score: 2

      That sounds like US Internet traffic rather than Internet traffic as a whole. Netflix don't operate in most of the world, whereas YouTube does.

    4. Re:Hmmm by Lanteran · · Score: 2

      I am the 0%.

      --
      "People don't want to learn linux" hasn't been a valid excuse since '03.
  3. Aliens! by DangerOnTheRanger · · Score: 2

    I knew it!

  4. Reading tools? by stoborrobots · · Score: 2

    Which of those categories do data analysis and aggregation tools fall into?

    I'm thinking of user-focused tools like RSS Readers, Stock Quote graphers, etc... They're automated non-human tools which access websites, but it's not clear how they are being categorised...

  5. web !=internet by Anonymous Coward · · Score: 5, Insightful

    the article seems to be about websites, not the intetnet

    1. Re:web !=internet by icebraining · · Score: 3, Informative

      Yeap, and that's the title of the ZDNet article, which was then copied by ITProPortal, which not only didn't add anything worthwhile, but also managed to fuck up the title.

  6. Arrogance by Rie+Beam · · Score: 2

    Hey, now, I know the United States isn't exactly the only game in town anymore, but you guys could be a little more sensitive.

    1. Re:Arrogance by pushing-robot · · Score: 2, Funny

      Don't worry, you're probably part of the 49%. The 51% is primarily comprised of furries, klingons, cat videos, our robot overlords, our reptilian overlords, our reptilian robot overlords, and the welsh.

      --
      How can I believe you when you tell me what I don't want to hear?
  7. Re:But what about... by Anonymous Coward · · Score: 5, Funny

    ... PORN?!?

    It says right there in the summary: "only 49% from people browsing the Internet." Although you could argue that it's higher than that since spiders must crawl through porn too. Adding the 20% for the search spiders, we have that 69% of web traffic is porn related. A fitting number, I dare say.

  8. if you run wordpress... by powerspike · · Score: 4, Insightful

    If you run wordpress for your site... It's more like 50% Bots (search engines), 40% Comment Spam, and 8% Content Scanners and 2% Visitors....

    1. Re:if you run wordpress... by fenix849 · · Score: 2

      Wordpress gets a bad rap, because bad sysadmins/developers don't keep it up to date, or enable comments but don't enable akismet.

      But yeah Visitors will often be a fraction of overall web traffic to a given blog, regardless of the platfrom that runs it.

    2. Re:if you run wordpress... by powerspike · · Score: 2

      While i Agree with you, However regardless if you use akismet, or keep it up to date, if your site has any decent SE rankings, you are going to get hit big time by comment spam, and search engines. The more content you have the more you are going to get. From 7 Wordpress sites, in the last 24 hours i have received over 400 blog comments (including ones automaticlly marked as spam). It's pretty bad, and getting worse.

  9. Re:Bad Title / Summary by Dahamma · · Score: 4, Insightful

    The summary is ok, but the title is completely wrong. It could well be 51% of HTTP requests, but far as 'Internet traffic", it's probably a tiny fraction of a percent.

    In fact, why is it even surprising or newsworthy that 50% of HTTP requests are malicious? Anyone who runs a public web server will be able to see that pretty quickly (though as long as it's configured correctly the actual traffic will be tiny (consisting of a whole bunch of 404's).

  10. Re:95% of the 49% are missing a chromosome or two by Anonymous Coward · · Score: 5, Funny

    4chan -- where the men are men, the women are men, and the children are FBI agents.

  11. Re:Weak figure. by mooingyak · · Score: 2

    Any webmaster should already know this, probably way more than 51% for websites in existence for several years.

    Agreed. I was thinking only 51%? I currently toss roughly 65% of my logs out when I'm calculating how much human traffic we've received.

    --
    William of Ockham had no beard. The most likely explanation is that it was chewed off by squirrels every morning.
  12. Re:Bad Title / Summary by mooingyak · · Score: 2

    They're not saying 50% are malicious, just non-human. I get a fairly large chunk of traffic from google's bots, which I don't consider malicious.

    --
    William of Ockham had no beard. The most likely explanation is that it was chewed off by squirrels every morning.
  13. Web != Internet by wiredlogic · · Score: 4, Insightful

    Seriously. Do they have liberal arts majors writing the headlines at /. now?

    --
    I am becoming gerund, destroyer of verbs.
  14. Re:Bad Title / Summary by Dahamma · · Score: 3, Insightful

    If you get a fairly large chunk of traffic from Google's bots, then you must have almost no *actual* daily traffic :)

  15. Want some bot traffic? by SuperCharlie · · Score: 2

    Try using a calendar which has next month and year links (along with every day therein) and doesnt know googlebot is coming.....gigs. seriously.

  16. Executive summary by Anonymous Coward · · Score: 2

    The internet is dangerous, buy our security product.

  17. Better link, crappy story by FrootLoops · · Score: 4, Informative

    Here's the original ZDNet blog post. It's a longer article with more detail; it's also linked at the bottom of TFA, which seems to have plagiarized it. Compare the first paragraphs:

    [TFA] Cloud-based service, Incapsula, has revealed research indicating that 51 per cent of website traffic is through automated software programs; with many programmed for the intent of malicious activity.

    [ZDNet] Incapsula, a provider of cloud-based security for web sites, released a study today showing that 51% of web site traffic is automated software programs, and the majority is potentially damaging, — automated exploits from hackers, spies, scrapers, and spammers.

    The sentence structure and order of ideas is identical, and many phrases are the same or nearly the same. A high schooler should do better. Minor rephrasing is not sufficient.

    That said, both articles are pretty much advertisements. The study doesn't appear to have attempted to actually be comprehensive (so it only used data from this one company). The point was apparently to give this cloud service provider some selling points for businesses to use their service to "secure" their sites. This story is yet another that shouldn't even have appeared on /.; shame on the editors who let it through.

  18. Consider the source by Gravis+Zero · · Score: 5, Insightful

    Incapsula, a provider of cloud-based security for web sites, released a study today showing that 51% of web site traffic is automated software programs, and the majority is potentially damaging, — automated exploits from hackers, spies, scrapers, and spammers.

    and it just so happens that Incapsula has the perfect solution to save you from all this... for a price.

    --
    Anons need not reply. Questions end with a question mark.
  19. Re:Bad Title / Summary by ceoyoyo · · Score: 2

    You'd think a Slashdot poster would know the difference between the web and the Internet. Sigh.

  20. oh and in case you are wondering by ILongForDarkness · · Score: 2

    Bots send short emails usually for throughput reasons. Why waste bandwidth when you are both trying to use little enough so you don't get caught and your peak email send rate is inversely proportional to content size.

    Another tidbit that I'm sure a bunch of people know but is worth throwing out there: spam with images, there is a reason for that. The images round trip to the spamers servers. Usually they set it up so that your email account is tagged somehow in the url that your viewer sends to their server. So opening the email "calls home" and tells the spammer "hey I got a real email addreess" (and likely someone gullable enough to look at spam). The spammer can then add your email address to a list of "live email accounts" which sell easily for 10X what a list of unconfirmed email addresses do. So ... if you don't recognize the sender don't even open it even if it is just your webmail client. If you do expect more spam.

  21. Isnt Human Trafficing Wrong? by Osgeld · · Score: 2

    Last time I checked whenever I sent any data across the net, it was not human, but rather data.

  22. IPv6 to the rescue by WaffleMonster · · Score: 4, Interesting

    With IPv6 no more wholesale scanning of the entire global address space in minutes time looking for expliotable hosts. No more 5 minutes to ownage of unpatched PCs and the associated waste of bandwidth.

    No more self propogating worms using simple algorithms to divide and conquer the global network.

    In the grand scheme of things it won't help much but better than nothing.

  23. Re:49% of population is male... by FrootLoops · · Score: 3, Funny

    Thanks, but I prefer "This." ;)

  24. Its worse than that, hes a bot, Jim... by TiggertheMad · · Score: 2

    Agreed. I was thinking only 51%? I currently toss roughly 65% of my logs out when I'm calculating how much human traffic we've received.

    The interesting thing is that 51% is identifiable as bots. What about bots that are designed to emulate real users?

    I mention that because I have written some bots that are designed to emulate users as closely as possible, so as to not be noticed by paranoid webmasters. Mine follow valid workflow scenarios, and even pause appropriate amounts of time between post backs, so I am fairly certain that they have gone unnoticed.

    I don't think that I am more clever than the average hacker, so I am sure that others are doing this sort of thing, too.

    --

    HA! I just wasted some of your bandwidth with a frivolous sig!
    1. Re:Its worse than that, hes a bot, Jim... by mooingyak · · Score: 2

      Agreed. I was thinking only 51%? I currently toss roughly 65% of my logs out when I'm calculating how much human traffic we've received.

      The interesting thing is that 51% is identifiable as bots. What about bots that are designed to emulate real users?

      I mention that because I have written some bots that are designed to emulate users as closely as possible, so as to not be noticed by paranoid webmasters. Mine follow valid workflow scenarios, and even pause appropriate amounts of time between post backs, so I am fairly certain that they have gone unnoticed.

      I don't think that I am more clever than the average hacker, so I am sure that others are doing this sort of thing, too.

      It depends on a few things.

      First, how much do I actually care? If your bot is pulling down less than a thousand pages in a day, it's not going to be noticed by me, and it would have to get higher still for me to make an effort to filter it out.
      Second, did you fool my advertisers? I keep track of page views mostly to keep them honest. If there's a significant discrepancy between their numbers and mine, I'm going to find out why.
      I only block access in fairly extreme cases. I mostly just don't count bot requests as pages served, but still handle the HTTP request normally, so you would not be able to tell whether or not I've noticed you.
      Also, I have a huge amount of data on what my normal traffic looks like, whereas you're guessing as to what it looks like. Chances are high you're doing *something* that stands out.

      This is assuming you've done some degree of randomization or obscuring on your IP and user agent. If you haven't done that, then it's damn near a given I can spot you if I'm looking for it.

      --
      William of Ockham had no beard. The most likely explanation is that it was chewed off by squirrels every morning.
  25. Re:49% of population is male... by Beat+The+Odds · · Score: 2

    Perhaps you're just considering a specific country. According to Wikipedia, the overall world sex ratio is 101 males to 100 females. At birth, the ratio is more like 106 males to 100 females, though males die earlier than females, especially in their last years. (An aunt who used to be a delivery room nurse told me that female babies are generally stronger than males, so eg. a premature female has a higher chance of surviving.) Some cultures don't like girl babies, leading to infanticide or abortions, so the ratio can get artificially skewed; it also just seems to naturally vary a bit.

    Almost everyone dies in their last year....