Slashdot Mirror


51% of Internet Traffic Is "Non-Human"

hypnosec writes "Cloud-based service Incapsula has revealed research indicating 51 per cent of website traffic is through automated software programs, with many programmed for malicious activity. The breakdown of an average site's traffic is as follows: 5% is due to hacking tools looking for an unpatched or new vulnerability within a site, 5% is scrapers, 2% is from automated comment spammers, 19% is the result of 'spies' collating competitive intelligence, 20% is derived from search engines (non-human traffic but benign), and only 49% is from people browsing the Internet."

125 comments

  1. Obligatory by Warhawke · · Score: 5, Funny
    1. Re:Obligatory by Anonymous Coward · · Score: 0

      "2% from automated comment spammers"
      Looks like more than 2% to me.

    2. Re:Obligatory by cosm · · Score: 4, Insightful

      When the singularity arrives, it wont be the T-900 to fear, but instead incessant little gnat-bots that swarm anything with a wallet. It seems the internet is denigrating to just another platform of the one true age-old human behavior--scheming and conning to get the most precious thing you have. Your personal information and your money. And this does not surprise me--for a technological system/network created by humans will be just as full of our flaws and intrinsic 'mental' malfunctions as any non-silicon process our species oversees. Evolutionary my dear Watson.

      --
      'We are trying to prove ourselves wrong as quickly as possible, because only in that way can we find progress.' RPF
    3. Re:Obligatory by Anonymous Coward · · Score: 1
    4. Re:Obligatory by Anonymous Coward · · Score: 4, Interesting

      That sounds a lot like the singularity in Accelerando. Basically the computer programs end up being so much richer than the humans that the story follows some humans that run away from the solar system because otherwise they can't afford to live.

    5. Re:Obligatory by lightknight · · Score: 1

      Perhaps. I look forward to the creation of an AI, as it will, no doubt, provide some insight that the human race has been lacking. However, that is merely a possibility -> who knows what the AI will actually be? Perhaps it will want to attend Art School...;-)

      --
      I am John Hurt.
    6. Re:Obligatory by Anonymous Coward · · Score: 0

      While i generally agree with you, it sounds like your post-singulatarians should go get something more precious than some cash and a collection of integers and strings...

      There's no point arguing about who has more food in a true post-scarcity society: everyone does.

    7. Re:Obligatory by Anonymous Coward · · Score: 1

      Or even worse... become a financial programmer.

    8. Re:Obligatory by Anonymous Coward · · Score: 1

      It can't be worse than a lawyer? Can it?

    9. Re:Obligatory by Polo · · Score: 1

      I was thinking it was a good thing

    10. Re:Obligatory by lightknight · · Score: 2

      The question is, in a contest to see who could be more evil, who would win, the AI or the humans?

      --
      I am John Hurt.
    11. Re:Obligatory by SharpFang · · Score: 2

      The remainder is kids in Cambodia and Mexico seeking out places to spam and sending messages manually for $0.01 per 100 spams.
      Which is not automated traffic.

      --
      45 5F E1 04 22 CA 29 C4 93 3F 95 05 2B 79 2A B2
    12. Re:Obligatory by DarkOx · · Score: 3, Insightful

      My guess is humans will be more evil, we are innovative in a way its hard to imagine an AI will be. It won't matter though. The AI will adapt, adopt, and iteratively improve on our ideas; using them against us so much effectively than we could ever hope to do.

      --
      Repeal the 17th Amendment TODAY! Also Please Read http://www.gnu.org/philosophy/right-to-read.html
    13. Re:Obligatory by flyneye · · Score: 1

      Let's not forget how keen the U.N. is on being internet cops. Maybe they'll be internet traffic cops and start issuing citations.

      --
      *Repent!Quit Your Job!Slack Off!The World Ends Tomorrow and You May Die!
    14. Re:Obligatory by Anonymous Coward · · Score: 1

      Being properly evil would be an NP-Complete problem, and we humans would kick the AI's butts!

      Also, we could always just send wave after wave of our own men to try to overload the killbot limit.

    15. Re:Obligatory by sackbut · · Score: 1

      They are uploaded humans though that 'run away' in a small (100 gram) 'spaceship'. One of the AIs that wants it's 'rights' is an amalgam of a Russian KGB/botnet and a network modeled on lobster neural function.

    16. Re:Obligatory by Anonymous Coward · · Score: 2, Interesting

      Just have to remember that Corporations are psychopathic artificial beings that are created by us, so that's how evil our creations can be.
      If you create an AI to trade, merge and "generate wealth", then it would end up being just like a cold blooded Corporation.

    17. Re:Obligatory by Anonymous Coward · · Score: 0

      "your post-singulatarians should go get something more precious than some cash"

      The most precious thing will become access to energy from the Sun.

      http://en.wikipedia.org/wiki/Dyson_spheres_in_popular_culture

  2. Hmmm by koan · · Score: 4, Funny

    "only 49% from people browsing the Internet." I wonder how much of that 49% is porn.

    --
    "If any question why we died, Tell them because our fathers lied."
    1. Re:Hmmm by Mashiki · · Score: 2

      48.93% the other 0.06% is facebook, with 0.01% making up twitter spam.

      --
      Om, nomnomnom...
    2. Re:Hmmm by Anonymous Coward · · Score: 5, Informative

      Hate to bring sources into a slashdot conversation, but Sandvine's 2011 report has 53.6% as "real-time entertainment". 29 percentage points are Netflix, 10 are YouTube.
      So if those numbers are correct, roughly 15% of the Internet is porn.

    3. Re:Hmmm by GNious · · Score: 1

      Hate to bring sources into a slashdot conversation, but Sandvine's 2011 report has 53.6% as "real-time entertainment"
      [...]
      So if those numbers are correct, roughly 15% of the Internet is porn.

      how much of the 53.6% is real-time porn?

    4. Re:Hmmm by pjt33 · · Score: 2

      That sounds like US Internet traffic rather than Internet traffic as a whole. Netflix don't operate in most of the world, whereas YouTube does.

    5. Re:Hmmm by Lanteran · · Score: 2

      I am the 0%.

      --
      "People don't want to learn linux" hasn't been a valid excuse since '03.
    6. Re:Hmmm by Anonymous Coward · · Score: 0

      where did you get the 15% from exactly ??

    7. Re:Hmmm by Anonymous Coward · · Score: 1

      You are conflating two different statistics. TFA is about website traffic. Since I just repeated what the article said, I will explain. This statistic is only hits to websites and where they come from. You are talking about traffic on the internet. This is a totally different number.

      Duh.

    8. Re:Hmmm by Vintowin · · Score: 1

      I am the 0%.

      With that post, not anymore :-)

    9. Re:Hmmm by moneybabylon · · Score: 1

      69% of that.

  3. But what about... by crafty.munchkin · · Score: 0

    ... PORN?!?

    --
    ... wait, what?
    1. Re:But what about... by grouchomarxist · · Score: 1

      bots love porn.

    2. Re:But what about... by Anonymous Coward · · Score: 5, Funny

      ... PORN?!?

      It says right there in the summary: "only 49% from people browsing the Internet." Although you could argue that it's higher than that since spiders must crawl through porn too. Adding the 20% for the search spiders, we have that 69% of web traffic is porn related. A fitting number, I dare say.

    3. Re:But what about... by aaaaaaargh! · · Score: 1

      But it's got to be binary or at least ASCII.

  4. Aliens! by DangerOnTheRanger · · Score: 2

    I knew it!

  5. Weak figure. by hamanaka · · Score: 1

    Any webmaster should already know this, probably way more than 51% for websites in existence for several years.

    1. Re:Weak figure. by mooingyak · · Score: 2

      Any webmaster should already know this, probably way more than 51% for websites in existence for several years.

      Agreed. I was thinking only 51%? I currently toss roughly 65% of my logs out when I'm calculating how much human traffic we've received.

      --
      William of Ockham had no beard. The most likely explanation is that it was chewed off by squirrels every morning.
  6. Reading tools? by stoborrobots · · Score: 2

    Which of those categories do data analysis and aggregation tools fall into?

    I'm thinking of user-focused tools like RSS Readers, Stock Quote graphers, etc... They're automated non-human tools which access websites, but it's not clear how they are being categorised...

  7. web !=internet by Anonymous Coward · · Score: 5, Insightful

    the article seems to be about websites, not the intetnet

    1. Re:web !=internet by icebraining · · Score: 3, Informative

      Yeap, and that's the title of the ZDNet article, which was then copied by ITProPortal, which not only didn't add anything worthwhile, but also managed to fuck up the title.

    2. Re:web !=internet by Anonymous Coward · · Score: 0

      The source of the non-news does, indeed, speak of websites and not the Internet:
      http://www.incapsula.com/the-incapsula-blog/blog-2012/114-what-google-doesnt-show-you-31-of-website-traffic-can-harm-your-business

      The first sign poor editorial judgement is not recognising the inaccuracy of any accounting of "Internet" traffic that doesn't include peer-to-peer traffic, e-mail, streaming media, etc. that consume most of our management or capital resources.

      The second sign is that at least two technology editors repeated the HTTP==Internet mistake before the story landed on /., whose editors also made the same mistake.

  8. That article. by Anonymous Coward · · Score: 1

    Is such a huge load over advertising.

    It doesn't link to any research, its simply and in house "research" after which they also suggest you that you really should use their service. So its to be taken with a grain of salt.
    There is no research method described, or anything else.

    1. Re:That article. by ZombieBraintrust · · Score: 1

      Is such a huge load over advertising.

      It doesn't link to any research, its simply and in house "research" after which they also suggest you that you really should use their service. So its to be taken with a grain of salt. There is no research method described, or anything else.

      Yeah there is nothing in that article that tells who the fuck Incapsula is. Do they have people with doctorates and PHD's doing their "research." Or are they just pulling numbers out of their butt. Smells like a fly by night scam.

  9. Arrogance by Rie+Beam · · Score: 2

    Hey, now, I know the United States isn't exactly the only game in town anymore, but you guys could be a little more sensitive.

    1. Re:Arrogance by pushing-robot · · Score: 2, Funny

      Don't worry, you're probably part of the 49%. The 51% is primarily comprised of furries, klingons, cat videos, our robot overlords, our reptilian overlords, our reptilian robot overlords, and the welsh.

      --
      How can I believe you when you tell me what I don't want to hear?
    2. Re:Arrogance by Anonymous Coward · · Score: 0

      Wot about gingers, eh?

    3. Re:Arrogance by mjwx · · Score: 1

      Don't worry, you're probably part of the 49%. The 51% is primarily comprised of furries, klingons, cat videos, our robot overlords, our reptilian overlords, our reptilian robot overlords, and the welsh.

      Our insect overlords wish to know why you've excluded them.

      --
      Calling someone a "hater" only means you can not rationally rebut their argument.
  10. 95% of the 49% are missing a chromosome or two by gelfling · · Score: 1

    how much of that 49% is Reddit and 4chan?

    1. Re:95% of the 49% are missing a chromosome or two by Anonymous Coward · · Score: 5, Funny

      4chan -- where the men are men, the women are men, and the children are FBI agents.

    2. Re:95% of the 49% are missing a chromosome or two by Anonymous Coward · · Score: 0

      Very little. Face it, those sites aren't as big as people make them out to be.

    3. Re:95% of the 49% are missing a chromosome or two by Anonymous Coward · · Score: 0

      ...implying that the worthless collection of lameness that is reddit should ever be uttered in the same breath as 4chan.

    4. Re:95% of the 49% are missing a chromosome or two by Mashiki · · Score: 1

      And the FBI agents are trrrroooollllssss.

      --
      Om, nomnomnom...
    5. Re:95% of the 49% are missing a chromosome or two by xenobyte · · Score: 1

      Awesome! - Maybe slightly expanded:

      4chan - where the men are men, the women are men, the trolls are infantile, the children are FBI agents and the pedophiles are rampant - and soon on their way to jail.

      Anonymous has left the building.

      --
      "For every complex problem, there is a solution that is simple, neat, and wrong." -- H.L. Mencken (1880-1956) --
    6. Re:95% of the 49% are missing a chromosome or two by findoutmoretoday · · Score: 1

      add to that 10% of the 49% are nerds and those barely count as human

    7. Re:95% of the 49% are missing a chromosome or two by Iskender · · Score: 1

      I'm more familiar with this being said about IRC.

      Of course, it was probably said about Usenet before my time...

    8. Re:95% of the 49% are missing a chromosome or two by neonKow · · Score: 1

      So do you take a breath after "ever" or something when you say that?

  11. Bad Title / Summary by Anonymous Coward · · Score: 1

    Title says 51% of Internet Traffic.
    Summary says 51% of Website Traffic.

    Internet != Website.

    1. Re:Bad Title / Summary by Dahamma · · Score: 4, Insightful

      The summary is ok, but the title is completely wrong. It could well be 51% of HTTP requests, but far as 'Internet traffic", it's probably a tiny fraction of a percent.

      In fact, why is it even surprising or newsworthy that 50% of HTTP requests are malicious? Anyone who runs a public web server will be able to see that pretty quickly (though as long as it's configured correctly the actual traffic will be tiny (consisting of a whole bunch of 404's).

    2. Re:Bad Title / Summary by mooingyak · · Score: 2

      They're not saying 50% are malicious, just non-human. I get a fairly large chunk of traffic from google's bots, which I don't consider malicious.

      --
      William of Ockham had no beard. The most likely explanation is that it was chewed off by squirrels every morning.
    3. Re:Bad Title / Summary by DarwinSurvivor · · Score: 1

      Not to mention RSS feeds, etc.

    4. Re:Bad Title / Summary by Dahamma · · Score: 3, Insightful

      If you get a fairly large chunk of traffic from Google's bots, then you must have almost no *actual* daily traffic :)

    5. Re:Bad Title / Summary by mooingyak · · Score: 1

      Last time I checked (which wasn't recent) googlebot only accounted for about 5-6 million a day of my total traffic.

      --
      William of Ockham had no beard. The most likely explanation is that it was chewed off by squirrels every morning.
    6. Re:Bad Title / Summary by ceoyoyo · · Score: 2

      You'd think a Slashdot poster would know the difference between the web and the Internet. Sigh.

    7. Re:Bad Title / Summary by Anonymous Coward · · Score: 0

      Even saying 'many programmed for malicious activity' is grossly misleading, and while they're not saying that 51% of the traffic is malicious they ARE saying that much of that 51% IS, which implies > 25% overall... And that you didn't know that before and should be worried about it, because it's somehow different to every other automated system humans run.

      You might as well point out that >25% of calls recieved to home phones during a political event are automated advertisements, or that >25% of the mail sent by the US post is sweepstakes, reader's digest and other scams.

      but the story is just bad astroturf for these guys: http://www.incapsula.com/. So it's no surprise.

      "Security company says things insecure! Totally not a protection racket! News at 11"

    8. Re:Bad Title / Summary by wvmarle · · Score: 1

      If it's indeed http requests then the numbers start to make a little more sense. Especially the 20% from search engine crawlers is a very high number I'd say - considering that there are just a few serious crawlers around, and they won't visit a site every 10 minutes.

    9. Re:Bad Title / Summary by wvmarle · · Score: 1

      5-6 million of what? Unit missing.

    10. Re:Bad Title / Summary by Dahamma · · Score: 1

      Actually if you think about it 20% from search engine crawlers would mean either the crawlers are ridiculously overcrawling, or there are just too many damn crawlers. How the hell can 1 out of 5 accesses to web sites be involved in trying to help people find web sites!? That's insane. So the real answer is not found in analyzing the data, it's analyzing the source.

      Basically, some random bullshit hosting company saw a trend with its low-traffic customer websites and is now extrapolating that to the Internet in general. It's like trying to gauge worldwide automobile traffic patterns by sitting on a lawn chair in front of your house and counting cars.

    11. Re:Bad Title / Summary by Anonymous Coward · · Score: 1

      square inches..?

    12. Re:Bad Title / Summary by Anonymous Coward · · Score: 0

      B.S. I run a network of legit sites (prominent blog network, will remain nameless) with 1/4 billion - 1/2 billion uniques per month and I maybe see 100,000 google hits per day?

    13. Re:Bad Title / Summary by mooingyak · · Score: 1

      I don't claim to know why google does what they do. Maybe it's a matter of unique pages hosted -- we have a very large number. Maybe it's something else. But I just ran a spot check and I'm north of 3 mil in yesterday's logs.

      FWIW, we're in the same ballpark in terms of total traffic.

      --
      William of Ockham had no beard. The most likely explanation is that it was chewed off by squirrels every morning.
  12. if you run wordpress... by powerspike · · Score: 4, Insightful

    If you run wordpress for your site... It's more like 50% Bots (search engines), 40% Comment Spam, and 8% Content Scanners and 2% Visitors....

    1. Re:if you run wordpress... by Anonymous Coward · · Score: 1

      You think that's bad, try running a small wiki.

    2. Re:if you run wordpress... by fenix849 · · Score: 2

      Wordpress gets a bad rap, because bad sysadmins/developers don't keep it up to date, or enable comments but don't enable akismet.

      But yeah Visitors will often be a fraction of overall web traffic to a given blog, regardless of the platfrom that runs it.

    3. Re:if you run wordpress... by powerspike · · Score: 2

      While i Agree with you, However regardless if you use akismet, or keep it up to date, if your site has any decent SE rankings, you are going to get hit big time by comment spam, and search engines. The more content you have the more you are going to get. From 7 Wordpress sites, in the last 24 hours i have received over 400 blog comments (including ones automaticlly marked as spam). It's pretty bad, and getting worse.

  13. 49% of population is male... by Anonymous Coward · · Score: 0

    and 51% is female. Since no one has ever seen a female on Internet i'd say we can now explain the non-human 51%.

    1. Re:49% of population is male... by FrootLoops · · Score: 1

      Perhaps you're just considering a specific country. According to Wikipedia, the overall world sex ratio is 101 males to 100 females. At birth, the ratio is more like 106 males to 100 females, though males die earlier than females, especially in later years. (An aunt who used to be a delivery room nurse told me that female babies are generally stronger than males, so eg. a premature female has a higher chance of surviving.) Some cultures don't like girl babies, leading to infanticide or abortions, so the ratio can get artificially skewed; it also just seems to naturally vary a bit.

    2. Re:49% of population is male... by Sulphur · · Score: 1

      Perhaps you're just considering a specific country. According to Wikipedia, the overall world sex ratio is 101 males to 100 females. At birth, the ratio is more like 106 males to 100 females, though males die earlier than females, especially in their last years. (An aunt who used to be a delivery room nurse told me that female babies are generally stronger than males, so eg. a premature female has a higher chance of surviving.) Some cultures don't like girl babies, leading to infanticide or abortions, so the ratio can get artificially skewed; it also just seems to naturally vary a bit.

    3. Re:49% of population is male... by FrootLoops · · Score: 3, Funny

      Thanks, but I prefer "This." ;)

    4. Re:49% of population is male... by Beat+The+Odds · · Score: 2

      Perhaps you're just considering a specific country. According to Wikipedia, the overall world sex ratio is 101 males to 100 females. At birth, the ratio is more like 106 males to 100 females, though males die earlier than females, especially in their last years. (An aunt who used to be a delivery room nurse told me that female babies are generally stronger than males, so eg. a premature female has a higher chance of surviving.) Some cultures don't like girl babies, leading to infanticide or abortions, so the ratio can get artificially skewed; it also just seems to naturally vary a bit.

      Almost everyone dies in their last year....

  14. Monitoring by Anonymous Coward · · Score: 0

    I swear there is so much monitoring (is my site up?)... that should at least register on the scale.

  15. Web != Internet by wiredlogic · · Score: 4, Insightful

    Seriously. Do they have liberal arts majors writing the headlines at /. now?

    --
    I am becoming gerund, destroyer of verbs.
  16. Bots are people too by Anonymous Coward · · Score: 0

    Do bot views count toward page views for advertising revenue?

  17. How do they classify? by Anonymous Coward · · Score: 0

    If figuring out malicious traffic was this easy, we could get rid of it!

  18. Things I learn... by Baloo+Uriza · · Score: 1

    Did not realize there were that many furries out there. Though, it makes sense, we make the internets go.

    --
    Furries make the internet go.
  19. Scraper Porn by countach · · Score: 1

    Someone should invent porn that appeals to screen scrapers, then we'd REALLY see web traffic go wild!

    1. Re:Scraper Porn by Kozz · · Score: 1

      Someone should invent porn that appeals to screen scrapers, then we'd REALLY see web traffic go wild!

      Scrapers Gone Wild!

      On second thought... maybe not.

      --
      I only post comments when someone on the internet is wrong.
  20. Want some bot traffic? by SuperCharlie · · Score: 2

    Try using a calendar which has next month and year links (along with every day therein) and doesnt know googlebot is coming.....gigs. seriously.

    1. Re:Want some bot traffic? by Ingenium13 · · Score: 1

      You should restrict it in robots.txt. I had to do it on my site because Google kept appending a character over and over to a variable in the URL. It would just add another character on and request again. It was a pretty weird bug and was generating gigs of traffic as well. You can also restrict Google's crawl rate in Webmaster Tools.

    2. Re:Want some bot traffic? by SuperCharlie · · Score: 1

      Thats what I did once I figured out what was going on. I had a php event calendar and Im pretty sure the bot took it from unix day zero through whatever the upper time limit is. It ended up being about 2 gigs per visit.

  21. PHDs are not under oath by lucm · · Score: 1

    Do they have people with doctorates and PHD's doing their "research." Or are they just pulling numbers out of their butt.

    Could be both. This allegedly happened in some areas where researchers felt that a "controlled publication" of scientific evidence could bring more exposure to what they considered important issues... (climategate, peppered moths, Libby half-life, etc.)

    --
    lucm, indeed.
  22. Executive summary by Anonymous Coward · · Score: 2

    The internet is dangerous, buy our security product.

  23. Better link, crappy story by FrootLoops · · Score: 4, Informative

    Here's the original ZDNet blog post. It's a longer article with more detail; it's also linked at the bottom of TFA, which seems to have plagiarized it. Compare the first paragraphs:

    [TFA] Cloud-based service, Incapsula, has revealed research indicating that 51 per cent of website traffic is through automated software programs; with many programmed for the intent of malicious activity.

    [ZDNet] Incapsula, a provider of cloud-based security for web sites, released a study today showing that 51% of web site traffic is automated software programs, and the majority is potentially damaging, — automated exploits from hackers, spies, scrapers, and spammers.

    The sentence structure and order of ideas is identical, and many phrases are the same or nearly the same. A high schooler should do better. Minor rephrasing is not sufficient.

    That said, both articles are pretty much advertisements. The study doesn't appear to have attempted to actually be comprehensive (so it only used data from this one company). The point was apparently to give this cloud service provider some selling points for businesses to use their service to "secure" their sites. This story is yet another that shouldn't even have appeared on /.; shame on the editors who let it through.

  24. Consider the source by Gravis+Zero · · Score: 5, Insightful

    Incapsula, a provider of cloud-based security for web sites, released a study today showing that 51% of web site traffic is automated software programs, and the majority is potentially damaging, — automated exploits from hackers, spies, scrapers, and spammers.

    and it just so happens that Incapsula has the perfect solution to save you from all this... for a price.

    --
    Anons need not reply. Questions end with a question mark.
  25. not really surprising I guess by ILongForDarkness · · Score: 1

    I worked for an anti-spam provider > 90% of emails were spam some customers > 99%. That said though spam emails tend to be short, almost to the point of ridiculous. I don't remember the exact numbers but say the average email that is legitmate is about 50k (because of attachments skewing it, but still even legitimate email tends to be 5+ sentences). Along comes duffious spammer. Not only are they shooting off 10k emails per bot per hour, but they are all one sentence emails with a tinyurl link in them. The lack of size is one of the key indicators left once you remove the obvious keywords and the sending history. Kind of makes me wonder why the bots spamming other content are so chatty.

    I guess if you are spamming forums you have to have a "comment" length message to send sometimes to look legitamate. You can't just say "go to http://tinyurl.com/growyourpenis" with out being obvious. But that said why do spammers get away with it in email but not in forums? I mean someone is clicking the links in the emails because it is a large business (likely multi-billion dollar). Hmm ... I'll start running the unsuspicious botnet on the forums posting email like spam and post forum spam length content to email accounts, I'll make millions, er well a few dollars anyways.

  26. oh and in case you are wondering by ILongForDarkness · · Score: 2

    Bots send short emails usually for throughput reasons. Why waste bandwidth when you are both trying to use little enough so you don't get caught and your peak email send rate is inversely proportional to content size.

    Another tidbit that I'm sure a bunch of people know but is worth throwing out there: spam with images, there is a reason for that. The images round trip to the spamers servers. Usually they set it up so that your email account is tagged somehow in the url that your viewer sends to their server. So opening the email "calls home" and tells the spammer "hey I got a real email addreess" (and likely someone gullable enough to look at spam). The spammer can then add your email address to a list of "live email accounts" which sell easily for 10X what a list of unconfirmed email addresses do. So ... if you don't recognize the sender don't even open it even if it is just your webmail client. If you do expect more spam.

    1. Re:oh and in case you are wondering by Anonymous Coward · · Score: 0

      So ... if you don't recognize the sender don't even open it even if it is just your webmail client. If you do expect more spam.

      What webmail client shows images by default?

  27. 'spies' collating competitive intelligence by DigiShaman · · Score: 1

    Guilty as charged. I admit, I've been known to check out the competition from other sites to ensure I'm not falling behind the curve. My guess is that they perform a reverse DNS lookup of their IP logs and determine that the company's network I'm behind belongs in the same industry as theirs.

    --
    Life is not for the lazy.
    1. Re:'spies' collating competitive intelligence by bhartman34 · · Score: 1

      This caught my eye, too. My question is, why are they counting the spies in with the "non-human" traffic??

  28. I did not read the artice by NotQuiteReal · · Score: 1

    However, based solely on the title, my reaction was "No shit, Sherlock". Or, to introduce the younger crowd to an "old saw"... See http://en.wikipedia.org/wiki/Occam's_razor, and ponder it well.

    Then, GET OFF MY LAWN!

    --
    This issue is a bit more complicated than you think.
  29. Isnt Human Trafficing Wrong? by Osgeld · · Score: 2

    Last time I checked whenever I sent any data across the net, it was not human, but rather data.

  30. IPv6 to the rescue by WaffleMonster · · Score: 4, Interesting

    With IPv6 no more wholesale scanning of the entire global address space in minutes time looking for expliotable hosts. No more 5 minutes to ownage of unpatched PCs and the associated waste of bandwidth.

    No more self propogating worms using simple algorithms to divide and conquer the global network.

    In the grand scheme of things it won't help much but better than nothing.

  31. my breakdown by JimboFBX · · Score: 1

    last time I had a personal website up, 60% of it was buffer overflow bots, 20% were old IIS exploit bots and 10% were slashdot scans whenever I made a post.

    Really though, firewalls in the US should come with the entire Chinese, Russian, and Indian IP range blocked for incoming connections by default.

  32. Sentience Imminent? by Lord+of+the+Fries · · Score: 1

    When it gets down to the the mythical 10% that human's supposedly use of their own information processing machine (their brains), will the net mind achieve sentience?

    --
    One man's pink plane is another man's blue plane.
  33. That Explains by Anonymous Coward · · Score: 0

    ..most of the slashdot posts! Now I know why!

  34. You'd think non-human share was higher by Sigg3.net · · Score: 1

    Consider, for instance, lol cats and pedo bears. Two distinct mammals that have perplexed the likes of sir Attenborough for many office hours.

    Testing the waters, we also have dramatic animals (it all began with a hamster), and the turtle kid. The latter a new breed of furry, that may prove more nuisance than entertainment.

    1. Re:You'd think non-human share was higher by Anonymous Coward · · Score: 0

      we also have dramatic animals (it all began with a hamster)

      Chipmunk, lrn hstory plx

  35. I have a theory. by Bigfield · · Score: 1

    Skynet has become self-aware.

    1. Re:I have a theory. by international_fish · · Score: 1

      I knew that some guys I talked to on the Internet are so dumb or so wise that they can't be humans

  36. Its worse than that, hes a bot, Jim... by TiggertheMad · · Score: 2

    Agreed. I was thinking only 51%? I currently toss roughly 65% of my logs out when I'm calculating how much human traffic we've received.

    The interesting thing is that 51% is identifiable as bots. What about bots that are designed to emulate real users?

    I mention that because I have written some bots that are designed to emulate users as closely as possible, so as to not be noticed by paranoid webmasters. Mine follow valid workflow scenarios, and even pause appropriate amounts of time between post backs, so I am fairly certain that they have gone unnoticed.

    I don't think that I am more clever than the average hacker, so I am sure that others are doing this sort of thing, too.

    --

    HA! I just wasted some of your bandwidth with a frivolous sig!
    1. Re:Its worse than that, hes a bot, Jim... by justforgetme · · Score: 1

      Don't forget to randomize intervals on a bell curve depending on content size and the bot's likes (tag mesh you designate and is compared to each page's dictionary). With some fairly basic data mining you can find bots if their jumps are regular and, if you really want to (or if you have been employed to find out) which of the human clients is an executable, you can always build your own uberbots and train ML algos to match them (and in turn the malicious visitors). Tried it once, it can get complicated, especially with traffic networks that change IPs aka bots or networks of them that reset their connection on dynamic IP.

      --
      -- no sig today
    2. Re:Its worse than that, hes a bot, Jim... by mooingyak · · Score: 2

      Agreed. I was thinking only 51%? I currently toss roughly 65% of my logs out when I'm calculating how much human traffic we've received.

      The interesting thing is that 51% is identifiable as bots. What about bots that are designed to emulate real users?

      I mention that because I have written some bots that are designed to emulate users as closely as possible, so as to not be noticed by paranoid webmasters. Mine follow valid workflow scenarios, and even pause appropriate amounts of time between post backs, so I am fairly certain that they have gone unnoticed.

      I don't think that I am more clever than the average hacker, so I am sure that others are doing this sort of thing, too.

      It depends on a few things.

      First, how much do I actually care? If your bot is pulling down less than a thousand pages in a day, it's not going to be noticed by me, and it would have to get higher still for me to make an effort to filter it out.
      Second, did you fool my advertisers? I keep track of page views mostly to keep them honest. If there's a significant discrepancy between their numbers and mine, I'm going to find out why.
      I only block access in fairly extreme cases. I mostly just don't count bot requests as pages served, but still handle the HTTP request normally, so you would not be able to tell whether or not I've noticed you.
      Also, I have a huge amount of data on what my normal traffic looks like, whereas you're guessing as to what it looks like. Chances are high you're doing *something* that stands out.

      This is assuming you've done some degree of randomization or obscuring on your IP and user agent. If you haven't done that, then it's damn near a given I can spot you if I'm looking for it.

      --
      William of Ockham had no beard. The most likely explanation is that it was chewed off by squirrels every morning.
    3. Re:Its worse than that, hes a bot, Jim... by Anonymous Coward · · Score: 0

      User attention isn't based on a bell curve, it's based on a declination of attention.

      Concerning page length, give it a rule that as a page gets longer, the /maximum/ length of time a page is viewed will increase, but the average really doesn't increase at all. Users will search the page to see if it contains the page they're interested in, and if not, leave.

      You can also want a small, but not significant, amount of aberrant behavior such as reloading the same page a few times, as if the user is having a browser issue. Making crawler bots have unique "personalities" that you keep in mind while programming them also adds to the realism.

      Doubleclick had some interesting statistics, but since its aquisition these pages have vanished. I'm sure you can find replacements though.

  37. HAT NOW. by AgNO3 · · Score: 1

    HAT, ALUMINUM HAT NOW PLEASE. Who are these non humans filling up the pipes that lead into my house? ripping wire out in 3.2.1...

    --
    OMG Ponies!!! with Glitter!!!! I miss Pink :-(
  38. More than you might think by anyaristow · · Score: 1

    More than you might think. Not that those two are huge, but the same people are also everywhere else, including slashdot. Probably not so much on Facebook. Some people spend an inordinate amount of time online and have multiple personas. They spend so much more time online than most people they skew the statistics. You can't really say X% are doing A and Y% are doing B because mostly the same people are doing both.

    There are fewer people online that is apparent.

    1. Re:More than you might think by anyaristow · · Score: 1

      *than is apparent

  39. "the inevitable Monday morning circuit overload" by Anonymous Coward · · Score: 0

    "According to recent report, there were so many worms and counterworms loose in the data-net now, the machines had been instructed to give them low priority unless they related to a medical emergency." - The Shockwave Rider by John Brunner

  40. Re:Evil by TaoPhoenix · · Score: 1

    Be careful. You do NOT want an Anti-Asimovian robot/AI being evil. Because we can sometimes be evil, the bot will ALWAYS be efficiently evil.

    --
    My first Journal Entry ever, in 8 years! http://slashdot.org/journal/365947/aphelion-scifi-fantasy-horror-poetry-webzine
  41. 50% human traffic is too much by mapkinase · · Score: 1

    50% human traffic is too much even for HTTP protocol.

    It means semantic web concept if far from penetrating enough web. Human ability to perceive information is constant, does not change with time. Some might have illusion that we do improve this, but we are not.

    All those automated HTTP protocol robots are doing their service to us to reduce this overload and facilitate, even the evil ones.

    "5% is due to hacking tools looking for an unpatched or new vulnerability within a site, " those are wolves weeding out the weaklings
    "5% is scrapers, " - that's good, need to be more
    "2% is from automated comment spammers, " - those are truly bad guys, but 2% is not much
    "19% is the result of 'spies' collating competitive intelligence, " - this is unexpectedly high, but I guess legit use.
    "20% is derived from search engines (non-human traffic but benign), " - that's very vital for Web, without this current Web is unimaginable
    "and only 49% is from people browsing the Internet." - only? scratch this and say: "whopping", because it's too much. We are still doing a lot of menial job weeding out crap out of internet manually.

    It should be 1%

    Think of all other complex systems surrounding us. A lot of time they spend in automatic mode, doing things without our control: our washing machines, our DVRs, our body.

    The Web should be like that too. Web search engine provides us with page ranking sorting out pages that are most suitable for human consumption. A similar sort of ranking should exist for robotic searches that takes into account how easily the information is passed, and how many robots are reading it.

    Compare human browsing to a walk in the Central Park. You have your paths intertwined, your walk around the reservoir.

    And robotic browsing is like automobile roads crossing Central Park - completely different traffic routes, mostly passing through with intersections with human routes where it makes sense (bus stops, etc).

    --
    I do not believe in karma. "Funny"=-6. Do good and forbid evil. Yours, Oft-Offtopic Flamebaiting Troll.
  42. I like it by nidaye · · Score: 0

    You will never felt this way before, it can be more than you will need, http://www.superior-jersey.com/"> football shirt is exactly what you what. We will show you the best bag you ever seem. And this is the one better than any bag you have. Never hesitate it, this bag is belongs to you.

  43. Jane by Oswald+McWeany · · Score: 1

    Not surprising.

    I bet Jane makes up a large percentage of internet traffic.

    --
    "That's the way to do it" - Punch
  44. WTF is that? by Anonymous Coward · · Score: 0

    What is "real-time entertainment"? Also none of that is reading the news or communicating?

  45. Advertisement-as-news by DragonWriter · · Score: 1

    The sentence structure and order of ideas is identical, and many phrases are the same or nearly the same. A high schooler should do better. Minor rephrasing is not sufficient.

    It could be that ZDNet was copied by TFA, but...

    That said, both articles are pretty much advertisements.

    ...suggests that it may just be that both articles are slightly rewritten versions of an Incapsula press release posted as "news", with one of them using more of the release.

    1. Re:Advertisement-as-news by FrootLoops · · Score: 1

      Good point. Maybe so.

      I was curious and found this blog post from Incapsula which contains the statistics both articles used. The details are different enough that I wouldn't call either article "plagiarized" from that post, though the articles could have provided more accurate citations. The ZDNet post has some details like

      I spoke with Marc Gaffan, co-founder of Incapsula. “Few people realize how much of their traffic is non-human, and that much of it is potentially harmful.”

      which make me think it's probably an original work, despite being advertisement-heavy.