How Much Internet Traffic Is Fake? Turns Out, a Lot of It, Actually. (nymag.com)
Long-time Slashdot reader AmiMoJo shared this article from New York magazine:
In late November, the Justice Department unsealed indictments against eight people accused of fleecing advertisers of $36 million in two of the largest digital ad-fraud operations ever uncovered... Hucksters infected 1.7 million computers with malware that remotely directed traffic to "spoofed" websites.... [B]ots "faked clicks, mouse movements, and social network login information to masquerade as engaged human consumers." Some were sent to browse the internet to gather tracking cookies from other websites, just as a human visitor would have done through regular behavior. Fake people with fake cookies and fake social-media accounts, fake-moving their fake cursors, fake-clicking on fake websites -- the fraudsters had essentially created a simulacrum of the internet, where the only real things were the ads.
How much of the internet is fake? Studies generally suggest that, year after year, less than 60 percent of web traffic is human; some years, according to some researchers, a healthy majority of it is bot. For a period of time in 2013, the Times reported this year, a full half of YouTube traffic was "bots masquerading as people," a portion so high that employees feared an inflection point after which YouTube's systems for detecting fraudulent traffic would begin to regard bot traffic as real and human traffic as fake. They called this hypothetical event "the Inversion...."
[N]ot even Facebook, the world's greatest data-gathering organization, seems able to produce genuine figures. In October, small advertisers filed suit against the social-media giant, accusing it of covering up, for a year, its significant overstatements of the time users spent watching videos on the platform (by 60 to 80âpercent, Facebook says; by 150 to 900 percent, the plaintiffs say). According to an exhaustive list at MarketingLand, over the past two years Facebook has admitted to misreporting the reach of posts on Facebook Pages (in two different ways), the rate at which viewers complete ad videos, the average time spent reading its "Instant Articles," the amount of referral traffic from Facebook to external websites, the number of views that videos received via Facebook's mobile site, and the number of video views in Instant Articles.
On Twitter the author also shared a Twitter thread by the Washington Post's director of advertising technology, who shares his own complaints about the ecosystem of online advertising. "The problem isn't just that the internet is full of fakery and bullshit and bad numbers and malfunctioning metrics and bullshitters and fraudsters. The problem is that all the fake shit is layered on top of other fake shit and it just COMPOUNDS itself... Like you get fake users, who get autoplay videos which no one is really watching....
"That's not even counting the entire ad campaigns that are fake where the product is just a bullshit excuse to collect data on you."
How much of the internet is fake? Studies generally suggest that, year after year, less than 60 percent of web traffic is human; some years, according to some researchers, a healthy majority of it is bot. For a period of time in 2013, the Times reported this year, a full half of YouTube traffic was "bots masquerading as people," a portion so high that employees feared an inflection point after which YouTube's systems for detecting fraudulent traffic would begin to regard bot traffic as real and human traffic as fake. They called this hypothetical event "the Inversion...."
[N]ot even Facebook, the world's greatest data-gathering organization, seems able to produce genuine figures. In October, small advertisers filed suit against the social-media giant, accusing it of covering up, for a year, its significant overstatements of the time users spent watching videos on the platform (by 60 to 80âpercent, Facebook says; by 150 to 900 percent, the plaintiffs say). According to an exhaustive list at MarketingLand, over the past two years Facebook has admitted to misreporting the reach of posts on Facebook Pages (in two different ways), the rate at which viewers complete ad videos, the average time spent reading its "Instant Articles," the amount of referral traffic from Facebook to external websites, the number of views that videos received via Facebook's mobile site, and the number of video views in Instant Articles.
On Twitter the author also shared a Twitter thread by the Washington Post's director of advertising technology, who shares his own complaints about the ecosystem of online advertising. "The problem isn't just that the internet is full of fakery and bullshit and bad numbers and malfunctioning metrics and bullshitters and fraudsters. The problem is that all the fake shit is layered on top of other fake shit and it just COMPOUNDS itself... Like you get fake users, who get autoplay videos which no one is really watching....
"That's not even counting the entire ad campaigns that are fake where the product is just a bullshit excuse to collect data on you."
Pay the bills (real ads) so if a server cannot serve those ads then the server is inconsequential and useless
Pond scum feeding on pond scum. I'm having a hard time drumming up concern.
Pretty much any webmaster that views their logs regularly can back up this claim.
One of my websites is a property listings portal in a small foreign county and I have more bot traffic on there than all my other websites combined for some reason. I have like 3 million submitted fake data info from bots in the last couple years.
Even this post is fake!
fake shit is layered on top of other fake shit and it just COMPOUNDS itself.
Our culture is exposed to the digital archeologists of the future already. Time to extract that traffic slice and store it in a nanotube ram for the 10000 year retention time.
A few stories back, the pros and cons of capitalism were discussed, the major flaw I concluded is the human greed and this story is the best confirmation (or is it worst?) The ad business is one of the ugliest of capitalism (among the "barely" legal ones) where you can see deception, greed, cuestionable practices and everything done in the name of the almighty dollar.
No one likes this idea, but this is the answer
Everyone is given a ipv6 block. All your traffic from them comes from this block and can be tracked if needed. You can randomly use any IP from this block as you wish (the block follows you around, is setup after you authenticate. The tech doesn't exist yet, but is simple to do, probably a simple USB key and password system for users) Businesses are treated the same as individuals, their servers also use the business block.
An independent authority knows who the block belongs to, This authority is not under any governments thumb and can only be petitioned threw the courts to release the information, for a darn good reason. No fishing. They cannot be compelled to release the information by anyone or anything, especially to any spy agencies. And there are no back doors for such agencies.
It is illegal for Marketing people to use this information in any form.
This stops 95% of crap, as you can't pretend to be anyone else, if you should get hacked its easy to identify you and get it resolved.
Who can tell the difference?
https://twitter.com/TitaniaMcG...
Titania McGrath can.
#consequences
If it's labeled AI.
Missing some digits...
But I notice that the fuckers (Facebook, Google, etc) are not bankrupt yet. Clearly more fraud is required to have the necessary impact and get these asshole advertizers in the work-house where they belong.
Is there an open source click-fraud site to which one can contribute to speed the demise?
We're getting closer and closer to that XKCD dream life: https://xkcd.com/810/
I'm not sure why the part about Facebook inflating ad display numbers is included here. That was not because of bot activity. The majority of FB traffic is consumed through their mobile apps (95% of it), and you can be sure that is not bot type activity. Facebook has gone to great lengths to prevent scraping of their website, and it is extremely unlikely that the scraping of the site would involve scrolling through a newsfeed so that an ad became visible, began autoplaying and was streamed to a bot.
Facebook misrepresented the amount of time a user sat watching an ad before they scrolled on past it, plain and simple. More than likely they were counting things like a small portion of the video still being visible on the screen as being "watched". For the difference, even by Facebook's own admission, to be off by 60-80% shows this was a misrepresentation of what it meant for a user to watch a video ad on a very large scale (including in the app on mobile platforms).
Better known as 318230.
we had this problem during the last dot com bubble and the ad networks started imploding taking down popular websites and/or forcing people to get "real jobs".
It really is quite difficult for me to feel all that sorry about advertisers and ad sellers being upset that their precious data is wrong/overstated/contaminated. The "ad wars" [on users' eyes, ears, cpu, screen space, bandwidth, patience] are so insane now, the anarchist in me is almost happy about it.
Ooops, another site that wants to shame/annoy/warn/block me because of my ad filters protecting my privacy/sanity/bandwidth/battery/security. Hmmm....
Ads need the data on users.
The brand that takes out the ads pays for it all.
With more advanced browsers getting the ad to display, work, track is getting more complex.
What can the ad brands do?
Make the browsers and OS more ad friendly.
Make users have to view ads in some nations?
Make the browser show an ad?
Domestic spying is now "Benign Information Gathering"
I hate being right.
All of this fuckery is due to ads.
Why can't the operators of these servers join a multi-publisher subscription network? Two decades ago, such a network called Adult Check was popular, founded on the principle that adults can pay for nice things. One $10/mo payment bought access to all sites that took Adult Check, and the network paid publishers per page view. This helped to alleviate the sticker shock from each website charging a separate subscription.
More recently, Google Contributor could have been that network. The biggest problem with Contributor is lack of privacy, as it shares a parent company with AdSense and DoubleClick. This means Google can use page history gathered through Contributor to infer interests of a Contributor user for use on sites using Google adtech.
Advertising is black magic and always has been. The internet has shown ad peddlers that they don't actually know all that much about advertising, but now they can measure every aspect of their ignorance, hubris, and ineffectiveness... and it freaks them out.
Some combination of Trump, UBI, random race baiting and global warming posts also.... wait a second, what does /. post now?
of course, 1st) 40% spam email, 2nd) 40% porn, 3rd) 20% real traffic, of course the 2nd is debatable...
The moderation mechanism described in xkcd #810 already resembles that in use on various forums and Q&A sites, such as Slashdot and Stack Overflow.
1. Each newly registered user sees a page of what Stack Overflow calls "review audits". This resembles Slashdot metamoderation: does what the new user sees as constructive align with what established users see as constructive?
2. Anyone who gets most of the review audits correct has posts placed in "awaiting moderation" state. Only established users can see such a post until at least one established user upvotes the post.
3. Once a user is firmly in positive reputation/karma, the user's posts skip the "awaiting moderation" state.
Yet this hasn't led to any artificial intelligence breakthroughs on the part of the spam industry. Instead, I've noticed that spammers on forums.nesdev.com appear to be humans in low-exchange-rate countries. They search for an old post, reword it, start a discussion, and days later edit the post to include off-topic commercial links. A user who isn't paying close attention is unlikely to see this karma whoring for what it is.
That shit is useless because its like 90% Russian bots hoping you'll wonder where they came from and click their fake domain...i stopped even looking at that shit years ago.
A big chunk of the fake traffic is the fake tits in porn.
Please look into the Brave browser which blocks advertisements and trackers by default, it is also implementing a system whereby people get paid to opt-in to watching advertisements. Also, the content creator will get a larger percentage of the advertising revenue as it is not siphoned off by middlemen. You can also tip people directly through the browser.
So this must be the fake news about all of the fakery that data mines and advertisers splash around like pigs in poop. Oh, horrors! liars and parasites are everywhere.
The majority of FB traffic is consumed through their mobile apps (95% of it), and you can be sure that is not bot type activity.
I would argue that the other way - because so MCUH of Facebook traffic is from mobile apps, that is where most of the bot activity is likely to be from.
All it takes is an Android user logged into Facebook and some background app can have plenty of likes and other things they don't even see happening...
"There is more worth loving than we have strength to love." - Brian Jay Stanley
A lot of the 'fakery' in the media is nothing more than chumming by LEAs.
I've always maintained that the way to beat the panopticon companies isn't with ad blockers and privacy legislation. It's to dilute the value of the data they collect by inserting so much fake data that they can no longer sufficiently distinguish real people from the bots.
There's an apocryphal story that after the end of the Cold War, a bunch of the CIA and KGB got together for drinks. The CIA spooks lamented that theirs had been the harder job. The Soviet Union was such a closed society and had so many restrictions on travel that it was virtually impossible for the CIA to get a spy in there, whereas all the KGB had to do was drive to a town next to a military base and mingle with staff from the base eating lunch there. The KGB spooks disagreed and claimed that theirs had been the harder job. The U.S. produced so much information that it was virtually impossible for them to separate out fact from fiction. If the National Enquirer ran a story about the military working on a, or some conspiracy theorist reported the military was controlling their brain waves with weather balloons, they had to devote resources to figure out if the stories were real or fake.
Patreon shares the same incremental sticker shock issue as individual website subscriptions. Just as your New York Times subscription doesn't let you view a Wall Street Journal article that your friend cited to you, viewing a single patron-only article from each of five different publishers on Patreon incurs a charge for an entire month's subscription to each of those five publishers. The a la carte price structure of Contributor and the flat monthly fee of Adult Check avoid(ed) the problem of it being more expensive to read from multiple publishers.
Shitty metrics are better than no metrics at all. That's why online advertising will continue eat TV and newspaper ad revenues.
Yeah, google news puts out fake stories directed at me, just to see who, what and where I am and what my interests are so I can be sold to advertisers. They learned their craft from the great slavers.
We created machines to get rid of the most mechanical and tedious activities. I will gladly leave it to robots to look and click at ads on my behalf. Let's also teach robots to do the "I'm not a robot" routine, so we humans become free to do meaningful stuff instead.
The best way to identify a real human these days is to look at traffic where ads are blocked.
If builders built buildings the way programmers wrote programs, then the first woodpecker would destroy civilization.
If a few people are able to game the advertisers and make a cool million, I am fine with that. The marketing corporations (and even large corporations in general) always look for ways to fuck over the little guy. If the little guy gets a little revenge, I have to give them 3 cheers.
Rounding error for advertisers. Skies are not falling.
You mean like LinkedIn?
> Studies generally suggest that, year after year, less than 60 percent of web traffic is human; some years,
> according to some researchers, a healthy majority of it is bot.
I frequently visit Web pages that are bot-generated. Among them are price quotes for stocks, daily weather data (rainfall, temperature, wind speed, etc), and what checks have cleared my checking account. These are not human-generated.
The problem is with bots that visit Web sites. Even there, not all bots are bad. After all, without crawler bots, search engines could not tell you what Chinese restaurants are in your ZIP code.
I'm wondering whether you would be willing to comment on the idea of "coopetition" in the TLA world. Having heard of the value of occasional human judgment in certain situations (Cuban missile crisis, a few launch warnings in the '80s, etc.)
I am curious about whether the operators were generally trying to hold the shit together without going too badly sideways, or was the competition among the on-the-ground operators much more hard-played?
I wonder how many people here really clicked to find out what the producers of Forrest Gump said about how Jenny died.