Google's Fight Against 'Low-Quality' Sites Continues
nj_peeps writes
"A couple weeks ago, JC Penney made the news for plummeting in Google rankings for everything from 'area rugs' to 'grommet top curtains.' Turns out the retail site had a number of suspicious links pointing at it that could be traced back to a link network intended to manipulate Google's ranking algorithms. Now, Overstock.com has lost rankings for another type of link that Google finds to be manipulation of their algorithms. This situation has led Google to implement a significant change to their search algorithms, affecting almost 12% of queries in an effort to cull content farms and other webspam. And in the midst of all of this, a company with substantial publicity lately for running a paid link network announces they are getting out of the link business entirely."
we can expect google to get better, e.g. closer to what it used to be in the early days?
on Google to send me exactly where they must know I belong because I can't make that decision for myself.
If you knew the location of the web site where you "belonged", you wouldn't have to search for it to begin with.
Please tell me they are going to start going after the myriad car parts spam sites that flood the google rankings when searching for anything but the most obvious automotive items. I am sick and tired of sifting through a dozen completely worthless sites when googling for a part number I am trying to track down. Ebay is more reliable than google for almost everything I am looking for lately.
Yes, that's what this is about. Freedom to have spam served to me on a silver platter. Please Google, stop filtering all that spam in my gmail inbox too! I hate that you feel the need to protect me; I am a big boy and enjoy sifting through 1000 messages a day looking for the 2 relevant ones! Let freedom ring! /sarcasm.
Completely unpredictable that Google keeps maintaining their product, unfathomable
Next target, those stupid mailing list aggregators that keep popping up first in results, but are a redirect to a redirect to a redirect ... and digg/reddit types
I run a spider. It seems over 95% of pages on the internet are content farm and similar randomly generated crap. They take a hundred sentence fragments, string them together, then see if they can fool Google and other engines into crawling them.
You will not be very happy if they stop filtering the garbage for you.
Rod Taylor
What I can say as guy who sells ad space on his website: My Google AdSense income has gone up by a factor of 5 to 10 in the past two months. No, I'm not gonna be able to retire on this money. But it's an obvious increase. And I see it coming at exactly the same time as I see Google cracking down on rank spamming.
I think Google has "rationalized" a lot of their ad process (both ranking and sales) and the only guys who are hurt, are the ones who were gaming the system to begin with. e.g. click fraud and spamming the ranking.
My searches don't seem to be turning up quite so many fake download sites with "certified full download" links anymore, good riddance to those.
send me exactly where they must know I belong because I can't make that decision for myself.
Congratulations on grasping the purpose of a search engine.
W..w..W - Willy Waterloo washes Warren Wiggins who is washing Waldo Woo.
One of the things I use Google for extensively is the ability to search for wierd error messages, return codes, etc. that appear in commercial software I use for work. It's very frustruating when your very specific search query returns 45 different sites, all of which are rehosting the same forum post or newsgroup article. These get ranked higher up than other unique posts, causing a lot of scrolling through results and wasting time. Also, these aren't queries like "bmw 335i" or "" that are guaranteed to return millions of unique hits. I'm looking for the one other guy in the world who's found this issue and has a workable answer. Google used to be pretty good for that, especially if your query was well formed and incredibly specific.
Real world example - I got an error message trying to install Windows 7 SP1 last week, with a long hex number and a very specifically-worded message. I typed the query into google, and the first hit was some idiot who had no idea what he was talking about on a support forum. The next 5-6 hits were that exact same idiot's post rebroadcast to sites like eggheadcafe.com, techarea.in, etc. I eventually found the answer, but it was on page 3 of the search results.
On another topic, how and why do these content farm sites exist? How does eggheadcafe.com, which just copies newsgroup and forum data, able to pay to keep the site going? Are they all just looking to cash in on ad revenue? Do they really get that much in revenue to justify the site-crawling they must have to do?
Market forces will insure that firms will continue to hack the google algorithm. If Google fights back too much firms will begin to use and promote other advertisers, like Bing. This is a typical case where the end user is not the customer. The customer is the firms that pay Google to advertiser. Then search engine only serves to collect views that raise the value of those ads. Therefore the only issue is if the 'low quality' search results causes substantially fewer people to view ads.
In fact I don't see Google doing anything to make the search results better. All the link farms with Google ads appear to perpetually stay high in the ranks. The only time that anything seems to be done is when a firm fails to pay Google for ads and instead pays other firms to manipulate the rankings. I can imagine that Google, who will doing anything, ethical or not, to be the only ad agency on the web, would find that to be a very bad thing.
"She's a scientist and a lesbian. She's not going to let it slide." Orphan Black
I've just looked back at JCPenney's stock price, and there's no fluctuation or even a news mention about them getting Google-slapped for SEO gaming. They made it through the Christmas season selling tons of stuff, Google has slapped them down, yet there isn't even a bump. An analyst noted they had slightly weaker January sales and blamed it on "Lower inventory clearance coupled with bad weather".
Apparently it means that SEO gaming does not rise to the level of "Corporate Evil" that would divert shoppers or stock traders. I guess the public must just see it as "corporations advertising like normal."
John
Let people tag sites they've found as a result of a search. Build a tagging system which will allow people to exclude linkspam for example.
I've set up Bayesian tagging for my email client and it works quite well, all my mails come in pre tagged, pretty much 99% accurately, only an occasional one comes through with an incorrect tag these days.
I'm aware of the processing overhead involved... which is what the Google Toolbar is for. Or should I have patented this idea first? Maybe they could just buy Stumbleupon.
Deleted
I guess the humor in my original post on this thread was lost somewhere.
It was low-quality humor, obviously culled from a humor farm - and thus downgraded.
If Slashdot were chemistry it would look like this:Cadaverine
If I have a site that google has identified as a "bad link source", I can sell that as a service so companies can lower the rank of their competition.
Of course, Dr Suess saw this long ago http://www.squidoo.com/thesneetches.
The next generation is to get out of generic search. Build a roster of say 5 sites that do a great job on your error code problems and then use advanced search to stay in that domain.
Set up your browser to be specific search domains. (Non error related example) - I typically run IMDB and Wikipedia in a pair, so I do the search on those, one per tab.
My first Journal Entry ever, in 8 years! http://slashdot.org/journal/365947/aphelion-scifi-fantasy-horror-poetry-webzine
How do they know "who" is responsible for the linkages?
Just maybe this will have a wonderful side effect of slowing down or even stopping automated comment link spam..
..well one can hope anyway.
PPN
Funny, two days ago I sent Google a suggestion regarding datasheets. If someone types in 74HC[anyhing] etc... We want the .pdf, not a link to bogusdatasheets.com. Hell, just make it easier to select filetype:pdf in a checkbox, on the search or results page.
The government which is strong enough to protect you from everything is strong enough to take everything from you.