Google Fires Back About Search Engine Spam
coondoggie writes "The folks at Google are taking issue over spam and the quality of Google searches, which some claim has gone down in recent months. Today on Google's official blog, Principal Engineer Matt Cutts said, 'January brought a spate of stories about Google’s search quality. Reading through some of these recent articles, you might ask whether our search quality has gotten worse. The short answer is that according to the evaluation metrics that we’ve refined over more than a decade, Google’s search quality is better than it has ever been in terms of relevance, freshness and comprehensiveness. Today, English-language spam in Google’s results is less than half what it was five years ago, and spam in most other languages is even lower than in English.' Cutts also explained that the company has made a few significant changes to their method of indexing."
"spam in most other languages is even lower than in English."
this is definately not true for Spanish. There has always been a higher level of spam results for Spanish
" according to the evaluation metrics that we’ve refined over more than a decade, Google’s search quality is better than it has ever been in terms of relevance, freshness and comprehensiveness. "
And thus begins the downfall of Google. Once you start drinking your own lemonade and stop listening to the people who use your product, you're on a greased downhill slope.
And the worms ate into his brain.
I've switched to other search engines; from my experience, Google provides too many tangential and corporate references when I do research.
Also, how does Google "know" that their search results were valid? I'll often do a Google search, click a couple of links, and after being disappointed, I'll go to another search engine where I get more useful results.
What bugs me the most are searches on technical or medical topics, where Google give me a dozen "harvester" results -- e.g., I get sites that have stolen conversations from other message boards, and reported them along with tons of ads. Yuck! There must be dozens of hundreds of sites, all with broken answers to questions about JavaScript and/or medicines.
Just because evidence is anecdotal doesn't mean it should be blithely discounted. If I say "Ouch" at being cut, that means the injury hurt me; the pain is quite real even if no one else has felt it.
All about me
Empiricism is all about saying "Here's what I did, and those are the results.". It's not empirical to say "Trust me, I did something I can't tell you about, and the results are really good".
My favorite part is how searching for something that happens to appear in a Stackoverflow question returns dozens of sites that copy and paste the Stackoverflow content surrounded by ads.
And I'm out of moderator points. Between the "oh, you're looking for something obscure... here's something that's spelled similarly" mentalality, and constantly returning pages from 2003 about technical subjects, it's pretty hard to find anything on Google that I care about. Except for using them to find large corporate sites.
Add the fact that spam copies are constantly higher than the original, and I see no solution.
Your ad here. Ask me how!
In the last few years, I've found search results have been dominated more and more by content mills like associated content, ehow, hubpages, about, and others; or some low quality Q&A page, like yahoo answers. The pages are hastily written and edited, and low content. The articles are also typically written by someone without any relevant knowledge or experience - so the information is common knowledge or wrong.
If google's metrics say quality is up, but their users think quality is down, then google's metrics need to be revised to match user experience more closely. I've started using duck duck go because they block content mills, and thus I think their results are as good or better than google, even without the complicated algorithms and all the data google has accumulated.