Slashdot Mirror


Google's Next Challenge, Spam Results

krou writes "The Guardian's tech blog is running an interesting piece on Google's next big challenge, which is dealing with the spammers it helped create. 'Google is the 900-pound gorilla of search, with around 90% of the market (excluding China and Russia), and there's an entire industry which has grown up specifically around tickling the gorilla to make it happy and enrich the ticklers.' They quote Paul Kedrosky who notes that 'Google has become a snake that too readily consumes its own keyword tail. Identify some words that show up in profitable searches — from appliances, to mesothelioma suits, to kayak lessons — churn out content cheaply and regularly, and you're done. On the web, no-one knows you're a content-grinder.' Whether searching for reviews, products, businesses, or even conducting academic research, scraper sites are ranking higher than original content. The article speculates that Google may try fix the problem but, from Google's perspective, most of these type of sites use AdSense ads, and generate revenue for Google (89% of clicks come from the first page of results), so Google may not have an incentive to change things too much. Alternatively, people could stop using Google, 'because its search is damn well broken... The question is whether it would be visible enough — that is, whether enough people would do it — that it would show up on Google's radar and be made a priority.'"

44 of 238 comments (clear)

  1. Broken? by afidel · · Score: 4, Insightful

    Compared to what exactly? I find Bing's results to be far more broken so that rules out Bing and Yahoo. What's left?

    --
    There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    1. Re:Broken? by Joehonkie · · Score: 2

      Yeah, Google's results are not what they used to be (especially when searching for reviews and benchmarks), but I keep hearing people say they are useless and I have yet to see anything I prefer. I keep wondering if they are using the same Google that I am.

    2. Re:Broken? by oldspewey · · Score: 2

      I guess it's time to go back to Archie.

      --
      If libertarians are so opposed to effective government, why don't they all move to Somalia?
    3. Re:Broken? by afidel · · Score: 2

      I got pretty good results when looking for car reviews the last couple weeks, put it $model review and the first two pages were 99% relevant. Put $model problem in and you get back mostly forum results talking about problems with that model car. I guess certain very high value keywords might be attacked because there's enough profit to be made worming your way around the algorithms to make it worthwhile but in general I'd say they do exactly what they were designed to do, give you the most relevant and authoritative information first.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    4. Re:Broken? by beerbear · · Score: 4, Informative

      Duck Duck Go

      --
      Hold my beer and watch this!
    5. Re:Broken? by Gaygirlie · · Score: 2

      Indeed. I do try the other search engines every now and then, but even when searching for something rather obscure Google returns more relevant results than the others. The problem is of course that Google's database has literally bajillions of webpages indexed and thus it's hard to come up with exactly correct results if you only use one or two keywords. But then again, the fault mostly lies with the users: they need to learn to make more coherent searches. Even adding two more keywords helps to narrow the search down, and using the minus-sign in front of one for excluding websites with that keyword helps a LOT.

      Making good searches simply is something one must learn, no search engine can read your mind and find exactly what you mean.

    6. Re:Broken? by beerbear · · Score: 2
      --
      Hold my beer and watch this!
    7. Re:Broken? by whiteboy86 · · Score: 2

      The system is broken, it honors poor websites that are so bad that people are left with no option but to "escape" through Google Adsense links. Most original content websites do not score such click through rate as those keyword spammers planting shitty webs full of Google links.

    8. Re:Broken? by doom · · Score: 2

      I do try the other search engines every now and then, but even when searching for something rather obscure Google returns more relevant results than the others. Seriously? I haven't noticed any limitation in the amount of ground that blekko.com or duckduck.go cover... and it really doesn't matter if there are a bajillion pages you index when two thirds of them are spam pages with content ripped from wikipedia.

      Also, when I search for something really obscure, google always wants to push me in the most obvious direction, which is not typically what I want...

      Making good searches simply is something one must learn, no search engine can read your mind and find exactly what you mean. Tell us more about this fabulous skill you've developed. We're all new to this internet thingie here.

      One of many search engine features I can think of, but have never seen implemented, would be to treat different grammatical forms as rough synonyms. It's a little irritating to have to run a nearly identical search multiple times, once on a term like "indexes" and again on "indicies" and again on "indexing".

    9. Re:Broken? by Omestes · · Score: 2

      It largely ignores quotes too now. Which pisses me off to no end.

      I'm sick of the time relevance problem too. If I type in something about a current thing with previous versions, topics about the previous version push the current one way down in the results. Try searching for a common Ubuntu problem in the current version, for example. You have to narrow it down "in the last year" EVERY single damn time you search. Do I really care about configuring the screensaver in Kubuntu 5.5, when 99% of users are not using it anymore? They really need to at add date to their relevance calculations.

      And let me use quotes, and +/-, and basic boolean logic.

      --
      A patriot must always be ready to defend his country against his government. -edward abbey
    10. Re:Broken? by Threni · · Score: 2

      > And let me use quotes, and +/-, and basic boolean logic.

      You can do all of those with google. Go here:

      http://www.google.co.uk/advanced_search

      create your search term using the UI and see what string it comes up with as its search term.

  2. Uh by moogied · · Score: 3, Insightful

    Why is slashdot providing us with opinions? News = facts and context. Gossip = Some facts, some context, lots of opinions. "because its search is damn well broken..." -- do not want to hear.

    --
    So basically, -1 troll/offtopic is really slashdots way of saying "I hate that you thought of something before me."
    1. Re:Uh by shish · · Score: 2

      Why is slashdot providing us with opinions?

      Because it's a for-profit gossip rag, and more gossip = more ad views. Thankfully you can normally wait 5 minutes then look at the comments and see 10 posts along the lines of "here's why the article is bull and the slashdot editor is a tard" and get some links to actually informative sites.

      What really confuses me is why the editors seem to reject submissions with links to source data, and approve submissions that come in hours or days later linking to some third party's useless opinion blog o_O

      --
      I mod down anyone who says "I will be modded down for this", regardless of the rest of their comment
  3. Mixed metaphor alert by wjousts · · Score: 4, Funny

    So which is it? Is Google a gorilla or a snake? Make your mind up!

    1. Re:Mixed metaphor alert by oodaloop · · Score: 5, Funny

      You want them to fix the summary? I'm afraid that train has sailed.

      --
      Tic-Tac-Toe, Global Thermonuclear War, and relationships all have the same winning move.
    2. Re:Mixed metaphor alert by TheGratefulNet · · Score: 3, Funny

      train has sailed??

      wow. you can lead a whore to water but you can't make her think.

      --

      --
      "It is now safe to switch off your computer."
    3. Re:Mixed metaphor alert by Combatso · · Score: 4, Funny

      I am tired of all the motherf*cking spam results on this motherf*cking search engine!

    4. Re:Mixed metaphor alert by Ryanrule · · Score: 3

      If we can hit that bullseye, the rest of the dominoes will fall like a house of cards. Checkmate.

  4. They are still far better than what came before by Ash+Vince · · Score: 2

    The thing is thought that the other search engines before Google were terrible in this regard.

    Before Google the SEO business was rife with dodgy practices. It was only when google showed that these dodgy practices were not going to help get to the top of their results that the SEO market grew up and started being more constructive for the web as a whole.

    Before this they would just do whatever they could to game their clients page higher up the ranks using whatever means they could just to get their clients page hits. This was a nice easy metric that clients could easily track and understand with minimum of technical knowledge. Their customer to visitor ratios might have been going down as the page hits went up but this was very hard to track before the whole web metric industry grew up. One might even say that the web metric industry owes much to Google in this regard as now any money spent on SEO and advertising usually needs to be justified by also spending money on tracking too.

    --
    I dont read /. to RTFA, I read /. to offend people in ignorance.
    1. Re:They are still far better than what came before by krou · · Score: 2

      The article raises this point in the second paragraph: 'In fact, the problem that plagued the first generation of search engines such as Altavista now seems to be gaining traction on Google, which outdistanced those earlier rivals precisely because it dumped the spam so effectively.'

      --
      'If Christ had tweeted the sermon on the mount, it might have lasted until nightfall.' - John Perry Barlow
  5. People change.... only for something better by djsmiley · · Score: 2, Interesting

    People wont change while theres nothing better to change to...

    I still don't "see" these issues with google that supposidly exist, I know others dont see these issues iether who aren't as web savvy as me, but if they DO exist, it's only when something better comes along that people will switch, I tried bing..... and couldn't even get it to find microsoft security essentials when searching for mse as its normally know.

    --
    - http://www.milkme.co.uk
    1. Re:People change.... only for something better by eddy · · Score: 5, Informative

      People wont change while theres nothing better to change to...

      I see some nerds switching to http://duckduckgo.com/

      --
      Belief is the currency of delusion.
    2. Re:People change.... only for something better by Spliffster · · Score: 3, Insightful

      ...

      I still don't "see" these issues with google that supposidly exist...

      Never got expertSexChange.com (and the like) in your results? I get them frequently and it's annoying.

    3. Re:People change.... only for something better by dotancohen · · Score: 2

      Wow, thanks! I have a few keywords that I use to test a search engine, and this one really is better and more relevant than google. I can't believe it, I've been waiting for this for years. Thanks!

      --
      It is dangerous to be right when the government is wrong.
  6. What scrapers? by savanik · · Score: 4, Interesting

    I don't have this problem - when I search for things on Google, I get relevant results from real pages. Either I regularly search for things that nobody scrapes, or there's actually some skill involved in getting relevant results that most people can't be bothered with.

    The biggest problem I've had of late searching on Google is trying to find reviews of hardware and getting ninety billion pages trying to sell it to me with 'Be the first person to review this product!" I need to find a different keyword on that.

    1. Re:What scrapers? by OolimPhon · · Score: 2

      I find exactly the opposite problem. I want to buy something and all I get is pages of reviews for the damn thing. Finding an actual shop is proving almost impossible, and it's not as if I'm looking for unusual items either.

      Yeah, I have noticed all the extra cruft creeping into Google lately. It's definitely not as good as it was a few years back.

    2. Re:What scrapers? by Anonymous Coward · · Score: 2, Informative

      I've found the easiest way to get around this problem is to remove words from google search. I usually use the following form:

      search: "product name" review -buy -first

      It's not perfect but it's much better than the results w/o the eliminated words.

    3. Re:What scrapers? by fulldecent · · Score: 2

      -"0 reviews" -"first to review"

      --

      -- I was raised on the command line, bitch

    4. Re:What scrapers? by Charliemopps · · Score: 2

      No, you need to search for something hard to find. The worst are the sites that aggregate their content from forums and then pile them above the actual answer. Experts Exchange is a good example. Especially if you are trying to find help with coding. Their results have been stolen from forums where the help was giving freely. But you have to pay to see nothing more than a copy of the original results. The only reason you can't see the free, original result, is because experts-exchange has buried the result on page 400 of goggles search results via their corruption of the search engine.

      There are ways around experts-exchange (firefox plugins) but really Google needs to take this stuff headon as the sole reason they became so dominant was because all of the other search engines had been polluted in this very same way at the time.

    5. Re:What scrapers? by Eil · · Score: 4, Insightful

      Try doing any programming or system administration related search and *not* have at least one of the first five results populated with the following worthless domains:

      - experts-exchange.com
      - ehow.com
      - about.com
      - scribd.com
      - ittoolbox.com

      These sites don't necessarily scrape and repost content, but the content they do provide is invariably worthless or too difficult to navigate in order to be worth my time. In fact, I really don't mind mailing list, wikipedia, and StackOverflow scrapers because at least they provide useful content as long as you block all the ads and javascript by default.

      Spammers have gotten pretty darn good at figuring out how to game Google and Google's countermeasures are increasingly ineffective. What Google really needs to do is place some control over the results returned in the user's hand. I would pay actual money to Google if they would let me customize search results as follows:

      - A way to mark results as useful or not for the query entered, and refine later searches based on those
      - Blacklist certain domains from showing up in my results, ever.
      - Add content qualification (for example, prefer sites that have a certain text-to-graphics ratio)

  7. Re:Playing the game changes the game by Atzanteol · · Score: 5, Insightful

    Believe it or not it's easier to be a good ad company if you're also a good search company. One doesn't have to suffer to the benefit of the other. It's easier to sell a product if it's a good product.

    --
    "Ignorance more frequently begets confidence than does knowledge"

    - Charles Darwin
  8. Hardly limited to search-engine spamming... by seebs · · Score: 2

    Google is now responsible for a fairly large portion of the plain old spam I get. As in, their computers send it. Their latest gimmick is a new "feature" of Google Groups:

    1. You can't send emailed abuse reports, they don't process those.
    2. You have to go to the group's home page and click "Report This Group".
    3. But you can't unless you're logged into a Google account, and your Google account is a member of the group. Otherwise, you just get the "you must be a member of this group to see this page" page.
    4. You can directly navigate to groups.google.com/abuse/, but...
    5. They don't do anything about spam reports anyway.

    Similarly, they are apparently rapidly becoming a world leader in Usenet spam, because they don't have any particular objection to people posting spam. Or, if they do, it has not yet risen to the level of the kind of objection that results in doing something to stop it.

    --
    My blog: http://www.seebs.net/log/ --- My iPhone/iPad app: http://www.seebs.net/seebsfrac/
  9. Re:Somebody else will come along, eventually. by oldspewey · · Score: 2

    Some other search engine will eventually come along. They will provide a better service, and Google will swallow them up like a ripe, juicy tomato.

    FTFY

    --
    If libertarians are so opposed to effective government, why don't they all move to Somalia?
  10. Google created... what? by eepok · · Score: 2

    "Google's next big challenge, which is dealing with the spammers it helped create."

    Except, "No." Creating a profitable system does not mean one helped to create policy-infringers, law-breakers, and exploiters. If we accepted that irrationality, we could say that young, pretty boys and girls create child rapists, cars with windows spontaneously generate car thieves, and political systems create thieving dirty politicians. But that's not true.

    Exploiters and criminals are created through a combination of their own high expectations, the lack of opportunity (by their standards), and their lack of ethical conviction. They only act opportunistically or impulsively on exploitable situations.

  11. mesothelioma by mcgrew · · Score: 2, Interesting

    This puzzled me: "profitable searches — from appliances, to mesothelioma suits, to kayak lessons"

    I'm thinking, "Mesothelioma suits? What's that, a protective suit you wear when you're working around asbestos?"

    Before Google came up I realized he was talking about lawsuits. Gees, lawyers and businessmen talking sure confuse this old nerd sometimes. To a businessman, "suit" is what lawyers bring, to a nerd, it's usually protective gear.

    If you go talking about RAM here, I'm going to think "memory". If you're talking about trucks, you need to say "Dodge RAM". If you're talking about Mesothelioma suits, you need to say "Mesothelioma lawsuits unless you're talking about protection from asbestos.

    1. Re:mesothelioma by circletimessquare · · Score: 2

      i remember an article, awhile back, that said the most expensive word in google's adword program (where advertisers pay a biddable amount each time someone clicks on their ad when someone searches for that word) was not some sex-related term, not some date site term, but... drum roll please... mesothelioma

      if you searched for that word, and clicked on an ad next to the search results, you were costing that advertiser something like $10 just for that click

      jeepers

      --
      intellectual property law is philosophically incoherent. it is your moral duty to ignore it or sabotage it
    2. Re:mesothelioma by windcask · · Score: 2

      The problem with that term is that you have to possess some degree of intelligence to actually be able to spell it, therefore eliminating almost the entire target market for spam pages.

      Then again...*checks common misspellings*...okay. I get it now.

    3. Re:mesothelioma by circletimessquare · · Score: 2

      LOL

      someone needs to introduce you to 4chan and anonymous

      --
      intellectual property law is philosophically incoherent. it is your moral duty to ignore it or sabotage it
  12. Re:Somebody else will come along, eventually. by Anonymous Coward · · Score: 2, Insightful

    If someone does, who will find out about it first: users or Google? If Google finds out first, they just have to stop the revenue-generating pollution to a degree that they remain best, and no one will ever know that the newcomer had briefly been better.

    For all its "brokenness" Google just has to remain best and they'll win. And if that brokenness is a result of allowing noise because it makes them money, rather than a technology limitation, then it's something they have control of. I wouldn't bet on Google losing any time soon.

  13. Normal people is doomed. by Tei · · Score: 2

    I see 3 problems with normal people using google:

      - Normal people can't tell the diference a scam and a honest page. The preferences are reverted, what you know is the honest page of a hacker (peple like Stallman, or the homepage of a project like MediaWiki) will look scary and dangerous, while will love a page full of flash ads, that probably are tryiing to install spyware.

      - Normal people are the target of spammers. If you search for tecnical problems with ocropus, you will see less spam targeted at you. While if you search something popular like soccer of music... you will see a lot of shit.

      - Normal people and spammers have similar mindset. Want everything withouth paying. Don't have any tecnical moral or respect for internet. Is not his home, so see not problem is shitting here, making internet worse. This bias spammers and normal people to the same areas of the internet.

    --

    -Woof woof woof!

  14. Re:Fake product reviews by canajin56 · · Score: 2

    There are two reasons for this, only one of which you can do anything about. The first is synonym matching. That's where you search for something like, I don't know, "website" and it will match "web page" as well. (I'm sure there are better examples). This is the one you can do something about, by putting +website, which forces it to appear as-is. You can also get this by putting a single word in quotes.

    The second reason is that google matches not only text on the page, but also frequent text in anchors that link to it. That is to say, even if Slashdot didn't actually have the word "Slashdot" anywhere on it, it would still show up as the first match because of all of the people linking to it with the word Slashdot in their anchor text somewhere. This is the one that you can't do anything about. I do wish you could put something beside a keyword to tell Google to only show websites that contain that keyword, without any fancy stuff. (Though for all I know, maybe the + operator does that too...)

    The third is that Google can't crawl every website every second. (Three, there are three reasons.) Dynamic pages will always be slightly out of date, so if a Slashdot article slips off the front page between being crawled and when you search, you'll be frustrated. That's mostly avoided because if you match, say, a Slashdot article, your top link will almost always be a link to the actual article+comments, rather than to the front page. Even still, if you find a match to a comment in a Slashdot article, when you click the link that comment might be collapsed, and thus will not show up in a text search unless the keyword was in the first handful of words. There are plenty of non-Slashdot examples as well, I'm sure.

    Fourth, a fanatical devotion to the pope...four...I'll just come in again shall I?

    --
    ASCII stupid question, get a stupid ANSI
  15. Re:You're wrong. by wjousts · · Score: 4, Insightful

    Good enough only involves being better than the competition. Which they were.

  16. Drug Interaction info is unfindable by billstewart · · Score: 3, Informative

    If you want to get information about how Drug $A interacts with Drug $B, Google's pretty useless - you mostly get sites that want to sell you drugs and list $A and $B, or at best lists of medical papers, usually scraped by reformatters, which have some paper on $A and another paper on $B. (Of course, if you want information on how Drug $A interacts with Drug $$V, then you're totally out of luck :-)

    I've given up on Google and use Wikipedia for any medical information.

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
  17. Re:You're wrong. by DigitalSorceress · · Score: 2

    I think the point a lot of us are trying to make is that Google got where they are today by those means, but there's been a shift over the last couple years toward more short-term thinking. We all came to Google because they did what you said - they did search better than others.

    The problem is that while not intentionally being evil (in my opinion) Google makes huge money from "Made for AdSense" type SEO garbage sites. If they took all those guys out, they'd make less. At the same time, they need to be careful they don't "AltaVista" themselves out of relevance.

    Maybe not by intent, but by action/inaction, Google has betrayed those initial principles in favor of "what works and what makes money".

    This whole article and discussion led me to doing so research on Google itself about blacklist features. For a period of time, if you used Google while logged in, you'd get this option to kind of down-rank search results/domains that popped up in your searches. Over time, the idea is that you'd get more relevant results and Google would learn about what folks found to be useful and not. Then, at some point about a year ago, they removed the ability to blacklist/demote/downlink in favor of "starring" good results.

    I found lots of discussions on the Google support forums about folks asking for BlackList features, and when Google folks replied, they suggested using the - operator... even when folks time and time again said that they knew how to use the - operator but wanted a PERMANENT PERSONAL BLACK LIST, Google folks kind of ignored the issue or deliberately misunderstood, suggesting workarounds.

    (January 2009 Google support thread)

    I'm thinking someone's written a FireFox addon that would provide google searches, but automatically append " -scummydomain.com -crappydomain.com ... etx) to every search before sending. If they haven't I need to teach myself how to write one so I can make it.

    --

    The Digital Sorceress