Slashdot Mirror


The Demographics of Web Search

adaviel sends a link to work out of Yahoo Research indicating that demographics can help Web searches; e.g. a women searching for "wagner" probably wants the 18th-century German composer, while for men in the US "wagner" is a paint sprayer. The Yahoo researchers claim that by taking user demographics into account, "they managed to get the chosen link to appear as the top-ranked result 7 per cent more often than in the standard Yahoo search." New Scientist mentions this research and two other innovative adjuncts to current search practice: following the mouse cursor as a proxy for eye tracking, and taking back bearings on online criminals by studying the searches they make. (The latter raises disburbing privacy questions: would you want Google trolling through your search data? How about governments?)

34 of 131 comments (clear)

  1. who is asking you? by roman_mir · · Score: 2, Insightful

    would you want Google trolling through your search data? How about governments?

    - what do you mean 'would you want', who is asking you, plebes?

  2. Correction: by BrokenHalo · · Score: 4, Informative

    Wagner was a 19th-century composer, not 18th.

    1. Re:Correction: by Yvan256 · · Score: 5, Funny

      Are you sure? I just searched and the first result is this Slashdot article which clearly says that he was an 18th century composer, right in the summary.

    2. Re:Correction: by jonbryce · · Score: 2, Informative

      Richard Wagner is much more famous though.

    3. Re:Correction: by BitterOak · · Score: 2, Insightful

      Are you sure? I just searched and the first result is this Slashdot article which clearly says that he was an 18th century composer, right in the summary.

      Good heavens, why was this modded Insightful? I think the poster was going for Funny. Anyhow, a quick Wikipedia search reveals that Richard Wagner lived from 1813-1883, making him a 19th century composer.

      --
      If I can be modded down for being a troll, can I be modded up for being an orc, or a balrog?
    4. Re:Correction: by Yvan256 · · Score: 3, Insightful

      Modded insightful twice too... I guess some people can't be bothered to think for themselves and just moderate to increase whatever the current moderation is.

    5. Re:Correction: by icebike · · Score: 2, Insightful

      Wagner was a 19th-century composer, not 18th.

      But when I (male) search for Wagner I'm more interested in Jill than Josef or Richard.

      --
      Sig Battery depleted. Reverting to safe mode.
    6. Re:Correction: by Phroggy · · Score: 4, Funny

      Are you sure? I just searched and the first result is this Slashdot article which clearly says that he was an 18th century composer, right in the summary.

      Quick, somebody update Wikipedia! You can cite this Slashdot article as your source.

      --
      $x='S24;r)>63/* h@<5+oZ)32"5cz';$me='phroggy'x$];
      $x=~y+ -xz+\0-Tx+;print$_^chop$me for split'',$x;
    7. Re:Correction: by MagusSlurpy · · Score: 2, Funny

      Are you sure? I just searched and the first result is this Slashdot article which clearly says that he was an 18th century composer, right in the summary.

      No it doesn't, it says he's a 20th century paint-sprayer company.

      --
      My sister opened a computer store in Hawaii. She sells C shells by the seashore.
    8. Re:Correction: by tehcyder · · Score: 2, Informative

      That sound you can hear is an almighty whoosh caused by the Ride of the Valkyries.

      --
      To have a right to do a thing is not at all the same as to be right in doing it
  3. Sauce for the goose. by Aaron+Denney · · Score: 5, Funny

    > would you want Google trolling through your search data? How about governments?

    Heck yes I want Google trolling through governments' search data.

  4. Neat-o. by Bieeanda · · Score: 4, Insightful
    So the search I did last night, for 'how to fix a cracked toilet', might result in 'hire a plumber, lady' instead of 'go to Home Depot for a replacement, dude'.

    (Yes, I'm being facetious, but still. That Wagner example is pretty awful.)

    1. Re:Neat-o. by Capt.+Skinny · · Score: 2, Insightful

      Just because it's true for the group doesn't mean it's true for the individual.

      Improving search results is about aggregates -- returning the best results for the most queries. Individuals don't matter. Google has used this fact to their advantage to show many links to many people while keeping their interface clean: each user only sees three links at the bottom of the main page, for example, but each of n>>3 links displayed in that spot is viewed many times.

      If Yahoo can move relevant links higher in the result list for 15 percent of queries, the only concern is about the quantity of queries for which relevant links have moved lower. If stereotypes do in fact represent the majority of a demographic, then it doesn't really matter to a search provider whether you or I as individuals represent our respective stereotypes.

      Last, what if you want to know what other people not from your demographic group are seeing?

      Why would you want to know? SEO? The goal of search engine optimization is completely at odds with the goal of improving search results: higher rankings of a site in spite of its relevance to the user, versus higher rankings for a site based on its relevance to the user.

      Oh gheeze. A philosophical rant. That wasn't my intention. It really wasn't.

  5. Sexist search engines by loufoque · · Score: 5, Insightful

    Yes, that's really what we need...

    What next, a search result that depends on your religion? If you type "Origin of the Universe", you get articles about the Bible if the engine thinks you're Christian, and scientific material otherwise?

    They need to understand there is little value in subjective data. Their results are already biased enough, they should take steps to fix that, not make it worse.

    1. Re:Sexist search engines by Krahar · · Score: 2, Insightful

      A search engine's purpose is finding what the searcher is looking to find, not in finding what you or someone else think they should be looking for.

    2. Re:Sexist search engines by noidentity · · Score: 2, Insightful

      Just imagine trying to share tips on finding things with someone. "Well, it helps if you're a male Atheist, otherwise I'm not sure how to find things related to this." Again, smart search engines are worse than dumb ones, because you can't predict how a smart one will respond to your query. Either it gets it right, or it gets it wrong and there's little insight you can have into why. Give me a dumb tool that does what I tell it and whose behavior I can predict and thus adjust to.

    3. Re:Sexist search engines by Krahar · · Score: 2, Interesting

      We already don't know how Google works. If you want to tell someone about something, you can give them a link, or you can log out of your Google account if you are doing this on Google and this comes in the way. This technique allows to give people the link they are looking for more often than if it isn't in use, and that's exactly what a search engine is about. I'm sure you can opt out and most people using search engines aren't as knowledgeable about what they are doing as you might be.

  6. Thanks but no thanks by jheath314 · · Score: 3, Funny

    I don't want my neighbors to find out about my obsessive and crippling fear of genetically engineered dinosaurs next time they do a search for "Toronto Raptors" from my computer.

    --
    Procrastination Man strikes again!
  7. Re:Why is it red? by Anonymous Coward · · Score: 2, Funny

    Because you're in the target demographic.

  8. Re:Then again... by TheLink · · Score: 4, Insightful

    > Applying demographic data like this is a non-sequitur.

    What would be useful is if I could choose to search from a different persons/demographic's point of view. Whether for ebay, amazon, google.

    For example say I am looking for a gift for someone else. Or I am helping someone else search for stuff. Or I'm the sort of person who has rather different interests but with search keywords that overlap.

    Same goes for reviews of restaurants/movies/etc. What I like, someone else may detest.

    Lastly, it could also be interesting (and even beneficial) to be able to more easily see things from other people's point of view.

    --
  9. wow... Just, wow.. by Dee+Ann_1 · · Score: 3, Informative

    "e.g. a women searching for "wagner" probably wants the 18th-century German composer"

    A -- women -- ???

    I see a FLOOD of this, women used where woman should be used and woman where women should be used.

    Wow......

  10. Re:Speaking as a female slashdotter by Anonymous Coward · · Score: 2, Funny

    what else would you expect from a site full of paranoid libertarian linux-using pedophile virgins?

  11. ROI at 7 Percent by Sully2161 · · Score: 2, Interesting

    I don't disagree with the general principle, but I have to wonder if 7 percent is worth the time, effort, and privacy issues involved. Also, note that the 7% is of a specific 30% subset; the actual value for all queries is 1.5%. I then have to ask how many of those 'upgraded' top-ranked results were already near the top (i.e. in the top 10/first page of results). I feel that the whole idea is getting less fruitful by the second... - S

  12. highly dubious by Audax_23 · · Score: 5, Insightful

    ... this idea smacks of a tool that's trying to be *too* helpful, and ends up getting in the way. Kinda like the old microsoft paperclip. I went and turned off this function in google accounts when I realized that my search results were being shaped based on my history, since that partially defeats my expectations of how a search engine behaves, and degrades the utility, insofar as the utility (to me the user) is based on receiving an unbiased sampling of the matches. I'm also troubled by this trend in the way that google delivers their news offerings, it seems that the logical progression of this is that we will mostly only be exposed to material that fit our highly individualized pre-existing reality bubbles.

  13. funny .... by smisle · · Score: 2, Insightful

    The first thing I thought of when I read Wagner was the popular brand of jeans.

    There was/are gender predictors out there that will look through your search history and try to predict what gender you are. They were mildly successful (though dead wrong in my case). I think I prefer Google's more invasive yet more accurate method of paying attention to which results I click on and giving me more of the same without regard to gender or age. I DO like getting local results though.

    As far as women vs woman goes ... tsk! just think, "would I use man or men here?", and then add a wo onto the front of it, its not that hard.

    --
    I'm not a bird, I'm a super-advanced flying stealth dinosaur!
  14. This is wrong. by sea4ever · · Score: 2, Insightful

    A search engine is supposed to find things which fit the regexp that you request.
    Often someone will tell me in a forum to "search for x in google", what happens when the results are not exactly the same worldwide because of this technique?
    Also, there are loads of people that use proxies and so on to search the web. (like people in china) Their demographics would appear all skewed because it would seem that someone in the proxy's country of origin is requesting to search for webpage x.
    I don't agree with this technique at all. It just doesn't fit. Imagine if 'egrep' started filtering strings based on additional info that you could not easily control (like timezone), it would be annoying.

    1. Re:This is wrong. by RobertLTux · · Score: 2, Insightful

      what would help is a simple way to toggle custom/standard searches and to see which way the toggle is currently set

      --
      Any person using FTFY or editing my postings agrees to a US$50.00 charge
  15. There is already bias in search results by jmcbain · · Score: 4, Insightful

    The search results are not just a regex matching. A modern search engine, like Google's, returns a ranked list of search results to you, and this ranking already has bias: the Pagerank algorithm sorts the results based on how popular the page is, as measured by the number of incoming links to that page. Of course, that is the general gyst of Pagerank as of the Google founders' research paper back in the late 1990s, and undoubtedly Google and other search engines have fine-tuned their algorithms since then to return "better" results to the user. But the point is still that there is already bias in the results.

    Make no mistake that Google has not already thought of similar search result ranking algorithms similar to that posed in this Yahoo Research paper. The difference is that Google does not have a research arm like Yahoo, so they do not publish ideas like this. In hindsight, the Google founders were foolish to publish their Pagerank algorithm in the first place, but they were still at Stanford then.

  16. Re:wow... Just, wow.. by girlintraining · · Score: 2, Funny

    We are borg. Resistance is futile. Make us a sammich and give us your wallet, man-slave.

    --
    #fuckbeta #iamslashdot #dicemustdie
  17. They could simply not save the info. by Jane+Q.+Public · · Score: 2, Insightful

    This would not be an issue if Google simply did not save that information. Sure, I know: they say they want all that information for "targeted advertising". BUT... surveys have shown that people do not want "targeted advertising" in the first place! Despite claims of the "benefits" to consumers, turns out they're not interested if it means losing privacy.

  18. Wrong century by Raffaello · · Score: 2, Informative

    Wilhelm Richard Wagner was born in 1813 and died in 1883 which makes him a 19th Century German composer, not an 18th c. German composer.

    Remember, here in 2010 it's the 21st century; in 1910 it was the 20th c.; in 1810 it was the 19th c., etc.

    1. Re:Wrong century by digitig · · Score: 2, Funny

      Yes, but the women are searching for Agnes Wagner, the incredibly obscure 18th century German composer. Don't bother doing a web search for her, you're not in the right demographic to find anything.

      --
      Quidnam Latine loqui modo coepi?
  19. Wanger *was* an 18th-century composer by geckoFeet · · Score: 2, Interesting

    That would probably be Georg Gottfried Wagner (1698-1756), who also played violin for Bach (1685-1750), another 18th-century composer, and not to be confused with Leonhard Emil Bach (1849-1902), a 19th-century composer.

    Either that or KDawson thinks that "18 century" means "1800s."

    (I am a musicologist, but I am not your musicologist, and this post is not intended as musicological advice).

  20. Re:If you are surfing from France, you speak Frenc by netsharc · · Score: 2, Insightful

    THIS! I too have major hate of forced localization, everytime I set-up a new browser and load up Google, it goes to google.de (I'm in Germany, I speak the language well enough, but I want the content that I want, you stupid f'ing websites!). Even worse is Comedy Central and their South Park clips, an English-language blog embeds a clip from a South Park from Comedy Central, I click play, and guess what happens? The clip is dubbed in German! Aaarrrrggghhh!!!

    Also trying to read myspace profiles (why, why?) gets pretty fucking irritating when it localizes the standard terms as "Favorite music", "Comments", etc, but then after the ":" displays the stuff the user's filled in, in their original language (usually English), meaning you have to read localized and then English words within the same sentence.

    God damned morons all of them...

    --
    What time is it/will be over there? Check with my iPhone app!