The Demographics of Web Search
adaviel sends a link to work out of Yahoo Research indicating that demographics can help Web searches; e.g. a women searching for "wagner" probably wants the 18th-century German composer, while for men in the US "wagner" is a paint sprayer. The Yahoo researchers claim that by taking user demographics into account, "they managed to get the chosen link to appear as the top-ranked result 7 per cent more often than in the standard Yahoo search." New Scientist mentions this research and two other innovative adjuncts to current search practice: following the mouse cursor as a proxy for eye tracking, and taking back bearings on online criminals by studying the searches they make. (The latter raises disburbing privacy questions: would you want Google trolling through your search data? How about governments?)
would you want Google trolling through your search data? How about governments?
- what do you mean 'would you want', who is asking you, plebes?
You can't handle the truth.
Wagner was a 19th-century composer, not 18th.
> would you want Google trolling through your search data? How about governments?
Heck yes I want Google trolling through governments' search data.
(Yes, I'm being facetious, but still. That Wagner example is pretty awful.)
Yes, that's really what we need...
What next, a search result that depends on your religion? If you type "Origin of the Universe", you get articles about the Bible if the engine thinks you're Christian, and scientific material otherwise?
They need to understand there is little value in subjective data. Their results are already biased enough, they should take steps to fix that, not make it worse.
I don't want my neighbors to find out about my obsessive and crippling fear of genetically engineered dinosaurs next time they do a search for "Toronto Raptors" from my computer.
Procrastination Man strikes again!
Because you're in the target demographic.
> Applying demographic data like this is a non-sequitur.
What would be useful is if I could choose to search from a different persons/demographic's point of view. Whether for ebay, amazon, google.
For example say I am looking for a gift for someone else. Or I am helping someone else search for stuff. Or I'm the sort of person who has rather different interests but with search keywords that overlap.
Same goes for reviews of restaurants/movies/etc. What I like, someone else may detest.
Lastly, it could also be interesting (and even beneficial) to be able to more easily see things from other people's point of view.
"e.g. a women searching for "wagner" probably wants the 18th-century German composer"
A -- women -- ???
I see a FLOOD of this, women used where woman should be used and woman where women should be used.
Wow......
what else would you expect from a site full of paranoid libertarian linux-using pedophile virgins?
I don't disagree with the general principle, but I have to wonder if 7 percent is worth the time, effort, and privacy issues involved. Also, note that the 7% is of a specific 30% subset; the actual value for all queries is 1.5%. I then have to ask how many of those 'upgraded' top-ranked results were already near the top (i.e. in the top 10/first page of results). I feel that the whole idea is getting less fruitful by the second... - S
... this idea smacks of a tool that's trying to be *too* helpful, and ends up getting in the way. Kinda like the old microsoft paperclip. I went and turned off this function in google accounts when I realized that my search results were being shaped based on my history, since that partially defeats my expectations of how a search engine behaves, and degrades the utility, insofar as the utility (to me the user) is based on receiving an unbiased sampling of the matches. I'm also troubled by this trend in the way that google delivers their news offerings, it seems that the logical progression of this is that we will mostly only be exposed to material that fit our highly individualized pre-existing reality bubbles.
The first thing I thought of when I read Wagner was the popular brand of jeans.
There was/are gender predictors out there that will look through your search history and try to predict what gender you are. They were mildly successful (though dead wrong in my case). I think I prefer Google's more invasive yet more accurate method of paying attention to which results I click on and giving me more of the same without regard to gender or age. I DO like getting local results though.
As far as women vs woman goes ... tsk! just think, "would I use man or men here?", and then add a wo onto the front of it, its not that hard.
I'm not a bird, I'm a super-advanced flying stealth dinosaur!
A search engine is supposed to find things which fit the regexp that you request.
Often someone will tell me in a forum to "search for x in google", what happens when the results are not exactly the same worldwide because of this technique?
Also, there are loads of people that use proxies and so on to search the web. (like people in china) Their demographics would appear all skewed because it would seem that someone in the proxy's country of origin is requesting to search for webpage x.
I don't agree with this technique at all. It just doesn't fit. Imagine if 'egrep' started filtering strings based on additional info that you could not easily control (like timezone), it would be annoying.
The search results are not just a regex matching. A modern search engine, like Google's, returns a ranked list of search results to you, and this ranking already has bias: the Pagerank algorithm sorts the results based on how popular the page is, as measured by the number of incoming links to that page. Of course, that is the general gyst of Pagerank as of the Google founders' research paper back in the late 1990s, and undoubtedly Google and other search engines have fine-tuned their algorithms since then to return "better" results to the user. But the point is still that there is already bias in the results.
Make no mistake that Google has not already thought of similar search result ranking algorithms similar to that posed in this Yahoo Research paper. The difference is that Google does not have a research arm like Yahoo, so they do not publish ideas like this. In hindsight, the Google founders were foolish to publish their Pagerank algorithm in the first place, but they were still at Stanford then.
We are borg. Resistance is futile. Make us a sammich and give us your wallet, man-slave.
#fuckbeta #iamslashdot #dicemustdie
This would not be an issue if Google simply did not save that information. Sure, I know: they say they want all that information for "targeted advertising". BUT... surveys have shown that people do not want "targeted advertising" in the first place! Despite claims of the "benefits" to consumers, turns out they're not interested if it means losing privacy.
Wilhelm Richard Wagner was born in 1813 and died in 1883 which makes him a 19th Century German composer, not an 18th c. German composer.
Remember, here in 2010 it's the 21st century; in 1910 it was the 20th c.; in 1810 it was the 19th c., etc.
That would probably be Georg Gottfried Wagner (1698-1756), who also played violin for Bach (1685-1750), another 18th-century composer, and not to be confused with Leonhard Emil Bach (1849-1902), a 19th-century composer.
Either that or KDawson thinks that "18 century" means "1800s."
(I am a musicologist, but I am not your musicologist, and this post is not intended as musicological advice).
THIS! I too have major hate of forced localization, everytime I set-up a new browser and load up Google, it goes to google.de (I'm in Germany, I speak the language well enough, but I want the content that I want, you stupid f'ing websites!). Even worse is Comedy Central and their South Park clips, an English-language blog embeds a clip from a South Park from Comedy Central, I click play, and guess what happens? The clip is dubbed in German! Aaarrrrggghhh!!!
Also trying to read myspace profiles (why, why?) gets pretty fucking irritating when it localizes the standard terms as "Favorite music", "Comments", etc, but then after the ":" displays the stuff the user's filled in, in their original language (usually English), meaning you have to read localized and then English words within the same sentence.
God damned morons all of them...
What time is it/will be over there? Check with my iPhone app!