The 7 Ways That People Search the Web
SpaceAdmiral writes "After the recent release of AOL search logs, Paul Boutin used the site splunkd.com to analyse the logs. His analysis groups searchers into seven categories: The Pornhound, the Manhunter, the Shopper, the Obsessive, the Omnivore, the Newbie, and the Basketcase. My favorite example search is in the Basketcase category: 'i hurt when i think too much i love roadtrips i hate my weight i fear being alone for the rest of my life.'"
The seven ways that people post on Slashdot.
The First Poster - Although this phenominon has been addressed and has somewhat lessened, there are still echoes of "First Post". These people wait on a "Mysterious Furure" story as post stupidities just to get in first.
The Fisher - These posters, rarely named Bobby, check-in with a kingly posts to generate replies and nothing more. Their posts, perhaps at first, seem to make sense, but on closer review contain mnay misstakes, intentionally designed to garner replies.
The old-timer - These posters, who hang around slashdot land, have forgotten to move on. They post just to show off their low slashdot id. This makes some druel, and others comment that low id does not mean more intelligent. However, they're all wrong anyway.
The reposter - Reposters wait for old stories to come up again and find modded-up comments from the old stories to repost. If this is the first time such a story is up, they post a bunch of old buzzwords that realign synergistic paradigm shifts.
The soap stander - Soap-Standers have what to say, and don't care where they say it, such as about why Bush is beery good, and that the UN and its anonymous leader are drunkards, and no amount of coffee will help.
The idiot - Idiots can't count, post moronic comments, and quickly type in useless garbage to fill in a little more space.
Have you read my journal today?
The Newbie.
They just figured out how to turn on the computer. User No. 12792510 is one of many who confuses AOL's search box with its browser address window--he keeps seaching for "www.google." Other AOLers type their searches without spaces between the words ("newcaddillacdeville") as if they were 1990s-era AOL keywords.
You forgot number seven. Should it be a troll? Or perhaps you forgot Poland?
Beyond your ability to count, the article seems quite interesting. My PhD supervisor made an intesresting comment about Google the other day: he said that people at Google must have very interesting information concerning the trends of "common knowledge," this is, before September, 11, 2001 a Google search for "september wtc" would yield totally different results, which surely will show the most "common" of things that people was searching for.
Likewise, if you searched for "Katrina" in Google before August 2005, you maybe ended in the page of someone named like that.
These are basic examples of informaiton that can be obtained with the "time" factor of the Google logs. Remember that time gives another dimension to your data, which lets you extract more information from it. Something among tht lines of image-pattern recognition, it is easier to match patterns from a moving image than from a static image.
Somethingawful posted what is presumably the first part in a series of gold from the AOL search logs: http://www.somethingawful.com/index.php?a=4016 These would definitely fit in the 'basketcase' category...
"'Yrch!' said Legolas, falling into his own tongue."
So was Neo a manhunter, an obsessive, or just an omnivore?
I'm not sure what category I fit in. I live in a padded cell, and just used AOL search for the first time to obsessively shop for Manhunter porn while eating a meat-and-vegetable stew.
Some attitudes replaced or by cgi optimizes
I know that I often can't recall websites I've been to once but want to revisit. I will, however, often remember the search terms that got me there -- sometimes very specific search terms, since I've narrowed it down from my first wide-net search.
For some reason I stubbornly don't use bookmarks often (as when you have too many, they quickly become worthless) so that obscure search term might be in my profile 300 times over the course of a year if it's a site that I visit daily from the office.
Then again, I post on Slashdot a ton... I'm sure it's pretty obsessive anyway.
"Trolls they were, but filled with the evil will of their master: a fell race..." -- J.R.R. Tolkien on Olog-hai
Ok, a lot of this AOL search data is quite amusing, in a sad, pathetic way. Too many people are having their jollies over it, while secretly being scared someone's going to get a peek at their searching record when Google finally loses its mind and makes the data available. It's easy to laugh, and be downright frightened, but in the end, we type our searches in, click the button and don't give it another thought. People wish to judge (myself included); it was a survival instinct in a far distant past and now it manifests itself as a morbid curiosity with the lives of other people.
People come in all colors, size, and mental states, AOL users undoubtedly more so. SO in their you'll find your fair share of freaks or freak wannabes, but mostly you'll just find people trying to find out things. What makes them freakish is not what they type in, but what they do with the information.
GetOuttaMySpace - The Anti-Social Network
The people who switch Tor nodes for every search they perform, so that later, then don't end up having articles written about them calling them weirdos and porn-freaks. Sheesh, what's wrong with horses?
This is a sig. It is appended to the end of comments I post.
"For we are all the Pornhound, the Manhunter, the Shopper, the Obsessive, the Omnivore, the Newbie, and the Basketcase, sincerely, the Breakfast Club"
Probably most people on this board are too young to remember anyway....
... it is nothing compared to the tremendous fallout that would befall the Interweb, should AOL ever unleash accidentally almost 13 years of collected AOL chatroom dialogue. It's one thing to see the search strings of User #24601, but quite another to see just what he says when emboldened by conversational anonymity. Of course, AOL would say now that they don't have that kind of data, that they haven't been logging chat since the earliest days of version 2.0 ... but come on, would you throw away all of that beautiful demographic fodder?
"Ye shall know the truth and the truth shall make you mad."
-- Aldous Huxley
Although AOL represents a certain niche market. i.e. it's heavily skewed towards n00bs.
I wonder if a similar Google sample would show different results or identify other archetypes?
I definitly fall into the "Omnivore" type. I would imagine most Slashdotters do.
Actually, maybe the Basket Case one is a better fit for most Slashdotters.
Execute? [Y/N] _
From TFA: The searches of AOL user No. 672368, for example, morphed over several weeks from "you're pregnant he doesn't want the baby" to "foods to eat when pregnant" to "abortion clinics charlotte nc" to "can christians be forgiven for abortion."
That, right there, tells you why we need to worry about "Uncle Sam" having access to *everyone's* search logs - search terms alone contain an implicit picture of what should be some of the most private aspects of your life. Now imagine if user number 672368 turns out to be, say, John McCain's daughter, and Karl Rove got his hands on this just before the Republican presidential primaries...
what do you think would happen? what do you think http://en.wikipedia.org/wiki/Joseph_McCarthyJoe McCarthy could have done with this kind of data? Write to your elected official and ask them these questions, and what safeguards they are putting in place to prevent any such abuse - and tell them you will be voting this fall. Then call your local news channel, and ask them to run a story on it, and ask the candidates for comment. The big networks won't start a story like this, but if a small station is lucky enough to get a clip of a politician stumbling over an answer, it'll be syndicated faster than you can say "feeding frenzy".
(and for those of you naive enough to think that Karl Rove doesn't have access to the equivalent government databases through some back-room contact or another, I have a bridge you might be interested in buying...)
I thought this was going to be a George Carlin skit.
In a way, it sort of is.
Innovation makes enemies of all those who prospered under the old regime... -- Machiavelli
This guy makes a lot of assumptions in his analysis. I often search for a single topic multiple times - not out of obsession, but to refine my search. Sometimes I didn't get what I was looking for the first time, so I'll go back and sift through the 2nd and 3rd pages. Sometimes I search again because I can't remember where the best page was. Each new search for the same topic may lead me to change my search target - at first I might be looking to buy a product at a major retailer, only to realize later that it might be available used. These are all reasons to repeat a search that have nothing to do with obsession. Also, the author may have labelled someone as "Obsessive" when they are searching for "texas real estate" when in fact they work in the real estate industry.
The article is an interesting read but I'm not buying into his category system.
I think the interpretation of why users google the same words over and over again is wrong. It's not obsessive or OCD at all.
For me, I will goggle words that I know that will contain links that I want to see, but never remember to bookmark. It's much easier to just go to a search engine and type a keyword and scroll for the link in the first 10 hits, rather than go through your hundreds of bookmarks to find exactly the one you're looking for.
The search data released by AOL could be great for research purpose. Even a stupid person will never release such kind of data. This seems very strategic.
If you analyze the search data you'll know that video market is growing rapidly. Search engines are surely driven by porn market. It explains why google was fighting for that data. It could have bought down their revenue. As search engines are useful for the development of internet, user data is useful for the development of future product because you know in advance who are the potential customers for the new product.
Spam: Any activity on internet to gain popularity without paying to advertising companies like Google.
"Do niggers have x-ray vision" Truly frightening. Also note the large religious influence in a lot of the searches.
...that nobody knows how to spell "beastiality"?
But if you've googled yourself and other people, it's a little trickier to determine from the list which one is you.
Though if the list of names contains 25 celebrities and "Joe Smith," it might not be hard to narrow down. At that point, you're the guy in the red shirt who beamed down to the hostile planet with Kirk, Spock, McCoy and Scotty. Yeah, the monsters could kill anyone in the party, but it doesn't take much effort to guess who it'll be.
You've hit upon something there. Perhaps 90% of my wife's usage of the internet is visiting 4 sites: Moviefone, Hotmail, MSN games, and IMDB. Does she use the convenient bookmark function... nope! Instead, her preferred solution is to home page Google and search for the sites there. I've explained the inherent wastefulness of using search for something where just typing into the Firefox's address bar will do the trick... but no dice.
I do have fun with it and occasionally, block Google on my DNS and watch as she complains that the internet is down.
The analysis denotes an astoundingly low level of understanding of how people actually use the web. What the author is seeing is absolutely normal and obvious. The only abnormal thing is his surprise.
The Pornhound. The fact that people search for porn on the web must rank as the discovery of the year!
The Manhunter. Who ever bookmarks other people's web pages? I just type the people's names in Google, and most people I know do just that. We are all manhunters I guess.
The Shopper. Same as above, who uses bookmarks? If I am interested in a treo 700 and I type it 37 times in 3 days, this just means that I find it more convenient to type treo 700, then select from the search results, that bookmark the result pages that I am interested in. And this is reasonable: why should I create bookmarks that become useless once I do buy the treo?
The Obsessive. See above. People that search often for A are simply people who don't bother creating a bookmark for some results about A. Big discovery.
The Omnivore. Ok, so when the pattern is complex, the author gives up. This is a really informative category.
The Newbie. Again, it must rank as one of the big discoveries of the year that there are newbies on AOL...
The Basket Case. This seems to be a repeat of "the omnivore", except that the author found these queries weirder.
Who posted this on Slashdot? It's not interesting research at all! It's junk!
While the pr0n crowd gets its own category, it would seem those who use the Internet to illicitly acquire copyrighted materials would simply fall into a subcategory of the Obsessive, and not an important enough one to be mentioned in the article. What of those brave souls who search for cracks, keygens, nocd patches, torrents, dvd rippers, and the like? Are they less prevalant than some would have us believe, or perhaps because AOL appeals to a less tech-savvy demographic, its searches might underrepresent them.
Okay, the bit about Omnivores who hit IMDb all the time hits a little too close to home here. :)
I use IMDb as much as I use Google. A merging of those two would be quite convenient for me.
Oh, and let's throw in Wikipedia while we're at it. While it may not be as accurate as a paper-published encyclopedia, it's still a zillion times more accurate than the average one-off webpage you're likely to find on any given topic.
How on earth does 'missing out the Googler' show that AOL is biased? The article wasn't even written by somebody at AOL.
I have absolutely no clue as to how the parent post deserves '+5 Insightful', I just guess there's enough people out there that *want* to believe anything bad said about AOL.
Notice one key factor here: These people all use AOL. That's naturally going to self-select your data towards certain segments of the population which might exhibit different inclinations than rest of the group.
I am officially gone from
{
The Pornhound: Lust,
the Manhunter: Envy,
the Shopper: Greed,
the Obsessive: Gluttony,
the Omnivore: Sloth,
the Newbie: Anger,
the Basketcase: Pride
};
*This is my post-RTFA relational array.
I don't know... those kinda look like lyrics...
---k--
</stupid>
Well the same user had such queries as "video pics of men fucking mares and cows free", "killing voyeur neighbors who are satanic cult mem", "old russian nuns for sex", "hillary clinton for sex", "west indian troublemakers in harlem", "female collies afgans vaginas free pics-beastial", "black gay boy sex with overbites porn site free", "fat ass gay black teens sex .com", "explain why people disbelieve the obvious", "emigrating to japan if you are mixed with blk white", "why can't america die", "who were the nazi's of japan and why", "latino pre-teen boys who love older men", "why people say best is'nt good enough", "is george w. bush jr. a american nazi party member", "american nazi party propaganda", "tuskegee university and people who have nightmares about it after attending it", "extermination of niggers in all boros of nyc", "how to end nightmares in the home" and "what makes an adult bully tic".
Truly a scary person. Bzzz.