Objectively Comparing Competing Search Engines?
aendeuryu asks: "My default search engine of choice is, like most of you I assume, Google. That said, some complaints about Google over the years do seem to have some merit -- basically, that sometimes the indices aren't always updated, that it's too easy to manipulate via googlebombing or legislation, and that maybe too many of its featured services never get out of beta stage. Maybe the fact that Google has gone so long without significant competition is enough to make one at least begin to ask questions about it possibly becoming stagnant. Personally, I'm so used to doing things the Google way (and achieving acceptable results quickly) that I'm not really interested in switching -- case in point, all the above links referenced were quickly found via Google. However, what am I missing out on by not giving (for example) Yahoo search a shot? Or, more to the point, how would one go about trying to effectively and objectively compare competing search engines? In what areas have people found Google to have become obsolete for their purposes? Have less ignorant people than myself figured out ways to test a competing search engine's efficacy for themselves?"
If you know how to use google to achieve your results, whats the issue? If a better search comes along, im sure it will be posted on slashdot (twice), so you dont need to worry about missing out.
"Something's wrong with you...and I hope we never do meet again." - Deftones When Girls Telephone Boys
personally I prefer dogpile. I like the organization of results much better.
""I don't see an obvious biosynthetic pathway from allicin (CH2=CHCH2SS(=O)CH2CH=CH2)to isothiocyanates (R-N=C=S) ""
Yahoo search is okay, not as nice as google, but a good second.
Alltheweb.com has found things google hasn't, but in general I rarely use it.
I rarely use MSN because it was awful all the times I tried it. Same for Altavista.
In general, if I'm searching for something I'll use google first and then Yahoo and Alltheweb to catch anything that google may have missed.
God Bless America. Why? Did it sneeze?
I think you have said it already, Google is good for returning acceptable results quickly, but acceptability is something very subjective.
Even by comparing keyword search side by side, one can still consider a worse result better, but who's to judge except the user?
I kept using Yahoo until it's not giving me results that I think are good enough, then I switched to Google, and I'll keep using Google until it's not returning good enough result.
Rock that crushes, Paper & Scissors that don't matter.
Hello.
I have been browsing your internet site for several hours and am generally impressed with your coverage of IT related issues. However, when I saw an article on Google I just had to voice my opinion. I would just like to say how increadibly appalled I am with the Google internet search engine. My main concern with Google is how easy it makes for malicious people to find information on the now illegal Bittorent computer software.
Some background information on Bittorent and what makes it so dangerous:
1. The Bittorent computer software allows distribution copyrighted material.
2. In doing so it inadvertently causes excessive use of bandwidth. Now you might say that this is fairly harmless, but is it really? The effects of electromagnetic radiation pollution caused by this cannot be underestimated. Just think of the millions of wired and wireless connections lighting up and emmiting those deadly electromagnetic rays and all the innocent men, women and children being exposed to them.
Every bittorent user has blood on his (or hers) hands. From this point on, I am boycotting Google and advise any person with a shred of decency to do so too.
Personally, I'm so used to doing things the Google way (and achieving acceptable results quickly) that I'm not really interested in switching -- case in point, all the above links referenced were quickly found via Google. However, what am I missing out on by not giving (for example) Yahoo search a shot?
I ask my wife the same thing. Honey, I'm used to doing things your way.. and I always get acceptable results from you.. but what am I missing out on by not giving (for example) Veronica a shot?
At least Google will never make you sleep on the couch, or give them half of all your assets. Hopefully.
https://www.eff.org/https-everywhere
I open my browser, and see the Google page up and running. I started with Yahoo, I tried meta search engines, altavista, a9, and many others, but I never change my home page to be the other ones. I know Google, I know how to use the results and to view pages all in HTML and to get the cache and to search sites that link to me, or search a specific site. It's easy in the other sites, but I already figured Google out. Google works for me, when I find the wrong thing, I just add "-wrongword" to the end and I find what I need. I see all the blogs and misindexed pages, but I've never really suffered from Google Bombing or any of the other problems that are mentioned.
Make your computer faster: rm -rf
Alternative search engines
-- Knowing too much can get you killed, but knowing who knows too much can make you rich.
This should give you an answer
Unfortunately, comparing search engines is a nearly impossible task, since they probably aren't indexing the same data.
When you measure a search technology, the values you typically look for are precision and recall. precision says "of the X results you gave me, how many of them are relevant". recall says "in the world, there were Y possible pages you could have found, but you gave me X of them".
you can't measure recall for a public search engine, but you can measure precision. Take a set of sample queries, and some users. Have them perform the queries, and go through the first ~100 pages and give them a "thumbs up" (relevant) or "thumbs down" (not relevant).
Your overall score will measure precision: if at N=100, all 100 were relevant, that's 1.0. if only 50 were judged relevant, precision is 0.5.
You can estimate recall by judging say 1,000 documents (phew). Then sample precision at N=10, 100, 500, etc, assuming that is an "exhaustive" list of documents in the world.
Teoma has this great feature called Related search which is very useful. Basically if you look for a particular topic, the search engine identifies all related topics and offers you a one click access to all of them. Makes the search equally usable for both a rookie and a domain expert using the same search term.
One thing I like about askjeeves and a9.com is the way the present the search results. I think the next step is to improve on the presentation of the results (data) to make it more usable/accessable. Hit up askjeeves and run a search. The preview feature is pretty nice. And check out a9.com searches with their Site Info mouse-over.
I followed your logic, searched for this term and google brought up this article! Now i'm trapped in an endless loop, help!
Zoom Player Lead Dev.
I got this from a friend who works at yahoo...
m l
http://www.langreiter.com/exec/yahoo-vs-google.ht
Sorry if it gets slashdotted.
I _used_ to go to altavista everytime i had a search that involved specific punctuation, usually some kind of coding question. Now i just get frustrated with google while trying to find some related term i can add in that will give me the results i want.
This Space Intentionally Left Blank
- Simple interface, quickly loads.
- No graphical Ads
- Paid results are clearly ads and seperated from real results.
That's it, that's why Google is king. Until Yahoo, MSN search, Ask Jeeves and the like get those three points, they will continue to be second fiddle.I usually test search engines by typing in popular keywords that spammers generally go after, ex:
phentermine
home loans
poker
mesothelioma
viagra
miserable failure
Then look at the sites that rank at the top. It's very easy to tell which search engines are more succeptible to manipulation. A quick look at the backlinks for sites favorably ranking in those competitive keywords tells you how that SE is doing.
Here's my opinion on the race between Google, Yahoo & MSN. Google has more sites that are authorities in the top results and Google penalizes over optimization however extreme examples of over optimization continue to show up in Google. Yahoo is a moderate success and does a fair job of filtering out spammy sites as well as authorities like wikipedia - wikipedia will always rise to the top in G but not in Y - and this is good for Y because you get more variety. MSN does an average job of filtering out blog spam but new sites are too favorably ranked and this is because MSN is new and has no recorded history of URLs. My personal preference is to use G simply because it loads the fastest in my browser... Maybe it's also worth pointing out that my company has several URLs ranked favorably in the terms listed above - looking at the change in rankings over time certainly helps give insight into which SE is better. MSN & Y are by far easier to manipulate than G but G gives the most traffic.
Many people don't realize that Yahoo! has a scaled down (Google like) search interface which is actually pretty sweet: http://search.yahoo.com
Lately my Google results have been so Google bombed that I've been going back and forth between the two. I can't say for sure yet, but I may be in the middle of a bit of a personal transition.
Depending on what you're searching for, Google is often so front-loaded with dead-end advertiser links that its results aren't really worth much. Although it has to be said, it depends what type of a search user you are, and what types of things you're looking for.
Google is still the king of advanced search.
------ The best brain training is now totally free : )
Nothing screams objective like this article displaying the Google logo.
I know how to use Windows to achieve the necessary results better than the Mac or Linux. Does that means I should never try to use the Mac or Linux? Does that mean that I won't achieve better results if I learn to properly use the Mac or Linux?
-Daniel
I've stuck with Google for a while, but I used to do surveys pretty often. My approach was to start preparing a couple of days in advance, by keeping notes about things I was searching for. Then I'd take three or four of them, usually the ones that I'd had the most trouble refining, and try them out on a bunch of search engines. For each, I'd keep track of how many searches I had to do and how many junk pages I had to get through before I could get to something useful on that subject. It usually became clear pretty quickly which search engines were allowing me to make efficient use of my time and which were wasting my time.
Another thing you might want to do is check out some of the newer "clustering" or "concept map" search engines such as Vivisimo or Kartoo, to see whether they suit your searching style better. They're really quite different from the search engines we've gotten used to, so the metrics I just described don't quite work for them. That doesn't mean they're better or worse - just different.
Slashdot - News for Herds. Stuff that Splatters.
The other day I needed to know, for obscure reasons, the number of heroin addicts in Dublin. This is the kind of info that you know is probably on the web, but is going to be hard to find with Google.
I used BrainBoost - "How many heroin addicts are there in Dublin?", and, bam, first line of the result - "There are 13,000 heroin addicts in Dublin."
That's damn impressive. Out of curiosity I tried to see if I could find the same info with Google - it was fairly tough. Took three or four searches, eventually resorting to
which is a fairly specialized search that average users probably wouldn't be able to construct. The BrainBoost search, on the other hand, was completely natural, my granma could have done it.So, thumbs up for BrainBoost for question answering.
Still, it's not the kind of thing you'll want every day. For day-to-day search, Google is the tool, but BB is worth a look.
For some time now, Search Engine Watch has provided a good editorial and comparison on various search engines. They focus on marketing topics, but also tend to talk a lot about the underlying technology, etc.
A recent roundup of engines is at http://searchenginewatch.com/links/article.php/215 6221.