Better Search Engines

← Back to Stories (view on slashdot.org)

Posted by ryuzaki0 on Tuesday January 25, 2005 @10:30AM from the finding-nemo dept.

prostoalex writes "Scientific American is seeking better Web searches. They report on all sorts of innovations happening outside the Google-Yahoo-MSN zone that the press is usually reporting on, including GPS-enhanced searches from University of Maryland, Shape Retrieval and Analysis from Princeton, musical search engine from New Zealand Digital Library Project, and some of the projects that A9 and Ask.com have been working on."

9 of 137 comments (clear)

Min score:

Reason:

Sort:

Re:What we need is whitelisting by iminplaya · 2005-01-25 10:37 · Score: 3, Informative

This is kinda close.

--
What?
Clusty = Innovative by int2str · 2005-01-25 10:38 · Score: 4, Informative

Asides from the horrible name, clusty (a clustering search engine) is very innovative and easy to use. I hope more search engines will adapt similar technology soon.

Link to clusty.com search engine
The BBC Search Engine by Anonymous Coward · 2005-01-25 10:43 · Score: 1, Informative

Personally I use the BBC Search engine. Not only does it seem to provide relivant results, it also has recomended links (info here http://www.bbc.co.uk/search/recommended.shtml ) which are editorially selected.

The site seems to return far less porn probably due to the fact they "use a combination of technology and regular human checks to detect and block offensive websites. We aim to be the safest search engine in the UK"

Also slashdot is the first return for "IT News" under the web tag.

http://www.bbc.co.uk/cgi-bin/search/results.pl?g o. x=&tab=www&go.y=&go=go&q=IT%20news
Re:What I want by me+at+werk · 2005-01-25 10:49 · Score: 4, Informative

CopyScape can do the recognizing of copied stuff, but it's purpose is only finding website plagarism. This, however, would definately find all the wikipedia forks unless it's a really old copy and the page has had a major rewrite.

If google could integrate copyscape into their search, you would be happy.

--
For context, click Parent.
Vivisimo by Dan667 · 2005-01-25 10:55 · Score: 2, Informative

Interesting, the first thing I thought is I had seen this with Vivisimo, but I guess no one could spell that so the changed the name?

http://vivisimo.com/

But I agree, it is a great search engine and has gotten better as I have used it.
It's available! by ByteMangler_242 · 2005-01-25 11:39 · Score: 5, Informative

You can do this in google: searchterm1 searchterm2 ~bogus The tilde will look for synonyms. You can see which ones hit back by reading the bold results which are neither searchterm1 or searchterm2. I use ~howto and ~cheats often.

--
Rule of the open mind
People who are resistant to change cannot resist change for the worst.
Re:What we need is whitelisting by Anonymous Coward · 2005-01-25 22:59 · Score: 1, Informative

Try: The Google Directory http://www.google.com/dirhp.

The data is from the Open Directory Project http://dmoz.org/ an almost entirely volunteer-run project http://dmoz.org/about.html. I suggest using the Google version because, for most people, its search facility is better than the ODP search, due to the fact that it works like most Google users would expect a search to work.

The actual directory is variable in quality - some of it is very, very good indeed. However, it suffers from the normal problem that many volunteer-run projects have: parts it are neglected, and rather out of date. Always worth a look though.
Re:Easy (relatively) improvement... by joker784 · 2005-01-26 00:02 · Score: 2, Informative

You mean like this: Google API Proximity Search ?!
Re:What I want by HugeFatty · 2005-01-26 04:36 · Score: 2, Informative
I agree that the things you have listed are problems, and that they'd sure be nice to solve. I just wanted to address one of them for now, as I have been trying to deal with it myself.
The hidden text problem that you mention is a surprisingly hard problem to deal with, as there are so many ways to do it.
You have:
- The <font> tag
- CSS (several ways, such as the :hidden property, changing the colors, using the z order, etc.), both internal and externally linked (for which the search engine must download that file while spidering)
- DHTML positioning over other elements
- A background image the same color as the text
- Javascript to generate any of the above
- Use of nearly identical colors for all of the above (such as #FFFFFF for the background and #FFFFFE for the foreground). In fact, there could be dozens of colors that are all slightly different enough that a human wouldn't be able to detect it without looking very closely, or at all.
I'm sure there are more that I'm missing, but I think you (meaning everyone...I'm not just picking on the parent here...) get the idea. You pretty much have to render the page like a browser to take care of all of those, which really sucks for us search engine developers trying to fight it, and us users that have to deal with that crap.
--

I am clearly fatter than you.