New Clustering Search Engine to battle Google

Klutsy? by mfh · 2004-10-03 01:08 · Score: 5, Insightful

New Clustering Search Engine to battle Google
More like New Clustering Search Engine goes Beta. Let's wait until it's production stable before talking about who it's going to take down in a fist fight reminiscent of the Spock/Kirk battle in Amok Time.

Clusty by Vivisimo? Did I even spell that right? They need to consider naming things that people can:
A) pronounce
B) spell
C) are actual words or at least close to words that qualify for both A & B.

Clusty sounds like something you would call the fat cheerleader. It also will be often mispronounced as Klutsy, so it's a very bad name for a search engine (of all things).

The search engine uses Overture for it's results but offers new features such as an encyclopedia search, clustered results, and a gossip search.

This is a Microsoft tactic: add features to get market share, and it's an evil tactic because nothing new comes out of it, except bloat and bad karma. The fact this is based on Overature leads me to believe that it won't be able to take Google head-on at all. Clusty uses the Google interface but shows sponsored results first (evil), and displays 404 pages in the results. (FYI dteam was the first 3d design guild that is no longer)

I don't think they really have a hope of competing with Google. If it ain't broke don't fix it, so most people will just continue to use Google.

--
The dangers of knowledge trigger emotional distress in human beings.

Re:Klutsy? by LiquidCoooled · 2004-10-03 01:11 · Score: 5, Informative

The original engine was actually called Vivisimo,
and the exact point you make was mentioned back then.

heres the article (january)

http://slashdot.org/articles/04/01/05/1839233.sh tm l?tid=126&tid=185&tid=95

I think clusty.com is better, but now makes me think of unclean prostitutes.

--
liqbase :: faster than paper
Re:Klutsy? by mfearby · 2004-10-03 01:12 · Score: 2, Insightful

It's probably called "clusty" because of all the domain-name hogging scum out there getting fat off registering everything they can think of to extort big bucks! Who would have thought of registering google, huh? Back when it first came out I remember thinking "google: what a stupid name!". Now, it has become both a noun and a verb in most peoples' everyday speech
Re:Klutsy? by Quixote · 2004-10-03 01:23 · Score: 4, Funny

They need to consider naming things that people can:
A) pronounce
Well, Google has got everyone beat in this regard. "Google" is probably the first thing a baby says (and hence I'm sure it is hardwired into our brains). The only thing that could beat "Google" would be "dada" or "burp". Any takers?
Re:Klutsy? by Dark+Twonky · 2004-10-03 01:31 · Score: 1

I totally agree on the naming issue: I was thinking "Klusty the Clown"??? And "Vivisimo" sounds more like a toy company to me.
The interface isn't gonna sell it either. It reminds me more of http://alltheweb.com/ (at least the front page) though - but uglier.

--
My other sig is clever and funny
Re:Klutsy? by meza · 2004-10-03 01:44 · Score: 3, Insightful

Clusty by Vivisimo? Did I even spell that right? They need to consider naming things that people can:
A) pronounce
B) spell
C) are actual words or at least close to words that qualify for both A & B.

The main reason why I used altavista for so long was actually because I didn't manage to spell google right. Honestly. I had to try all kind of combinations everytime I wanted to go there, like gogle, googel, gogel. I should also say that english is not my native language.
Re:Klutsy? by br0ck · 2004-10-03 02:09 · Score: 2, Informative

According to the CEO the awkward name and the Krusty similarity were both intentional.

Valdes-Perez said his company dumped the name Vivisimo for the search engine because it was ``an obstacle.''

``It's a name that is difficult to pronounce and type and spell. Other than that, it's a great name,'' he quipped.

But the new name may face similar challenges, Valdes-Perez acknowledged. Though it is easy to remember, for many people Clusty evokes the name Krusty the Clown, the not-so-kid-friendly character on ``The Simpsons'' television show.

Valdes-Perez said he initially recoiled at the name Clusty, which was conceived by a business partner. But he found it more memorable than the vanilla-sounding names proposed by a professional branding company.

``A mildly negative association,'' Valdes-Perez said, ``will be swamped by a positive experience. And that's what we hope to offer.''
Re:Klutsy? by Anonymous Coward · 2004-10-03 02:58 · Score: 0

I imagine various profanity would also work. Just google it would be replaced by just _________ it.
Re:Klutsy? by strider44 · 2004-10-03 03:12 · Score: 1

I click on your 404 page link and get nothing but links to sites about dteam except for one solitary link... I'm not sure that's a bug as you've just found a loophole in the engine where if you search for a page that's gone, has the name in it's link, and hundreds of sites link to, it's sure to get a 404 page. All that says is they need a routine to get rid of the 404 pages!

However I agree with you totally in one point - Clusty is already totally bloated. If you look at google itself's example who won the previous search engine war by being clean and unbloated you can see what a bad idea for search engines that is!
Re:Klutsy? by Anonymous Coward · 2004-10-03 04:04 · Score: 0

I think clusty.com is better, ...

I think we need a .clown TLD. (Or should that be .crown?)
Re:Klutsy? by It'sYerMam · 2004-10-03 04:21 · Score: 2, Insightful

Google's pretty much a misspelling, anyway. The original word is 'googol' meaning 10^100. Incidentally, this was the £1,000,000 question on WWTBAM, when the cheater was on. He didn't know it, but everyone in my family did... :|

--
im in ur .sig, writin ur memes.
Re:Klutsy? by Fishead · 2004-10-03 04:30 · Score: 1

Evil of all evil's.

Clusty didn't warn me about an impending PDF file!!!

Oh how I loath PDF. Atleast Google warns of one so I can avoid it.
Re:Klutsy? by Donny+Smith · 2004-10-03 04:53 · Score: 1

>The main reason why I used altavista for so long was actually because I didn't manage to spell google right.

Have you heard of bookmarks or autocomplete? ;-)

By the way, I never used Altavista because it reminded me too much of Asta La Vista, I had difficulties remembering which one was the right one.
Re:Klutsy? by melvster · 2004-10-03 05:46 · Score: 0

'Vivisimo (sic): the correct spelling would be vivissimo :-)' according to searchlores anyway http://www.searchlores.org/compound.htm
Re:Klutsy? by BarryNorton · 2004-10-03 05:50 · Score: 1

They need to consider naming things that people can: A) pronounce B) spell
Sure the spelling of Vivisimo will already have been mentioned on Slashdot, but for that reason I call it viv-aiy-simo...
Re:Klutsy? by RotJ · 2004-10-03 06:00 · Score: 2, Insightful

More like New Clustering Search Engine goes Beta. Let's wait until it's production stable before talking about who it's going to take down in a fist fight reminiscent of the Spock/Kirk battle in Amok Time.

Whether it's beta or not doesn't matter. Google picked up most of its steam by word of mouth while it was still in beta and was already on its way to becoming the dominant search engine by the time it took off the beta tag. Just look at Google's own Gmail beta. Hotmail and Yahoo! didn't have to "wait until it's production stable" before worrying their asses off about the marketshare its gaining.

This is a Microsoft tactic: add features to get market share, and it's an evil tactic because nothing new comes out of it, except bloat and bad karma.

So is Google is using evil Microsoft tactics by adding a news search, newsgroup search, image search, directory search, university search, special search, price search, local search, catalog search, definition search, Klingon search, calculator, translator, weblog, email, and photo organizer? Do you think this make Google bloated or a better service?

As far as clustering goes, I'm pretty sure NortherLights.com was marketing it as its key feature back when it was still competing in the consumer search market. Seems their enterprise search engine still has it: "Automatic classification. Northern Light has patented, proprietary technology that classifies every document in the database by subject, type, language, and source. We provide a complete 17,000-node subject taxonomy developed by our expert gang of librarians that is extensible and customizable. Our classification powers advanced search forms, vertical search applications, and our patented Custom Search Folders(TM) for results navigation."
Re:Klutsy? by freejung · 2004-10-03 06:15 · Score: 1

It's not just using pure Overture, it's a meta-search, it seems to be mixing in Google results with Overture/Yahoo results.
Google has sponsored results too. At least they're separating them out from the main index, like G does.

--
My site: Free Nature Pictures
Re:Klutsy? by hazem · 2004-10-03 06:37 · Score: 1

klusty just sounds like a disease.

I wonder if they thought of "vivis"? Kind of sounds cool... most cultures can pronounce it, and it has no tricky syllable combinations. It's evolved from something related to project, and it's easy to remember and not mix up letters from memory.
Re:Klutsy? by shufler · 2004-10-03 06:57 · Score: 1

I remember thinking "Google: That's a rip off of the name of a line of plush platipi toys."
Re:Klutsy? by skraps · 2004-10-03 10:44 · Score: 1

That 404 page isn't actually a 404 page. Check the response headers - the server actually gives a 301 (Moved).
Granted, their crawler should probably be smart enough to sort that out.. but it is strictly standards-compliant.

--
Karma: -2147483648 (Mostly affected by integer overflow)
Re:Klutsy? by panic911 · 2004-10-03 19:14 · Score: 1

hallelujah brother
Re:Klutsy? by Gopal.V · 2004-10-03 22:03 · Score: 1
Did you ever notice that the popular website names often have nothing to do with what they are really are ?. of course they in turn get added to the vocabulary... but see the examples.
- Google -- a huge number , and nothing to do with searching, per se
- Amazon -- WTF does Amazon have to do with book, rather supply the wood ?
- Monster -- Monster ?. and jobs ?.
- Ebay would BuyHere.com sounded better ?
- Slashdot what does "Slashdot" mean actually ?. "Slashdotted" ? (ok, sourceforge makes sense for one).
Very few of the "meaningfully" named sites have taken off in a big way (HoTMaiL.com is an excellent example). often a single syllable catchy name is all it takes :).
--
Quidquid latine dictum sit, altum videtur
Re:Klutsy? by Strange_Attractor · 2004-10-04 01:30 · Score: 1

often a single syllable catchy name is all it takes :).
Not according to your examples - every one is multisyllabic...
Of course, your main point is dead on. I suspect the problem with thematically-related site names is built-in. Take books - you'd remember "related name", then would try book, bookstore, library, text, novel, etc., rarely if ever going back to the right site.* The fact that "Amazon" wasn't one of the taken-for-granted synonyms for book meant that people had to make the unlikely mental connection, which then stood out because it wasn't an existing association. I think that applies to HoTMaiL, too - I bet the vast majority of its users have never noticed the HTML buried in the name.
* Lots of people (including me) forget to bookmark something until they've visited a number of times.

--

----
WWJD...For a Klondike Bar?

The interface looks pretty "cruddy" by mfearby · 2004-10-03 01:08 · Score: 0, Flamebait

Like I said, the interface is crud!

Re:The interface looks pretty "cruddy" by Anonymous Coward · 2004-10-03 01:11 · Score: 1, Interesting

Guess you don't use Linux then. But all in all this actually has a chance against Google. I tried it and it makes way more sense than Google's just throw everything into a huge heap of 1000s of links approach. I think I've got a new search engine.
Re:The interface looks pretty "cruddy" by mfearby · 2004-10-03 01:31 · Score: 2, Informative

After having just searched for "BinaryWrite ASP Stream" to see if it might produce the goods in trying to solve a little web page problem I have, clusty did actually turn up something I hadn't seen before. Maybe it is going to be OK?

Back when Teoma came out I remember thinking the same thing, but soon forgot about it.

And yes, I have seen Linux. I was until a year ago a perpetual new-distro-installing-slut to see if Linux was up to scratch. Sadly, I still have to tinker and fiddle with the thing to even get it to recognise a frigging USB mouse (SuSE 9.0). Said USB mouse (Logitec optical) worked beautifully during the installer, but after rebooting, refused to work in X.

Hey, mabye I should install Syllable 0.5.4 (http://slashdot.org/comments.pl?sid=124146&cid=10 416966), the world's latest addition to the steaming heap of soon-to-be-abandonware :-)

is going to try and take Google head-on. by jdkane · 2004-10-03 01:08 · Score: 1, Insightful

named "Clusty"

A cute name is a start.

Re:is going to try and take Google head-on. by fgb · 2004-10-03 01:13 · Score: 3, Funny

...but not one that reminds me of "Clippy".
Re:is going to try and take Google head-on. by Anonymous Coward · 2004-10-03 01:15 · Score: 0

Who the fuck is Clippy? His name is fucking Clippit dumbass.
Re:is going to try and take Google head-on. by Anonymous Coward · 2004-10-03 01:20 · Score: 0

His name is Clippy dumberer ass.
Re:is going to try and take Google head-on. by ack154 · 2004-10-03 02:30 · Score: 1

or the word "crusty"
Re:is going to try and take Google head-on. by Anonymous Coward · 2004-10-03 03:56 · Score: 1, Funny

A cute name is a start.

And "Clusty" is not a cute name. It's kind of creepy.
Re:is going to try and take Google head-on. by Anonymous Coward · 2004-10-03 04:05 · Score: 0

Or Crusty.
BTW did anyone else read that as "New Cluttering Engine..."?
Re:is going to try and take Google head-on. by bot24 · 2004-10-03 09:10 · Score: 1

http://clusty.com/search?query=Clusty
Did you mean Lusty
They don't even have their own name in their word list.
Re:is going to try and take Google head-on. by camcloud1 · 2004-10-03 21:34 · Score: 0

I think Clusty sounds more like the after effect of a bad Indian curry

Clue! by mfh · 2004-10-03 01:10 · Score: 4, Funny

But anyway, this does look interested.

I think there's your first clue for why your story was rejected.

--
The dangers of knowledge trigger emotional distress in human beings.

Re:Clue! by MrHanky · 2004-10-03 01:41 · Score: 1

Yes, Slashdot has always followed strict standards when it comes to spelling and grammar.
Re:Clue! by jabber-admin · 2004-10-03 01:44 · Score: 4, Funny

Perhap s/he also forgot to include a witty comment about the NYT registration req.
Re:Clue! by sgant · 2004-10-03 03:09 · Score: 1

Lol, yeah, after I submitted this comment I went D'OH and noticed all the spelling errors.

I also got "recieved" wrong also. "I before E except after C" was pounded into my head enough in grade school you'd think I would rememeber it.

Well, I guess that was why it got rejected! Cause I'm an idiot!

Oh well, at least someone got it posted on here, and that's all that matters really.

--

"Leo Fender was in a 'state of grace' when he designed the Stratocaster." -- Paul Reed Smith
Re:Clue! by Wireless+Joe · 2004-10-03 15:31 · Score: 1

And where's the concluding link to Roland Piquepaille's Blog?

Vivisimo... by Anonymous Coward · 2004-10-03 01:11 · Score: 0

I don't know if a clustering Search Engine if worth while. At least not for an average used...They will just use something like yahoo, msn og google because it's simpler. I gave Vivisimo a go and i didn't find it all that good and worth while.

Re:Vivisimo... by beh · 2004-10-03 02:01 · Score: 1

Having used vivisimo for a while, I would say that vivisimo certainly has a lot of potential. The problem the average user has with google, is that google is very good if you have a good idea what exactly you are looking for.

If you only have a word or two to go by, then the clustering really pays off in spades, as it will offer you several routes to follow up in your search results. Very much like, say, kartoo is great in the way that it shows what unites various results.

The problem with kartoo is just that its interface sucks (it by far has the cutest interface, but that alone doesn't make a good search engine).

The problem with vivisimo is that google indexed a lot more pages than vivisimo has so far, hence google often has some results that vivisimo doesn't know about (yet).

BUT - more often than not, vivisimo's clustering helped me to drill down to some very very good search results that didn't rank up highly on google (with the original search).

Gossip filter by IwannaCoke · 2004-10-03 01:12 · Score: 3, Insightful

Instead of being able to search through just gossip, I would be more interested in being able to filter out all the gossip.

Re:Gossip filter by Bazer · 2004-10-03 03:23 · Score: 1

withoutGossip = normalResults - gossipResults;

;-)

Somebody did an "unclean search" for Google using normal search and "clean" search results the same way but I'm too lazy to google for it.

Imagine a Beowulf.. by Anonymous Coward · 2004-10-03 01:14 · Score: 0, Offtopic

ah, nevermind.

Since when is search a solved problem? by hanssprudel · 2004-10-03 01:19 · Score: 3, Insightful

So everybody is waiting for the next great search engine to come along and out-google Google, but it seems to me that they are looking in the completely wrong places.

All Clusty, A9 and the other more recent search engines seem to do is add more gimmicks to search results from yahoo and Google respectively. To some extent, this seems to be exactly what Google is doing recently as well: the searches are hardly getting beter, instead we can search news, search references (try define:), search printed text, do automatic conversions, etc etc.

But the truth is that not only are the searches at Google not getting better: they are getting worse. It seems like PageRank is more or less unused nowadays, and Google just uses easily manipulated things like searchterm in URL, searchterm in Title, how recently updated, to rank pages. I think anybody who uses Google to search for specific things must have observed that it works only a fraction of how well it did when it was new.

So what is going on here? Does everybody consider the basic searching a solved problem, and that we don't need to find pages better than google does? Or is a good search that cannot be manipulated really an intractable problem?

If I owned Google stock, I would really be wondering how many of all those thousands of PhD's at the Googleplex are working on this, and how many are writing gimmicks and elegant webmail applications. Or maybe one of them already proved that the problem can't be solved, and Google is just hoping to make as much money as possible before the secret comes out...

Re:Since when is search a solved problem? by Anonymous Coward · 2004-10-03 01:24 · Score: 0

I agree with this. There are certain things that can actually be quite hard to find on google now. Want to look up a musician's website? Sometimes you have three pages of lyrics4u.com type sites to get through to find it. There are certain topics that have spawned hundreds of automated pages that contain the same spelling errors and unhelpful content.
Re:Since when is search a solved problem? by MrHanky · 2004-10-03 01:36 · Score: 1

All Clusty, A9 and the other more recent search engines seem to do is add more gimmicks to search results from yahoo and Google respectively.
But Clusty's clusters seem like a nice gimmick. Do a search for 'Debian', and it returns the usual, but with options to look at the clusters 'Debian GNU', 'Install', 'Package', 'Reviews', and so on.

Of course, these clusters don't work very well for every search. It's anglo-centric, and searches for things that are unknown to the English speaking world get some really weird options. For instance, a search for a Norwegian author returned 'Med' as the second largest cluster. 'Med' means 'with'. Another search returned something meaning 'I have' as one of the smaller clusters.
Re:Since when is search a solved problem? by otisg · 2004-10-03 01:39 · Score: 1

I think search IS the killer feature. However, just trying to be _better_ than Google is not going to do it. The company/service that will succeed in being better than Google will also have to come up with some alternative ideas and approaches to solving the same problem (finding a needle in a haystack). One of the approaches is to create a service based on humans instead of crawlers - and before you say it - I am not talking Yahoo or DMOZ-like directories and such. I'm talking what some people are calling 'folksonomies'. Instead of describing and explaining, I'll just point you to a demo account of one such service. Of course, this is just one of the alternative approaches, and this alone will not beat Google (at best, it will intrigue Google or Google-like companies to either copy, steal, or acquire). But, it's a start of thinking in a somewhat new direction, in my opinion.

--
Simpy
Re:Since when is search a solved problem? by j14ast · 2004-10-03 01:39 · Score: 1

The problem is optimisation. When something becomes the search engine, moving up 1 or 2 rankings can become a matter of great monitary importance and people start to game the system. To fool page rank people started to use wiki's to artificaly raise the number of links to thier site. To fool word counters(this was a while ago) people used to embed a hole lot of comented out gibrish at the bottom of their pages, and when they atarted parsing it out, they stated just taking on the text and making it visable or matching the color of the background. So basicly and system which becomes dominant will be abused by the invisible hand.

--
Damn the man!
Re:Since when is search a solved problem? by barthrh2 · 2004-10-03 02:05 · Score: 2, Interesting

You point out the exact benefit. In most searches where it could apply, your first five pages are mata-shopping engines. People are using tactics like creating stupid page names based on popular searches that the manage to push to the the top of the rankings.

This is a battle that will always go on. Change your page rank system and people will just start gaming it again.

What Clusty/Vivisimo accomplishes is that by clustering data, it takes sequence out of play. Even if my preferred pages for "Debian's social contract" appear deep into a search on Debian, it comes front & center on a clustered search.

If this catches on, I'm certain that people will figure out how to game that too. One feature that I'm surprised was never implemented is an option to suppress meta-engines from search results. That would clean up results a lot.
Re:Since when is search a solved problem? by Space_Soldier · 2004-10-03 02:36 · Score: 0

The worst problem in search results are the commercial link. If you are looking for technical data for a certain product, you are in for a long search. I understand that everybody wants to make a buck, but it is becoming anoying to have to add and so on to a search.
Re:Since when is search a solved problem? by Space_Soldier · 2004-10-03 02:40 · Score: 0

The worst problem in search results are the commercial link. If you are looking for technical data for a certain product, you are in for a long search. I understand that everybody wants to make a buck, but it is becoming anoying to have to add -buy -sale -sell -store -shop -shopping -buying and so on to a search.
Re:Since when is search a solved problem? by hanssprudel · 2004-10-03 03:23 · Score: 1

I understand that everybody wants to make a buck, but it is becoming anoying to have to add and so on to a search.

That is pretty fresh coming from somebody spamming his every post with commercials for a two-bit pyramid scheme...
Re:Since when is search a solved problem? by whereiswaldo · 2004-10-03 03:53 · Score: 1

There's plenty of room to improve on Google. I wish Clusty good luck and I hope the competition benefits us users all the more.
Re:Since when is search a solved problem? by Dr+Tall · 2004-10-03 04:29 · Score: 1

Even if my preferred pages for "Debian's social contract" appear deep into a search on Debian, it comes front & center on a clustered search.

Why not just search for "Debian's social contract" if that's what you're looking for?
Re:Since when is search a solved problem? by Anonymous Coward · 2004-10-03 04:46 · Score: 0

wow, am i the only one here who Likes the clustered results?
Re:Since when is search a solved problem? by freejung · 2004-10-03 06:05 · Score: 1

Apparently M$ doesn't think so, they're building a new SE for MSN.
Google has been pretty stagnant for months now, but it was always easy enough to manipulate, and is generally considered harder to game now. Right now they are not giving good rank to most new sites for some reason, perhaps to fight "SE spam."
Google is probably in the process of massive overhauls too. It hasn't really updated its algorithm much in months, and many people believe we're in for a big shake-up.
Certainly there is no such thing as search that cannot be manipulated. Search must be based on some sort of criteria, and manipulation is simply knowing those criteria and making your website match them. It is impossible to keep the criteria completely secret, because they can be reverse-engineered by experimentation. The trick from the SE point of view is to get the criteria closer and closer to the criteria of a reasonable person for a "relevant website." That's not easy.
Rest assured, the big G is working on it. But so are others. Competition in the search market is far from dead.

--
My site: Free Nature Pictures

Not impressed; but more competition is good by Quixote · 2004-10-03 01:20 · Score: 4, Insightful

I didn't RTFA (I'm a regular, I don't have to) but I tried out Clusty. In particular, the News section.

Under the heading "House" are the news items:

Gunmen Attack Mauritania Security Chief's Home (Reuters)
U.S. Policies Stir More Fear Than Confidence (Los Angeles Times)
N.Y. Auction Houses Expect High Totals (AP)

And under the heading "Record", are listed:

As Reservoirs Recede, Fears of a Water Shortage Rise (Los Angeles Times)
NASA Delays Plans to Fly Shuttle Soon (NY Times)
San Jose State, Rice Set Scoring Record (AP)

This shows that just a clustering technique isn't enough; you need more context. Google (IMHO) does a better job of clustering their news results.

Having said this, I wish Vivisimo all the luck. Google needs more competition; it is what will give us the Next Great Search Engine(tm).

Ob: I, for one, would like to welcome our new clustering overlords.. ;-)

Re:Not impressed; but more competition is good by jls2151 · 2004-10-03 01:30 · Score: 1

The news section appears to be a search engine for Reuters.
Re:Not impressed; but more competition is good by DissidentHere · 2004-10-03 02:28 · Score: 1

I just gave this a try searching for "SQL 1203" as I have spent the last two days trying to fix a 1203 error in SQL Server. Now, Google of course was the first place I went. While I got many, many results, I had to 'manually' filter out about 50% of them as irrelevant.

I gave 'Clusty' the same search and it really did reduce the amount of time it took to get the relevant information. Yes, most of the results were the same, and I didn't get any new answers for my problem, but I did get to the information faster, which I like.

There's no doubt that Google's handling of news results is pretty hard to beat, but I'm not sure that searching for news is what most people use Google for. At least not anyone I know.

So, while 'Clusty' may have reduced my overall search time, there is no chance I will abandon Google, I'm not about to trust my searching to results based on Yahoo's search engine. And if for no other reason, then the name. It just sounds too much like the way I shorten 'clusterfuck' to 'cluster' when in polite company (which ain't much).

--
"None of us are as dumb as all of us." - meeting mantra

Nothing by Anonymous Coward · 2004-10-03 01:20 · Score: 1, Funny

It's rubbish. Just look at the results when searching for "large breasts". Nothing of any interest what-so-ever. Nothing!

Re:Nothing by AndroidCat · 2004-10-03 02:16 · Score: 1

You probably want to use Booble for that. (I see they've dropped the Google look and .. feel.)

--
One line blog. I hear that they're called Twitters now.

But this of the utmost importance! by Trikenstein · 2004-10-03 01:23 · Score: 0, Redundant

How does it measure up at finding porn?!

Re:But this of the utmost importance! by Anonymous Coward · 2004-10-03 03:59 · Score: 0

More importantly, how does it measure up at filtering it?

Searches for Samus and Her, both of which give pr0n with a GIS, are nudity-free, except for a sketch drawing of some breasts ith Her, and possibly (can't remember, but don't think so) another sketch of sex with Samus.

Good work, guys!

ooh a complete suite of search engines by Savves · 2004-10-03 01:24 · Score: 2, Funny

but i have to say, the gossip search needs to index more sites, and the image search is still no match for google's.

not a very reliable porn search engine.

Some problem with the encoding or something? by MadFarmAnimalz · 2004-10-03 01:25 · Score: 0, Offtopic

Arabic results display alright here (firefox 0.9.3 on windows) but you can't click on them. Other UTF-8 encoded sites display just fine; can anyone else replicate this odd behavior?

Look here for an example. Something with the bidi functionality, perhaps? Any hebrew users?

--
The Egyptian LUG

--
Blearf. Blearf, I say.

Re:Some problem with the encoding or something? by mbaciarello · 2004-10-03 01:33 · Score: 0, Offtopic

Arabic results display alright here (firefox 0.9.3 on windows) but you can't click on them.

Firefox 1.0.1 on OS X, here. Arabic results display fine (well, they look like they're alright...) and are clickable. I used your link to check.

Encyclopedia? Bah! by Zeddicus_Z · 2004-10-03 01:26 · Score: 3, Insightful

The submitter had me all excited there for a minute or so, but unfortunately the "encyclopedia search" he mentions is simply searching the wikipedia.org site. Now don't get me wrong; there's absolutely nothing wrong with wikipedia, however it's already a web resource. You've been able to "encyclopedia search" Wikipedia for AGES by appending "site:wikipedia.org" into a google query.

Now if they'd done some sort of deal with Britanica to gain search access to its online library, THAT would be a resource worth posting to /. about. Bah.

--
Janie took my gun...

Re:Encyclopedia? Bah! by wuice · 2004-10-03 05:04 · Score: 1

There's nothing wrong with Wikipedia, it's just not what I would consider a professional, coherent, "real" encyclopedia.

It does have its uses, though.

I will need... by bogaboga · 2004-10-03 01:27 · Score: 1

...an effort. Google has become a verb, and an adjective! Heck, I hear it (Google) was even added to the English dictionary. The other day I was googling the drug "Cogentin", and sure enough, I got it. I have always googled and would comfortably ask anyone to google in case they cannot find info they need. I wish them (those who hope to compete), the best.

Cb..

Re:I will need... by Anonymous Coward · 2004-10-03 01:38 · Score: 0

How do you use the word "google" as an adjective? I think adjective does not mean what you think it means.

Clusters, Folders, yakkity-yak by papyr · 2004-10-03 01:29 · Score: 1

I occasionally used Vivisimo's search engine years ago (I don't know if it had any association with Overture at the time).

It would take the search results and place them in categorized folders so that you could narrow down the search. Naturally, they picked the categories.

I suppose some filtering would be nice, but do I really need them to do something I should have done when I came up with my search parameters in the first place?

Re:Clusters, Folders, yakkity-yak by HyperChicken · 2004-10-03 07:47 · Score: 0

Ever run a website that has a built-in search function? Ever look through the logs of the search feature? It usually takes a couple of queries before people come up with the right terms to get something close to what they want. The categories feature allows them to narrow down their results in a smart manner.

--
Free of Flash! Free of Flash!

Slashdot Tab by Anonymous Coward · 2004-10-03 01:30 · Score: 1, Informative

One of the default tabs you can add to its interface is for Slashdot.

Hrmm... by t7 · 2004-10-03 01:30 · Score: 3, Interesting

How many people actually jump on the "bandwagon" and switch search engines just because some one says it's "new and fresh"?

I gave a9 a try, I like the interface and some of the new features like the search history and the multiple search panes. But shortly after I found myself using google again. Even though a9 uses google, and the results are almost identical, I didn't find anything compelling enough to make me switch.

Does anyone else feel they might be missing some results if they were to use another search engine?
What must a new search engine provide to "steal" users from google?

Free iPods? Sure!

So... by a.deity · 2004-10-03 01:32 · Score: 0, Offtopic

If this engine beats out Google...

Will they give Clusty the Crown?

/bad joke

--
Option-Shift-K.

Why do Clusty... by stephanruby · 2004-10-03 01:35 · Score: 0, Redundant

Why do Clusty when Vivisimo.com was already working just fine? If I want Google, I'll use Google, not an imitation-Google. And vice versa, if I want Vivisimo, which is useful sometimes, I'll simply use Vivisimo. I certainly don't need a cross-breed of the two.

A better mousetrap by mrshowtime · 2004-10-03 01:35 · Score: 3, Interesting

I have always considered Google's best point, is it's utter simplicity in design. Also, the name is easy to remember. Anyone who wants to up Google has to not only be MUCH better, but also have a good name and be as easy to use as google. Before, in the old days, each search engine produced sometime wildly different results. At the time, HotBot was the best search engine going, but they lost their steam and was ultimately "replaced" with google.

--
"Jeremy, you need to get to an internet cafe and cut and paste some appropriate sentiments about me from the world wide

Re:A better mousetrap by Albert+Sandberg · 2004-10-03 01:42 · Score: 1

Not entirely true.

Back in the days, when the search engine war begun, I had no problem typing in http://www.altavista.digital.com or whatever, it was simply my choice of search engine.

Today, google is my primary choice, but if they will ever bloat it with more than it is today (neat, simple and honest!) I will search for a better one, the one which suits my needs the most, or a page which simply parsers googles results and presents them nicely, in short terms, something nicer.

Google's powers is not in the name, but its simplicity and non-bloat.

Albert

Dada by mfh · 2004-10-03 01:36 · Score: 4, Insightful

Well, Google has got everyone beat in this regard. "Google" is probably the first thing a baby says (and hence I'm sure it is hardwired into our brains). The only thing that could beat "Google" would be "dada" or "burp". Any takers?

You joke, but a search engine named Dada would likely be well received for the name, and if it was a good system it could find a nice user base. I mean it has taken Google *years* to perfect its systems and they started with a good premise: do no evil. That was when all the search engines were cashing in on ads. A lot of people were turned off of the internet because of that, until Google came along. So it was purposeful, not evil, and light/easy to use.

My suggestion to anyone trying to take on Google is that they should do something else unless google becomes evil, and because power corrupts and absolute power corrupts absolutely -- it's just a matter of time before Google turns evil. Maybe not, though. :-)

--
The dangers of knowledge trigger emotional distress in human beings.

Re:Dada by drinkypoo · 2004-10-03 05:59 · Score: 2, Funny

The problem with a search engine named "dada" is that when you searched for articles on cutlery you'd end up with a picture of a haddock impaled on a pitchfork being held by a naked man wearing fake breasts. Probably not what you were looking for.

--
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
Re:Dada by Anonymous Coward · 2004-10-03 11:28 · Score: 0

Probably not what you were looking for.

You'd think that... but...
Re:Dada by NoMercy · 2004-10-03 12:23 · Score: 1

Dada is good but shows a desire to be superiour, Dog's probably better :)

--

And because I'm only going to make one post and crawl into bed, the color scheme is horrid, the layout not very frienly, and the dialogue options for advanced searches so un-organised it's painful to use.

Fix the name, colors and layout, and it'd actually be a contender against google. I'd say borrow from everyone else other thank ask jeeves and go with a nice clean white background with easilly readable results, clearly layed out :)

It's more impressive than Slashdotters realize by Everyman · 2004-10-03 01:40 · Score: 5, Interesting

First of all, "uses Overture results" strikes me as misleading. They have an agreement with Overture to share the proceeds from the sponsored links.

The results include MSN and Gigablast and Lycos. Basically, that means Yahoo's crawling plus Gigablast. Yahoo has ramped up their crawling since March, and is on a par with Google. They've been slow about passing all of it to MSN in a timely fashion, but by now MSN has most of it. I think Lycos, which also uses Yahoo's Inktomi, is about the same as MSN.

The clustering is the best of any search engine, meta or otherwise. You don't have to have JavaScript enabled, which is a big plus over the Vivisimo interface I remember from a year ago.

Finally, I was delighted to see that Clusty.com does not set a cookie unless you customize. Even the cookie for customization looked like it lacked a unique ID. I emailed Clusty and they confirmed for me that they have no plans for a unique ID in their cookie.

Google tracks you with a unique ID across all of their services, and saves everything it knows about you. Google's cookie expires in 2038.

Now I ask you, why do Slashdotters feel the need to dump on Clusty?

Re:It's more impressive than Slashdotters realize by Anonymous Coward · 2004-10-03 03:05 · Score: 0

A) clusty tried to set a cookie on me when I hit the site. Sorry, try again.

2. Paid search results suck. Keep them the hell away from my searches.

Next, Clustering sucks. The human brain is the best clustering device ever. Present a list of search results, and most people will automatically sort "relevant to my search" from "not relevant to my search" Why do you need a computer to do this for you?

Fourth, google's unique id. Set firefox to accept cookies only for current session. Problem solved. Then their "unique ID that lasts forever" lasts about ten minutes.
Re:It's more impressive than Slashdotters realize by celerityfm · 2004-10-03 03:13 · Score: 1

I was going to say, if it were based solely on OVERTURE'S results then the only competition it would have would be Gator/Claria's horrid Searchscout/GAIN system, which is the only "major search engine" I know of that is powered purely by Overture results. Though to be fair when Overture runs out of ads it just dips into the Yahoo/Inktomi pool and runs those listings instead. So I guess its not TOTALLY worthless - try for yourself at http://www.overture.com

By the way, the Gator/Overture partnership has gotten Overture's owner Yahoo in trouble and continues to miff Overture advertisers since Overture does not allow advertisers to "opt out" of their scheme.

--
...unfortunately no one can be told what The Mat^H^H^HGoatse is...they must experience it for themselves...
Re:It's more impressive than Slashdotters realize by shnarez · 2004-10-03 08:26 · Score: 1

Google tracks you with a unique ID across all of their services, and saves everything it knows about you. Google's cookie expires in 2038.
I block all *.google.com cookies. I am still able to use Google's search services. I'm sorry, you were saying something?
Re:It's more impressive than Slashdotters realize by khallow · 2004-10-03 09:31 · Score: 1

Next, Clustering sucks. The human brain is the best clustering device ever. Present a list of search results, and most people will automatically sort "relevant to my search" from "not relevant to my search" Why do you need a computer to do this for you?
Google dumps a list on you. You have to refine the search yourself or evaluate these results by examination. I like the concept of clustering of these search results because *if* it works, then it's work I don't have to do. In practice, clusty appears to group in interesting ways, but I don't have a feel for how useful the groupings are. Seems better, if you start with near zero clue about what you're looking for.

Mozilla search plugin from the actual company by palfrey · 2004-10-03 01:46 · Score: 3, Informative

Now there's a first. Not even Google has ever directly supported Mozilla - the Google toolbar from Google is IE only. And this one now has a Mozilla search plugin link on the front page. Kudos.

--
Beware the psychokinetic mimes!

Re:Mozilla search plugin from the actual company by NeoSkandranon · 2004-10-03 02:22 · Score: 2, Interesting

The biggest "feature" of the toolbar is the popup blocking..which Moz users don't need

At any rate Firefox has a box in the corner that's directly linked to google

--
If you can't see the value in jet powered ants you should turn in your nerd card. - Dunbal (464142)
Re:Mozilla search plugin from the actual company by dubstar · 2004-10-03 03:31 · Score: 1

Why write a version of their toolbar for Moz/Firefox when the functionality it provides is already there by default?

All these toolbars and crap exist for IE mainly because MS seems to have stopped actively developing it.
Re:Mozilla search plugin from the actual company by kulack · 2004-10-03 07:09 · Score: 1

Actually, since I switched to firefox, the thing I miss most about the Google toolbar is the autofill support.
It works quite well, and when you buy more things online than in the real world, its like the difference between writing a check and using a check card. Much quicker and more convenient.
I'm simply too lazy to try to write a firefox plugin to do the same thing, but probably wouldn't too difficult (once you've actually written a firefox plugin... 8-).

--
Re:Mozilla search plugin from the actual company by stephanruby · 2004-10-03 18:33 · Score: 1

Since I switched to FireFox, the thing that I like the most about it is the plugin that allows you to highlight text on a page, right click it, and then search this string on Google (and/or whatever search engines or dictionaries you want).
Nowadays, using autofill is just too much work for me.
Re:Mozilla search plugin from the actual company by stephanruby · 2004-10-03 18:35 · Score: 1

And by the way, if you're looking for other ways to automate your work, take a look at this
Re:Mozilla search plugin from the actual company by Anonymous Coward · 2004-10-03 22:40 · Score: 0

I'm simply too lazy to try to write a firefox plugin to do the same thing, but probably wouldn't too difficult (once you've actually written a firefox plugin... 8-).
now i know how u managed a page full of un-moderated comments.. stop blowing ur own horn.. welcome to slashdot

interesting but ... by sosuke · 2004-10-03 01:50 · Score: 1

how do you add your site to the results? all my searchs so far have come up poorly and none of my sites are on it, so of course i think its lame :D
plus the name, as stated in previous comments, sounds like something ikky, like a cockroach cluster or something, nifty idea and design though sponsered results suck being in the results and not on the side

I use vivissimo in preference to google quite ofte by GuyFawkes · 2004-10-03 01:52 · Score: 1

.n

Simply because for *many* kind of searches, for example looking for a supplier of aluminium extrusions for heatsinking, ALL google's top returns are for OTHER fucking indexing sites like fucking kellysearch and NOT fucking sites owneed and run by aluminium extruders.

This is a trend that has become ever more prevalent over time, and it makes google ever more irellevant.

this will of course get modded troll, see journal

--
http://slashdot.org/~GuyFawkes/journal

Clusty by Vivisimo (powered by overture) by Anonymous Coward · 2004-10-03 01:53 · Score: 0

but basically its just *another* interface to overture who are nothing more than search spammers which pay for traffic/click sites usually are.

It beats Google by danharan · 2004-10-03 01:57 · Score: 2, Informative

on a search for "MILF" by putting the Moro Islamic Liberation Front in one category, and separating the more "mature" content in others.

It's not perfect, but it's a good start. I'm sure /.'ers can think of other ambiguous search words where clustering helps. The UI could use some simplification, but otherwise I'm impressed.

One neat consequence for web marketers will be more targetted traffic. With Google, you have to hope searchers will be savvy enough to use 3-4 keywords to search for exactly what they want- if they can click on two more KWs that refine their search, we'll see the inventory of cheaper 3 KW terms go up significantly.

--
Information: "I want to be anthropomorphized"

Search is a dialog, not a ranking by G4from128k · 2004-10-03 01:59 · Score: 3, Insightful

The basic concept of any kind of PageRank is flawed because it assumes a monotonic ordering of sites on some single scale (e.g., popularity as defiend by linkage). The problem with PageRank is not the use of links to assess popularity, but the presumption of a single scale.

The search of "Apple" illustrates this well. This search, like many is deeply ambiguous. It could refer to the computer company, to the fruit, to the record company, to New York City, to the singer (Fiona), or to Apple Valley (MN or CA). Even if the search engine knows that it refers to the computer company, it's still ambiguous. It could refer to the company (as an investment), the products (for purchase), or a question (as in technical support).

The point is that each of these ambiguous alternatives creates an independent cluster of hits. One cannot even rank hits within a cluster due to a hierarchy of ambiguity. Within the Apple computer cluster are distinct subclusters for computer purchase, investment evalaution, and technical support. Although one can create a ranking within each subsubsubsubcluster, it is impossible to construct a meanful rank for all hits across all clusters - the second hit for "purchasing an Apple computer laptop" is not comparable to the 2nd hit for "Apple Records".

Instead of a pagerank scheme that sorts the universe of hits the instant the user enters the search, search engines should be more interactive. The first page of hits would emphasize breadth -- displaying hits most representative of a broad range of alternative clusters. The UI would enable a "more like this"/"fewer like this" selection process that tells the search engine what the searcher is actually looking for. As the searcher selects hits, the subsequent pages might show popularity-ranked hits within the clusters that seem to interest the searcher.

Each hit and each page would serve a double-duty -- serving the searcher's need to get information from the internet, and answering the search engine's question about the needs of the searcher for that particular search. Until the search engine understands each searcher and each search, it cannot hope to rank the hits.

--
Two wrongs don't make a right, but three lefts do.

Re:Search is a dialog, not a ranking by coolsoldier · 2004-10-03 10:28 · Score: 1

The problem with that idea is that some of the most important aspects of a search engine are for it to be simple and fast. I don't know how many users would prefer to click through a whole series of pages to get to what they are looking for, rather than just looking at several results and filtering them mentally.

Although, Google does have an experimental project that allows you to create a profile of your interests and filter results accordingly, if that's your cup of tea.

nice!!!! by the_2nd_coming · 2004-10-03 01:59 · Score: 1

the clustered results are awesome!!! it is like google and northern-lights combined!!!

nothing really big about the encyclopedia search, it just mirrors wikipedia which already has a good search engine for its articles....unless clusty adds something like natural language search or a close to search I see no benefit for using clusty over wikipedia.

--

I am the Alpha and the Omega-3

Associated with Slashdot? by Anonymous Coward · 2004-10-03 01:59 · Score: 0

Hey,

Click 'Customize!' on Clusty's tabs and Slashdot can be made a full time search tab.
Are Clusty people just fans of Slashdot or there is something more?

While I love Google, I'm quite impressed with this new one. Tons of new little features there, like preview frame, and the side panel is helpful, Mozilla plugin.
The welcome page sucks though, and is uninviting.

This one impressed me way more then the one from Amazon.

Re:Why do Clusty... - publicity by fons · 2004-10-03 02:07 · Score: 1

Because this "rebranding" generates a large amount of publicity they could never buy.

Even on Slashdot, a lot of people didn't know about Vivisimo.

Now you do.

The money they put in the rebranding is FAR less then wath this amount of publicity would cost to buy.

The Porn Factor by Anonymous Coward · 2004-10-03 02:08 · Score: 0

Its very ugly....

All engines should be ranked on their porn finding abilities.

'Clusty' is bad name though. by master_p · 2004-10-03 02:09 · Score: 1

It has the word 'lust' in it. And it sounds real bad in my language (close to 'fart').

Re:'Clusty' is bad name though. by Anonymous Coward · 2004-10-03 18:23 · Score: 0

Fascinating! and, how ironic...

In my language, "googel" means "shit".

tabs by dancedance · 2004-10-03 02:09 · Score: 2, Insightful

Like google, clusty can seach for/through: images, news, ebay, blogs, and . . . SLASHDOT? I was quite supprised to see that it can be customized to have a slashdot tab at the top. The other interesting thing I noticed is that there is a link on the main page to "mozilla search plugin". I am not able to actually follow the link, but it would seem to suggest that they are interested in supporting OSS. Who do you think they are trying to target?

Pretty darn good search engine! by Cryofan · 2004-10-03 02:10 · Score: 0, Redundant

After trying it, it appears to be a little bit better than google. For one, it has that fast, spare interface that set google apart in the beginning.

And these clusters are a great idea.

Definitely worth bookmarking!

--
eat shiat and bark at the moon

Clustering Client for OS X by saddino · 2004-10-03 02:12 · Score: 1

theConcept is a client-based clustering/thematic search engine that works with Google, Wiki, DMOZ and other search engines (it data mines result and analyzes most significant keywords from the source pages). If you have OS X, you can check it out for free.

</shameless plug>

Re:Clustering Client for OS X by Anonymous Coward · 2004-10-03 02:50 · Score: 0

$40

Hooray! Hooray! Hooray! It puts Wikipedia first! by ortholattice · 2004-10-03 02:12 · Score: 3, Interesting

Finally, a search engine that correctly bubbles wikipedia above the spam clones (and read the reply to this post too). Google doesn't even show wikipedia at all on the first page, even if expanded. Kudos, you've won your first (?) customer!

Awesome - clustering seems to be very effective by MarkWatson · 2004-10-03 02:18 · Score: 1

I am a big fan of all things Google (I **love** their SOAP API - free use of the Google backend; the limit of 1000 hits a day is no problem for me because I just use it for a NLP question answering prototype) and one of my best friends works for Overture.

That said, I think that Clusty will do very well. I use Firefox and Safari for my browsers; after breakfast I am going to configure one of them to use Clusty by default.

One thing that make Clusty so interesting to me is that I have been working (for the last year and a half) on version 2 of my KBtextmaster product that uses similar technology: I cluster documents based on what categories I automatically assign to them (for this, I use the Reuter's categories and trained with their excellent 2+ gigabyte tagged news corpus). I also identify people's names, places, etc. and I am planning on using this information to improve clustering also.

oblig simpsons ref by sacrilicious · 2004-10-03 02:21 · Score: 0, Redundant

"Clusty" is going to try and take Google head-on.

Maybe this will give Clusty the crown.

--
- First they ignore you, then they laugh at you, then ???, then profit.

Dada would be... by sam_handelman · 2004-10-03 02:30 · Score: 4, Funny

A search engine that finds pages containing the words you typed which are *least* likely to relate to your actual underlying question. A google of the absurd, as it were.

This could be very, very difficult. How would you implement such a thing, from a technical standpoint?

--
The good and new comes from no quarter where it is looked for, and is always something different from what is expected.

Re:Dada would be... by Anonymous Coward · 2004-10-03 02:58 · Score: 0

Well, you wouldn't, would you? It's an art.
Re:Dada would be... by Anonymous Coward · 2004-10-03 05:19 · Score: 0

Do the Google search and order by ascending PageRank.
Re:Dada would be... by rembem · 2004-10-03 06:14 · Score: 1

A dadaistic search engine would be nihilistic.
Re:Dada would be... by kevincal · 2004-10-03 07:53 · Score: 1

Whose your dada?
Re:Dada would be... by 42forty-two42 · 2004-10-03 08:25 · Score: 1

Simple: Make a list of all non-matching pages, sorted ascending by relevance.

clustering? already got that... by lawngnome · 2004-10-03 02:31 · Score: 1

Google has had this for a while - its the allinurl advanced search option.

So does that mean by floydman · 2004-10-03 02:39 · Score: 0, Offtopic

that soon enough someone is gonna tell us that

Google is DYING

--
The lunatic is in my head

me no like it by Anonymous Coward · 2004-10-03 02:45 · Score: 0

tried it, do not like.

did simple test, searched for boobs. ok got boobs....... ICK! no direct like to the pic, most was advert pages.

i like google's split image page. i like google's "just show it" with out preferance listings.

Clusty works! by Anonymous Coward · 2004-10-03 02:47 · Score: 0

A search for "the the" (a band) actually turns up an entire page of results related to... the band!

Google, OTOH:
- returns no results for the band on the first page of an unquoted "the the" search
- returns one result -- #5 -- for the band on the first page of a quoted "the the" search. (before last year, you couldn't even get Google to do this)

the "plug in" is less than you think by Avishalom · 2004-10-03 02:49 · Score: 1

a plug-in for a search engine for my mozilla firefox consists of ~13 lines of text that you can write all by yourself (get variable name for the query term etc.)
oh , and a nice icon,
they're just trying to kiss up to mozilla users.

--
-- Avishalom is usually vish

clusty??? by Anonymous Coward · 2004-10-03 02:51 · Score: 0

ok i red it as CRUSTY..... like old crusty girls /shudder /vomit

needs a better name

Vivisimo by Don+Tobin · 2004-10-03 02:58 · Score: 1

Vivisimo was the search engine I liked a few years ago. The reason I kicked them to the curb went something like this:

"Hey Vivisimo guys, we're starting up a new company here and you have a very powerful results format. What would it take to license your engine to help organize and present our data to the public?"

Vivisimo's answer: Well, that depends on how much money you're going to make of course. Exactly how much money do you have and do you expect to make?

Google's answer: We're designing a hardware box that'll let you do exactly that. If you have a large enough interest we may even be able to include you as a case / study to help save some costs.

Wasn't a hard choice. Technology does not in itself win, impractical people running a company can still deep six the best ideas. Vivisimo we didn't even bother calling back.

Re:Vivisimo by Anonymous Coward · 2004-10-03 09:43 · Score: 0

Did you end up buying the Google appliance? What do you think of it? I don't see any search on your site...

Technology? by Anonymous Coward · 2004-10-03 03:04 · Score: 0

Am curious about the technology behind Clusty. It appears they are running Apache/Linux but what have they used to implement the engine logic? Java, C/C++, ...?

Why clustering? by sabNetwork · 2004-10-03 03:06 · Score: 1

Frankly, I don't see what's so great about clustering. It feels non-intuitive to me. And it doesn't appear to improve the quality of my results.

For example:
I search for 2600. I want information on the Atari 2600.
As soon as I realize that 2600 is a hacker magazine, I add the word "atari" to my search.
Why is this harder than clustering?

I do like the Open in Preview Pane feature, though.
--

Definition of irony by ZB+Mowrey · 2004-10-03 03:11 · Score: 1

Irony (n), 1. Learning about a Google competitor from a screen-scraping bot called Google News.

--

Self-referential sigs are rarely entertaining.

Re:Klutsy, but... by vivekg · 2004-10-03 03:19 · Score: 1

I agree with you but..

They are using Linux with Apache .. Good but no site should provide this kind of details open! Look at google and Yahoo, they don't provide accurate information on these issues.

Results are clustered that's the only think I love right now. But I'm dam sure rest of competitor get this stuff soon

--
The important thing is not to stop questioning --Albert Einstein.

The slashdot search doesn't work... by PetoskeyGuy · 2004-10-03 03:22 · Score: 1

If you go to Customize, you can add your own Slashdot tab.

But it doesn't work! A simple search for "Grits" or
"Linux" return 0 results.

Marketing Slip-Up... by One+Childish+N00b · 2004-10-03 03:30 · Score: 2, Funny

"Ewww, Jimmy's got a clusty in his hair!"

- I refuse to use anything that sounds like children's slang for a bogey or some other lump of offensiveness. Whoever thought that name up needs to be drummed out of marketing forever. The layout of the main page is reminiscent of Ask Jeeves (which is a bad thing, it automatically makes me think 'bad searches') and search pages look cluttered and the vivid background against the soft shades of the foreground looks awful. This 'Clustered Searching' is a good idea, badly executed. Next please.

--
Dealing with lawyers would be a lot less tedious if they all looked like Casey Novak.

What's it's name... by karniv0re · 2004-10-03 03:42 · Score: 2, Funny

"Yeah, I use that new search engine. Crusty. Er, Colostomy. Er, Callusy, or whatever."

Which is superior.. my brief take on it by mattr · 2004-10-03 03:57 · Score: 1

If Google is my hands for the web, Clusty is going to be my eye. I can swipe my hand through water in a pond, or sandhills on the beach, and get an idea of what is out there on the web, *in the order of popularity* (because backlinking is so important).

I have no problem with this way Google works, I found backlinking to be tremendously useful when implementing a gigabyte-sized database on htdig, and Google "just works".

Clusty on the other hand works to reduce my information saturation, it will reduce the overload and make me feel better. It seems similar to NorthernLight which did clustering a long time ago, but I believe stopped their public engine and are now going after the enterprise (apparently successfully? They have a linux download too).

I may be biased as I am also very interested now in faceted metadata search engine design, and that seems to be what Clusty is doing. I can't tell if it is the same categories as dmoz.org (which Overture says they leverage), but it seems to work. For example I typed in Northern Light and it gave me the categories of Search Engine, Reviews, Aurora, and even Crude Oil. Crude Oil disappeared when I put quotation marks around the search terms, so I'm impressed, they've taken the trouble to match phrases.

I tried some nonsense words, and discovered connections I didn't know exist (mostly foreign language) - I tried splik, splike, and spli*. Try it yourself in google and clusty. Note Google gives you ten pages for splike while Clusty tells you the knowledge domains they fit into. No more clicking here and there in the google screen list to try and find less-popular links. And Clusty turned spli* into split. And click Details in clusty, and a little yellow information window descends, telling you the different sites (Reuters included) and how many hits from each.

Look at their clustering, it seems good and useful. I searched for something I'm interested in now, the search term was: free bioinformatics tutorials.

Clusty gives me categories like Genomics, some institutes, and the Bioinformatics FAQ. It lets me expand more than one section at a time, and the tiny "More" link at the bottom of the category list continually extends that list each time you click it. That's useful. This leads me to other categories including some C++ libraries, a Computational Biology heading, MDL Chime, and a bunch more. Wow! I haven't studied it much more yet, but I'd like to be able to show a lot of categories the first time (no More button clicking), have more screen width given to the categories column, and display the associations that made it pick certain categories. Also I'd like a yellow popup when I mouse over a category to show the next inner level's category list (at th e moment not too many levels it seems) the way Berkeley's Flamenco does. There is a legend below the category list with a line describing the plus mark as "Expand clusters", but I wanted this to expand all clusters and give me all the categories. About the way I just checked the Flamenco site.. I had to use Google. The first time I typed Flamenco into Clusty and it didn't give me any category called Search Engine, which surprised me. I selected cluster by URL instead of Topic, and when I clicked on berkeley.edu I got a Clusty Error which was reproduceable then but not later. I found it on Google by typing in Flamenco and Berkeley. To be fair, Google took me to an old page that redirected me to the right page. When I went back to Clusty and typde in Berkeley as well and searched by topic, it was fine and took me to the right server the first time. Also the berkeley.edu links worked okay then too, so I want to give them the benefit of the doubt.

About the tech, I'm not sure they went as far as they could even though it works well for me. I thought it seemed to be a faceted metadata engine in some ways because they show the number of hits in each section, th

Mozilla Search Extension by wyldeone · 2004-10-03 04:15 · Score: 1

They have a search extension for Mozilla based browsers, which is certaintly somewhere where they stand out against google.

--
In the beginning the universe was created. This made a lot of people very angry and is widely considered as a bad move.

Re:Mozilla Search Extension by Alric · 2004-10-03 14:54 · Score: 1

I guess their recognition of Mozilla makes them stand out, but it's not bad for Google, as every version of Mozilla I've used has had a Google search extension built into it.

Also, it only takes about 15 minutes to write a search extension. Tons of them exist; just Google for one.

the next test for search engines by 3l1za · 2004-10-03 04:16 · Score: 1

I'd like to see a search engine -- perhaps just for shopping -- that weeds out repeat entries. I.e. I was trying to find a particular pair of "Donald J Pliner" shoes and so punched various combinations of "Pliner" and the name of the shoe into Google.

And from that I got what looked like several *different* websites. The first website was zappos.com so I went there and found that they didn't have my size / the color I wanted. So I tried some of the 80+ other *different-looking* sites (each with unique URLs). And for just about every single one, I would go to the new URL, then find the shoe I was interested in, then when I clicked on the link to "learn more about the shoe" or "see if the shoe was in stock," EVERY single time I got sent back to Zappos.com which I already KNEW didn't have the shoe in my size/the desired color.

Usually you can tell hack websites by the URLs -- the URLs are something like this: donald-j-pliner.urbanclothing.net

You can tell it's a sham website (as contrasted to shoebargains.com, shoedini.com, shoebuy -- which seem as though they could be legitimate operations in and of themselves).

So I want a search engine that will dereference these various URLs and let me know if if there is something truly new there or if these are just front websites for a website I've already visited.

This was one of my most frustrating online shopping experiences ever; zappos.com saturated the search engine results so that only zappos.com proxies were returned. And maybe the answer is that there are no other retailers (besides macys.com nordstrom.com ...) but if that were the case: I'd rather just have that answer than spend hours following links that led me back to the same answer. Total bullshit.

Lose the Butt-Crack Logo by Anonymous Coward · 2004-10-03 04:16 · Score: 0

I do like the functionality, and am impressed that even after slashdotting it seems to be functional. But they really should reconsider the name, and lose the patch logo, which reminds me of nothing more than a plumber's butt crack.

Re:Lose the Butt-Crack Logo by Anonymous Coward · 2004-10-03 04:30 · Score: 0

Concur!

"Clusty" as a name sux; kind of reminds me of Clippy.

But mostly makes me think these guys are probably great on the engineering but only gave the marketing about 2 seconds worth of consideration... just before walking into a mtg with the VCs.

Which is fine but *somebody* should be spending some time trying to capture the beauty / strength / brilliance of this product if the folks who designed it are too busy getting it right. They do themselves a disservice. The Am public does love something "catchy."

Imagine a beowulf cluster of these.... by nerd256 · 2004-10-03 04:17 · Score: 0

1: Get a compile farm going
2: Set up a worldwide database network
3: Start archiving the web
4: ...
5: PROFIT!!!

Sorry, it had to be done

YAGK by louisykarma · 2004-10-03 04:24 · Score: 1

> Cat got your tongue? Yet Another Google Killer, stoopid machine

Re:Klutsy? Clustering is hardly "new" by zapadoo · 2004-10-03 04:38 · Score: 1

Search result clustering is nothing new. Search engine technology companies like Verity, Fulcrum (from U of Waterloo grads among others, now owed by Knowledge Manager vendor Hummingbird), and others have been working in this space for many years.

While I've seen no overt evidence of this from Google, I would be shocked and amazed if Google did not already have this technology in a highly advanced state.

Google's own "Similar Pages" link associated with search results probably in part uses clustering technology in the background to achieve its results.

Dictionary Search by Anonymous Coward · 2004-10-03 04:47 · Score: 0

new adj. Middle English, der. Latin novus

1. : having recently been discovered by the New York Times

2. : having been posted to /. so long ago that the poster has forgotten,

cf. everything old is new again

3. : having recently been renamed

4. : of or pertaining to a recent product launch event (more at song & dance)

Klitsy? by Donny+Smith · 2004-10-03 04:50 · Score: 2, Funny

>I think clusty.com is better, but now makes me think of unclean prostitutes.

And Google makes me think of clean prostitutes!

With .sig: by FooAtWFU · 2004-10-03 04:57 · Score: 2, Funny

What must a new search engine provide to "steal" users from google?

Free iPods? Sure!

Well, I guess that's one way to do it...

--
The World Wide Web is dying. Soon, we shall have only the Internet.

What ? No FTP Search ?! by Taco+Cowboy · 2004-10-03 04:59 · Score: 1

Where's that FTP-Search thing ?

--
Muchas Gracias, Señor Edward Snowden !

When all questions can be answered by Anonymous Coward · 2004-10-03 05:06 · Score: 0

When a search engine can point you to, or better yet, inline, answers to arbitrary queries, it will be done.

WTF is "the English dictionary"? /nt by handslikesnakes · 2004-10-03 06:12 · Score: 1

damn you slashdot and my inability to post a textless comment

i just couldn't care less in such cases by l3v1 · 2004-10-03 06:13 · Score: 3, Insightful

Am I the only one who is fed up reading like "company A developed a new search engine which uses company B's search engine by adding revolutionary and world shaking features like thinking instead of you"...

If some are so revolutionary, then why are they using someone else's engine by adding some stuff most people most probably never find out what to use for. Doesn't A9 ring a bell for anyone, or does it.

I have an idea. Let's make a totally new and ground breaking search engine which will use Google's results, but hey, the main idea: let's have a different logo and paint the site pink !

Geez, I sometimes just can't stop wondering about all the freaky things that money can be earned from these days.

--
I am putting myself to the fullest possible use, which is all I can think that any conscious entity can ever hope to do.

Re:i just couldn't care less in such cases by HyperChicken · 2004-10-03 08:35 · Score: 0

I have an idea. Let's make a totally new and ground breaking search engine which will use Google's results, but hey, the main idea: let's have a different logo and paint the site pink !
Poogle? Hmm....

--
Free of Flash! Free of Flash!
Re:i just couldn't care less in such cases by lordpud · 2004-10-03 13:07 · Score: 1

The thing is why bother inventing a new search engine when that is not the problem you want to solve? I think that Google has a great search engine, but the way that we get results pretty much sucks. Why in the world should I just get a list of the billions of pages that matched my search. In fact if I were to create a company like this I would want to license someone else's search engine, since it would probably be cheaper than developing my own, take less time, and probably give better results than mine.

The idea here is that we have additional useful information about the pages. In particular we know (through some sort of NLP that I will ignore) that their are clusters of documents that are related in your search. It makes a lot of sense to tell the user what kind of additional refinement to their search can help them narrow in on what they really want.

I guess that is what you mean by "thinking instead of you". But I think one of the major goals of computers is to do just that. If not then we should go back to doing all our math by hand and manually typesetting our documents on our printing presses. But just as that seems a bit absurd (unless you happen to be a Luddite of some variety), it seems absurd to dismiss other worthwhile things a computer can do, such as clustering documents.

um, nuthin' beats... by Anonymous Coward · 2004-10-03 06:22 · Score: 0

WebGoggles for those special times when no other search engine will do.

WebGoggles.com by Anonymous Coward · 2004-10-03 06:27 · Score: 0

Um, no Booble sucks. Try WebGoggles.

real headline by dioscaido · 2004-10-03 06:27 · Score: 1

New clustering search engine makes Slashdot front page, falls into obscurity shortly thereafter.

Sponsered results at the top by Snaller · 2004-10-03 06:49 · Score: 1

No thank you.

--
If Google really cared they would fix Android Chrome to reflow text, instead of discriminating

Re:Sponsered results at the top by HyperChicken · 2004-10-03 08:38 · Score: 0

Google puts sponsored results at the top: http://www.google.com/search?hl=en&ie=UTF-8&q=linu x&btnG=Google+Search Wow, one of them for that query is a Microsoft page!!!

--
Free of Flash! Free of Flash!

Clusty is an illness? by Anonymous Coward · 2004-10-03 06:57 · Score: 0

Sounds like something you'd catch while sitting on the John at a dirty truck stop.

Rule number one by Lord_Dweomer · 2004-10-03 07:30 · Score: 1

Rule #1 of making a search engine to take on Google: Make sure the name of your engine sounds good when used in verb form.

Rule #2, innovate the actual searching, not just the organizing.

--
Buy Steampunk Clothing Online!

Impressed by Anonymous Coward · 2004-10-03 07:34 · Score: 0

I did a search for my website, and it actually was listed. Something that, dispite my best efforts, I have been unable to obtain with google yet. More power to 'em!

suprising similarities... by endersdouble · 2004-10-03 07:58 · Score: 1

can be found between this and google. For one thing, it must have a similar algorithm to PageRank. If you search "litigious bastards", not counting the "special" result on the top, the fourth hit is SCO. "Miserable failure" hits GWB #1. Not *quite* the same results, but close....

I'm impressed by Gunark · 2004-10-03 08:28 · Score: 1

I spent the last few days Googling for some obscure Linux SATA driver information... the best Google could give me was some half-coherent mailing list posts on the subject. A quick search on "Clusty" on the other hand came up with more or less exactly the info I'd been looking for.

It looks like you're trolling by Anonymous Coward · 2004-10-03 08:35 · Score: 1, Funny

___ | | Would you like some help? (.)(.) | | / * Search for duplicate articles || || * Scan for bad grammar in TFA ||__|| * Use a cliche from saved templates |____| * Post a hidden link to tubgirl

YASE rant by Anonymous Coward · 2004-10-03 08:40 · Score: 0

Yet Another Search Engine is able to take on Google. So what? There are plenty of search engines and the problem is that the results often suck.

I started using Google because it gave good results -- not because it had a "clean" interface; not because I liked the name; it was because it usually provided good enough results that I could find what I wanted within the first ten results returned.

It seems to me that the media or whoever reports this stuff just doesn't get it: people want results, not a "portal" or a load of features.

In business news, I keep hearing about how AskJeeves stock is going to go up and how it will challenge Google. AskJeeves, Overture, Lycos, Infoseek, Excite, Altavista and all of the other big search engines have had years to do what Google was able to do, but they haven't. Now that Google is public, they finally get it?!?

When a new search engine surfaces that is able to scan a text and pass a sixth grade reading comprehension test about the text then I will be impressed. That would be a search engine worth using. Until then, I'll stay with Google.

This search engine *IS* better than Google by mattdm · 2004-10-03 09:48 · Score: 1

I am the top hit for my own name. Clearly Clusty is superior -- on Google, some lame journalist comes up first, but, c'mon, who cares about him? Clusty properly files him as the second choice. The better results on this very vital search prove that everyone should switch now. Thanks.

cell phones by Anonymous Coward · 2004-10-03 09:58 · Score: 0

Actually, god hates shrimp. People (including me) hate cell phones ringing in church.

Re:cell phones by AndroidCat · 2004-10-04 00:21 · Score: 1

God told you to think that.

--
One line blog. I hear that they're called Twitters now.

linkified by Anonymous Coward · 2004-10-03 10:15 · Score: 0

http://slashdot.org/articles/04/01/05/1839233.shtm l?tid=126&tid=185&tid=95

It's not hard, people. It's not like if you create a link, people are going to slashdot slashdot.

Re:linkified by druhol · 2004-10-03 16:46 · Score: 1

Slashdotting Slashdot? That's deep.

--
WWD4D?

Looks like by Code+Dark · 2004-10-03 11:06 · Score: 0

Looks like they stole the page layout from Ask Jeeves; I see that they have a Mozilla Plugin. If enough people use Clusty with Firefox (since it is booming, after all), or it becomes a standard option, they could really make it big. I think that their biggest opposition is simply people wanted to stay "loyal" to Google- or just have used Google for so long, that they don't want to change.
I'm sad, though- I didn't show up on the Gossip search!

--
- Code Dark

A: How to beat Google [Re:Hrmm...] by Anonymous Coward · 2004-10-03 11:17 · Score: 0

What must a new search engine provide to "steal" users from google?

Index more than 2^32 Web pages and make them accessible as fast as Google?

Forget Clusty. Just fix Google. Like this... by iamcf13 · 2004-10-03 11:40 · Score: 1

It's quite simple really....

Since Google appears to be nearing their 4GB page index limit, do this:

Delete ALL (YES *ALL*) indexed webpages except the homepage.

Example:

Why index:

http://www.example.com/
http://subdomain.exampl e.com/
http://www.example.com/thispage/
http://w ww.example.com/thatpage/

When all you really need to index is just:

http://www.example.com/

Added to that, Google has already been 'spamdexed' by online retailers -- 'about 1,670,000 pages' indexed by Google from one particular online retail giant's domain alone!

This approach will also kill off all pages like this:
http://www.example.com/~ispcustomer/

and make it harder, for example, to find useful info in a particular labyrinthine website I freqent via Google on an ongoing basis as needed.

For the 'ispcustomers': if you truly value your information in such a context, buy a .com domain and point it to your webspace at your ISP or, better yet, host your info at the domain itself. End of story.

Then the next thing that could be done is to make it easy to report 'spamdexed' domains and 'link farms' so they can ALL be automatically purged from the Google database as needed. To avoid 'Joe Jobs', this purging does not extend to the domains listed on the pages hosted at the offending domains.

Problem solved.

what i can do you can too! by Nexcet · 2004-10-03 13:00 · Score: 0

how long till google has all em new features, along with that cool side bar selection :p *rolls dice*

Scum by Stephen+Gilbert · 2004-10-03 13:56 · Score: 1

Now I ask you, why do Slashdotters feel the need to dump on Clusty?

Clusty, you don't want to sit with this scum.

*General murmers and complaints from Slashdot*

But I only consider you scum next to Clusty! That's right, you see how you're scum.

Re: New Clustering Search Engine to battle Google by Anonymous Coward · 2004-10-03 14:29 · Score: 0

The search engine uses Overture for it's results

"its".

Was there ever a standard 'google'? by hooqqa · 2004-10-03 14:39 · Score: 0

For some reason I thought there was a consensus that the 'google' was some arbitrarily large value and at some point somebody decided to define it as 1 followed by a hundred zeros. ...before google - the search engine; I always figured that was why they stretched out the o's like they do.
There wan't any mention of that at the webster place 'tho?
http://www.webster-dictionary.org/definitio n/Googl e

Useful by Anonymous Coward · 2004-10-03 15:18 · Score: 0

All other things being the same, "Cluster by URLs" at least will be massively useful for quickly checking up on who links to certain sites. Just for that this is going into the toolbox.

Given gmail using a .google.com cookie, if it proves speedie enough over the long run to be not annoying, I might even make it my default search engine.

They should have called it Krusty! by ezh · 2004-10-03 15:39 · Score: 1

%subj% - well, here it is... OSQ - Obligatory Simpsons Quote...

clusty already full of spam by mcguyver · 2004-10-03 16:27 · Score: 1

Clusty is already full of spam, much like MSN and yahoo. Google still does the best job filtering spam websites and clusty/msn/yahoo seem to be far behind. Which is ok by me because this makes manipulating clusty and doing SEO much easier, :). Howver I do recognize Clusty is in beta. Go clusty.

educate the users by majid_aldo · 2004-10-03 18:30 · Score: 1

no matter how good search engines become, users will have to be able to define what they want. it's just plain the user's fault if he/she searches for 'apple' and expects 'good' results.

currently, search engines will provide MUCH better results if users used search syntax (-, +, "..", *, ~..et al)
such syntax can be eliminated (thus reducing the need for educated users) if more advanced search engines can effectively interpret natural language queries.
google already helps user's mistakes with their "did you mean ..." phrase.

--
--- widget evolution: enhanced, plus, super, ultra, extreme, exxxtreme, ultra-extreme, ..etc.

I don't see by camcloud1 · 2004-10-03 21:29 · Score: 0

how this is any different to http://mooter.com/

er, did you read the text? by CGameProgrammer · 2004-10-03 22:01 · Score: 1

Below the search results, it says:

Your query linux did not return any result because all the selected sources timed out.

Note: Slashdot.org servers are currently not serving all requests from Clusty.com.

--
~CGameProgrammer( );

Only 4 days late by smooth+wombat · 2004-10-04 00:40 · Score: 1

Way to go guys. I posted this story on September 30th and it was rejected:

Googles newest competitor? Thursday September 30, @08:31AM Rejected

And I didn't even link to a NY Times article but rather an AP story.

You can see that I have this story listed in my journal at this link.

Keep up the good work. This is the kind of nonsense we have come to expect from /.

P.S. I will usually post some of the stories which have been rejected so be sure to check my journal every so often. After all, why wait for three days to go by when you can get the story when it comes out?

P.P.S. Anyone else see the story of the vibrating condom?

--
We will bankrupt ourselves in the vain search for absolute security. -- Dwight D. Eisenhower

Re:Klutsy, but... by Ash-Fox · 2004-10-04 00:47 · Score: 1

I agree with you but..

They are using Linux with Apache .. Good but no site should provide this kind of details open! Look at google and Yahoo, they don't provide accurate information on these issues.

Results are clustered that's the only think I love right now. But I'm dam sure rest of competitor get this stuff soon

Yahoo use Apache and FreeBSD with some custom c CGIs and PHP scripts on the new stuff.

Google use alot of special mini appliances that runs linux and a customised apache server called GWS/2.1

Google also now have a special cacheing server that gives out the server header: GFE/1.3

Google's search software is written primarilly in JAVA.

Enough open details for you?

--
Change is certain; progress is not obligatory.

Slashdot Mirror

New Clustering Search Engine to battle Google

189 comments