Slashdot Mirror


Desktop Search Engines Compared

nutterButter writes "After Google created a stir with its desktop search engine, other engines gained more awareness in the public eye. Slate did a comparison of them and Google was not their top pick; Copernic was. I tried it - and am quite impressed."

24 of 361 comments (clear)

  1. Re:Mac version? by Anonymous Coward · · Score: 1, Informative

    Sure, there's one coming out later this year. It'll cost $130, or $70 if you opt for the "educational discount."

  2. history search by FrenZon · · Score: 5, Informative

    The biggest use (and what makes it a necessity for me now) I have for a desktop search tool is searching for a webpage I partially remember visiting a few weeks ago, but need more information from. GDS indexes the content of all pages as you visit them, making finding them relatively easy - as far as I could tell (tested over half an hour), Copernic only indexed title and URL, which was of much less use.

    A minor point for the geekier here - GDS can also be activated using quicksearch URLs from IE or Firefox, which is handy for those used to getting everything from one field.

    1. Re:history search by duncangough · · Score: 3, Informative

      Install your own proxy server and let that do the searching and indexing.

      Like this: Python proxy server - a proxy server, written in Python, that uses Lucene/Lupy to do the indexing and searching.

  3. X1 by Anonymous Coward · · Score: 1, Informative

    Hello? What about the company that invented this category, X1? Yahoo's using them for their desktop search app, and they're considered the standard-bearer by many. Definitely the most feature-rich (250+ file formats, netscape and eudora mail support, etc. etc.)

    1. Re:X1 by Anonymous Coward · · Score: 1, Informative

      And calls home on every run. No thanks.

  4. DT Search by Anonymous Coward · · Score: 3, Informative

    I've tried these so-called "Desktop Search" apps like Google and Copernic, but they're all crap. If you want serious desktop search, get something like DTSearch (http://dtsearch.com/PLF_desktop_2.html).

    Only problem is DTSearch is hella expensive at $200.

    But if you've got serious amounts of text that you need to search (I use it to search through 80gb of text on an external HD), its the only way to go.

    1. Re:DT Search by Mehtuus · · Score: 2, Informative

      DiskDB is not the most beautiful program, but it works very well. ( http://www2.neweb.ne.jp/wd/morimoto/en/diskdb/inde x.html ) Or you could try one of the programs listed on this page: ( http://www.snapfiles.com/freeware/system/fwdiskcat .html ).

      --
      http://mehtuus.googlepages.com
  5. Some GNOME folks look to be working on it. by Chuck+Chunder · · Score: 5, Informative

    Beagle is a search tool that ransacks your personal information space to find whatever you're looking for. Beagle can search in many different domains.

    The latest edition of the Beagle newsletter has just been released.

    --
    Boffoonery - downloadable Comedy Benefit for Bletchley Park
  6. Re:Linux anyone? by ken_devon · · Score: 5, Informative

    Wow. The timing on this article is uncanny. I installed Beagle yesterday, and I'm already addicted to it - it indexes documents, mail and web pages as they're accessed, and updates it search results in real time.

  7. Re:Why is desktop search so hot? by mOoZik · · Score: 5, Informative

    Actually, it CAN search inside of files, contrary to your post. The results can then be arranged by size, type, folder, date, etc. Isn't that enough?

  8. Re:Apple's coming out with something like this... by byolinux · · Score: 2, Informative

    Looks pretty sweet too.

    Apparently it's a SQL Lite DB that stores Metadata.

  9. Re:How neccessary is this for home users? by standsolid · · Score: 2, Informative

    As someone who works helpdesk...

    You, sir, are completely wrong :)

    Users HAVE NO CLUE where they put their files... ever.

    Now whether or not a search tool will help them find the files they save is another question...

    --
    WTPOUAWYHTTOTWPA
    What's the point of using acronyms when you have to type out the whole phrase anyways?
  10. Re:Apple's coming out with something like this... by Shanep · · Score: 5, Informative

    It's called Mac OS X Tiger.

    Actually, it is called Spotlight.

    Which will be a part of Tiger, the latest upcoming version of Mac OSX.

    --
    War crimes, torture, lies, illegal spying... Would someone give Bush a blowjob, already, so he can be impeached?
  11. FYI, Copernic contains adware. by Shanep · · Score: 4, Informative

    Copernic's Privacy Policy reveals that, "Copernic Technologies, Inc. works with third parties that transmit advertisements to the Copernic Agent and Copernic Desktop Search product families and Copernic Meta."

    --
    War crimes, torture, lies, illegal spying... Would someone give Bush a blowjob, already, so he can be impeached?
    1. Re:FYI, Copernic contains adware. by Scutter · · Score: 3, Informative

      Copernic's Privacy Policy reveals that, "Copernic Technologies, Inc. works with third parties that transmit advertisements to the Copernic Agent and Copernic Desktop Search product families and Copernic Meta."

      It also says this:


      # Keywords and result contents processed by Copernic Desktop Search
      Copernic Desktop Search does not allow transmission of keywords or result contents to Copernic Technologies, Inc. or any of its partner for searches conducted by the user on his computer or corporate or home network. If the software ever requires collection and processing of data, such as user's profile, location, search history, fields of interest and tastes, these data should be processed only by the user's computer and not be transmitted deliberately to Copernic Technologies, Inc. or any of its partner.


      I'd like to know how they reconcile the two. CDS does interface to web searches, though, so perhaps that's what they use.

      --

      "Tell me doctor, with all of your defenses, are there any provisions for an attack by killer bees?"
  12. Re:Why is desktop search so hot? by eric_01 · · Score: 3, Informative

    I have about 6 years worth (10 gigs) of old project files sitting on my hard drive. I use X1 and think its an absolute god send. Just type in a few keywords and X1 pulls up the file. I used to have to pour through a dozen levels of directories and rely on my rusty memory to try to find files.

  13. Another free alternative... by Anonymous Coward · · Score: 1, Informative

    I like DocYouMeant Hound http://myradus.com/. But, I know the guy who wrote it, so I'm a bit biased. :-)

  14. Re:the main problem i had with google by thenextpresident · · Score: 2, Informative

    Yes, you can move files with Copernic. You can drag them from the search result to a new location. Of course, it actually moves the file, and doesn't just copy it.

    --
    Jason Lotito
  15. Re:Linux anyone? by dAzED1 · · Score: 3, Informative
    if every document you have is cached, then there are two copies of every document, which is a serious waste of space. I think what you mean to say is that its indexed, but I'm not going to answer all your questions for you.



    there's no reason to grep your entire damn harddrive for a single phrase. Use some degree of organization. The business world has limited use for someone who can't keep themselves organized.



    finally - egrep will easily find patterns in all sorts of binary files. Creating a tiny little happy gui to search for things in your folders with DOCUMENTS (instead of searching your whole damn hard drive) is easy enough, if typing egrep "Thing I Want" * proves to just be too darn complicated.

  16. Re:Why is desktop search so hot? by WiPEOUT · · Score: 2, Informative

    Way to go, Mr Anonymous Windows Expert. The Indexing Service does everything these desktop search tools do, and has for many years.

  17. Wilbur from Redtree by spywarearcata.com · · Score: 2, Informative

    I've used the free open source Wilbur from redtree.com for ten years now. Now that everybody's doing it, I can tell the secret.

  18. Take your pick by useosx · · Score: 4, Informative

    Please, the Mac shareware developers practically invented this genre:

    Launchbar (the first)

    Quicksilver The current favorite, and free.

    Butler About the same as Quicksilver, more features but not as slick.

  19. Re:Apple's coming out with something like this... by dr.badass · · Score: 2, Informative

    Quicksilver also has the worst interface of any Mac app, ever.

    You're an idiot. There, I said it, and will probably get modded down just for that. But, honestly, QuickSilver having a bad interface? Bullshit. Your description sounds like you just looked at a screenshot and guessed at how it works. It's functionally no different than LaunchBar, Cmd+Space and start typing in the box cleverly marked "Type to search".

    Yes, it's a *slightly* different approach than LaunchBar, but if you closed your yes, you'd be hard pressed to tell the difference between the two.

    --
    Don't become a regular here -- you will become retarded.
  20. Re:Linux anyone? by kirkjobsluder · · Score: 2, Informative
    Locate isn't bad, but for some applications you really need to have a content-based search that can't be accomodated by variations on grep. The grep family is great when you are dealing with text based files, but tends to run into problems with content like pdf and OpenOffice.org files.

    So for a practical example, I have about 120 collected pdf files of academic articles under filenames with the primary author and year. (I could put the title in there, but filenames between 16-25 characters seem to be reasonable.)

    If I'm doing reading on a particular topic, I might want, for example, all of the articles related to Barry Wellman's work on social networks on the internet. The obvious way to get that is to list all of the articles that cite Wellman. This is probably not information that I want to put in the filename.

    So, to try a naive example (which according to others here should work.)
    % time grep -il wellman *.pdf
    grep -il wellman *.pdf 0.65s user 1.27s system 99% cpu 1.939 total
    So in this case, grep spends about two seconds returning no results.

    Now I could write a shell script that runs pdftotext on every file in my library, then grep the output. But pdftotext is expensive for one file much less a directory of 120 files:
    % time pdftotext postgresql_tutorial.pdf - > /dev/null
    pdftotext postgresql_tutorial.pdf - > /dev/null 1.84s user 0.16s system 99% cpu 2.019 total
    Thankfully, I have a document indexing application that does the work for me. A while back I set up swish-e to index almost everything in my home directory. So...
    % time swish-e -f ~/.swish-e/Web_index -w wellman | grep library
    1000 /home/kirk/www/library/garton_1997.html "STUDYING ONLINE SOCIAL NETWORKS, by Laura Garton, Caroline Haythornthwaite, and Barry Wellman" 103238
    927 /home/kirk/www/library/koku_2003.doc "koku_2003.doc" 306176
    375 /home/kirk/www/library/Cassell_2005.pdf "Cassell_2005.pdf" 615126
    323 /home/kirk/www/library/Qualifying_Exams/onlinecomm .pdf "onlinecomm.pdf" 63894
    255 /home/kirk/www/library/Koehly_1998.pdf "Koehly_1998.pdf" 1410176
    255 /home/kirk/www/library/Qualifying_Exams/methods.pd f "methods.pdf" 72688
    255 /home/kirk/www/library/cho_2003.pdf "cho_2003.pdf" 118267
    161 /home/kirk/www/library/SearchDBDT/INDEX_K.IX "INDEX_K.IX" 294912
    161 /home/kirk/www/library/ICLS_doctoral_consortium_pr oposal.pdf "ICLS_doctoral_consortium_proposal.pdf" 44923
    161 /home/kirk/www/library/barab_ilf_2002.pdf "barab_ilf_2002.pdf" 280560
    161 /home/kirk/www/library/barab_dvc.pdf "barab_dvc.pdf" 683011
    swish-e -f ~/.swish-e/Web_index -w wellman 0.05s user 0.03s system 95% cpu 0.090 total
    grep library 0.00s user 0.01s system 9% cpu 0.087 total
    The full-text index gives me 11 hits, in 1/20th of the time as a naive grep, sorted by score. (It missed one, primarily because xpdf respects copy protection while Copernic seems to be able to index through copy protection.)

    Sometimes fulltext searching is useful, and egrep just does not work.