Slashdot Mirror


What Desktop Search Engine For a Shared Volume?

kriston writes 'Searching data on a shared volume is tedious. If I try to use a Windows desktop search engine on a volume with hundreds of gigabytes the indexing process takes days and the search results are slow and unsatisfying. I'm thinking of an agent that runs on the server that regularly indexes and talks to the desktop machines running the search interface. How do you integrate your desktop search application with your remote file server without forcing each desktop to index the hundred gigabyte volume on its own?'

232 comments

  1. Call the NSA by Anonymous Coward · · Score: 5, Funny

    They already have it indexed for you.

    1. Re:Call the NSA by jo42 · · Score: 1, Informative

      Or Google. They already wire-tap your voice mails, index them and post the results for public consumption.

    2. Re:Call the NSA by mysidia · · Score: 1

      Or the RIAA... (assuming said shared volume contains MP3s).. not only do they have it indexed for you, but they'll conveniently mail the list to you in the form of a letter with a bunch of legal sounding junk at no upfront cost to you.

  2. wow! by tivoKlr · · Score: 5, Funny
    It's been an hour since this story was posted.

    You've stumped Slashdot. Bravo!

    --
    Ocean is land, covered with water.
    1. Re:wow! by beelsebob · · Score: 2, Informative

      I was wondering if "spotlight on OS X does that just fine" was too trollish... But, too late, I said it, and it's true, can't be trollish if it's true can it?

    2. Re:wow! by cyber-dragon.net · · Score: 1

      That's what we use... server version indexes servers, SAN volumes etc and makes them searchable from each desktop ;)

    3. Re:wow! by v1 · · Score: 1

      and it's faaaaast

      It searches by content as well as by filename too for those that didn't know. Incredibly useful.

      --
      I work for the Department of Redundancy Department.
    4. Re:wow! by Anonymous Coward · · Score: 0

      sharepoint does this. (Let's find out how biased slashdot readers are)

    5. Re:wow! by Tubal-Cain · · Score: 1

      And yet it looks like you posted an hour before the article.

    6. Re:wow! by Anonymous Coward · · Score: 0

      I don't really care for spotlight... I find that it doesn't always have a up to date index.. maybe I just have to many files but sometimes its days before a item shows up.. I've seen it between Leopard and Snow Leopard..

      What I don't understand is why isn't there a instant entry into the index when a file is generated.. I don't expect to be able to search inside the document but at least have the file show up (based on file name) when doing a search..

    7. Re:wow! by Anonymous Coward · · Score: 0

      It is instant.

    8. Re:wow! by cbiltcliffe · · Score: 0, Redundant

      Maybe so, but apparently nobody uses Sharepoint.

      --
      "City hall" in German is "Rathaus" Kinda explains a few things......
    9. Re:wow! by stuffeh · · Score: 1

      I can tell you with Mac OS X Server 10.3 (the actual 10.3 server edition) and 10.4 client, the spotlight would be going through the files on the network drive themselves and thus would be SLOW. Which is why I would ssh or vnc into the server and manually initiate a search. I have no idea if it still searches slow with 10.4 or 10.5 server.

    10. Re:wow! by willy_me · · Score: 1

      I believe that the newer versions of OSX server will perform indexing on the volume so the clients do not have to. So searches are fast and the problems you described are gone.

      Ok, just went over to apple.com to find the info page. It is here.

    11. Re:wow! by pinkushun · · Score: 1

      Sorry for the late reply, was busy downloading the internet. anyway a good cheap solution would be to make the drive publicly shared, expose it to the web and search it through google using something like "company user password list site:YOUR_PUBLIC_IP". Post your results here so we can all test your indexing speed :)

    12. Re:wow! by darkpixel2k · · Score: 1

      I believe that the newer versions of OSX server will perform indexing on the volume so the clients do not have to.>

      That's a novel idea. The day after Windows Search was made available through Windows Updates, I spent about 45 minutes trying to figure out why my storage server was totally maxing out the disk and users were complaining about the network crawling. Apparently Windows Search on 45 machines decided it was time to index "My Documents" which is actually stored on the server thanks to folder redirection.

      I spent another hour stopping the Windows Search service on every freaking PC and disabling it. Finally I realized my mistake and I created a group policy that administratively forces the service to be disabled on startup so no new machines would make the same mistake.

      Awesome job Microsoft.

      --
      There's no place like ::1 (I've completed my transition to IPv6)
    13. Re:wow! by Knara · · Score: 1

      Yup yup.

      We've also seen the same issue with Google Desktop, but also, for some reason we've never quite figured out, a number of "my outlook is acting weird" problems solve themselves once we remove Google Desktop.

      As a result, if we get a machine that is "acting weird", the first thing that comes off is Google Desktop. Has saved us a bunch of time.

    14. Re:wow! by tivoKlr · · Score: 1

      I must be prescient then...lucky me!

      --
      Ocean is land, covered with water.
    15. Re:wow! by Zotdogg · · Score: 1

      "Sharepoint does this"....

      COME ON! I can't believe Sharepoint is even being referenced as an even half-way acceptable solution to provide a search method for remote volumes with LESS overhead. Not only would you have to install the broken piece of shit known as Sharepoint and all it's overhead componentry but you would also have to make all kinds of changes to your existing computing system(s) as a whole that would serve no necessary purpose other than sinking MS's (not biased enough to use a $) teeth further in to your system and enabling a system that BARELY works anyhow.

      The best solution looks to be the one Blowdart posted below:
      "Oh well, if we're recommending MS solutions on slashdot (ah karma suicide) then good old Windows Desktop Search works just as well. Since V4.0 came out you can have WDS on other machines, indexing away and it's the remote index that is queried - so no need for local machines to index remote shares. Plus, like sharepoint (spit) indexing, and Index Server before that it uses iFilters, so format aware indexing is available for most of the common formats a business uses."

      See, not necessarily biased against Microsoft. I'm just biased against good-for-nothing, useless, waste of time, expensive, over-hyped, retarded, cock-gurgling, card-tabled, takes-you-12-times-as-long-to-set-up-and-do-half-the-work-as-not-using-it software.

      Seriously, shame on anyone suggesting Sharepoint as a solution to this question! WTF!?!?!!!

  3. Google Enterprise Search by HeavyD14 · · Score: 5, Interesting

    Not that I've ever used it before, but it sounds like it does what you want: http://www.google.com/enterprise/search/gsa.html

    1. Re:Google Enterprise Search by xoundmind · · Score: 1

      It's exactly whats he wants. In a shared Windows environment, it beats the native Outlook search speed handily and covers my organization's shared drive. Actually, the search speed has saved my a few times in being able to reconstruct a problem and react accordingly.

    2. Re:Google Enterprise Search by lorenlal · · Score: 1

      ...it beats the native Outlook search speed handily...

      Honestly? I beat native Outlook search on a fairly regular basis. I cried tears of happiness and joy when Lookout hit the scene. I choked up a bit when Microsoft bought them out, but I recovered when it looked like they integrated the engine into the engine Microsoft uses to do Desktop search on XP.

    3. Re:Google Enterprise Search by mspohr · · Score: 0
      A few years ago I joined a huge organization that uses Outlook/Windows and was forced to come to terms with it. My experience...

      Outlook search is a complete waste of time.

      Lookout is much better and I used it for a while.

      However, I now use Google desktop search which is much better. (YMMV)

      --
      I don't read your sig. Why are you reading mine?
    4. Re:Google Enterprise Search by fast+turtle · · Score: 1

      I agree that Google's Desktop search works wonders on an XP/Vista box but I've discovered that the tweaks to the Win7 search actually seem to be damn solid. I've got a document archive that is 13GB with 300K plus files in it and Win7 was able to find more files (content search) with a specific term then Google was. Of course, this surprised the hell out of me that MS was actually better then Google on Win7.

      --
      Mod me up/Mod me down: I wont frown as I've no crown
  4. solution to hundreds of terabytes of docs by Anonymous Coward · · Score: 3, Interesting

    how about using a program like Documentum? We generate several thousand technical documents and drawing a month, and use it for all our document management needs.

    1. Re:solution to hundreds of terabytes of docs by mu51c10rd · · Score: 1

      I assume you mean the Verity part. Documentum can be a bit pricy for the whole suite of DiskXtender, AppXtender, and Verity, but it does work great for this purpose.

  5. Everything by OttoErotic · · Score: 2, Informative

    How about Everything (assuming the server is Windows & NTFS)? Works well for me (quickest desktop search I've found yet), and can either run locally or connect to an ETP server. The site seems to be down right now, but here's the original Lifehacker article where I found it. Incidentally, I never heard of ETP til I started using it. Anyone know if it's an Everything-specific protocol?

    --
    "Once in Hawaii I had sex with a 102 year old male turtle. It is difficult to argue that it was consensual." - Steve Ma
    1. Re:Everything by The+MAZZTer · · Score: 1

      Well it sounds like it is. I should note that Everything only indexes filenames, so if you want to index file CONTENTS you're out of luck (that sort of thing is GOING to take a long time anyway, since you have to read every file on disk that the indexer knows how to parse, so "quicker" could well translate to "less complete search index").

      But if you don't care about indexing contents then Everything should work fine for you.

    2. Re:Everything by stg · · Score: 1

      Everything only searches for file and folder names. While that is useful too, usually Search Engine presumes searching inside the files too.

    3. Re:Everything by Anonymous Coward · · Score: 0

      you know, there is a Program that uses Everything's search engine to search inside files. it is a lot slower though.

      http://www.donationcoder.com/Forums/bb/index.php?topic=18865.msg170962

    4. Re:Everything by atamido · · Score: 1

      I'm a big fan of Search Everything and use it at both home and at work to search through hundreds of thousands of files with no delay. However, it should be noted that it is not necessarily appropriate for larger environments where not everyone should know about all other files. Search Everything reads the NTFS metadata from the drive, but it ignores the file permissions. So if an HR rep has a file (Sally Jones' official reprimand.doc) in her private share, then it will turn up in the results for other users.

  6. A couple of options by Unhandled · · Score: 4, Informative

    Here's a few options you might want to consider: 1) Use Office SharePoint Server 2007 to index the share 2) Upgrade to Windows Server 2008 (or above) and Windows Vista (or above) and use the Federated search feature: http://trycatch.be/blogs/roggenk/archive/2007/11/05/windows-vista-amp-windows-server-2008-federated-search.aspx

    1. Re:A couple of options by shutdown+-p+now · · Score: 2, Insightful

      Here's a few options you might want to consider: 1) Use Office SharePoint Server 2007 to index the share

      First, MOSS isn't free.

      Second, have you ever actually tried using SharePoint 2007 text search feature? I dunno what it indexes, but finding anything in that afterwards is about as convenient as searching for a needle in a haystack.

      There have been claims of some huge improvements in the upcoming SP2010, which is not surprising in light of Bing, but that's not released yet.

    2. Re:A couple of options by cyber-dragon.net · · Score: 5, Funny

      You are on /. and actually recommending an upgrade to Vista?
      Brave man.

    3. Re:A couple of options by popeyecu · · Score: 3, Interesting

      I got your back. I love Federated Search, and so do my clients. It's way easier than any other solution, because it's in Windows and it "Just Works." Try it before you bash it, /.

    4. Re:A couple of options by Unhandled · · Score: 1

      I know I'm rowing against the current, but when something actually does the job, why not mention it, I think the point is to actually help someone resolve a problem here, but I could be wrong. Maybe I should add some MS bashing comments to blend in. :-p

    5. Re:A couple of options by Unhandled · · Score: 1

      It's interesting to see how people go to great lengths in this thread to avoid using features that are either built in Windows or come with a free add-on. I know MOSS isn't free, but it's a pretty decent collaboration and document management platform if you're looking into that. Depending on what you need, WSS might do the job as well and comes with Windows Server 2003 and above. /. is certainly entertaining!

    6. Re:A couple of options by Anonymous Coward · · Score: 0

      Well, here you read more than anywhere else about microsoft.. Almost as if this was a Microsoft site :rolleyes:

    7. Re:A couple of options by blowdart · · Score: 3, Informative

      Oh well, if we're recommending MS solutions on slashdot (ah karma suicide) then good old Windows Desktop Search works just as well. Since V4.0 came out you can have WDS on other machines, indexing away and it's the remote index that is queried - so no need for local machines to index remote shares. Plus, like sharepoint (spit) indexing, and Index Server before that it uses iFilters, so format aware indexing is available for most of the common formats a business uses.

    8. Re:A couple of options by Tomsk70 · · Score: 1

      Well, he *is* asking for a real-world solution, not a technically superior (but used by nobody) app that somehow allows the sysop a huge sense of superiority whilst removing them even furthur from the aforementioned real world.

      Am I alone in seeing a pattern here? :-)

    9. Re:A couple of options by MrNemesis · · Score: 1

      Coming from a company that made heavy use of sharepoint (2003 and 2007 - to be fair 2007 is aeons ahead of 2003 but inherited out projects' terrible design) I can heartily say that the index tool sucks, and the relevance engine is piss poor; no one could ever, ever find anything on the company intranet, and one of the helldesk's most idiotic tasks was keeping a bookmark of all the important sub-sites and documents on the intranet... and all because devs fell for the idiocy of the "don't bother organising it... it'll all be instantly SEARCHABLE... with LASERS!!!!!!!" spiel.

      It took a google mini box to make our sharepoint cluster anywhere near usuable. Helldesk calls dropped 15% the week the box went online.

      --
      Moderation Total: -1 Troll, +3 Goat
    10. Re:A couple of options by Anonymous Coward · · Score: 0

      It's not unfounded.

      Microsoft *really* does suck that bad.

    11. Re:A couple of options by FlyingBishop · · Score: 1

      Yeah, that sort of thing can get you downvoted. I certainly ain't brave enough to risk Slashdot karma.

    12. Re:A couple of options by atamido · · Score: 1

      WDS is also painfully slow and an insane resource hog. Unfortunately you need it to search emails in Outlook 2007 cached mode. I've also not seen it reliably query WDS installed on a server. And it will do random things like suddenly attempt to index Outlook Express for all users that have logged into a machine, even though none of the users have used OE, and policies have been set up to expressly not index OE.

      In a single user environment, the difference between Windows Desktop Search and Google Desktop Search is night and day. GDS indexes faster, uses less resources, indexes more things (websites you visit in Firefox?), and keeps cached versions of the indexed text. So with GDS you can see the text to a document you deleted 2 years ago, but was indexed then.

      I've never used Google's Enterprise Search box, but if it works anywhere as well as their desktop version, I'd recommend that.

  7. Everything (Search Engine) by dxdisaster · · Score: 2, Informative

    I guess it could work, although you can't index the files directly. You have to run a local copy and one on the server as an EPT Server. www.voidtools.com, although it seems to be down at the moment, so here's a link to the FAQ on Google's Cache: http://74.125.113.132/search?q=cache:fcYHcEJKH3UJ:www.voidtools.com/faq.php

  8. Federated Search by Anonymous Coward · · Score: 5, Informative

    MS does have a solution, it's called Windows Federated Search. Windows 7 with 2008R2 has it .. there might be a way to do with Windows Desktop Search 4.0. Here's some info on it - http://geekswithblogs.net/sdorman/archive/2009/05/14/windows-7-federated-search.aspx

    1. Re:Federated Search by Anonymous Coward · · Score: 0
    2. Re:Federated Search by endr · · Score: 1

      MS does have a solution, it's called Windows Federated Search. Windows 7 with 2008R2 has it .. there might be a way to do with Windows Desktop Search 4.0. Here's some info on it - http://geekswithblogs.net/sdorman/archive/2009/05/14/windows-7-federated-search.aspx

      You don't know what you're talking about, Federated Search has nothing to do with this; it's just a system for search plugins in Explorer. (link: http://msdn.microsoft.com/en-us/library/dd940456(VS.85).aspx )

    3. Re:Federated Search by icepick72 · · Score: 1

      But of course those search plugins in Explorer provide an answer to the question. If the built-in ones don't work, customized providers can be built because it's extensible. Where you think GP doesn't know what is being talked about, rather he does and your comment is short-sighted especially considering the link you gave provides this answer.

    4. Re:Federated Search by bhpaddock · · Score: 1

      Actually it's quite relevant. Windows 7 can federate queries to a SharePoint or Search Server index using OpenSearch.

      Also, Windows Vista and Win7 (and even XP with WS4 to some extent) can query remote Windows Search indexes. I use this functionality along with my Windows Home Server (running WS4) for my personal needs.

    5. Re:Federated Search by endr · · Score: 1

      That's the point, though, it's still running WS4 which is not going to change the speed of indexing at all, and neither is the existence of Federated Search. His indexing isn't running slow because of it and the existence of it will not speed indexing, so Anonymous Coward's answer is incorrect.

    6. Re:Federated Search by PalmKiller · · Score: 1

      It works in vista also, and windows server 2008, not that anyone would care about vista :).

  9. Sharepoint Services by anexkahn · · Score: 2

    If you have a windows server, you can tell Share point to index the file share. See: http://dotnetmafia.sys-con.com/node/1046930

    --
    Curious about Storage and Virtualization? Check out
  10. SSH and locate. by dov_0 · · Score: 1

    I use SSH to access my file server. Because I use it as a music server as well, I use X forwarding. As I'm accessing the actual server instead of just mounting fileshares (which I do also), I do the file searches directly on the server. Usually good old locate. Haven't really found anything that beats it yet, but then again I like the CLI. If you're running windows... Sorry.

    --
    sudo mount --milk --sugar /cup/tea /mouth /etc/init.d/relax start
    1. Re:SSH and locate. by Jesus_666 · · Score: 1

      In fact, shouldn't it be possible to mirror the locate database to the local file system so that local calls to locate will show the proper results on the share? Granted, you lose the ability to index the local file system but depending on the setup that might not actually be a loss.

      --
      USE HOT GRITS WITH STATUE OF NATALIE PORTMAN (NAKED AND PETRIFIED)
    2. Re:SSH and locate. by ls671 · · Score: 1

      Hmmm... locate doesn't allow you to search within files. What about using rgrep or grep -r ?

      find is great too (but slower on the first run before results get cached by the kernel, if you have enough spare memory) when you need to know which files have been modified in a given period of time, which files take more room on the disk, etc..

      I usually disable locate for security reasons, at least use slocate ! ;-)

      So I'd say I use find and rgrep ;-)

      --
      Everything I write is lies, read between the lines.
    3. Re:SSH and locate. by Anonymous Coward · · Score: 1, Informative

      Well, if you are mounting a networked filesystem (e.g. use sshfs), then there's no reason to mirror it locally. But even if you did, GNU locate and slocate understand $LOCATE_PATH as a list of databases, so you can use both..

      Of course, there's the whole mount-point issue, but either a hacked updatedb or a mount --bind lets you build the index with a fictitious prefix. e.g.:
      mntpt=/ssh-$HOSTNAME
      mkdir $mntpt
      mount --rbind /home/music $mntpt
      #for slocate:
      updatedb -U $mntpt -o $mntpt/slocate.db
      #not 100% sure on GNU updatedb syntax, but you get the point...
      umount /ssh-$HOSTNAME

      Now you can:
      scp music@host.net:slocate.db .musicdb
      export LOCATE_PATH=$LOCATE_PATH:$HOME/.musicdb
      locate mytune

      and receive results of the form
      . . .
      /usr/share/some/path/armytune.wav
      /ssh-host.net/myalbum/mytune.ogg
      . . .

      -- that is, local and remote results together. Or just make an alias for slocate -l0 -d $HOME/.musicdb, if you want searches only on the remote volume.

      p.s.

      WRT GP's music server -> X forwarding, I've come to the conclusion there's no one right way to deal with music indexing/databasing/serving/streaming/playing, but for my needs, I've always found mpd the right solution. Just thought I'd throw it out there...

    4. Re:SSH and locate. by hey · · Score: 1

      You could make a web interface to locate.
      (Only searches files names.)

  11. Enterprise Content Management with Alfresco by RicRoc · · Score: 5, Informative

    Yes, Google's Search Appliance (GSA) could be used, I have seen it used with limited success. The main problem was how to respect access control on documents: either you index them or you don't, and if you index them with GSA, sensitive data may show up in search results. Also, we had a lot of trouble "taming" GSA: it would regularly take down servers that were dimensioned for light loads.

    I would suggest using Alfresco http://www.alfresco.com/ as a CIFS (Common Internet File System) or WebDav store for all those documents. This would give you the simplicity of a shared folder and the opportunity to enrich the documents with searchable metadata such as tags, etc. Each folder (or any item, in fact) could have the correct access control that would be respected by the search engine, Lucene. http://lucene.apache.org/java/docs/

    Alfresco comes in both Enterprise and Community Edition, it's very easy to try out -- even our non-techie project manager could install it on his PC within 10 minutes. Try that with Documentum, FileNet or IBM DB2 Content Manager!

    --
    Who?
    1. Re:Enterprise Content Management with Alfresco by Tim+C · · Score: 1

      There is some stuff you can do with GSA to try to implement document security - you can set up separate collections/indexes (I forget the GSA term) for different parts of your document repository, then restrict the search results to specific indexes based on the logged-in user's credentials. (That's assuming you roll your own interface)

      Note that my one use of GSA was a couple of years ago, and we had an extremely simple security model with only 2 user types - one got access to everything, one got access to a simply-defined subset of the document hierarchy.

      I agree about it hammering on servers though, we had trouble with that too - specifically with the app servers returning error 500 response codes to the GSA while normal users could access the same pages just fine. Never did get that one ironed out properly (though it may have been an app server problem of course)

    2. Re:Enterprise Content Management with Alfresco by Anonymous Coward · · Score: 0

      Tosh.

      You can index documents in a variety of security contexts and have them faithfully filtered out when the results are rendered to screen. I've seen and deployed examples using NTLM and a custom security model layered on top of a SAML component.

      If you've got the money for a full enterprise search solution, the GSA is a great way to go.

    3. Re:Enterprise Content Management with Alfresco by spyrochaete · · Score: 1

      The GSA works perfectly well with many security standards (AD, NTLM, Kerberos, SAML, forms, cookies) and it obscures secure search results from users who do not have read permissions to those documents. It's probably easier to configure the GSA's security settings than with any other enterprise search platform.

      Furthermore, the GSA has a self-throttling feature called Host Load Scheduling which allows you to limit the number of connections opened per second. If that's not sufficient you can throttle the VLAN on your switch to limit the throughput in KB/s. Or you can stop crawling during the day and schedule it at night.

      Based on your comments it sounds to me like you haven't used the GSA in a long time. There's a lot of handy new features. Google has really been listening to its customers and configuration partners, and especially the larger ones.

  12. DTSearch by Anonymous Coward · · Score: 0

    DTSearch (http://www.dtsearch.com/). Although not free, you can install it on a server, schedule index updates, and have the client use the indexes (provided they are placed on a shared folder of the server).

  13. Mirror it. by palegray.net · · Score: 4, Funny

    You could just rsync the shared volume to a local drive as frequently as needed and run the search engine on the local copy.

    1. Re:Mirror it. by codepunk · · Score: 1

      So you would rsync hundreds of gig to a local disk to index it? I think I would rethink that strategy.

      --


      Got Code?
    2. Re:Mirror it. by quanticle · · Score: 1

      It seems that the parent wants to merge a remote index with his desktop search so that he doesn't have to do this. Also, wouldn't giving each desktop its own copy of the data defeat the purpose of having a shared server?

      --
      We all know what to do, but we don't know how to get re-elected once we have done it
    3. Re:Mirror it. by palegray.net · · Score: 1

      Whether it defeats the purpose or not depends entirely on the organization's needs. If querying data every few hours in a local app is the objective, that can be met quite effectively with mirroring. Disk space is cheap.

    4. Re:Mirror it. by palegray.net · · Score: 0

      I don't think you understand how rsync works. After the initial mirror, it only transfers changed files, which tends to be a really quick operation. For most organizations, that's going to be a hell of a lot less than the total disk usage on the shared volume.

    5. Re:Mirror it. by quanticle · · Score: 1

      Disk space is cheap when you're outfitting a single server. Outfitting even ten workstations with the same amount of disk can become quite expensive.

      --
      We all know what to do, but we don't know how to get re-elected once we have done it
    6. Re:Mirror it. by Trahloc · · Score: 1

      So I should rsync my 20TB server to my 320gb drive? What was the point of the 20TB file server then?

      --
      The Goal: A long simple life filled with many complex toys.
    7. Re:Mirror it. by palegray.net · · Score: 1

      Okay, so how much do you figure it will cost to outfit a server to be capable of supporting an arbitrary number of users, running extremely IO intestive reports on a shared volume whenever they feel like it, including the network infrastructure required to support this? Oh, and don't forget redundancy for the server.

      Trust me, I've learned from experience that the local disk space works out to be much cheaper for this sort of thing.

    8. Re:Mirror it. by Makoss · · Score: 5, Insightful

      Have you ever actually used rsyng on a decent sized file set? Determining the changed file set requires significant disk activity.

      It's a certain win when compared to just blindly transferring everything. But if you think that rsyncing 20 changed files in a 100 file working set is the same as rsyncing 20 changed files out of a 2,000,000 file working set you are very very wrong.

      Completely aside from the absolute insanity of suggesting that you replicate the full contents of the fileserver to every desktop, which has been covered by others.

      --
      Building a better backup.
      Zettabyte Storage
    9. Re:Mirror it. by cl0s · · Score: 1

      No matter how you put it, rsyncing a share on every persons computer that wants to search it, is not how you do it.

    10. Re:Mirror it. by adamchou · · Score: 1

      LOL, you've been modded funny because your idea seems so ridiculous yet after reading your comments, it becomes apparent you're being serious.

    11. Re:Mirror it. by Trahloc · · Score: 1

      Bingo, I guess Palegray has a much larger budget than most of us when it comes to storage. Afterall why have a single copy when you can have fifty.

      --
      The Goal: A long simple life filled with many complex toys.
    12. Re:Mirror it. by cbiltcliffe · · Score: 1

      Bingo, I guess Palegray has a much larger budget than most of us when it comes to storage. Afterall why have a single copy when you can have fifty.

      Well, at least that way you don't have to screw with tape drives......

      --
      "City hall" in German is "Rathaus" Kinda explains a few things......
    13. Re:Mirror it. by jabuzz · · Score: 1

      Yeah, but what if your file system is 70TB with 20 million files?

    14. Re:Mirror it. by Saba · · Score: 2, Funny

      File modification date.

    15. Re:Mirror it. by L4t3r4lu5 · · Score: 1

      Or checksum database.

      --
      Finally had enough. Come see us over at https://soylentnews.org/
    16. Re:Mirror it. by kriston · · Score: 1

      I nominate this comment for Best of Slashdot.
      I actually had to look at the moderation score to get the joke.

      Kriston

      --

      Kriston

    17. Re:Mirror it. by Makoss · · Score: 1

      File modification date.

      Yes, modification time and file size are the two things that rsync checks by default.

      Now, while reflecting on the funny mod someone gave you, what operations need to be done to retrieve that information for each file? And furthermore, how does that number of operations scale with regard to the number of files in your working set?

      Not that it matters in this case anyhow. Moving the problem to the desktops would multiply the original problem, not resolve it.

      --
      Building a better backup.
      Zettabyte Storage
    18. Re:Mirror it. by jedwidz · · Score: 1

      In all seriousness, this is basically what I'm doing - but only for a moderate amount of data (< 5GB) that's specifically under my control.

      Pros:
      - searching is fast and centralized (i.e. one search covers the lot)
      - content is accessible at local disk speeds
      - network or file server outages aren't a big deal
      - 'local' files are included in file server backups

      The software I'm using is:

      sync: SyncBackSE
      search: Copernic Desktop Search Professional

      My resyncs are scheduled nightly. I'd prefer to have local changes syncing live to the network shares (and not vice-versa), but haven't found sync software that does this efficiently on Windows.

      The only major downside is the mess when I have to use the network shares directly, before my local changes are synced across. To work around this I'll sometimes edit the remote copy directly, or kick off a resync.

    19. Re:Mirror it. by palegray.net · · Score: 1

      Sounds like a pretty decent setup. If you hadn't noticed, I caught a lot of flack in responses to the parent comment, mostly from people who appear to have no clue what they're talking about.

  14. Why even ask? by h4rr4r · · Score: 0

    slocate

    1. Re:Why even ask? by quanticle · · Score: 1

      Well, that runs into the problem the OP has discussed. If the data is present as a network share, it'd take slocate forever to index the data on the remote server. Basically, he or she wants a way to run slocate once on the server and have that index file be merged with all of the individual desktops. That way, each desktop wouldn't have to go through the effort of duplicating work.

      --
      We all know what to do, but we don't know how to get re-elected once we have done it
    2. Re:Why even ask? by h4rr4r · · Score: 1

      So use ssh and run slocate on the server, or share out the slocate.db file.

    3. Re:Why even ask? by quanticle · · Score: 1

      Yeah, that could work, but I don't think it'd be as seamless as the OP wants. The user would still have to select which db file to use. Still, its a solution.

      --
      We all know what to do, but we don't know how to get re-elected once we have done it
    4. Re:Why even ask? by h4rr4r · · Score: 1

      or here is a real smart idea:

      In each users path place a locate_on_server.sh script that just runs "locate -d $PATHTODBONSERVER".

      -d can take multiple database filename arguments, so you could have one locate_on_server that searches all your fileservers.

    5. Re:Why even ask? by Trahloc · · Score: 1

      Well a problem with slocate is that it doesn't track changes live. Its basically a prettier version of a find dump into txt file then grep it. Something that tracks files live on the server end which can be searched remotely. Heck even a web interface or ssh would fit my needs, it doesn't need a pretty popup window thing.

      --
      The Goal: A long simple life filled with many complex toys.
    6. Re:Why even ask? by zippthorne · · Score: 1

      Heck, you could skip the script and just alias locate

      But you still run into the problem that it runs from the command line and the database is byte-order dependent.

      --
      Can you be Even More Awesome?!
  15. the god-awful truth by Anonymous Coward · · Score: 0

    How do you integrate your desktop search application with your remote file server without forcing each desktop to index the hundred gigabyte volume on its own?'

    Really? I ssh to the fileserver, and then do something like

    find . -name "*.php*" -print | xargs grep damnfunctionname

    1. Re:the god-awful truth by simcop2387 · · Score: 1

      you might be interested in ack; http://search.cpan.org/~petdance/ack-1.90/ack-base ; it's written in perl, but it acts like grep for source code (it supports lots of languages); your command there would just be;

      ack damnfunctionname

      thats it, it'll print out the files and lines that match.

    2. Re:the god-awful truth by Anonymous Coward · · Score: 1, Informative

      Don't you mean,

      grep damnfunctionname -R . --include='*.php*'

      I guess if you're skipping perfectly cromulent indexing servers, you might as well needlessly break out the pipes, too.

    3. Re:the god-awful truth by beelsebob · · Score: 1

      And then you sit and wait for ages for find to finish, and then you realise that it only searches in the file names, and not the contents of the file. Of course, what I do, is ssh in, and then use mdfind, but yeh, find doesn't cut it on multi-terrabyte volumes, and especially not when you want to search on more than just the name.

    4. Re:the god-awful truth by kabloom · · Score: 1

      Wrong! The "grep" part of that command searches the contents of the files.
      But if you think you can get away with just grep on large amounts of data, you really ought to learn something about how indexing works and how much faster it can make your searches.

    5. Re:the god-awful truth by Anonymous Coward · · Score: 0

      And then you man xargs, and maybe man grep, and realize that find is indeed being used to search the filenames for *.php*, and letting grep search the contents for damnfunctionname.

      Anyway, any decent locate implementation should work fine for the file names, modulo a few details about mount-points. For searching in files? Good question, but I very rarely need to search inside more than a hundred files, so locate/grep works well enough for me, with find on the rare occasions I need fs metadata...

    6. Re:the god-awful truth by peter · · Score: 2, Informative

      FYI, GNU find has xargs built in these days:

      find -name '*.php*' -exec grep func {} +

      the + instead of ';' makes it collect up multiple arguments to grep
      like xargs instead of the traditional find -exec behaviour which is like xargs -n1. I use -exec {} + all the time, because it's less typing, and safe with
      filenames with punctuation or whitespace, so you don't have to type -print0 | xargs -0 either. (BTW, if you have a list of filenames that you processes with something line oriented, you can use xargs -d'\n')

      --
      #define X(x,y) x##y
      Peter Cordes ; e-mail: X(peter@cordes , .ca)
    7. Re:the god-awful truth by rrohbeck · · Score: 1

      ssh user@fileserver 'cd /shared/myproject;wcfind . -name _\* -prune -o -type f -! -name \*.o -a -! -name \*.a -a -! -name \*.so\* -a -! -name \*.d -print | while read a; do ( file "$a" | fgrep -q 'text' ) &>/dev/null && fgrep -H "" "$a"; done | gzip -c9' >~/index/myproject.gz

    8. Re:the god-awful truth by rrohbeck · · Score: 1

      Oops, forgot to add

      ... and then
      zfgrep goddamnfunctionname ~/index/myproject.gz|less
      That's actually quite quick, for a couple MLoC.

  16. How about Spotlight? That works on shared volumes. by thedbp · · Score: 3, Informative

    *ducks*

  17. Use Microsoft Indexing Service by Icono · · Score: 2

    One way is to set up Microsoft Indexing Service on the server with the shared drive. The MSC console app provides a search capability and one can also use the Indexing Service SDK for client apps.

  18. Hmm by ShooterNeo · · Score: 1

    Basically, you need your desktop search application to look at the index file on the remote file server generated by an instance of the application running on the file server. Technically, incredibly simple but I don't know which application currently available is divided into front and back ends like that. Maybe open source...

    1. Re:Hmm by spyrochaete · · Score: 1

      That's not really a "desktop search engine" anymore, but I agree with your suggestion. Desktop search is not the right tool to index shared content, lest you get a department full of PCs constantly indexing the same repository over slow ethernet wires. Microsoft Enterprise Search Server lets you separate the indexing and serving servers, but unfortunately the free Express version doesn't support this.

  19. Use MSS 2008 Express, SharePoint, FAST by VTBlue · · Score: 3, Interesting

    Use Microsoft Search Server 2008 Express...its free, all you need is a free server box. Also Check out SharePoint Search and FAST enterprise search.
    http://www.microsoft.com/enterprisesearch

  20. Locate32 by EvilXenu · · Score: 1

    It really depends on what you are looking for. Are you wanting to index file names or do you want names and contents? For me, I typically know what I'm looking for based on file name, so Locate32 works out great. It's the Windows equivalent to 'slocate'.

    1. Re:Locate32 by Anonymous Coward · · Score: 0

      Locate32 works great for me too.

  21. Two words... by roc97007 · · Score: 1

    Google appliance...

    --
    Oliver's law of assumed responsibility: If you're seen fixing it, you will be blamed for breaking it.
  22. Your answer in song by Anonymous Coward · · Score: 0

    dtsearch - google it

  23. Re:How about Spotlight? That works on shared volum by Anonymous Coward · · Score: 0

    Final Cut Server, CatDB, Mediabeacon and a number of other asset cataloging management tools could do the job and offload it in a reasonable sensible way .

  24. Autonomy by Anonymous Coward · · Score: 0

    Autonomy search. Check them out. One of the best in the world. Obviously an enterprise solution and not inexpensive.

  25. Lucene is a great foundation for this by sribe · · Score: 1

    So I think I'd start by looking here.

    1. Re:Lucene is a great foundation for this by Thundersnatch · · Score: 1

      Lucene has a horrifying tendecy to corrupt it's indexes frequently. And it doesn't do what the OP wants in any case, as it doesn't understand file formats or respect native OS permissions. Even the Lucene- based Nutch is not a good fit.

  26. ISYS by indulgenc · · Score: 1

    ISYS Search Software (http://www.isys-search.com/) has a variety of enterprise search applications. Web based search interface or a local client depending on your needs.

    -i

  27. Cross-Platform by pinkocommie · · Score: 1

    Wondering if there's anything cross-platform. I'm in the process of setting up an OpenSolaris fileserver (primarily to use ZFS/Raid-Z) and have both linux and windows boxes. It would be great to be able to have an index on each that could be read by a client app or a unified index perhaps.

    1. Re:Cross-Platform by spyrochaete · · Score: 1

      You want an enterprise search service. Microsoft and Google both make web-driven search solutions which work on any end user platform including mobile. There are many other companies that have web-based search services as well but those are the two goliaths.

    2. Re:Cross-Platform by Anonymous Coward · · Score: 0

      Yeah I'm gonna have the same setup and I'd seriously be interested in making that feature available

  28. Re:How about Spotlight? That works on shared volum by Henriok · · Score: 3, Informative

    Yeah, my thought exactly? I wasn't aware that it was a problem searching hundreds of gigabytes on shared volumes. We have a couple of terabytes shared by our Mac servers and I don't think I've had search times longer than ten seconds over a couple of million files.. MS Office files, PDFs, movies, audio, pictures, photographs, text, HTML, source code.. all indexed with metadata and contents.

    Even the days before Spotlight, using AppleShare IP Servers in the 90s, finding stuff on the servers was never an issue. It has always been so fast that I have never even reflected over that it was fast. Maybe I should use some other operating system once in a while to experience what the majority experiences. Or not.. I'd rather stay care free and productive.

    Don't call me when you figure this out.

    --

    - Henrik

    - when the Shadows descend -
  29. BlackBall by Anonymous Coward · · Score: 0

    www.blackball.com

    They do federated indexing/searching without having to import data. Scans the data where it resides...

  30. The XP search assistant dog by Anonymous Coward · · Score: 0

    The dog. Always helps me sniff ouut the files.

    1. Re:The XP search assistant dog by larry+bagina · · Score: 1

      Also useful if you accidentally smear peanut butter all over your balls.

      --
      Do you even lift?

      These aren't the 'roids you're looking for.

    2. Re:The XP search assistant dog by ubrgeek · · Score: 2, Funny

      I see you are trying to write a funny post. Would you like me to help with that?

      --
      Bark less. Wag more.
  31. NO! Try Alfresco by thule · · Score: 4, Informative

    SharePoint is $$$$. Try Alfresco. Alfresco can look like a file share (support SMB, DAV, FTP, etc). The indexing is built is and does not require a separate SQL Server license.

  32. Re:How about Spotlight? That works on shared volum by Anonymous Coward · · Score: 1, Funny

    As usual, Apple and Closed Source to the rescue! Is there anything open sores can do that comes even CLOSE?

  33. RDP + CTRL-F by isochroma · · Score: 0

    RDP into the machine, then CTRL-F on the volume, which is now local. Don't bother with indexing service anyway, it just wastes time. Life's faster without it.

  34. Windows 7? by craenor · · Score: 1

    I am probably missing something obvious here or misunderstanding the question, however, I am very happy with the search integrated in Windows 7. I have about a terabyte of data across two different volumes, and when I use the regular Windows 7 search I get instant, detailed results.

    1. Re:Windows 7? by Anonymous Coward · · Score: 0

      Apart from the first time when it has to index the directory. And the second time, when it forgets that it's already indexed it and does it again.

  35. I do it all the time by krunchyfrog · · Score: 0

    updatedb
    locate [bleh]

    It's easy, really.

    --
    printf($randomline(sigs.txt) \n "-- "$randomline(authors.txt));
    -- myself
  36. Re:NO! Try Alfresco by Orion+Blastar · · Score: 4, Informative

    You mean the Document Management Alfresco and not the CMS software. The Community Edition is free but unsupported, and the Enterprise edition has a free 30 day trial. It looks like it won a government award for document management which is rare for open source document management software.

    --
    Remember, Slashdot does not have a -1 disagree moderation, and no, troll, flamebait, and overrated are not substitutes.
  37. PC Docs Docs Open by Orion+Blastar · · Score: 1

    Docs Open is a commercial document management system but right now their web page doesn't seem to be working. We used it at a law firm I worked at. IIRC it was able to search through the billions of documents that the 300+ lawyers used in their cases.

    --
    Remember, Slashdot does not have a -1 disagree moderation, and no, troll, flamebait, and overrated are not substitutes.
  38. You don't by saleenS281 · · Score: 2, Interesting

    You don't allow every client to index. There's been several suggestions already, but most enterprises intentionally DISABLE desktop search. It absolutely slaughters the share. It's not a big deal when one user is doing it... but when 5,000 are, the I/O load becomes unsustainable.

    1. Re:You don't by amRadioHed · · Score: 1

      You seem to be missing the point. He already knows having clients index the shares is a problem, that's why he's asking how to avoid that. Disabling searching altogether is not a solution because obviously he wants to be able to search.

      --
      We hope your rules and wisdom choke you / Now we are one in everlasting peace
    2. Re:You don't by Anonymous Coward · · Score: 0

      Simple Approach:
      Use a good naming convention and do not index. I do not know what you are indexing on or what is in the files. However, you could pick an appropriate naming convention and folder structure and then let the human brain find the right thing.

      Slightly More Complex Approach:
      You do not have a share anymore. Instead you have a web page with a database backend. Now fast forward that line of thought. Several commercial products do just this and even offer additional features such as desktop integration. One example is Livelink.

    3. Re:You don't by griffinme · · Score: 1

      This became a huge problem here after we started to upgrade to dual cpu machines. On the old machines it would only index when the computer wasn't doing anything which really wasn't that often. With a dual cpu setup one cpu is usually doing nothing so desktop search was indexing _all_ the time on just a few machines in one office. The office kept complaining about their network connection running at a crawl. The few machines indexing the share were conflicting with each other. "Hey somebody looked at this file, we better re-index it." They created a vicious circle of indexing the same files over and over.

      --
      Is he strong? Listen bud, He's got radioactive blood.
  39. Your problem is obvious by Locke2005 · · Score: 1

    If I try to use a Windows desktop search engine

    --
    I've abandoned my search for truth; now I'm just looking for some useful delusions.
  40. Try Earth by theritz · · Score: 2, Interesting

    "Earth allows you to find files across a large network of machines and track disk usage in real time. It consists of a daemon that indexes file systems in real time and reports all the changes back to a central database. This can then be queried through a simple, yet powerful, web interface. Think of it like Spotlight or Beagle but operating system independent with a central database for multiple machines with a web application that allows novel ways of exploring your data." http://open.rsp.com.au/projects/earth

    1. Re:Try Earth by drinkypoo · · Score: 1

      Earth requires a single database of search results. This is stupid because it creates an unnecessary single point of failure. When I execute a search, it should query all the remote machines for their own databases. This creates a few more ACKs but otherwise puts little more load on the system. It would be nice to have the OPTION to store the catalogs on a repository, but being forced to means that I need literally another server on my network; I don't have anything running 24/7 right now that is capable of doing that job. I don't even have room for MySQL on my only 24/7 server, and would have liked to have kept it that way.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
  41. Re:How about Spotlight? That works on shared volum by Anonymous Coward · · Score: 0

    While the parent's response is rather snide, it nonetheless highlights an important truth. Granted there was a many-year gap where Spotlight didn't have server integration like good old Classic Mac OS, but it does now as of Leopard.

    To be honest I assumed Windows would have the same thing already, given how obvious it is. Why don't they ever copy the good bits? :P

  42. Re:How about Spotlight? That works on shared volum by RancidPickle · · Score: 2, Funny

    I've tried ducks, but they tend to nibble the occasional one or zero, and they leave an awful mess on the platters when they poop. Try Spotlight instead -- not as cute, but easier on the data, hardware, and the nose.

    --
    "First things first, but not necessarily in that order."
    - Doctor Who
  43. Desktop search is not the way to go by vmxeo · · Score: 1

    Seriously. You're probably going to want a separate server(s) for this job. You didn't specifiy what you're indexing, how often, or where, however I'll make some assumptions and point you towards an enterprise search appliance or product. Many will probably point you to Google Enterprise Search. I've worked with the search functionality withing Microsoft Sharepoint 2007, and it's (ostensibly) free spin-off, Microsoft Search Server. Again, you'll probably need to dedicate some hardware to this. In addition to crawling all the content, the search product will also need to index and present it to the user. This requires a front-end crawling role, back-end indexing role, and a database to keep all the data in. Dealing with several hundred gigs mean you'll want to have separate servers for all 3 (again, basing this off of my knowledge of MS products. YMMV). The nice part is that your users will work through a webpage, and the workstation won't be tied up doing any crunching of it's own.

    Try starting here: http://www.google.com/enterprise/

    or here: http://www.microsoft.com/enterprisesearch/en/us/search-server-express.aspx

  44. we have a winner! by Anonymous Coward · · Score: 0

    Everything is EVERYTHING you could want

    1. it is blazingly fast indexing drives
    2. it is truly instant searching
    3. it can be run in client/server mode

  45. www.X1.com by Anonymous Coward · · Score: 0

    Haven't used it in a couple of years since they went away from their free model, but X1 (www.x1.com) rocked as a desktop search engine. They have federated search and plugins for a variety of server apps (sharepoint, etc).

  46. Re:How about Spotlight? That works on shared volum by Anonymous Coward · · Score: 0

    spotlight doesn't work on my ntfs volumes.
    or am i doing something wrong?

  47. Share searching. by Timmmm · · Score: 1

    I wrote a web site/spider to do this for the whole network at uni. It was beautiful C++ all the way. After I left some silly CS people rewrote it in Python/PHP (ugh) here: http://code.google.com/p/trufflepig/

  48. Re:How about Spotlight? That works on shared volum by Anonymous Coward · · Score: 0

    No you're not seeing anything wrong, you were just distracted by him ducking.

  49. Portfolio Server (Digital Asset Management) by mrnutz · · Score: 2, Informative

    (Disclaimer: I work for Extensis)

    Portfolio Server can continuously index files on SMB/CIFS (and AFP) volumes using a feature called "AutoSync". Web and Desktop (Windows/Mac) clients then search by folder name, file name, document text, or other metadata. Indexing and thumbnail creation takes place on the server, so clients are relieved of any cataloging workload and metadata is centralized.

    http://www.extensis.com/en/products/portfolioserver9/overview.jsp

    1. Re:Portfolio Server (Digital Asset Management) by Anonymous Coward · · Score: 0

      I'd rather look through a pile of floppies from sneakernet than use anything by Extensis.

  50. Look at e-discovery products by Anonymous Coward · · Score: 0

    One of the products in this category will probably meet your needs.

  51. Re:NO! Try Alfresco by Anonymous Coward · · Score: 0

    Sharepoint is $$$$ if you want search across portals. If you have a single "portal" (consisting of multiple "webs"), Sharepoint comes with Server 2003 and above and is included in the price. There is a difference between Sharepoint Portal Server and Sharepoint Portal Services. Portal Server serves multiple portals and is not free.

  52. Use Windows Indexing Service by a.koepke · · Score: 1

    I am just embarking on a project to do exactly what the OP is asking for. Windows Server 2003 has an indexing service you can setup. http://www.windowsnetworking.com/articles_tutorials/Working-With-Windows-Server-2003-Indexing-Service.html It is limited in its own form but provides the back-end tools you need.

    Combine that with the next article from that site and you have a solution: http://www.windowsnetworking.com/articles_tutorials/Making-Windows-Server-2003-Indexing-Service-Useful.html

    This article shows you how to use the Indexing service from an ASP script. The solution I am working on will be done in PHP as it can also link to COM applications. This basically allows you to put a file search tool on your Intranet which is indexed and returns the results very quickly. Best of all, it uses existing software on Windows and doesn't cost any extra.

    --


    (\(\
    (^.^)
    (")")
    *This is the cute bunny virus, please copy this into your sig so it can spread
    1. Re:Use Windows Indexing Service by bhpaddock · · Score: 3, Informative

      For indexing files, you're better off using Windows Search 4, a free download for Windows Server 2003. The old content indexing service is deprecated and a much older technology. It's useful in some particular scenarios but for a smaller (100,000 - 250,000 items*) corpus of file content, WS4 will work much better. And for larger repositories, SharePoint and Microsoft Search Server are almost always better options.

      * = Server 2008 R2 / Win7 has a newer version of the Windows Search indexer that scales better to even larger corpuses.

    2. Re:Use Windows Indexing Service by evel+aka+matt · · Score: 1

      Heh, you said "corpus."

  53. No simple options by benjamindees · · Score: 1

    Since this is a task that benefits from some optimization, there are so many different combinations of file servers/clients out there, and so many use cases to choose from, there are lots of different solutions but not many good ones that will do exactly what you want out of the box.

    So, in order to narrow it down, you need to decide exactly what you're looking for. What server are you using? What clients do you need to support? Are you wanting to just search file names, or contents, ownership and modification times as well? Do you need the index to be completely up-to-date, or not? How long can you stand to wait for results?

    --
    "I assumed blithely that there were no elves out there in the darkness"
  54. Try Xapian Omega by Sarusa · · Score: 1

    I had this same problem not too long ago - we have a shared documentation tree with tens of thousands of documents that I wanted to index. I tried dozens of search engines in my spare time, most of which were just horrible (Beagle), were a nightmare to install for someone like me who's not a full or even part time admin (Apache SOLR), wouldn't allow cross platform access (lots of Windows ones, obviously), store a complete separate copy of every document (Alfresco, which didn't seem to have an option to ) and especially ones that had trouble indexing pdfs and MS Office docs (which we have a lot of). I'm not the IT guy, and have no budget for this, so Google Appliance was right out.

    What I eventually ended up with is Omega on top of Xapian - http://xapian.org/ - it's not too hairy to install, indexes pretty fast, points back to the original files (so it doesn't duplicate everything), and can handle multiple repositories. It will also detect dups and not show them twice, though similar files are treated as completely different (which is probably what you want in the absence of something more sophisticated).

    Two downsides: It can't do incremental update (unless that's changed recently) so you have to rebuild the entire index nightly. And the search is really sparse and ugly, which turned off some of my users, but you can rewrite the templates if you want to.

    1. Re:Try Xapian Omega by socsoc · · Score: 1

      If you weren't the IT guy, why were you doing this?

    2. Re:Try Xapian Omega by Sarusa · · Score: 1

      Good question. I'm doing it for my project team, being the lead software guy. The IT dept. is a small team and kept busy with other stuff - but they gave us hardware for a team Linux server when I asked for it and let me admin it, so I'm just grateful that they're helpful when so many would be obstructionist.

      Rambling: I've also installed a wiki, which is getting rave reviews from the people using it; hoping to make that popular enough that IT will /want/ to take it over.

    3. Re:Try Xapian Omega by socsoc · · Score: 1

      Okay, great answer.

  55. grep and locate by gzipped_tar · · Score: 0, Troll

    If you can't grep or locate, go fsck yourself ;)

    --
    Colorless green Cthulhu waits dreaming furiously.
  56. SPLUNK by Anonymous Coward · · Score: 0

    You can use Splunk if your files are heterogeneous. It is fast and easy to set up. It is pretty good for doing relatively advanced searches against tons of data.
    http://www.splunk.com/

    The Google GSA is quite good, too, and better for non-technical users.

    1. Re:SPLUNK by Anonymous Coward · · Score: 0

      Make that homogeneous. My bad.

  57. There's also an alternative to GSA and Sharepoint by binaryspiral · · Score: 1

    From IBM and Yahoo called OmniFind. It runs on a desktop or server and can index multiple shares... and the basic version is free but offers a lot of functionality.

    Although if your business is booming, a GSA is freakin' sweet.

  58. 3 steps by Sun.Jedi · · Score: 0, Redundant

    1. uninstall windows
    2. install linux
    3. run locate

  59. Sharepoint? by toastar · · Score: 1

    Sounds like a a job for Sharepoint!

  60. ISYS by Anonymous Coward · · Score: 0

    we use ISYS at work.
    and its danged handy. does large scale shared drives very well.

    http://www.isys-search.com/

  61. Use Windows Search 4.0 by timestride · · Score: 1

    If the share is indexed, Windows Search 4.0 will query the index on the remote server. Here is a link to the admin documentation with more information: http://technet.microsoft.com/en-us/library/cc772446(WS.10).aspx

  62. Depends on the server, doesn't it? by GNU(slash)Nickname · · Score: 1

    Novell's QuickFinder (http://www.novell.com/products/openenterpriseserver/quickfinder.html) works well in a Netware or OES2 environment. It even respects file permissions when displaying results.

  63. Seriously, ask slashdot? by GeckoAddict · · Score: 1

    There are a number of decent options, which one to pick depends on specific requirements not included in the original question. Did the OP even search?

    http://en.wikipedia.org/wiki/List_of_enterprise_search_vendors

    1. Re:Seriously, ask slashdot? by cbiltcliffe · · Score: 2, Insightful

      Maybe search, but didn't know what to search for.

      If this is a guy who's used to doing home and small business support, with a handful of machines at best, and kind of got thrown into the deep end by management because he's "good with them thar cmpooterz" then he may not be thinking "enterprise" search.
      After all, why would anybody need a search engine to find a starship?

      Probably searched for "shared drive search engine" or something like that.

      --
      "City hall" in German is "Rathaus" Kinda explains a few things......
  64. Microsoft Search Server 2008 Express by Icyfire0573 · · Score: 1

    Seriously? Microsoft offers a free piece of software that runs on the server that does "exactly" what he needs and we are suggesting Sharepoint/Alfresco, grep/locate and Google Search Server?
    http://www.microsoft.com/enterprisesearch/en/us/search-server-express.aspx

    You don't have to like them, but you should consider that they want to steal the market everywhere enough to give away decent software to get their foot in the door.

  65. Google Search Appliance is the way by wizact · · Score: 1

    GSA is the best way for enterprise search! It crawls, index and servers data. About a concern one of guys reaised above about access control. It authenticate you against the current authentication server (NTLM, Kerberos, LDAP, etc) and only shows you results that you are eligible to see that. Oh wait. There is one hiccup there. Licensing sucks. It is based on number of documents it crawls and after 2 or 3 years you should take back your appliance and buy a new one!

    --
    Our progress as a nation can be no swifter than our progress in education. The human mind is our fundamental resource.
  66. What are you trying to Index? by sabre307 · · Score: 1

    Are you trying to index all files, or just documents, or what? If you are trying to cheap out on indexing documents, I highly recommend Alfresco

    --
    My software never has bugs.
    It just develops random features.
  67. Re:How about Spotlight? That works on shared volum by kuzb · · Score: 2, Interesting

    Except then you have another terrible search solution which isn't meant for the amount of data you'd find on a large server. Worse, you have an operating system that is terrible as a server solution.

    On the other hand, you could just use a unix/linux distro of your choice, and beagle (http://beagle-project.org) - which is meant for indexing large amounts of data and has many clients some of which can remotely access it.

    --
    BeauHD. Worst editor since kdawson.
  68. X1 might be the answer by Anonymous Coward · · Score: 0

    I use X1 locally to index my hard drive, network drives, email attachments, etc. I love the search-as-you type functionality, unobtrusive indexing, and previews. They have an enterprise version available that might meet your needs. http://www.x1.com

  69. never mind by Anonymous Coward · · Score: 0

    i remembered how to use google...

  70. Surely you could just... by Anonymous Coward · · Score: 0

    For Linux/Unix:
    1. Mount the CIFS share/s on a linux/unix box.
    2. run updatedb
    3. use the locate/slocate command to search OR use places->search in gnome?

    For Windows:
    1. Install cygwin+updatedb+locate/slocate
    2. do as above

    Some Other Options:
    Femfind: http://femfind.sourceforge.net/
    pySMBsearch: http://sourceforge.net/projects/pysmbsearch/
    LAN-Crawler: http://code.google.com/p/lan-crawler/

  71. ffsearch by Psykar · · Score: 1

    Used this a while back to index all the samba shares at my res college, works great, and is accessible from a browser by other hosts on the LAN.

    ffsearch.sourceforge.net

  72. Microsoft / Windows Search options by bhpaddock · · Score: 1

    Microsoft has a few solutions you can consider depending on your specific needs.

    With Windows XP/2003, Vista/2008, or Windows 7 - you can install Windows Search 4 (not necessary on Win7, but recommended for Vista) on the server side to index the content, and then if you have WS4 (or Win7) on the client, it will automatically query the remote index when you perform searches against that file share.

    Alternatively, if you run the free Microsoft Search Server (the Express version is free) which is based on SharePoint, you can index files on the server and then set up a Federated Search connector in Windows 7. Windows 7 supports federating to OpenSearch + RSS/Atom enabled sources, and SharePoint / Search Server support this. On current versions there's a bit of manual work to create the right OpenSearch description file, but it's pretty easy. The upcoming 2010 SharePoint version provides those out of the box (as well as some additional enhancements supported by Windows 7).

    I'm actually the developer who built the OpenSearch feature in Windows 7, so if you have questions about the search options in Windows 7, you can visit my blog (brandonlive.com) and/or e-mail me (via my site).

    Hope that helps.

  73. X1 by Anonymous Coward · · Score: 0

    X1 is by far the most robust and accurate client I've used. It's not free but it's a great piece of software. There's even a server application that will allow you to search throughout your entire organization.

  74. How about... by KermitJunior · · Score: 1

    #updatedb

    --
    There is a Universal Life Value Check it
  75. Locate32 for Windows servers by kaffesumpen · · Score: 1

    I have used locate32 on my Windows XP machine for indexing the local and shared drives. My shared volumes server runs Linux, so I tried utilizing its locate-database. Unfortunately I came to the conclusion that locate32 and the Unix/Linux variant of locate are not compatible at all. However, if you are using a Windows server then you can run locate32 on the server and allow the clients machines to access it.

    --
    Do not meddle in the affairs of dragons, for you are crunchy and taste good with catsup.
  76. xfriend can do this. by yooy · · Score: 1

    I use xfriend personal (20 US). For your problem you would need xfriend business: http://www.xfriend.de/de/business/loesungen/

  77. x1 by blokje00 · · Score: 1

    try www.x1.com I think it will solve all your issues

  78. Re:NO! Try Alfresco by thule · · Score: 2, Informative

    My understanding is that you only get the full document text search when the data is backed by a real SQLServer license. The person was looking for a full search solution. This is built into Alfresco.
    SQLServer is per CAL even though the app is a web app.

  79. Microsoft Search Server Express, it's free! by dpm67 · · Score: 1

    The free microsoft search server express does exactly that. Plus it is very extensable if you want to write some code for it. Free download at:
    http://www.microsoft.com/enterprisesearch

  80. Some ideas by nielo · · Score: 1

    Under linux/unix:
    1. mount CIFS share
    2. run updatedb
    3. run locate/slocate

    Under Windows:
    1. Do the above with cygwin
    Other Options:
    Locate32
    Femfind
    Lan Finder

  81. We use mnogosearch to index about 700GB by oernii · · Score: 1

    We use mnogosearch to index about 700GB: search results are on our intranet, but mnogosearch can return an XML I belive, to integrating into some king of web-service should be easy. PS: access restrictions are a little problem though. We now have to implement our own search on mnogosearch which checks permissions first.

  82. Zoom Search is another option - But don't crawl by PassMark · · Score: 1

    We are the developers of the Zoom search engine.
    http://www.wrensoft.com/zoom/

    We have spent some time recently looking at the problem if indexing large amounts of data, for see,
    http://www.wrensoft.com/zoom/support/faq_large_sites.html

    Many people above have recommended using external appliances, or external hardware. This doesn't make sense in our opinion. Using an external indexer that crawls your files means that 1) You are loading up your network, 2) You are limited to network bandwidth speeds (rather than SATA or SCSI data transfer speeds) 3) You have the overhead of the HTTP protocol.

    What makes sense is to run the indexer on the server that is hosting the files and index them directly off the disk. Don't spider them, and don't do it across a network. This can save you many days of indexing time.

    But with this much data, I don't think there is any really quick solution. Whatever you decide to do is going to take some setup effort.

  83. IBM OmniFind by Anonymous Coward · · Score: 0

    IBM OmniFind Yahoo Edition.

    No charge (unless you want support). Install it on your shared fileserver, it will index the files on the server and provide a web interface for searching it.

    It does need a reasonable server, though, if you have a lot of documents.

    1. Re:IBM OmniFind by Anonymous Coward · · Score: 0

      nothing but a packaged version of Apache Lucene.

      Ahh... no... OmniFind Enterprise Edition (the version you pay for) is NOT based on Lucene. So, maybe you should keep your mouth shut if you don't know what you are talking about.

      To the OP, if you want an enterprise class secure search engine that will search across
      - filesystems
      - websites
      - nntp server
      - Content repositories like FileNet, Documentum, Stellant, Quickr, Sharepoint
      - Portal servers
      - UIMA plugins
      - Messaging servers (Exchange & Domino/Notes)
      - and APIs for you to crawl any third party system (SAP, Siebel, anything etc)
      then OmniFind Enterprise Edition is the way to go.

      And IBM has a half price starter edition for smaller implementations.

  84. Re:NO! Try Alfresco by Slashcrap · · Score: 1

    SharePoint is $$$$.

    It's traditional to use **** to imply expletives. Although I don't really see how just four characters covers "unbelievable half-baked cocksucking pile of shit".

  85. Spotlight by Herve5 · · Score: 1

    ... seconded

    --
    Herve S.
  86. Dedicated search by Anonymous Coward · · Score: 0

    Desktop search is by design not fit for indexing large quantity's of data over the network. You should consider switching to a dedicated search appliance or software solution.
    examples are:
    Google Mini: http://www.google.com/enterprise/search/mini.html
    Thunderstone: http://www.thunderstone.com/texis/site/pages
    SophiaOne: http://www.sohpiaone.com
    Microsoft search server: http://www.microsoft.com/enterprisesearch/en/us/search-server-express.aspx

  87. Re:How about Spotlight? That works on shared volum by gd23ka · · Score: 2, Insightful

    Spotlight is the obvious answer if you have OS X. Not everybody in the world is lucky enough to be in that
    position, most are stuck on one of the inferior platforms. Your rubbing it in, is not helping it just
    alienates people who already have been through enough and have it tough.

  88. Autonomy etc by Anonymous Coward · · Score: 0

    Autonomy IDOL with mapped security.

  89. Dedicated search is the way to go. by Anonymous Coward · · Score: 0

    Dedicated search is the way to go.
    Desktop search is not meant for share indexation. Network load of multiple users will kill you.
    Possible solotions are:
    Google Search appliance:http://www.google.com/enterprise/search/mini.html
    Thunderstone: http://www.thunderstone.com/texis/site/pages
    SophiaOne: http://www.sophiaone.com
    MS Search server 2008: http://www.microsoft.com/enterprisesearch/en/us/search-server-express.aspx

  90. Pfff... by ls671 · · Score: 1

    Slackware is at version 13 which makes it much more advanced than a version 7.

    Read at Slackware got to version 13 so quickly at this link:

    http://en.wikipedia.org/wiki/Slackware ;-)))

    --
    Everything I write is lies, read between the lines.
  91. Novell quickfinder by Anonymous Coward · · Score: 0

    http://www.novell.com/products/openenterpriseserver/quickfinder.html

  92. Try regain Desktop Search (server variant) by thtesche · · Score: 1

    You should try regain http://regain.sourceforge.net/?lang=en . There exists a desktop and a server version. And if you are clever you mix them. You can create your indices with the server version (crawler) and copy the to your desktop computer. So you can search without the need of an application server to run the search frontend.

  93. Re:How about Spotlight? That works on shared volum by Anonymous Coward · · Score: 0

    It's Microsoft's fault.

    Or then it's your fault for formatting something with NTFS, and then putting your data there.

    Never blame Apple. All hail Apple.

  94. Copernic Desktop Search by Anonymous Coward · · Score: 0

    i'm a avid user of Copernic Desktop Search. It indexes my shared network drives. Once the index is created,CDS only updates the index when the computer is idle which does NOT slow down your computer. There is a free trial on their web site. www.copernic.com

    Check it out!

  95. Access Rights? by An+anonymous+Frank · · Score: 1

    I had a look at some solutions last year, and ran into one hell of a road block; most solutions I had a look at presume that all the information you're indexing should be searchable and/or available to anyone that can reach your search tool's client.

    Has anyone had experience with something that will search the indexes for items based on your credentials? (Meaning that if you're not in accounting you can't get results for that data set)

  96. Looking at the wrong problem? by thogard · · Score: 1

    Ever consider hiring a librarian? I've worked at 3 small companies that had one and they were far more profitable than most of their competition because there was someone in charge of organizing the data.

  97. Re:NO! Try Alfresco by spyrochaete · · Score: 1

    You could use Microsoft Enterprise Search Server Express which is free (if you have a Windows Server license laying around). It's the same search engine as MOSS without the CMS functionality and it can crawl just about everything either natively or with connectors. You can use MSSQL Express as the database engine which is also free.

    Or you could go completely open source with Apache SOLR, though I hear it's so featureful that it's very difficult to install and configure.

  98. New times Coogi Jean,Bape Hoody,Gucci Sweatert---F by Anonymous Coward · · Score: 0

    Http://www.tntshoes.com

      we are a prefession online store, you can see more photos and price in our website which is show in the photos
    if you are interested in our product, please email me by ,hellow pls see our website in the photos attached attached is our store's website, we are a online shopping store, we are selling large brand new shoes,clothing, handbag,sunglasses,hats etc, our products are all best quality with the cheapest price. You will see the more pictures and the price for our product in our website, we are selling all brand new handbag, please see below some price list of the product. We accept paypal as payment, and give free shipping. Jeans : A&f Armani artful dodger jeans Bape BBC christian audigier COOGI D&G diesel ED HARDY lrg etc $33-50 free shipping. Jersey NBA Jersey MLB NLBM nike puma adidas $12-30 free shiping.

      OUR WEBSITE:

                                                      YAHOO:shoppertrade@yahoo.com.cn

                                                            MSN:shoppertrade@hotmail.com

                                                                  HTTP://www.tntshoes.com

  99. Re:How about Spotlight? That works on shared volum by Anonymous Coward · · Score: 0

    I agree. Ducks are not all they're quacked up to be. You bet I'm posting as Anonymous Coward! :)

  100. Try Wilma? by mcclk · · Score: 1

    I've used this program with some of my users on a large network volume that they needed to do keyword searches on. The initial index build takes time (although not days), but subsequent updates are fast, and searching is super-fast. You can tell it what file types to index, too, which is nice. Everyone I've installed it for loves it, even though we have enterprise content management - they just find this easier to use and faster. http://s3.amazonaws.com/redtree/wilma/en/help/index.html

  101. http://beagle-project.org Linux Search Tool by j0ebaker · · Score: 1

    -----BEGIN PGP SIGNED MESSAGE-----
    Hash: SHA1

    The Beagle tool on Linux can be used. It also has a web based
    interface for running queries over the network. From looking at the
    configuration screens, it appears to be able to cascade to other
    systems to send queries to beagle services there also.

    Dogs have a great sense of smell for finding things, so put a Beagle
    to work databasing your files for instant access in the future.

    I've used beagle for a long time, and I even added the Beagle plugin
    for firefox so I can find content on web pages I've visited.

    Thanks to an open source tool such as Beagle, I don't have to trust
    proprietary code that has the potential of giving away my secrets
    through hidden back doors.

    - -Joe
    -----BEGIN PGP SIGNATURE-----
    Version: GnuPG v1.4.9 (GNU/Linux)

    iEYEARECAAYFAkrd1GMACgkQ7J1dPd3sAmCRiACgieFtKL8IsB1ub4V7zBTEVQ8/
    WQEAoJHh8jO6FFr1LPFfCiqVMSeD78a6
    =3DGr
    -----END PGP SIGNATURE-----

  102. Wrong, read again. by bhpaddock · · Score: 1

    Again you are incorrect on several counts.

    The whole point of Federated Search is that the server is NOT running Windows Search. Instead it can be running SharePoint / Search Server, FAST, or virtually any other indexing solution. Lots of indexing solutions can output RSS or Atom over HTTP, or can be easily extended to do so. Then you get the same local file experience users expect from Explorer, but with whatever remote index you want.

    Also, Windows 7 and Server 2008 R2 do not run WS4, they have a newer version of the indexer that *is* significantly faster and scales significantly better.

  103. No it doesn't. by bhpaddock · · Score: 1

    Federated Search is a Windows 7 feature and did not exist in Vista.

    Vista can query a remote Vista or Windows Search 4 index for a file share, but that's separate from the feature we call Federated Search, where Windows 7 can federate queries to OpenSearch enabled sources such as SharePoint, Search Server, FAST, etc.

  104. IBM OmniFind by mvip · · Score: 1

    If you're able to get a hold of it, IBM OmniFind Yahoo Edition would do the trick. Unfortunately Yahoo pulled the plug when they went into bed with Microsoft (Bing). I'm using it on a local intranet, and it works great. If you have a deep wallet, you can always look into the commercial version IBM offers, but it is really nothing but a packaged version of Apache Lucene.

  105. SearchIn : Distributed Data Management Solution by Anonymous Coward · · Score: 0

    Go to BlackBall,Inc.s Website and download a free trial of SearchIn 7.0:
    http://www.blackball.com/b/nav

    Best Indexing tool I have ever used, for all types of data anywhere on a network, and the new version is cross platform.
    And Im not just saying that because I helped develop it. Haha, but seriously, it does everything you are talking about, and it outperforms other indexing and searching tools on the market.

    It's so good, Toshiba even branded it under their name for distribution with their MFPs, for indexing documents that are scanned in from paper.

    Check it out there too, its the same program, jus different branding;

    http://copiers.toshiba.com/usa/software/document-management/e-bridge-research.html

  106. use KnowwledgeTree document management system by Anonymous Coward · · Score: 0

    Use something like the community version of KnowledgeTree. Searches are done on the server, not on individual desktops. I run it in combination with the open source Amahi server project and really like it.

  107. Solr is Lucene-based and understands Word files by Jan-Pascal · · Score: 1

    The upcoming Solr 1.4 release (http://lucene.apache.org/solr) includes the Tika document parser. You can throw Word, PDF etc files to it, and it will index them for you. Solr is a web service; a few hundred lines of PHP would be enough to use the Solr to index your shared volume, and a again that much for a web-based search tool.

  108. This sounds like a job for cron! by Anonymous Coward · · Score: 0

    A cron job that does a find with file and maybe grep should be a good start. Then you can get clever with command line tools for pulling meta data out of images, videos, audio files, documents, etc.

  109. Re:How about Spotlight? That works on shared volum by KnownIssues · · Score: 1

    "Open sores". Freudian slip?

  110. You can't do it...yet by KnownIssues · · Score: 1

    This is impossible to do with any product now, but you'll be able to do this with Windows Server N+1 as long as all your clients are also Windows N+1, but only if you also buy Microsoft Windows Office SharePoint Enteprise Search Server X. You could try to do it with current technology, but you'd have to remove one of your arbitrarily self-imposed restrictions. I could tell you which one, but I'd be reducing my chance that anything I'm saying is relevant if I guessed wrong.

  111. Re:NO! Try Alfresco by jasonwea · · Score: 2, Informative

    Full text search in Alfresco uses Lucene. Or at least it did when I deployed it on Debian with PostgreSQL.

  112. RSP's Earth by skyphyr · · Score: 1

    Depending on your needs you may find RSP's Earth to be a viable solution. http://open.rsp.com.au/projects/earth/ It's cross-platform and has a web interface (though you could build a desktop one easily enough if you wanted to). Cheers, Alan.

  113. Wait, what kind of files? by Kagetsuki · · Score: 1

    I think you are being a bit too vague on what kind of files you are searching for/indexing. We have 5TB of storage that's 70% full, that includes quite a few smaller files/code files, and I've never had an issue with "find -name ". Of course I rarely use that because our files are organized and we have naming standards. Do you have a shared volume full of random files all mashed together? Are you in a college dorm and someone grabbed an external drive and shared it out for everyone to just throw porn and music and movies into and you want to index it so you can find Beastie Boys songs and XXX Blond Anal videos just that much faster?

  114. X-Friend is the answer by Anonymous Coward · · Score: 0

    have a look at the X-Friend product line of the german company x-dot. (http://www.x-friend.de).
    They offer an enterprise search solution which seems to do everything you want.
    They support a ton of file formats.
    They even obey to any ACL set on the files, so a user only sees results of files which are readable by him.
    You can also aggregate the search results over several distinct servers, so the end user only interfaces with his local client over a browser interface and then transparently can search over several servers at once.

    Disclaimer: I have not tested the enterprise search product, but I am a happy user of their desktop search product.

  115. everything by Mr.Mustard · · Score: 1

    Sorry I'm late, but you might want to look into Everything.
    http://www.voidtools.com/

    --
    fnord
  116. Check searchtool Lookeen with shared index! by Anonymous Coward · · Score: 0

    I recommend the Outlook add-in Lookeen for searching on shared volumes in combination with outlook. This tool has a shared index feature which will be very helpful!!!

  117. Re:How about Spotlight? That works on shared volum by thedbp · · Score: 1

    FYI - Mac OS X is UNIX. Certified and all. It's a fantastic server platform.

    http://beast.bio.ed.ac.uk/BEAGLE_on_Mac_OS_X

    I'm guessing you've never used Spotlight extensively. It works great on large network shares.

  118. Re:How about Spotlight? That works on shared volum by thedbp · · Score: 1

    Whoops, wrong link:

    "The Beagle [1] developers decided to fill this search gap using Apple’s MacOS X search function as basic material."

    http://www.linux-magazine.com/w3/issue/58/Beagle_Search_Tool.pdf

  119. IBM/Yahoo by Ksigpaul · · Score: 1

    There's a free version of the expensive OmniFind product by IBM available with a little bit of Yahoo branding (which can be removed). You can't really integrate it easily with local search but I would assume that your users know whether they want local or shared documents. If you need security/permissions around people being allowed to access certain documents you're going to need to buy something. Otherwise try: http://omnifind.ibm.yahoo.net/

  120. Coogi suit,Afflication Hoody is Season Goods by Anonymous Coward · · Score: 0

    Welcome TO Our Website: Http://www.tntshoes.com

            we are a prefession online store, you can see more photos and price in our website which is show in the photos.
              pls find the more photos and the price for our product in our website,hellow see our website in the photos attached, on line shipping sotre, selling all kinds of brand new shoes,clothing, handbag,sunglasses,hats etc, if interested please email me by we are selling all brand new handbag, we take paypal as payment, . shoes Nike jordan1-23 $23-$28 free shiping.

      OUR WEBSITE:
                                                            YAHOO:shoppertrade@yahoo.com.cn

                                                                    MSN:shoppertrade@hotmail.com

  121. Artful Dodger men jean,Air Yeezy fusion man shoes by Anonymous Coward · · Score: 0

    Http://www.tntshoes.com

        We offer kinds of Newest Style Handbag,Brand Handbag,Fashion Handbags,
    Ladies' Leather Handbag,Replica Handbag--AmmonOnline
    We ship to worldwide by EMS,TNT,DHL,UPS.
    We supply you with smooth and fast services, and do dorp shipping.
    Welcome to visit our factory.
    Please visit our Website:www.tntshoes.com or products Album,

    Contact us now, We can send you more details.

          OUR WEBSITE:
                                                                  YAHOO:shoppertrade@yahoo.com.cn

                                                                    MSN:shoppertrade@hotmail.com

                                                                              HTTP://www.tntshoes.com

  122. How to ask my question (submit my own) by Anonymous Coward · · Score: 0

    Someone know how i can submit my own question..

  123. libFerris federation of "other" indexes by Anonymous Coward · · Score: 0

    Clucene and Postgres are combined, sure you could manage clucene and clucene, via libferris?

    http://www.linuxjournal.com/article/9298

    Surely some clever sort can whip up a quick GUI for you...

  124. Re:NO! Try Alfresco by Anonymous Coward · · Score: 0

    SharePoint is $$$$. Try Alfresco. Alfresco can look like a file share (support SMB, DAV, FTP, etc). The indexing is built is and does not require a separate SQL Server license.

    Windows SharePoint Services (WSS) is free with any copy of Windows 2003 Server R2 or above. It can also use SQL Express, which is also free.