Examining Mac OS X 10.4's Spotlight
Ton writes "Apple has published a discussion of Spotlight, the radical systemwide search technology that will be part of Mac OS X 10.4 'Tiger'. The really interesting part is that metadata will be playing a big role in Spotlight while just a few years ago people were afraid metadata in Mac OS X was going the way of the dodo."
Can someone please explain a little more as to how Spotlight using metadata is a "radical" new thing?
I haven't seen any mainstream implementations (WinFS?) of it, but I didn't know it was a brand new concept.
Error 407 - No creative sig found
I've been using computers through my whole childhood, school, work, etc. I'we been doing all sorts, from playing games to hardware design and real time data analysis on the computer. And never, ever have I had the need to search for my files.
;)
What is with this search thing that everybody is so hot about?
Not that it doesn't look cool though. Anything on a Mac looks cool
stat file.jpg
in linux. Would be nice in linux to beef up on metadata too.
I hope that spotlight will work also, if you have a linux partition exported to the Mac via NFS. Will file information of NFS mounted systems also stored in the database?
Having linux and OS X working together is already now not without issues. If you have a file Test.jpg and test.jpg in your Linux partition and you copy both to the same place in OSX, the finder (on the mac) complains, because the two files are considered the same.
it won't work properly in Longhorn and you have to wait three more OS revisions for Microsoft to get it right.
Sounds funny, but it's more likely to be true.
The kind of metadata that was almost deprecated by Apple isn't quite same thing as the "modern" concept of metadata. The classical HFS metadata covered concepts like file type, file creator, and "Finder bits" that aren't handled at the file system level in other OSes. This, combined, with the Mac OS's historical use of resource forks for storing developer defined data records, made perserving such data difficult or impossible in heterogenous environments like the Internet. It's really a shame; I've always thought this concept was the most elegant attempt to solve the problem of "rich data" associated with data files without requiring the data in the file itself to have some form of universal container format.
The metadata concept used by Spotlight is going to be based in part on a plug-in system that allows the Mac OS to reconstruct metadata information from the data within files themselves, rather than just using the metadata facilities provided by HFS and Mac OS resource forks. That means that each different kind of file, from Word documents to PDFs to Postscript jobs, needs its own special kind of processing to read its own format of storing such data. It's less elegant and more processor intensive that just using the historical HFS system, but it's more likely to to be useful for extracting metadata from files provided by Windows and other Unix variant users.
Those who complain about affect & effect on
Filesystem metadata is great, but "instantly" updated search indexes sounds like a solution to a problem that doesn't really exist.
Doesn't exist *for you* perhaps. Perhaps you don't have a lots of user data, or you have taken time to sort it into useful folders. I'd say it's about as useful as the incremental seach in iTunes is. Sure I could remember what artist did a track, and access a track by scrolling down to that artist, then finding the track. Or I could scroll down the list of thousands of track names, remembering my alphabet ordering, and locate the track that way. Assuming I've remembered the exact wording of track name. But I've always found it easier to type whatever word comes to mind first from artist or track into the search box.
And so it is with documents. Even if I do remember the file name and folder that a particular piece of information is stored in, I still need to navigate there. Most times it will be quicker just to type in whatever it is you remember about the data you want into a search box - even if you know where the data is stored.
The technologies are barely related; Apple is not ripping off QS/LB in the least here. Spotlight is a technology for searching through files based on their conent and metadata. QS/LB are utilities for finding files based on easily typed mnemonics. You are looking at one aspect of Spotlights appearance (the dropdown search pane in the corner) and assuming it's a ripoff based on some similarity to the appearance of the other utilities.
In fact, the Spotlight indexing technology will be a boon to the utilities, as they will be able to leverage this newly available metadata to execute even more powerful searches. Quicksilver is already invaluable to me, and I expect it to just get better.
Clicking the 'X' doesnt actually close the application. This annoyed me to start with, but ive slowly gotton used to it.
LOL. Quitting the app just because I closed a window is one of the things that annoys me the most about Windows. If I'm done working with one document in Word, I have to be sure to open up the next one before I close the first or I have to wait for Word to start up again.
However, from what I've seen, that's not the sort of thing Spotlight is about. The plugins we're talking about make use of intrinsic metadata - information extracted from the datastream itself. Many common file types include some descriptive information: EXIF data in pictures, MP3 tags in audio files, meta tags in HTML files, and so on. Spotlight is a way of extracting and using that data.
The practical differences include, OTTOMH:
- Spotlight's information won't be lost when files get stored on other file systems, sent over email, processed on other platforms, &c.
- Spotlight uses information that's already in the files - you won't have to set it up manually.
- You can use existing tools to see and edit the metadata - MP3 taggers, photo editors, whatever. And you can do so on any machine and OS.
This is probably one of those rare cases when that foul word 'leverage' might be appropriate -- Spotlight should allow you to make much better use of an existing resource. As such, it sounds like a jolly neat idea!Ceterum censeo subscriptionem esse delendam.
There's been reference from the beginning of the computer revolution to this solution we've all been waiting for... and credit to evolutionary steps taken by apps such as,Quicksilver, Launchbar, BeOS, etc... but one application that predates AND which most closely matches the feature set is:
Simson Garfinkle's "Sbook.app" from NeXT in the 90's.
The usefulness of Sbook.app ability to add tokens in a flat file for instantaneous searches enabled people to apply Sbook.app outside its realm of address book that it originally was designed.
Abstracting its functionality and interoperating at the kernel level is pure Apple polish on the brand. Until people start using "Spotlight", the verdict will be out on adoption across the platform.
I will venture it will be one of the defining characteristics of the Mac platform into the future.
Well, let's just throw some figures up into the air. You've got a 200GB hard drive. The index is taking up 1GB. This is half of one percent of the drive space, a couple of dollars worth.
I'd say that 1GB is a lot larger than it will ever be, so it's not a concern for me at all
I'll happily spend a couple of dollars on drive space for instant searches on my local machine.
Kai
Specialist Mac support for creative pros, Melbourne
If you have a mac with a ton of files, various "Previous System Folders" etc...follow along :)
I have smart folders for pdfs, avis, mpgs, and wmvs
I have these sorts of files *all over the place*...movie clips, test files, you name it.
I go to the finder, "open" the Windows Media Files folder, and they are all "there"
Or all the "archive" files (zip, rar, sit/sitx etc) i've collected and not erased in the last year...
or all of the emails i've received from japanese users...
it goes on and on.
To me, its like the whole star trek "Computer..find all of the blah blah blah for sector Whatever"
It concentrates on the "what you want" as opposed to the current paradigm of where did i pit it/what app did i use, etc