Slashdot Mirror


Microsoft's Search Engine Plans

prostoalex writes "Andy Beal from SearchEngineGuide.com interviews Robert Scoble from Microsoft. Scoble tells the audience what current search technologies Microsoft is working on as part of its Longhorn/WinFS development as well as in the field of Internet. Scoble also discusses current problems with local drive and Internet searching, such as absence of metadata for a lot of files out there: "When I take pictures off of my Nikon, they have some metadata (for instance, inside the file is the date it was taken, along with the exposure information) but that metadata isn't useful for most human searches. For instance, how about if I wanted to search for "my wedding photos?" Neither X1, nor Windows XP's built in search would find your wedding photos. Why? Because they have useless names like DSC0001.jpg and there's no metadata that says they are wedding photos.""

92 of 407 comments (clear)

  1. I have a suggestion for em.. by panxerox · · Score: 4, Insightful

    Make it "Micosoft Search powered by GOOGLE". Then "maybe" it might function well. Also metadata needs to be created by the user, I aint gonna be entereing data on a keypad on my camera for every photo.

    --
    "It's so convenient to have a system where everyone is a criminal" - A. Hitler
    1. Re:I have a suggestion for em.. by queen+of+everything · · Score: 2, Interesting

      I think they already are...

      My site for a long time wasn't ranked on Google, MSN, yahoo! search. Then one day I was on the first page for Google. Amazingly enough, I was in exactly the same place on msn and yahoo searches. They all supposedly have their own crawlers, but why was it until I was listed on Google that I was listed on the rest? Just a theory I have...it probably means nothing.

      --
      "Wisdom is not a product of schooling but of the life-long attempt to acquire it." -Albert Einstein
    2. Re:I have a suggestion for em.. by lintux · · Score: 5, Funny

      > Also metadata needs to be created by the user, I aint gonna be entereing data on a keypad on my camera for every photo.

      That's why the next version of Microsoft Photo[tm] will send every photo you make to the Microsoft headquarters so they can add the metadata for you. ;-)

    3. Re:I have a suggestion for em.. by nospam007 · · Score: 3, Funny

      If they claim that they can indentify bitmaps with kiddie porn, then it should be no problem doing it with wedding photos where there is usually a broad with a 19th century white gown on most of them.

    4. Re:I have a suggestion for em.. by nolife · · Score: 5, Informative

      Google can already do a good job of finding wedding photos.. They may not be yours but they are wedding photos ;)

      --
      Bad boys rape our young girls but Violet gives willingly.
    5. Re:I have a suggestion for em.. by mgblst · · Score: 3, Funny

      A Google crawler indexed the net, while a Microsoft crawler will simply index Google.

  2. Search by date by Saven+Marek · · Score: 5, Interesting

    I can get around searching for "wedding photos" because I remember the date. 3 special days, and hundreds of wedding photos appear.

    It's part of being human that we don't necessarily remember the phrase "wedding photos" but we may remember many other tiny pieces of data about a shoot that are unique to us, and the time and date are one of those. I can be certain the post 9pm photos done on those days are pretty embarassing.

    Just concentrating on "Wedding Photos" is useful if someone else is searching my picture archive, but that's not useful to me

    nude geekgrrls

    1. Re:Search by date by TitanBL · · Score: 4, Informative

      The new iPhoto handles this very well - automatically imports the date the photo was taken. Then you can group the images together in albums similar to the way you create 'smart playlists' in iTunes.

    2. Re:Search by date by ThogScully · · Score: 2, Insightful

      MOD PARENT UP. A little common sense goes a long way. Entering all that metadata that would make keyword searches viable would certainly help, but people already have the ability to do that - it's not worth the time or effort. Cataloguing by directory and maybe filename is all I ever do and I'm pretty sure I'm not alone.
      -N

      --
      I've nothing to say here...
    3. Re:Search by date by gl4ss · · Score: 2, Funny

      exactly, whatever microsoft does they can't make the system figure out the proper metadata on their own and the user has already the possibility to enter it anyways.

      like, have the wedding photos in a 'wedding' folder or whatever, porno in 'porno' folder and wedding porno in 'wedding night' folder.

      sure they can make it a nice categorizing system but wtf, why steam so much about it, are they going to bring it "up on your face" so that it's hard to ignore typing the metadata(most people would ignore it anyways)? don't they have real features?

      --
      world was created 5 seconds before this post as it is.
    4. Re:Search by date by higgins · · Score: 3, Interesting

      David Gelernter built a system called Lifestreams that basically claimed that time-ordered series plus some simple search and organization operators was everything you needed. It always seemed like a pretty good idea to me.

      That said, if you do have metadata available, you can do a lot with it.

    5. Re:Search by date by stevens · · Score: 3, Funny
      Entering all that metadata that would make keyword searches viable would certainly help, but people already have the ability to do that - it's not worth the time or effort.

      I'd like a camera that could accept voice metadata, turn it to text, and preserve it when it went onto the filesystem, using whatever metadata the filesystem supports.

      That way I could easily hit the button on the camera, say "vacation with supermodel"[0], and search on that later. Although we'd need something more than ext{2,3} which could hold proper user-defined metadata.

      Note 0: The simple metadata storage system would not have a lie detector. ;P

    6. Re:Search by date by Chess_the_cat · · Score: 2, Insightful

      Who's really dumping all their pics into My Pictures without making subfolders? Subfolders and thumbnail view in XP already make searching for photos from a command line useless.

      --
      Support the First Amendment. Read at -1
    7. Re:Search by date by Flakeloaf · · Score: 4, Funny

      Funny, I get around not being able to find my wedding photos by putting them all in a directory called WEDDING PHOTOS - THE PHOTOS OF YOUR WEDDING ARE HERE YOU TWIT

      --

      Am I the only one who heard Roxette to sing "I'm gonna get blitzed for some sex"?

    8. Re:Search by date by alangmead · · Score: 4, Interesting

      What if a system's search could draw information from all the applications within the system. For example, if your electronic datebook had a day long entry for "wedding" and the photo manager has photos taken on that date, then a search for "wedding photos" would be able to find out when the wedding was, match it up with the date the photos were taken, and come up with the appropriate set.

      To some extent Apple tried this with "Newton Intelligence" on the MessagePad. If you wrote "Thursday, Lunch with Bob at Redbones" It would (after you fixed all of its handwriting recognition mistakes) look up Bob and Redbones in the address book, look in the calendar for the next occurrence of a Thursday, and schedule a noon time appointment.

      Newton Intelligence really only amounted to a small set of interapplication tricks, but it was assumed that as the popularity of the units increased, the functionality would be extended. (which pretty much tells you what happened to it.)

    9. Re:Search by date by Lord_Dweomer · · Score: 2, Interesting
      What if photo meta data could be saved steganographically (spelling?) inside the image? Oh wait.....what if the image got edited, whoops.

      --
      Buy Steampunk Clothing Online!
  3. I'm not buying it by de+Selby · · Score: 4, Insightful

    For instance, how about if I wanted to search for "my wedding photos?" Neither X1, nor Windows XP's built in search would find your wedding photos. Why? Because they have useless names like DSC0001.jpg and there's no metadata that says they are wedding photos."

    That's why you can change filenames and organize things into directories.

    1. Re:I'm not buying it by Saven+Marek · · Score: 4, Interesting

      Even easier than putting into directories is using a portfolio type application, like Picasa (the original version of Apple's iPhoto btw) which allows simple drag and drop library creation. You can have pictures in multiple libraries, it just takes a small few moments to drop photos into their correct places and they are sorted as need be. If you want wedding photos, look in there if you want photos of janine, kate or benson look in their respective folders.

      It doesnt need to be a morass of embedded folder after folder either, as humans have mental acuity unlike a computer. You may have uncle bob who is photographed a lot and auntie beryl who isn't, but all the photos of beryl you may know will contain bob. We can store a surprising amount of information, and perhaps 5 to 10 libraries is all you will need for most peoples collections.

      Special occasions get their own. It just takes moments after downloading the photos.

      nude geekgrrls

    2. Re:I'm not buying it by leomekenkamp · · Score: 3, Funny

      I think the guys at Microsoft are under the impression that it is easier for a user to add metadata to a file than change its filename and put it in a logical place.

      I think they are wrong.

      Even if they put more annoying pop-ups like "Hi, I'm Photi, your annoying photo-organiser! What are the names of the people on this photo?" I think people will go straight for the 'Ok' button. Especially if you take the number of pictures that can be stored on a memory card these days. Even clicking 'Ok' 50 times can be pretty annoying.

      --
      Wenn ist das Nunstueck git und Slotermeyer? Ja! Beiherhund das Oder die Flipperwaldt gersput.
    3. Re:I'm not buying it by tsa · · Score: 4, Insightful

      But, if the industry works together on common WinFS schemas (not just for contacts either, but other types of data too), we'll come away with some really great new capabilities. It really will take getting developers excited about WinFS's promise and getting them to lose their fears about opening up their data types.

      If M$ would work together with the industry and open up its data types we would come away with some really great new capabilities. This is incredible: they want 'the industry' to do what they never do, and I expect they will succeed. Prepare for the even more total domination of Microsoft in the near future.

      --

      -- Cheers!

    4. Re:I'm not buying it by edalytical · · Score: 4, Informative

      I did a quick Google search and what I can tell iPhoto was introduced at MacWorld Expo January 7, 2002. Picasa was introduced in October 2002.

      --
      Win a signed Stephen Carpenter ESP Guitar from the Deftones: http://def-tag.com/?r=0008781
  4. Ouch by Anonymous Coward · · Score: 5, Funny

    X1, nor Windows XP's built in search would find your wedding photos. Why? Because they have useless names like DSC0001.jpg and there's no metadata that says they are wedding photos.

    Right, dude! The camera should automagically recognize that it's taking pictures of your wedding and include that info in the metadata!

    1. Re:Ouch by Otter · · Score: 5, Interesting
      Yeah, I'm sure being a computer science visionary is harder than it looks. But from the outside, all they seem to ever do is to announce that computer use is difficult because software developers aren't as smart as them, and that what we really need is some way for everything to magically sort itself out. Details of implementation to be left to those of less rarified brilliance.

      The closest thing to a workable scheme is Gelerntner's Lifestream stuff -- where your system knows that you got married on a certain date (even if you have trouble remembering it) and that documents (JPEGs, Word files, GNUCash transactions from that time probably pertain to it.

    2. Re:Ouch by ForteTuba · · Score: 3, Insightful

      Imagine a world where we're all broadcasting identity. Say we've got RFID-enabled nametags at the wedding. Now picture a camera that has a (preferably directional) RFID reader. Suddenly all your photos have the names of all the subjects automagically added as metadata. Scary in some contexts, useful in others (like most technologies, I suppose).

    3. Re:Ouch by cybpunks3 · · Score: 2, Interesting

      --
      The closest thing to a workable scheme is Gelerntner's Lifestream stuff -- where your system knows that you got married on a certain date
      --

      That's fine for personal photos, but what about MP3s or other acquired media which has no direct association with personal life events?

  5. Hmmmm... by Bowie+J.+Poag · · Score: 4, Insightful

    I think thats what "organization" is for. You place files like "DSC0001.jpg" in things called "folders", and then name the folder "Wedding" or something.

    I dunno.

    --
    Bowie J. Poag

    1. Re:Hmmmm... by khuber · · Score: 5, Funny

      I'm so stupid! I named all my folders like that too. folder0001, folder0002, ....

    2. Re:Hmmmm... by AndroidCat · · Score: 3, Funny

      But how will you know who they belong to? The OS should force names like MyFolder0001, MyFolder0002, ...

      --
      One line blog. I hear that they're called Twitters now.
    3. Re:Hmmmm... by epsalon · · Score: 3, Funny

      I've actually heard of a guy putting all his data in a heirarchy of folders called "New folder something" because he didn't know how to rename them.

    4. Re:Hmmmm... by jd142 · · Score: 2, Interesting

      And even better, many photo programs allow batch renames. So while you're putting them in the wedding folder, rename them all to wedding####.jpg and let the program automatically append numbers.

      Reminds me of Scotty's line, "The more they overthink the plumbing, the easier it is to stop up the drains." They've developed a complex solution for a simple problem that already had a simple solution.

      While a database driven file system with the ability to let users define their own metadata fields in the database sounds really, really cool, I won't be using Microsoft's first or second version for anything I value.

      So what's the status of the *nix version of a database file system?

    5. Re:Hmmmm... by Anonymous Coward · · Score: 2, Insightful

      I think thats what "organization" is for. You place files like "DSC0001.jpg" in things called "folders", and then name the folder "Wedding" or something.

      The problem with this is an inherently one-dimensional view of the data. If you have placed your wedding photos in the weddings photo folder, you have not got the option of ordering photos by size, of easily finding facial photos or having some other property of your pictures you might want to use to get a subset of your existing picture bank.

    6. Re:Hmmmm... by tomstdenis · · Score: 5, Informative

      you *can* do this to some limited sense with the command shell

      for %a in (DSC*.JPG) do rename %a Wedding_%a

      You just have to know a bit about the command shell...

      Tom

      --
      Someday, I'll have a real sig.
    7. Re:Hmmmm... by jd142 · · Score: 2, Informative

      Yeah, there's a couple of different ways to do it in a dos batch file. I think you can even do a foreach to loop through all the files in a directory. And then there's windows script, another overly complicated solution for a simple problem. And of course, if I use a computer for more than half an hour, I put perl and php on it for command line scripting.

      But for the average user with a digital camera, the software that comes with the camera normally has a batch rename function. I know my Nikon did and the others I looked at when shopping had similar software.

  6. In an Internet far far away... by Anonymous Coward · · Score: 2, Funny

    Darth Ballmer: Commander tear this site apart until you've found those plans and bring me the users I want them alive!!

  7. Thumbnails by sreid · · Score: 5, Insightful

    What happened to thumbnails?

  8. I'm impressed by Anonymous Coward · · Score: 5, Funny

    This whole longhorn winfs thing seems like a big technological advance to me ...

    Manualy adding metadata to each of your 200+ wedding pictures looks so smarter than just creating an old fashioned directory "wedding pics" and moving them into it ...

    I can't wait to start using this wonderful FS

    1. Re:I'm impressed by AndrewHowe · · Score: 2, Insightful

      But that way, things can only be in one folder. The whole point of the WinFS stuff is that you're not forced to store stuff in a rigid hierarchy.
      What if you wanted to find your favourite wedding pictures? Ones with your Mum & Dad in them? Ones with your wife in, but not necessarily from the wedding?
      I'm a game developer. We have a lot of "materials" which describe how to shade surfaces. Our editor allows you to put the materials anywhere under a certain directory. That's nice, as it avoids having thousands of files in the same directory, but in practice it's a bit of a nightmare.
      They're currently grouped in a number of different ways. All of the artists have their own folder. Then there are folders for each different level of the game. Also there are "grass", "metal", "wood" etc., and they're also sometimes grouped by slipperyness. It's a complete mess, as to find a material you have to check all the categories it might fit into.
      Allowing them to be dynamically grouped based on metadata would be a dream. I would implement it myself if I had time.

    2. Re:I'm impressed by SwansonMarpalum · · Score: 3, Insightful

      If your file system supported true symbolic links, your problem could be largely mitigated by using them.

      --
      "Give away the stone, let the oceans take and transmutate this cold and faded anchor." - Maynard James Keenan
    3. Re:I'm impressed by ealar+dlanvuli · · Score: 2, Informative

      With iPhoto I can do that now.

      Congrats, so revolutionary.

      *rolls eyes*

      --
      I live in a giant bucket.
  9. Search results in Micro"Google" by AtariAmarok · · Score: 3, Funny

    search for "best OS" find Microsoft

    search for "viral software" find Linux

    search for "secure" find Windows XP

    search for "handsome smart guy" find Bill Gates

    --
    Don't blame Durga. I voted for Centauri.
  10. Google by linuxci · · Score: 5, Insightful
    The way Google needs to compete is to show their users that there's no need for Microsoft.... Why? Because MS may just do its best to stop the Google toolbar working in IE for Longhorn. Microsoft have already 'innovated' an MSN toolbar that looks very similar to the Google offering.


    So instead of offering their official toolbar for IE only (the one for Mozilla is unofficial), start to slowly phase out the Google Toolbar and replace it with the Google Browser which would basically be a Google branded Mozilla Firebird. With all the features that make Firebird great like Tabbed Browsing, with the addition of the Google Toolbar features such as PageRank, etc. All on a cross platform basis.


    If people get used to downloading better browsers now, then they won't even notice when the next release of IE starts to reject the Google Toolbar.


    Let them know what you think

    1. Re:Google by HoneyBunchesOfGoats · · Score: 4, Insightful

      That would be very interesting. Hardly anybody apart from us geeks has heard of Mozilla or MozillaFirebird, but if Google rebranded it and put a little link on their front page, it would be exposed to millions every day... who wouldn't grab it and try it out? People trust Google.

    2. Re:Google by 1010011010 · · Score: 2, Insightful


      An excellent idea. This could work well for Google and Mozilla. They can pitch "GoogleBrowser" or "GoogleWeb" as more secure, more feature-rich, and easier to use. Luckily, these things are already true, and Google won't have to do all that much development, beyond branding and optionally adding some google-specific enhancements.

      --
      Napster-to-go says "Fill and refill your compatible MP3 player", which is a lie. It's not MP3. It's WMA with DRM.
    3. Re:Google by grahamlee · · Score: 2, Informative

      Appendix B.1 of the HTML 4.01 specification suggests that a conformal browser should have a mechanism for flagging errors; this does not apear to be the case with FB (although there is a 'side-effect mechanism as it is possible to visit validator.w3.org).

    4. Re:Google by linuxci · · Score: 2, Insightful

      In W3C specifications a SHOULD is not a MUST (they usually write them uppercase in the specs), so items labelled SHOULD are more of the nice to have things, not essential for standards compliance.

  11. useful dir names by cr@ckwhore · · Score: 4, Insightful

    If users didn't suck so much, then descriptive dir names would easily solve the problem of trying to locate a wedding photograph on a hard drive.

    So what, the image file is named "DSC0001.JPG" -- who cares. Put it in a folder named "my images" and there's no wonder you can't find it!! Put it in a folder named "wedding photos", and then you've got something there!

    The best way to describe it to the average joe (non)user is that directories/folders are analogous to folders in a filing cabinet. Would you file telephone bills, for example, under "mortgage" or "telephone"?

    Thanks Microsoft for "my photos", and "my documents", and the like. We appreciate it!

    --
    Skiers and Riders -- http://www.snowjournal.com
  12. Check out Phil Greenspun's similar idea by Speed+Racer · · Score: 4, Interesting

    Phil Greenspun has a similar idea and is looking for help on how to accomplish this on a personal level with existing the Windows XP filesystem. Check out his blog post for details. There's already an intersting discussion taking place in the comments for that post.

    --
    Free Mac Mini. Yes, I'm
  13. sharing proprietary formated data by stonebeat.org · · Score: 5, Insightful

    I hope the industry sees the opportunities that Longhorn's WinFS opens up. We can either work together and share data with each other, or we can be afraid and keep data to ourselves.

    Share data? with whom? how can you share data that is in either proprietary format or "patented XML" ???
    It is following the OpenStandard that will help in "working together and sharing data".

  14. What? by djupedal · · Score: 4, Insightful

    Why? Because they have useless names like DSC0001.jpg and there's no metadata that says they are wedding photos.""

    You mean to say you don't know the date you got married? You're in trouble.... iPhoto on OS X at least breaks them out into folders according to either last imported and/or month/year etc.. You're responsible to breaking them down further, in which case you don't search the entire drive later, you simply open iPhoto and take a short trip to your wedding folder, just like having a folder in a drawer in a cabinet in your home.

    It's not really that hard, now is it? if you're dropping any files onto your drive randomly, the issue is with your basic housekeeping, not that a top level search tool seems blind to your target.

    You're talking about EXIF, and the list of data there is long. Why you took the picture isn't part of it, and if you want the camera to interpret which part of the subject matter is root (noses..faces...age...sex...background..,trees...d aylight...horizon?), you've got more issues than just locating a particular photo.

  15. Microsoft Clientside Search:Road Signs for Spyware by NZheretic · · Score: 4, Insightful
    Given the history of Microsoft's security flawed implementation and design, any such client based search engine implementation would inevitably become a set of road signs for spyware, virus and worm malware developers.

    Would you want to trust your private data, gathered from govenment departments, purchases and financial transactions etc, being accessed by such a system run by any old govenmental or business agency?

    How about your private correspondence on friends and acquaintances home computers.

    Microsoft culled the URL name:password@ functionality from Internet Explorer because it claimed it could not create a secure enough fix, yet in the same month, it yet again proposes a privacy nightmare such as this? Madness.

  16. WinFS sounds promising, but... by Teckla · · Score: 4, Insightful

    WinFS sounds promising, but unless Microsoft makes the WinFS specs open and free, it'll be yet another lock-in technology, which would be very dissapointing.

    Adding metadata to all your files would require a lot of time and effort, and if it's a closed technology, it'd be yet another reason people wouldn't want to even attempt switching to another OS. I can almost hear it now...

    "This other OS looks cool, but I've spent so much time adding metadata to all my files, and I can't export that metadata to this other OS because the format is proprietary and patented... I'd better stick with Windows, switching OS's would be too hard..."

    Sorry, someone had to state the blatantly obvious. As usual, all promising technologies coming out of Microsoft are poisoned. And most people don't even realize it. Not even intelligent people. Most .NET developers don't even realize that .NET's so-called "standardization" via ECMA doesn't really make it an open standard (lots of the "standardized" .NET technology is encumbered by patents).

    -Teckla

  17. Irony by TheDarkRogue · · Score: 2, Funny

    I'm watching Invader Zim, and he says "And NOW For my Evil plan" and /. finishes loading with this story is at the top of the page. Second when it loads, it's got a google ad in it.

    --
    (Score:0, Interesting)
  18. Adding metadata is not the way by Flyboy+Connor · · Score: 5, Interesting
    Scoble's idea is that you will add metadata to your files. Can you imagine? You have literally tens of thousands of files you created (photos, documents, etc.) on your hard drive and you are going to add metadata to all of them? Does he really think people are going to do that? If they would be willing to do that, they would just rename those photo files from "DSC00001.JPG" to "MyWedding00001.JPG".

    Judiging from the interview, the "innovative" Longhorn seems to allow you to add metadata in a slightly user-friendly way. But virtually nobody will use it, except maybe to mark a few important files which you have stored in a special place anyway.

    So what would be a better solution then? My idea is that metadata should be added automatically. For instance, a human will recognize most wedding photos for what they are. Getting a computer to recognize this is not trivial, but lots of research is currently invested in this. Already computers can easily recognize general categories ("groups of people", "nature", "animal", "portrait"). My guess is that it is already possible to implement a system that you can train to let the computer recognize your particular brand of photos.

    I don't expect Microsoft to try to go into this way of innovation. They will probably wait until an entrepeneur develops it and then copy it or buy them out.

  19. Thumbnails don't scale! by ka9dgx · · Score: 2, Interesting
    The fact is that thumbnails need to be at least 200x200 pixels before you can really tell what's in the picture. Once you pass the first few thousand photos, it's no longer feasable to visually search through them... your brain starts to hurt!

    I store them by date photographed, using ThumbsPlus to view thumbnails and metadata stored in a database. So far, it's worked out for the 45Gb of photos I've taken in the past 5 years.

    --Mike--

    PS: Yes, I'll chat with and give ideas to anyone who wants to make this better... even Microsoft.

  20. Bad example by Lord+Kano · · Score: 2, Insightful

    For instance, how about if I wanted to search for "my wedding photos?" Neither X1, nor Windows XP's built in search would find your wedding photos. Why? Because they have useless names like DSC0001.jpg and there's no metadata that says they are wedding photos.

    You know the date of your wedding right? If not, don't let your wife find out. You can search for jpegs taken on a certain date.

    As you previously said...

    When I take pictures off of my Nikon, they have some metadata (for instance, inside the file is the date it was taken, along with the exposure information)

    It isn't exactly rocket surgery. :)

    LK

    --
    "Hi. This is my friend, Jack Shit, and you don't know him." - Lord Kano
  21. EXIF headers in .jpg files contain the metadata by Rick+Richardson · · Score: 4, Informative
    The metadata is there in the .jpg file itself...
    $ cd ~/photos/100nikon
    $ exifcomment dscn*.jpg | grep -y birthday
    dscn0503.jpg 2001/06/02 14:40:33 Abby's 6th Bday: The birthday girl
    dscn0713.jpg 2001/10/19 19:38:33 Dylan's 8th birthday
    dscn0714.jpg 2001/10/19 19:38:47 Dylan's 8th birthday cake
    dscn0715.jpg 2001/10/19 19:39:15 Dylan's 8th birthday - making a wish
    dscn0718.jpg 2001/10/20 10:08:56 Dylan's 8th birthday - Lego construction
    dscn0719.jpg 2001/10/20 10:09:20 Dylan's 8th birthday - Lego construction
    [snip]

    I once made the mistake of working with these files under Windoze. After I was done, all the EXIF information had been removed. You can imagine how mad I was.

    So what is Microsoft going to do? Fix this bug and call it a feature?

    -Rick

    1. Re:EXIF headers in .jpg files contain the metadata by ka9dgx · · Score: 4, Insightful
      That's a good start, far better than most.

      If Dylan had a more common name, like, uhm...Mike... the value would go down. What would you do to include the names of the other 4 people in the photo? How do you link it to Dylan's other photos, etc?

      --Mike--

  22. Meta data is seductive, but its a fools method. by Anonymous Coward · · Score: 4, Insightful

    "Why? Because they have useless names like DSC0001.jpg and there's no metadata that says they are wedding photos."

    Metadata is a stupid concept. It puts the cart before the horse. Files should not have to 'know' about themselves, they are not objects.

    You have to treat files as just files, their names are nothing more than identifiers, their contents are nothing more than contents.

    By all means its possible to build a great search capability into a filesystem, but you need to build the 'meta' data _outside_ the file.

    A system built on file metadata is doomed to be incompatible with anything but the latest datatypes designed for it.

  23. MS need to (un)fix their Find program... by ejaw5 · · Score: 4, Insightful

    Used to be if you wanted to find a file real fast under windows, you'd hit WINDOWS+F to pullup the find window, enter in your search query, and Go.

    Now if you're in front of an XP machine and want to find say...all the pictures on the system you can't just enter in "*.JPG" anymore. You have to read what some animated dog is asking you, click on one of the options before you get to the search query window, then enter in the query. doesn't sound like much of a hassle, but it IS an extra step.

    --

    $cat /dev/random > Sig
    1. Re:MS need to (un)fix their Find program... by Photon01 · · Score: 4, Informative

      This dosent need fixing .. it obviously didnt annoy you enough to make you look for the option to turn it off.

      Hit Windows+F, click change preferences, click I want to search without an animated character.

      Next click preferences again go to 'change files and folders search behaviour' then click advanced

      Voilla, the find program is (un)fixed

  24. That's the app you were using, not Windows by blorg · · Score: 3, Informative
    "I once made the mistake of working with these files under Windoze. After I was done, all the EXIF information had been removed."

    I use Windows (duck) and it preserves my metadata fine.

  25. Human entry errors are THE problem by blueworm · · Score: 5, Insightful

    "Neither X1, nor Windows XP's built in search would find your wedding photos. Why? Because they have useless names like DSC0001.jpg and there's no metadata that says they are wedding photos."

    Metadata will NEVER improve searching in this way unless the things that generate the content FORCE you to put it in before they can snap pictures, etc...

    Even if people were forced to put metadata into all their files there is a big chance that typos and other errors in entering the info would occur. This will make the metadata totally useless in a search!

  26. Re:Renaming files by ka9dgx · · Score: 4, Insightful
    Renaming files doesn't work... how many times have you had to search using the "containing text" features of Windows (or some other OS)?

    It's saved my bacon more than once. As we move away from text, we become completely dependent on metadata to find things. Standards for metadata need to be settled soon, or Moore's law means our computers will become less and less useful.

    --Mike--

  27. More feature creep by krygny · · Score: 3, Interesting

    How many people have trouble finding files on their hard drive using the most basic search criteria. People who are so unorganized as to lose files on their hard drive are probably not sophisticated enough to use advanced search methods successfully.

    --
    Research shows that 67% of those who use the term "research shows", are just making shit up.
  28. Re:Renaming files by glpierce · · Score: 4, Insightful

    "As we move away from text, we become completely dependent on metadata..."

    Exactly what do you think metadata is? This system would require more text than current. At present, you can rename the files and put them in folders, which works quite well if you have any organizational ability. Metadata would require dozens of unrelated pieces of info be input, and the a more complex retrieval (search) process would be required. While metadata standards are important, it's only advanced users who will be using them. How many "typical" users do you know that are going to search for a photo by the F-value?

    And for the record, I've never used the "containing text" search, because I name files in unambiguous ways.

    --
    G
  29. Great ! Just what we were missing... by catyoul · · Score: 2, Interesting
    But, contacts in Outlook can't be used by other applications (...) By putting a "contacts" file type into the OS itself, rather than forcing applications developers to come up with their own contacts methodology. What if ALL applications, not just Outlook, could use that new file type? What if we could associate that file type to social software services like Friendster, Tribe, Yahoo's personals, or Google's Orkut? Would that radically change how you would keep track of your contacts? Would that make contacts radically more useful?
    Would that make worms spreading even better ? ;)
  30. Find images like this by unoengborg · · Score: 5, Informative

    In KDE you can allready select an image file and say select "Find similar images". provided you have indexed your images using GIFT (Gnu Image Finding Tool)

    You can search images both in your own GIFT database and databases on the internet.

    So to solve the wedding photo problem you could make a drawing similar to your photos and search for similar images.

    --
    God is REAL! Unless explicitly declared INTEGER
  31. My Bachelor Party by bitflip · · Score: 3, Insightful

    This is more useful than it would seem. I've read a bunch of posts that talk about keeping the wedding pictures in a folder called "Wedding", and that's the extent of the organization.

    Except it doesn't work that way. If I dig around a little, I see that I have the same images in several places: in the folder called "Vacation", another folder called "work" where I did some touchups, another folder called "staging" where I laid things out before putting them on the server, and again on the server, where my family can view them on the web.

    If I follow the suggestion of putting them all into a single folder, then I've created a logistical headache. The _only_ thing I've gained is the ability to find all the files at once. Using metadata, I would no longer have that restriction - I could put files where they made the most sense, and still find all the files at once.

  32. MS already does this and nobody uses it by Pointy_Hair · · Score: 3, Interesting

    Ever look at the properties page of an MS Office file? There's enough metadata tags in there to keep you busy for hours.

    Does anyone really fill those in? Rarely.

    Is there a method to search on them? Never looked.

    Sometimes it's interesting to browse the properties page to see who really created a spreadsheet or document. For example, people who shamelessly "borrow" templates from former employers and either aren't smart enough or too lazy to do just a little clean up. But that's about it.

  33. M$ search technology by stefan999 · · Score: 5, Funny

    Ever tried to search for Xfree86 on search.msn.com?

    Stefan

  34. Re:Search by date? Search by Face! by Giltron · · Score: 2, Informative

    I was recently at a longhor demonstration at my university and Longhorn had facial recognition software built right in. You could specify the name of the person by face and then all the photos with that person were searchable on your computer.

    Also by default at the presentation there was a search bar built right in to the default desktop.

  35. ACDSee has had this for years by 1u3hr · · Score: 3, Insightful
    they have useless names like DSC0001.jpg and there's no metadata that says they are wedding photos.

    ACDsee, a well-known and, at one time, free, image viewing and organising app, supports metadata. It puts it in a "descript.ion" text file in each directory. This is an ancient DOS standard. It's still supported by a few Windows apps, notably the Far manager (a shareware clone of Norton Commander for Win) and ReGet, a downloader; both Russian.

    In fact I find the "descript.ion" metadata so useful I stick with apps that use it. At my last job, a web news site, I organised out image library using ACDsee and this metadata to add notes. ACDsee also has a nice batch rename.

    No need to invent a whole bloody new file system to find your wedding photos.

    1. Re:ACDSee has had this for years by WuphonsReach · · Score: 2, Informative

      ACDsee, a well-known and, at one time, free, image viewing and organising app, supports metadata. It puts it in a "descript.ion" text file in each directory

      Correction, it used to.

      ACDSee 3.1 was quite intelligent, if it found DESCRIPT.ION files, it would automatically load the comments in. When you moved files to another folder, it would automatically create a new DESCRIPT.ION file in the destination folder.

      Nice and simple, ne?

      But, oh no, that's not good enough for the bright folks at ACDSee. Instead with version 6, you have to manually tell ACDSee to import information out of a DESCRIPT.ION file and manually tell it to export information to a DESCRIPT.ION file.

      Not so nice and very complex, ne?

      --
      Wolde you bothe eate your cake, and have your cake?
  36. No, no, no... by maharg · · Score: 2, Insightful

    Cripes. You'd had thought that a company as big as Microsoft would have considered a better way, but no.
    from the article, Microsoft's Robert Scoble:
    But, WinFS goes further than X1 and other file search tools do today. It lets you (and developers of apps you'll use) add metadata to your files. So, even if you don't change the name of your files, you might click on one of the faces in a picture application and get prompted to type a name and occasion. So, you would click on your cousin Joe's face, type in "Joe Smith" and "Wedding."

    So Microsoft, who have sold many more graphical interfaces that anyone else on the planet, require you to "type in" Joe Smith for each and every photo of Joe you have !

    Oh, sure, there'll be a dropdown list, but it'll surely list every last irrelevant person and topic you ever defined in WinFS.

    Instead consider the following scenario: -

    You've uploaded your latest batch of photos from your camera to your PC and have them in thumnails view in a file manager of your choice. -

    Now you want to add your metadata, so you open up your "Meta topics" folder and select a number of graphical icons representing the subject matter of your photos, e.g. "Wedding", "Uncle Jim", "Mary-Jane" and some others. You then drag'n'drop these into a "Scratch" folder and close the "Meta topics" folder. So you now have the freshly-uploaded photos, and the relevant meta topics. -

    Now select all the photos in the folder - they're all wedding photos, so drag'n'drop 'em onto the Wedding topic icon. -

    Now select the photo of Uncle Jim staggering across the reception with a pint of special, and .. you got it .. drop it on the Uncle Jim topic icon. -

    Now the picture of Mary-Jane in her wedding hat - yeah, that's it baby - drop it on the pretty icon.. -

    Now you can access all the Wedding photos by clicking on the wedding icon all the pictues of Uncle Jim by clicking the uncle jim icon and so on. -

    There's even an interface to combine filters, e.g. Wedding AND (Uncle Jim OR Mary-Jane), simply by dragging and dropping the icons onto AND and OR icons in a cumulative fashion.

    Now you can do all of this (bar the interface combine filters interface) TODAY, albeit in a fairly crude way, with a file system that supports symlinks (such as ext3), and a graphical file manager (say, Rox-FILER..). And here is my claim to prior art in respect of this "graphical metadata manipulation" concept. Of course, I had to hold down Shift+Ctrl to make it do the symlinks when I dropped the photos on the relevent icon, which a proper interface wouldn't require. Also, a posix filesystem is not as elegant as say, a relational database for the purposes of storing the metadata. But hey, not bad for 5 minutes work. How long have Microsoft been working on this exactly ?

    --

    $ strings FTP.EXE | grep Copyright
    @(#) Copyright (c) 1983 The Regents of the University of California.
  37. metadata by period3 · · Score: 2, Informative

    I don't see how adding 'metadata' is going to help. If people are unwilling to give their files meaningful filenames or organize them in directories, then how can they be expected to provide properly describe their data?

    An interesting article that addresses this and several other points is here.

  38. 0 Hits for Linux by f0rt0r · · Score: 2, Funny

    Microsoft Search Engine Clippy - "I found 0 matches for 'Linux', maybe you meant to search for 'Microsoft Windows XP'?"

    --
    I can't afford a sig!
  39. Re:Renaming files by Anonymous Coward · · Score: 2, Insightful

    Folders are annoying, though, because (on windows) files can only be in one folder at a time. Unix has links, but even then -crucially- you can't introspect on a file to find out what folders it is in. What is needed is a relational filesystem, where the "directory" structure is _equivalent_ to metadata. I can look for a file in directory mp3, or I can look AT a file and see that it is in directories mp3, kittymonkey.

  40. Good Things are happening with thumbnails by Z-MaxX · · Score: 2, Informative
    PhotoMesa is a program that uses algorithms to automagically lay your images out in categories and, using a zoomable user interface you can zoom in on the collection at various levels. Just rename your images to describe each image, and images with similar key words are laid out near each other.

    I used PhotoMesa before they wanted money for it, but you can still download a free trial. It's written in Java "but" it is well-written and feels very fast.

    There's an article on Sun's Java website about PhotoMesa.

    --
    Dr Superlove 300ml. I use my powers for awesome
  41. Define 'metadata' by inkswamp · · Score: 3, Interesting
    Because they have useless names like DSC0001.jpg and there's no metadata that says they are wedding photos.""

    Doesn't storing your photos in hierarchical folders labeled appropriately count as metadata? I know it's not very flexible or powerful, but it's metadata of a sort. Store your wedding photos in a wedding folder in a photos folder.

    Now, if you're talking about a database of metadata about files, then that's something else.

    --
    --Rick "If it isn't broken, take it apart and find out why."
  42. calendar based metadata by chocolatetrumpet · · Score: 2, Interesting

    Maybe the photo software could check with your calendar, see that a certain date/time was "my wedding," and assign that metadata to photos as they are downloaded. Most photos already have time/date metadata.

    --
    Spoon not. Fork, or fork not. There is no spoon.
  43. meta data by WhatsAProGingrass · · Score: 2, Interesting

    Yeah, I can't wait to download stuff from the internet full of their own meta data. Isn't it true that search engines are not using meta data as much cause of false data? The OS having its own contacts list might seem like a good idea, but i can see many people trying to hack into it and mass mail all your friends.

    --
    Mark
  44. Re:Renaming files by Malcontent · · Score: 2, Insightful

    I have given up even trying to use MS search on my hard drive. First of all it frequently desides that I have nothing on my machine. In other words whenever I search for something it instantly decides that it's not there. The only cure is to turn off indexing and delete the database files. Then it takes an half an hour to search for one filename.

    I have taken to saving things on a samba share just so I can use locate and grep.

    --

    War is necrophilia.

  45. I think Microsoft should... by CeZa · · Score: 2

    concentrate on getting MIME data identified first. The entire operating system relies on ".JPG" to find pictures to begin with. Internet Explorer has no clue what to do with a file unless it has a "proper" extension.

  46. Apple's Solution by JohnsonWax · · Score: 3, Interesting

    Apple has a solution to this, which has trade-offs, but seems pretty functional.

    Essentially, each of their iLife apps is a replacement for the Finder. Do we really need music search integrated with file search? Or is it sufficient to build independent metadata (ID3) and filestructure (playlists) just for music. That's really the brilliance of iTunes in that it never takes you back to your HD filestructure. You can even ask it to maintain the HD filestructure to reflect the metadata structure, so it'll keep everything in an artist/album/song structure, naming things as needed.

    iPhoto is set up the same way, but it's pretty apparent that the iPhoto guys are the 'B' team, since they haven't gotten it nearly as slick as iTunes yet, but it also has the equivalent of content metadata, playlists, and smart playlists. So, yes, I can easily find my wedding photos. The trade-off is that you can't search for 'Wedding' in the Finder and get wedding photos, wedding songs, etc. Maybe that's upcoming, but I'm not totally convinced of the value.

    The iTunes organizational structure does carry into iPhoto, so if you want to select a song for a slideshow in iPhoto, you can see your iTunes playlists, and filter against metadata. It also carries into iMovie, etc.

    Other posters have clearly identified the problems with metadata. File organization is generallly only useful if you are willing to symlink across all of your metadata, otherwise your photos of you mom and your wedding photos are disjoint, since some should be in both places. The single biggest problem with metadata is putting it in to begin with. iPhoto now allows you to do that during photo import - using a slide-show type UI.

    I think MSs tendency to do everything in one place is interesting, but tends to not come off so well. Having everything in SQL could eliminate one of the shortcomings in Apple's implementation which is that they need to maintain an XML intermediate structure for music files, photos, etc. While somewhat handy, it's main function is to join file metadata and the FS, which means that it is somewhat fragile.

  47. Put a GPS in the camera by Animats · · Score: 2, Interesting
    The right solution is to put a GPS receiver in the camera and tag photos with time, date, and location. No user action is required at picture-taking time. Ricoh is already selling such a camera in Japan. Kodak has a camera that plugs into an external GPS, but that's too clunky.

    Pros would love this; often you want to search some big image archive for pictures of a specific location. Tourists would find their photos self-organizing.

    Lookup can then be by address, or using a map or globe. Think MapQuest.

    This offers the possibility of a new (and totally legitimate) peer-to-peer application - location based picture-sharing. See the pictures others took of tourist locations.

  48. Obligatory Dilbert reference by zerofoo · · Score: 2, Funny

    PHB: I need you to make this so simple my mother could use it.

    Alice: It's already so simple a squirel could use it. How much dumber is your mother?

    -ted

  49. iPhoto - The Application Paradigm by Slur · · Score: 3, Interesting

    I think it's cool that Microsoft is taking cues from the iApps - interesting that they want to integrate it so much into the operating system. Whereas so far Apple is stressing an application-centered solution on top of a more general-purpose filesystem, Microsoft is getting deeper into the integration game, getting into file metadata a la BeOS, and tracking files according to thematic relevance a la relational databases.

    If the "smart desktop" idea catches on it will be interesting to see the response from developers on Mac OS X and Linux, as far as offering intelligent activity tracking. Somehow I see a twisty maze of documents and activities, all alike.

    Should operating systems do all the work of organizing users files for them, concealing the filesystem behind a database veneer, or behind a purely task-oriented veneer? Should this kind of thing be left to application developers, like the maker of Path Finder?

    Wouldn't Windows be more useful if it was a truly modular system that could be configured simply by stripping away unwanted components? Isn't that what makes Darwin so healthy in the enterprise market today?

    --
    -- thinkyhead software and media
  50. Re:Slashdot luddites by Crypto+Gnome · · Score: 2, Insightful
    The trick is incorporating it into the file system mean you don't have to reinvent the wheel.

    Actually no, the trick is incorporating it into the filesystem LOCKS YOU INTO USING THAT FILESYSTEM EXCLUSIVELY.

    Your metadata would no longer relate to an image, it relates to a particular file on a particular filesystem. If you copy that file with anything other than Microsoft Magic Filesystem Aware Software (like, I dunno, a 3rd party FTP client) and your precious metadata goes up in smoke faster than "medicinal" marijuana.
    1. Move MetaData from files and into FileSYSTEM
    2. PATENT said concept and technologies
    3. and ONLY permit said FileSYSTEM on "Windows"
    4. refuse technology licenses to software deveopers
    5. Encourage users to meta-encode their files, without explaining the complications
    6. time passes , users get addicted to havng metadata
    7. Users discover their files are almost useless without Windows
    8. Microsoft appoints Billy G President of the Corporate Empire of America.
    --
    Visit CryptoGnome in his home.
  51. PC based Google search application by Twister002 · · Score: 2, Interesting

    What I'd like to see come out of Google, is an add in that will categorize and search my local drives using the Google search algorithm. They have Google appliances that businesses can buy and use internally. I'd like to see a home based, and home priced, version of that application. Maybe have it search the internet as well, present the results separately. So if I'm looking for a file containing the words "efficient search keywords" (or something like that) it shows me files in my local system (including network shares maybe) as well as results on the internet.

    --
    "For a successful technology, honesty must take precedence over public relations for nature cannot be fooled." -Feynman
  52. Doctorow's "MetaCrap" by bruceoberg · · Score: 2, Informative

    Cory Doctorow has a great analysis of why most metadata schemes become less and less useful. This is also where he described the ebay "Plam Pilot" phenomenon that the NYT picked up on a little while back.

  53. file formats by ShadowRage · · Score: 2, Insightful

    what will this meta data do with older windows versions or other operating systems?
    will it lock them out from viewing longhorn only data?

    you better bet the farm.

    microsoft isnt only locking in the bios, they're locking everything in.

    it's like if you read the libjpeg readme, jpeg isnt a concrete format it can be icompatible with itself.
    and microsoft will possibly create data on each file that their software will produce and any other programs on the system, or older windows versions or any other system will be locked out from viewing those file besides let's say....Microsoft photoready. (wouldnt be surprised if they came up with that cliched name)

    personally, this winfs search crap is another front for lockIN, and really, I hope people react the way microsoft expects them not to do, flock to an alternate OS

    not to mention once programs like photoshop need to use this technology to run on windows, and it would hurt their macos support, not to mention they'd have to pay royalties for it, I think Adobe would start shifting away, and many gaming companies would prolly start flocking as well, because they dont want to put up with this shit, I think atm, the way XP crashes (where if you need to poweroff the system, you cant because XP controls the power on and off of the system) is about how far people are gonna take it.

    Teacher next to my third period class has had it with his current XP set up because it just keeps getting viruses that cripple the system, and he spendsmore money and time repairing it than actually being able to use it.

    and microsoft wants to lock us into this sort of bullshit? riight, once I told him about linux, he started sound a bit interested in it.

    You show people something they havent seen before, plus it being stable, and showing general advantages, they will use it. most people are trained to think you have to pay for quality.

    hell, this woman was selling a bunch of computer parts for 5 bucks yesterday (got me a nice monitor, hehe) and she said that these old games she had for her children no longer worked on XP, so she just got new ones.

    reason people havent flocked over to linux is because it's completely new to them and their neighbor doesnt use it, but your typical grandmother or parent or average joe will get it as long as it works garaunteed. to your more geekcentric user, it takes more convincing.

    but to get back onto my original point, microsoft is gambling with this metadata crap.
    it's either going to make or break them.
    not to mention if another manufacturer offers support for older windows systems and linux and alternatives, epople would gladly flock to that instead of this new system which doesnt work well with their older stuff. though then again, they might just flock to it because it's the latest thing and they're willing to change. this is something I think all desktop linux companies need to keep in mind.

  54. Re:Why not use an available metadata standard by harmonica · · Score: 2, Insightful

    These standards are already available, are already being used in cameras and imaging software,
    and are documented well enough that support can be implemented into your open-source imagebrowser or other app.


    Unfortunately, they do not all support the kinds of information you may want to store. EXIF may miss a feature, JPEG2000 another one. The smallest common denominator is probably not desirable.

    Besides, certain formats do not support metadata that well, or at all. But you may be forced to use those formats anyway. So data has to be stored outside of the file.

    The most obvious shortcoming - it's not feasible to search all files for a certain query. The metadata has to be cached in order for the query to be answered fast. That caching can (and should) be done in some sort of database. If it's on the file system level, all applications can profit from it.

    The idea is fine (although not that new, really). It's up to MS now to deliver a good implementation for their OS.