Slashdot Mirror


Microsoft's Search Engine Plans

prostoalex writes "Andy Beal from SearchEngineGuide.com interviews Robert Scoble from Microsoft. Scoble tells the audience what current search technologies Microsoft is working on as part of its Longhorn/WinFS development as well as in the field of Internet. Scoble also discusses current problems with local drive and Internet searching, such as absence of metadata for a lot of files out there: "When I take pictures off of my Nikon, they have some metadata (for instance, inside the file is the date it was taken, along with the exposure information) but that metadata isn't useful for most human searches. For instance, how about if I wanted to search for "my wedding photos?" Neither X1, nor Windows XP's built in search would find your wedding photos. Why? Because they have useless names like DSC0001.jpg and there's no metadata that says they are wedding photos.""

37 of 407 comments (clear)

  1. I have a suggestion for em.. by panxerox · · Score: 4, Insightful

    Make it "Micosoft Search powered by GOOGLE". Then "maybe" it might function well. Also metadata needs to be created by the user, I aint gonna be entereing data on a keypad on my camera for every photo.

    --
    "It's so convenient to have a system where everyone is a criminal" - A. Hitler
    1. Re:I have a suggestion for em.. by lintux · · Score: 5, Funny

      > Also metadata needs to be created by the user, I aint gonna be entereing data on a keypad on my camera for every photo.

      That's why the next version of Microsoft Photo[tm] will send every photo you make to the Microsoft headquarters so they can add the metadata for you. ;-)

    2. Re:I have a suggestion for em.. by nolife · · Score: 5, Informative

      Google can already do a good job of finding wedding photos.. They may not be yours but they are wedding photos ;)

      --
      Bad boys rape our young girls but Violet gives willingly.
  2. Search by date by Saven+Marek · · Score: 5, Interesting

    I can get around searching for "wedding photos" because I remember the date. 3 special days, and hundreds of wedding photos appear.

    It's part of being human that we don't necessarily remember the phrase "wedding photos" but we may remember many other tiny pieces of data about a shoot that are unique to us, and the time and date are one of those. I can be certain the post 9pm photos done on those days are pretty embarassing.

    Just concentrating on "Wedding Photos" is useful if someone else is searching my picture archive, but that's not useful to me

    nude geekgrrls

    1. Re:Search by date by TitanBL · · Score: 4, Informative

      The new iPhoto handles this very well - automatically imports the date the photo was taken. Then you can group the images together in albums similar to the way you create 'smart playlists' in iTunes.

    2. Re:Search by date by Flakeloaf · · Score: 4, Funny

      Funny, I get around not being able to find my wedding photos by putting them all in a directory called WEDDING PHOTOS - THE PHOTOS OF YOUR WEDDING ARE HERE YOU TWIT

      --

      Am I the only one who heard Roxette to sing "I'm gonna get blitzed for some sex"?

    3. Re:Search by date by alangmead · · Score: 4, Interesting

      What if a system's search could draw information from all the applications within the system. For example, if your electronic datebook had a day long entry for "wedding" and the photo manager has photos taken on that date, then a search for "wedding photos" would be able to find out when the wedding was, match it up with the date the photos were taken, and come up with the appropriate set.

      To some extent Apple tried this with "Newton Intelligence" on the MessagePad. If you wrote "Thursday, Lunch with Bob at Redbones" It would (after you fixed all of its handwriting recognition mistakes) look up Bob and Redbones in the address book, look in the calendar for the next occurrence of a Thursday, and schedule a noon time appointment.

      Newton Intelligence really only amounted to a small set of interapplication tricks, but it was assumed that as the popularity of the units increased, the functionality would be extended. (which pretty much tells you what happened to it.)

  3. I'm not buying it by de+Selby · · Score: 4, Insightful

    For instance, how about if I wanted to search for "my wedding photos?" Neither X1, nor Windows XP's built in search would find your wedding photos. Why? Because they have useless names like DSC0001.jpg and there's no metadata that says they are wedding photos."

    That's why you can change filenames and organize things into directories.

    1. Re:I'm not buying it by Saven+Marek · · Score: 4, Interesting

      Even easier than putting into directories is using a portfolio type application, like Picasa (the original version of Apple's iPhoto btw) which allows simple drag and drop library creation. You can have pictures in multiple libraries, it just takes a small few moments to drop photos into their correct places and they are sorted as need be. If you want wedding photos, look in there if you want photos of janine, kate or benson look in their respective folders.

      It doesnt need to be a morass of embedded folder after folder either, as humans have mental acuity unlike a computer. You may have uncle bob who is photographed a lot and auntie beryl who isn't, but all the photos of beryl you may know will contain bob. We can store a surprising amount of information, and perhaps 5 to 10 libraries is all you will need for most peoples collections.

      Special occasions get their own. It just takes moments after downloading the photos.

      nude geekgrrls

    2. Re:I'm not buying it by tsa · · Score: 4, Insightful

      But, if the industry works together on common WinFS schemas (not just for contacts either, but other types of data too), we'll come away with some really great new capabilities. It really will take getting developers excited about WinFS's promise and getting them to lose their fears about opening up their data types.

      If M$ would work together with the industry and open up its data types we would come away with some really great new capabilities. This is incredible: they want 'the industry' to do what they never do, and I expect they will succeed. Prepare for the even more total domination of Microsoft in the near future.

      --

      -- Cheers!

    3. Re:I'm not buying it by edalytical · · Score: 4, Informative

      I did a quick Google search and what I can tell iPhoto was introduced at MacWorld Expo January 7, 2002. Picasa was introduced in October 2002.

      --
      Win a signed Stephen Carpenter ESP Guitar from the Deftones: http://def-tag.com/?r=0008781
  4. Ouch by Anonymous Coward · · Score: 5, Funny

    X1, nor Windows XP's built in search would find your wedding photos. Why? Because they have useless names like DSC0001.jpg and there's no metadata that says they are wedding photos.

    Right, dude! The camera should automagically recognize that it's taking pictures of your wedding and include that info in the metadata!

    1. Re:Ouch by Otter · · Score: 5, Interesting
      Yeah, I'm sure being a computer science visionary is harder than it looks. But from the outside, all they seem to ever do is to announce that computer use is difficult because software developers aren't as smart as them, and that what we really need is some way for everything to magically sort itself out. Details of implementation to be left to those of less rarified brilliance.

      The closest thing to a workable scheme is Gelerntner's Lifestream stuff -- where your system knows that you got married on a certain date (even if you have trouble remembering it) and that documents (JPEGs, Word files, GNUCash transactions from that time probably pertain to it.

  5. Hmmmm... by Bowie+J.+Poag · · Score: 4, Insightful

    I think thats what "organization" is for. You place files like "DSC0001.jpg" in things called "folders", and then name the folder "Wedding" or something.

    I dunno.

    --
    Bowie J. Poag

    1. Re:Hmmmm... by khuber · · Score: 5, Funny

      I'm so stupid! I named all my folders like that too. folder0001, folder0002, ....

    2. Re:Hmmmm... by tomstdenis · · Score: 5, Informative

      you *can* do this to some limited sense with the command shell

      for %a in (DSC*.JPG) do rename %a Wedding_%a

      You just have to know a bit about the command shell...

      Tom

      --
      Someday, I'll have a real sig.
  6. Thumbnails by sreid · · Score: 5, Insightful

    What happened to thumbnails?

  7. I'm impressed by Anonymous Coward · · Score: 5, Funny

    This whole longhorn winfs thing seems like a big technological advance to me ...

    Manualy adding metadata to each of your 200+ wedding pictures looks so smarter than just creating an old fashioned directory "wedding pics" and moving them into it ...

    I can't wait to start using this wonderful FS

  8. Google by linuxci · · Score: 5, Insightful
    The way Google needs to compete is to show their users that there's no need for Microsoft.... Why? Because MS may just do its best to stop the Google toolbar working in IE for Longhorn. Microsoft have already 'innovated' an MSN toolbar that looks very similar to the Google offering.


    So instead of offering their official toolbar for IE only (the one for Mozilla is unofficial), start to slowly phase out the Google Toolbar and replace it with the Google Browser which would basically be a Google branded Mozilla Firebird. With all the features that make Firebird great like Tabbed Browsing, with the addition of the Google Toolbar features such as PageRank, etc. All on a cross platform basis.


    If people get used to downloading better browsers now, then they won't even notice when the next release of IE starts to reject the Google Toolbar.


    Let them know what you think

    1. Re:Google by HoneyBunchesOfGoats · · Score: 4, Insightful

      That would be very interesting. Hardly anybody apart from us geeks has heard of Mozilla or MozillaFirebird, but if Google rebranded it and put a little link on their front page, it would be exposed to millions every day... who wouldn't grab it and try it out? People trust Google.

  9. useful dir names by cr@ckwhore · · Score: 4, Insightful

    If users didn't suck so much, then descriptive dir names would easily solve the problem of trying to locate a wedding photograph on a hard drive.

    So what, the image file is named "DSC0001.JPG" -- who cares. Put it in a folder named "my images" and there's no wonder you can't find it!! Put it in a folder named "wedding photos", and then you've got something there!

    The best way to describe it to the average joe (non)user is that directories/folders are analogous to folders in a filing cabinet. Would you file telephone bills, for example, under "mortgage" or "telephone"?

    Thanks Microsoft for "my photos", and "my documents", and the like. We appreciate it!

    --
    Skiers and Riders -- http://www.snowjournal.com
  10. Check out Phil Greenspun's similar idea by Speed+Racer · · Score: 4, Interesting

    Phil Greenspun has a similar idea and is looking for help on how to accomplish this on a personal level with existing the Windows XP filesystem. Check out his blog post for details. There's already an intersting discussion taking place in the comments for that post.

    --
    Free Mac Mini. Yes, I'm
  11. sharing proprietary formated data by stonebeat.org · · Score: 5, Insightful

    I hope the industry sees the opportunities that Longhorn's WinFS opens up. We can either work together and share data with each other, or we can be afraid and keep data to ourselves.

    Share data? with whom? how can you share data that is in either proprietary format or "patented XML" ???
    It is following the OpenStandard that will help in "working together and sharing data".

  12. What? by djupedal · · Score: 4, Insightful

    Why? Because they have useless names like DSC0001.jpg and there's no metadata that says they are wedding photos.""

    You mean to say you don't know the date you got married? You're in trouble.... iPhoto on OS X at least breaks them out into folders according to either last imported and/or month/year etc.. You're responsible to breaking them down further, in which case you don't search the entire drive later, you simply open iPhoto and take a short trip to your wedding folder, just like having a folder in a drawer in a cabinet in your home.

    It's not really that hard, now is it? if you're dropping any files onto your drive randomly, the issue is with your basic housekeeping, not that a top level search tool seems blind to your target.

    You're talking about EXIF, and the list of data there is long. Why you took the picture isn't part of it, and if you want the camera to interpret which part of the subject matter is root (noses..faces...age...sex...background..,trees...d aylight...horizon?), you've got more issues than just locating a particular photo.

  13. Microsoft Clientside Search:Road Signs for Spyware by NZheretic · · Score: 4, Insightful
    Given the history of Microsoft's security flawed implementation and design, any such client based search engine implementation would inevitably become a set of road signs for spyware, virus and worm malware developers.

    Would you want to trust your private data, gathered from govenment departments, purchases and financial transactions etc, being accessed by such a system run by any old govenmental or business agency?

    How about your private correspondence on friends and acquaintances home computers.

    Microsoft culled the URL name:password@ functionality from Internet Explorer because it claimed it could not create a secure enough fix, yet in the same month, it yet again proposes a privacy nightmare such as this? Madness.

  14. WinFS sounds promising, but... by Teckla · · Score: 4, Insightful

    WinFS sounds promising, but unless Microsoft makes the WinFS specs open and free, it'll be yet another lock-in technology, which would be very dissapointing.

    Adding metadata to all your files would require a lot of time and effort, and if it's a closed technology, it'd be yet another reason people wouldn't want to even attempt switching to another OS. I can almost hear it now...

    "This other OS looks cool, but I've spent so much time adding metadata to all my files, and I can't export that metadata to this other OS because the format is proprietary and patented... I'd better stick with Windows, switching OS's would be too hard..."

    Sorry, someone had to state the blatantly obvious. As usual, all promising technologies coming out of Microsoft are poisoned. And most people don't even realize it. Not even intelligent people. Most .NET developers don't even realize that .NET's so-called "standardization" via ECMA doesn't really make it an open standard (lots of the "standardized" .NET technology is encumbered by patents).

    -Teckla

  15. Adding metadata is not the way by Flyboy+Connor · · Score: 5, Interesting
    Scoble's idea is that you will add metadata to your files. Can you imagine? You have literally tens of thousands of files you created (photos, documents, etc.) on your hard drive and you are going to add metadata to all of them? Does he really think people are going to do that? If they would be willing to do that, they would just rename those photo files from "DSC00001.JPG" to "MyWedding00001.JPG".

    Judiging from the interview, the "innovative" Longhorn seems to allow you to add metadata in a slightly user-friendly way. But virtually nobody will use it, except maybe to mark a few important files which you have stored in a special place anyway.

    So what would be a better solution then? My idea is that metadata should be added automatically. For instance, a human will recognize most wedding photos for what they are. Getting a computer to recognize this is not trivial, but lots of research is currently invested in this. Already computers can easily recognize general categories ("groups of people", "nature", "animal", "portrait"). My guess is that it is already possible to implement a system that you can train to let the computer recognize your particular brand of photos.

    I don't expect Microsoft to try to go into this way of innovation. They will probably wait until an entrepeneur develops it and then copy it or buy them out.

  16. EXIF headers in .jpg files contain the metadata by Rick+Richardson · · Score: 4, Informative
    The metadata is there in the .jpg file itself...
    $ cd ~/photos/100nikon
    $ exifcomment dscn*.jpg | grep -y birthday
    dscn0503.jpg 2001/06/02 14:40:33 Abby's 6th Bday: The birthday girl
    dscn0713.jpg 2001/10/19 19:38:33 Dylan's 8th birthday
    dscn0714.jpg 2001/10/19 19:38:47 Dylan's 8th birthday cake
    dscn0715.jpg 2001/10/19 19:39:15 Dylan's 8th birthday - making a wish
    dscn0718.jpg 2001/10/20 10:08:56 Dylan's 8th birthday - Lego construction
    dscn0719.jpg 2001/10/20 10:09:20 Dylan's 8th birthday - Lego construction
    [snip]

    I once made the mistake of working with these files under Windoze. After I was done, all the EXIF information had been removed. You can imagine how mad I was.

    So what is Microsoft going to do? Fix this bug and call it a feature?

    -Rick

    1. Re:EXIF headers in .jpg files contain the metadata by ka9dgx · · Score: 4, Insightful
      That's a good start, far better than most.

      If Dylan had a more common name, like, uhm...Mike... the value would go down. What would you do to include the names of the other 4 people in the photo? How do you link it to Dylan's other photos, etc?

      --Mike--

  17. Meta data is seductive, but its a fools method. by Anonymous Coward · · Score: 4, Insightful

    "Why? Because they have useless names like DSC0001.jpg and there's no metadata that says they are wedding photos."

    Metadata is a stupid concept. It puts the cart before the horse. Files should not have to 'know' about themselves, they are not objects.

    You have to treat files as just files, their names are nothing more than identifiers, their contents are nothing more than contents.

    By all means its possible to build a great search capability into a filesystem, but you need to build the 'meta' data _outside_ the file.

    A system built on file metadata is doomed to be incompatible with anything but the latest datatypes designed for it.

  18. MS need to (un)fix their Find program... by ejaw5 · · Score: 4, Insightful

    Used to be if you wanted to find a file real fast under windows, you'd hit WINDOWS+F to pullup the find window, enter in your search query, and Go.

    Now if you're in front of an XP machine and want to find say...all the pictures on the system you can't just enter in "*.JPG" anymore. You have to read what some animated dog is asking you, click on one of the options before you get to the search query window, then enter in the query. doesn't sound like much of a hassle, but it IS an extra step.

    --

    $cat /dev/random > Sig
    1. Re:MS need to (un)fix their Find program... by Photon01 · · Score: 4, Informative

      This dosent need fixing .. it obviously didnt annoy you enough to make you look for the option to turn it off.

      Hit Windows+F, click change preferences, click I want to search without an animated character.

      Next click preferences again go to 'change files and folders search behaviour' then click advanced

      Voilla, the find program is (un)fixed

  19. Human entry errors are THE problem by blueworm · · Score: 5, Insightful

    "Neither X1, nor Windows XP's built in search would find your wedding photos. Why? Because they have useless names like DSC0001.jpg and there's no metadata that says they are wedding photos."

    Metadata will NEVER improve searching in this way unless the things that generate the content FORCE you to put it in before they can snap pictures, etc...

    Even if people were forced to put metadata into all their files there is a big chance that typos and other errors in entering the info would occur. This will make the metadata totally useless in a search!

  20. Re:Renaming files by ka9dgx · · Score: 4, Insightful
    Renaming files doesn't work... how many times have you had to search using the "containing text" features of Windows (or some other OS)?

    It's saved my bacon more than once. As we move away from text, we become completely dependent on metadata to find things. Standards for metadata need to be settled soon, or Moore's law means our computers will become less and less useful.

    --Mike--

  21. Re:Renaming files by glpierce · · Score: 4, Insightful

    "As we move away from text, we become completely dependent on metadata..."

    Exactly what do you think metadata is? This system would require more text than current. At present, you can rename the files and put them in folders, which works quite well if you have any organizational ability. Metadata would require dozens of unrelated pieces of info be input, and the a more complex retrieval (search) process would be required. While metadata standards are important, it's only advanced users who will be using them. How many "typical" users do you know that are going to search for a photo by the F-value?

    And for the record, I've never used the "containing text" search, because I name files in unambiguous ways.

    --
    G
  22. Find images like this by unoengborg · · Score: 5, Informative

    In KDE you can allready select an image file and say select "Find similar images". provided you have indexed your images using GIFT (Gnu Image Finding Tool)

    You can search images both in your own GIFT database and databases on the internet.

    So to solve the wedding photo problem you could make a drawing similar to your photos and search for similar images.

    --
    God is REAL! Unless explicitly declared INTEGER
  23. M$ search technology by stefan999 · · Score: 5, Funny

    Ever tried to search for Xfree86 on search.msn.com?

    Stefan