Photosynth Demo
A couple of days ago Microsoft labs released a demo of their new Photosynth software on the web. Photosynth allows the aggregation of social picture networks (a la Flickr) into a completed image in addition to allowing a level of depth to image browsing previously unavailable. There is also a very impressive video of the demo available.
Better Link to the video demo.
http://www.ted.com/index.php/talks/view/id/129 Click Here
The link that the GP posted was to the same video, but it splits the presentation into three parts (Advertisement, Enter Seadragon, The Photosynth Experience) that you can easily skip between. The good part starts at about 30 seconds into the clip.
This lets you take all sorts of pictures of your room, and will automatically assemble them into a 3D environment. It will assemble your photos to look like an RPG, instead of a slideshow.
Using the example in the video...there are hundreds of online collections of people's photos of Notre Dame cathedral. Each photo is of a different part of it, from a slightly different angle.
This software takes all those different photos and assembles them into a 3D representation of Notre Dame cathedral, where you can look at any of the individual photos.
In addition, if someone identifies one of the saints in a statue on the cathedral, when you take a photo of it and your photo is added to the collection with the software, your photo will also have that saint identified--thereby enhancing the data contained in your photo.
- RG>
Hey pal, this isn't a pleasantforest, so don't waste my time with pleasantries!
Here is a better link to the video demo. Pretty Amazing stuff
http://www.ted.com/index.php/talks/view/id/129
This system was demoed a while ago, I think at siggraph. There are some videos on the original university of washington PhotoTourism page.. Also here's a repost of the video on youTube.
Also there's microsoft's page, which has the demo (I don't think that's new either). It seems to have some longer videos
Non-newness and marketing hype aside, this software is frickin' awesome. It lets you view and tag photos organized in a 3D environment that reflects where the photos were taken. It should be particularly useful once cameras have GPS built in.
I imagine the reason the software is still in the demo phase is because it's very difficult to take a large number of photos and reliably figure out where they were all taken from. For the demo purposes, Microsoft probably hand corrected a lot of the placements. Even so, everyone I've shown this too thinks its often (even non-slashdot readers!)
I decided wade through the hype/ads/blah, and came across a really cool piece of software. It takes thousands of flickr images stitches them into a 3-dimensional mosaic, all just through software. No special on-site 3d imaging hardware, just a program compiling everyday images of something. It does this through some very advanced image recognition. If you can brave the ads, it IS worth it.
The project was demonstrated on the Research Channel at the beginning of the year.
Microsoft bought out a company that had written the non 3D part of Photosynth and student(s) at the University of Washington wrote the rest if I remember correctly. At the time they didn't work for Microsoft.
Right here.
Animoog.org
The people responsible for creating the intro (TED) are just the people responsible for giving the presenter a forum to share their ideas/technologies, don't let it color your impression of the rest presentation or the technology itself too much. The same brief advertisement is used across all the videos hosted on the TED web site, for all speakers, some of whom include Al Gore, Bill Clinton, Richard Dawkins, Bono, Peter Gabriel, Jane Goodall, Ray Kurzweil, Sir Martin Rees, Michael Shermer and Craig Venter and in that context the intro isn't as over the top as it may at first seem, if you think TED is just all about showcasing new technological toys.
No, the demo is not rigged (and it's about 11 months old).
t / for a real application using them.
The whole thing is based on SIFT keypoints http://www.cs.ubc.ca/~lowe/keypoints/ . These are very powerful and work indeed as shown in the video/demo. Check autopano-sift http://user.cs.tu-berlin.de/~nowozin/autopano-sif
There is only a little problem, M$ cannot use SIFT commercially. The licence says "for research purposes only" and the US Patent 6,711,293, Asignee: The University of British Columbia protects SIFT.
No, it's far more advanced than that, as its recognition is able to match objects that are not directly from the same set of photos, or even all photos, some can be diagrams or drawings for example.
The part that blew me away is the SeaDragon technology behind the image/information scaling portion of things... now that is just incredible... check out a talk/demo at TED on March of 2007 by Blaise Aguera y Arcas of Microsoft, just amazing stuff.
You must not have seen the whole thing. The cathedral was assembled from images available from the internet taken by hundreds of different people and cameras.
This sig is exactly seventy characters long and a real waste of space!
I was one of thee engineers that worked on the first release Photosynth. It's a great team, and it was a super fun project.
I can tell you that we did not tweak any camera positions by hand. The only real "editing" we did was to eliminate pictures that just didn't correlate well, generally because they didn't have enough feature points in common with the rest of the photos. We didn't tweak any camera positions, but the camera positions (i.e. the locations of the orange camera frusta when you have frusta turned on) are a best estimate, which is subject to some error. Same goes for the projection planes.
What's great about Photosynth is that from the perspective of anyone outside the computer vision community, it appears to be magic. Enough so that lots of the blogosphere was convinced that we somehow "authored" the 3D point clouds. Nope. It's more or less an automatic (albeit somewhat prolonged) process. The hard work is done as a big preproceess, then the client consumes largely precomputed data.
It'll be cool to see Photosynth in action in BBC's upcoming How We Built Britain piece that was announced on Live Labs today.
I did a video interview about Photosynth a while back which is targeted at a non-technical audience but still might be of interest. (And I wrote the music for the original video at Live Labs.)
Microsoft didn't buy Photosynth. It bought Seadragon. The Photosynth client is indeed built on Seadragon's client, but the idea behind Photosynth (which was a joint University of Washington/Microsoft Research project called PhotoTourism) significantly predated the Seadragon acquisition, and there was a working client. When Microsoft decided to reimplement the client as a technology preview, that's when the Seadragon team and client came into the picture.
That said, Seadragon's technology is great. It's a fantastically smooth way to browse arbitrarily large images or collections of images, and it was a good acquisition indeed.
(I was on engineer on the Photosynth team.)
PhotoSynth was previewed and available months and months ago, like a year almost.
6 /06/windows-live-photodoom-alpha-silverlight-power s-new-microsoft-live-labs-project.aspx
The real news story today is about using Silverlight technology in a new Live project.
Today's MS story was about "Windows Live PhotoZoom". A set of features managing photos using Silverlight using some of the original PhotoSynth technologies.
http://www.liveside.net/blogs/main/archive/2007/0
Ya, PhotoSynth is a cool technology, but not exactly new, at least not today.
We know that SlashDot as the 'Faux' news of technology, but at least get the article at least '1%' correct, and skip the links with insane ads not related. People on here actually think they are MS ads.
Gee. Talk about selective reading. Why don't you post the entire thing?
"This demo software is provided for research purposes only. A license must be obtained from the University of British Columbia for any commercial applications. The sofware is protected under a US patent as listed below. This demo software is a research implementation, while the licensed software has been further optimized for speed and to provide a range of other capabilities. See the LICENSE file provided with the demo software."[Emphasis mine]
Dude, did you watch the video? The acquisition the guy mentioned was the first part - the zoom in and out and pan around lots of images. That was the "meh" part.
The cool part... the part where they constructed a 3D model of Notre Dame by using only photos from Flickr, well the Photosynth page says where that came from: "Photosynth is a collaboration between Microsoft and the University of Washington based on the groundbreaking research of Noah Snavely (UW), Steve Seitz (UW), and Richard Szeliski (Microsoft Research)."
The Online Slang Dictionary