Photosynth Team Does It Again

color by catbertscousin · 2008-08-14 01:06 · Score: 4, Interesting

The color matching section was quite impressive given the wide variety of lighting and color temp in the starting photos; if they wrote their own software to do that, it sure counts as R/D.

--
No good deed goes unpunished. - Avon, Blake's 7

Re:color by Gewalt · 2008-08-14 01:35 · Score: 4, Interesting

The color matching section was quite impressive given the wide variety of lighting and color temp in the starting photos; if they wrote their own software to do that, it sure counts as R/D.
AFAIK; adobe created the technology first in response to the needs of automation in the pornography industry. It seriously helped alot of "studios" color match the whole set just by having a wizard scan the pics and correct them all.

--
Modding Trolls +1 inciteful since 1999
Re:color by JCSoRocks · 2008-08-14 03:13 · Score: 1

Lol, why does everything have to be for the porn industry. It couldn't possibly have been because it'd be useful for any other group that uses large numbers of photos... Just about any photographer would find it userful... whether it's a wedding or a fashion or a sports photographer. I'm not attacking you personally, there are just so many insane, "well this format won because the porn people picked it" type urban legends that it gets a bit ridiculous after a while.

--
You are using English. Please learn the difference between loose and lose; they're, there, and their; your and you're.
Re:color by Gewalt · 2008-08-14 03:26 · Score: 4, Interesting

Actually, I was a rabid Adobe Forum troll when some self-declared porno studios started clamoring for the feature. The other people it would be useful for actually dismissed it, as they did not seem to think they wanted that step in their workflow automated. But once the feature was added, everyone seemed to appreciate it. Of course, adobe is not one to normally listen to and assimilate feedback, especially not from their forums, so that could have just been coincidence.

--
Modding Trolls +1 inciteful since 1999
Re:color by Bazman · 2008-08-14 05:11 · Score: 1

I guess someone doing colour matching on a porno movie might get a bit distracted. Computers don't have that problem...
Re:color by jfengel · 2008-08-14 06:25 · Score: 1

Why are porno studios particularly interested in the feature?
Re:color by Gewalt · 2008-08-14 07:48 · Score: 1

Why are porno studios particularly interested in the feature?
It's an extremely efficient (read: cheap) way of making a rather amateur shoot look like it was done by a professional.

--
Modding Trolls +1 inciteful since 1999

No sense to limit how many photos you take... by Max+Romantschuk · 2008-08-14 01:07 · Score: 5, Insightful

And THIS is why I tend to take huge numbers of photos and never delete any... Technology like this will account for easy geotagging, date I already have in the EXIF data, whereas people can be tagged with face recognition soon enough.

That done, I'll be able to navigate my tens of thousands of photos by asking for things like photos taken of the kids while outside at the cottage when they were 3 years old.

Also, remember to backup! :)

--
.: Max Romantschuk :: http://max.romantschuk.fi/

Re:No sense to limit how many photos you take... by Anonymous Coward · 2008-08-14 01:15 · Score: 1, Funny

Why not just use a video camera?
Re:No sense to limit how many photos you take... by fastest+fascist · 2008-08-14 02:48 · Score: 2, Informative

Geotagging is not that hard nowadays, assuming you have a device capable of creating a gps log and are using a digital camera (timestamps). Take the log, load it into gpicsync and let the program tag your photos for you. Just make sure the gps device and your camera have their clocks synchronized. I'm still waiting for a decent way to browse photos on a map, though - pretty much what you're looking forward to, I guess. Picasa lets you view geotagged photos in albums in google earth, but it's not much more than a gimmick. Something that would let you draw a circle on a map to limit your search to that area, then use a time slider to further refine the search and then allow keywords, face recognition once we get that, would be useful.
Re:No sense to limit how many photos you take... by ka9dgx · 2008-08-14 03:33 · Score: 5, Informative
Because a video camera is nowhere near the quality of a still image, still cameras will win for a number of reasons:
- Still Camera - less motion blur, if any
- Bigger sensor - less noise
- Focus mechanism - an SLR has a much better focusing mechanism
- Image Compression - almost all video codecs record a stream of images, and do not optimize the quality of an individual frame
- Exposure time - A still camera can take from 1/8000 second to 5 minute exposures for a single frame, as opposed to a fixed time of about 1/30 second for NTSC
- Aperture - A still camera can control the aperture to get desired depth of field
So, those are the ones I can think of off the top of my head.
Re:No sense to limit how many photos you take... by 3dr · 2008-08-14 04:27 · Score: 1

The Windows program TopoFusion (www.topofusion.com) will merge photos with GPS logs, and place them on a map. As you said, make sure the camera is synced to GPS time.
I've used TopoFusion for a few years and have been pretty happy with it (primarily used for cycling logs and making rough maps of trails).
Re:No sense to limit how many photos you take... by QuestionsNotAnswers · 2008-08-14 12:39 · Score: 2, Informative
If a poor photographer uses a cheap video camera - what can be acheived using image processing? In theory signal processing of a long enough video could:
- Remove blur using deconvolution - multiple frames can be used to reduce deconvolution noise sensitivity and multiple frames help calculate point spread functions. Of couse motion blur is also avoided in video cameras by using Image Stabilisation (adaptive optics are becoming common even in low end cameras - eg. TX-1).
- Multiple frames can be used to remove most forms of noise (and sensor noise in particular), remove specular reflections, remove obscuring objects (if can see around obstruction using different POVs), perhaps even help with removal of some shadows (maybe by moving point lighting while videoing).
- With enough frames and not far out of focus, DOF can be increased. Also normal focus hunting, or by using say CHDK to continually vary focus, can synthetically increase DOF in post processing.
- With varying F-stop post processing of a video stream could recover a higher dynamic range.
- Aperture - I don't understand your point - video cameras have aperture control too.
- Long exposure times - can be simulated via integration of multiple images - don't digital cameras do this internally anyway? Certainly the CHDK firmware does this to simulate exposure times over 65 seconds. Short exposure times can obviously be done with a suitable video camera.
- Compression artifacts - yeah - a real problem.
Image processing is getting better - and I am definitely keen on using software to process videos to synthetically generate high definition images. My time and dollar resources are limited for getting the perfect photo.
--
Happy moony

Generalized, or just well-known, static objects? by pz · 2008-08-14 01:07 · Score: 4, Interesting

Very cool stuff! Does anyone know (are any of the project team members here?) how much foreknowledge of the object being orbited that is required?

For example, is a 3D wireframe model necessary?

Is a filtering of the photos necessary to ensure that they are all of the same subject?

What level of pre-processing is required on the photos, either automated, or manual?

How well does the system fare when the object being photographed isn't absolutely static? A drawbridge, for example, changes shape. Or Niagara Falls. Or a flag. Or a single person.

Anyone know?

--

Put my fist through my alarm clock with its ding-dong death inside my ear. - The Blackjacks.

fascinating by thedonger · 2008-08-14 01:08 · Score: 2, Informative

Science fiction and VR have primed me to believe someday we would all be walking around some imaginary digital world (oh wait, WoW anyone?), but this is "virtualization" of the real world. Like Google street view on crack. I am simultaneously in awe of the technological achievement and embarrassed that my life in computers hasn't yet created anything so cool.

I, for one, welcome our new PhotoSynth overlords.

--
Help fight poverty: Punch a poor person.

Re:fascinating by Tryfen · 2008-08-14 01:25 · Score: 2, Interesting

Read The Light of Other Days by Arthur C Clarke.

--
If a square is really a rhombus, why aren't all triangles purple?

Re:Generalized, or just well-known, static objects by larry+bagina · 2008-08-14 01:10 · Score: 1, Funny

more importantly, does it have an anti-goatse filter?

--
Do you even lift?

These aren't the 'roids you're looking for.

Can't wait..... by BitterOldGUy · 2008-08-14 01:12 · Score: 1

I can't wait for the ability to do a search and replace of all the images of women replacing them with naked Playboy playmate pictures. Then again, it would be embarrassing to walk into a church and be staring a Nun up and down with a woody.

Re:Can't wait..... by Single+GNU+Theory · 2008-08-14 01:45 · Score: 1

You will probably not find any nuns with woodies in Playboy, though they did feature Caroline Cossey. Either your source material's suspect, or you've got the nuns and priests mixed up.

--
Little Debian: America's #1 Snack Distro!

Re:Huh by BitterOldGUy · 2008-08-14 01:14 · Score: 4, Informative

Shut The Fuckup Sonny - it's what the old guys on Photo.net say when you tell them that photography as we know it is dead - especially if you mention film in the same sentence. It'd be like saying that BSD is dead here on /..

Re:Wow by ErroneousBee · 2008-08-14 01:16 · Score: 3, Interesting

Seems a bit simplistic to me, I'd have thought that they'd turn the photos into a virtual world, using the colour corrected photos to create wireframes and bumpmaps and then being able to apply whatever lighting and other effects to the world. That allows you much more freedom to use other methods (e.g. LIDAR) to populate the database.

Creating 3d models also allows you to remove transient objects (people), or add objects to the scene, e.g. what would David look like on the empty plinth in Trafalgar Square.

I suspect the reason they've done it this way is more about the patents than practical application.

--
**TODO** Steal someone elses sig.

Video original better? by BitterOldGUy · 2008-08-14 01:19 · Score: 2, Insightful

It looks like taking a video would be easier. That way, you wouldn't have to spend time stringing all the stills to together - if I understood correctly.

Re:Video original better? by Max+Romantschuk · 2008-08-14 01:46 · Score: 4, Insightful

It looks like taking a video would be easier.
Depending on what you are trying to do... My original point was that technology like this will make it possible to navigate the swamps of data we're accumulating.
I like having a lot of family photos, but traditional albums won't do when we have literally thousands of them. Stuff like this can make it possible to easily call up photos based on suitable criteria. Like I said we need other parts to, like face recognition, but summing it all up we'll eventually have a feasible way to navigate a huge amount of photographic data.

--
.: Max Romantschuk :: http://max.romantschuk.fi/
Re:Video original better? by Tim+C · 2008-08-14 01:51 · Score: 1

Well the tech demo is using photos taken by arbitrary people. While it could be used to similar effect on your own photo collection (if you take enough photos from enough positions), the real power would seem to be when it's used on a large collection of user-submitted photos, or if its fed the contents of Flickr, etc.

--
It's official. Most of you are morons.

Re:Generalized, or just well-known, static objects by oldspewey · 2008-08-14 01:21 · Score: 4, Funny

goatse

Awwwww Christ ... now you've put zooming and panning into my head at the same time as goatse.

Thanks.

--
If libertarians are so opposed to effective government, why don't they all move to Somalia?

Security by robvangelder · 2008-08-14 01:21 · Score: 5, Interesting

I was on an ocean cruise recently, and a little girl was lost... Ship's Security were looking for her.

I later heard she had been found, and as I walked back to my cabin I thought of this software.

Every corridor of the ship has cameras.

The parent could recall the last time she was with the child. An operator could then fly through a 3d map of the ship, from that point in time, with recorded video overlaid, following the girl in fast-forward until the current time was reached.

The flying would be like spectators do in first-person-shooter type computer games.
An observer could even be automatically tethered to the missing person.

Re:Security by Indecision+Bob · 2008-08-14 01:40 · Score: 1

The husband could recall the last time he was with his wife. An operator could then fly through a 3d map of the ship, from that point in time, with recorded video overlaid, following the woman in fast-forward until her new amore was reached.
Ultimate stalker/invasive state tool!
Re:Security by houghi · 2008-08-14 02:02 · Score: 1

Forget the ship, use all of London and any other place with CCTV. Orwell was an optimist.

--
Don't fight for your country, if your country does not fight for you.
Re:Security by ozbird · 2008-08-14 02:36 · Score: 1

An observer could even be automatically tethered to the missing person.

Or you could just tether the kid to the parent with a kiddy leash.

Re:Wow by ttapper04 · 2008-08-14 01:22 · Score: 5, Insightful

Microsoft had better not repeat google's slight miscalculation. The credits given to the flicker accounts tell that they must of had to opt in, unlike streetview. This photosynth system would be incredibly powerful if it used all flicker images or crawled the web. People are clearly visible everywhere in this system, and some may become upset.

I'm confused by all this by MobyDisk · 2008-08-14 01:28 · Score: 2, Interesting

I've seen some of these articles about Photosynth, and there seems to be a lot of hype. But... I don't get it.

I see that Photosynth can glue a series of images together so that you can zoom into and move around a scene and get an epileptic-seizure of correlated viewpoints. This group seems to have made a virtual walk-through using this. But I am unclear:
1) What is the point
2) What is the breakthrough

As for #1, Photosynth is ugly. I would much rather have a few good quality same-lighting photos to look at than to have my eyes torn out trying to make sense of this. So unless my brain works differently from everyone else's, the point is not an aesthetic one. It must be a technological one. Is it the promise that we could one day use this to combine amateur images into a real 3D image? Why would this matter when doing that with professional images is easy to do and looks much better?

As for #2, without reading the entire paper I'm unclear how much of this was done automatically. If someone manually entered the GPS coordinates and direction of these photos and then wrote a program to glue them together, I see a lot of hard work but no science. If this required creating a rough 3D layout and it was able to extract the positions programatically, then that is impressive. If it was able to make this entirely from nothing other than the images, then holy moly that's amazing. But I can't tell from the video which of these it is.

Can someone explain this to me and why I should be interested?

Re:I'm confused by all this by Anonymous Coward · 2008-08-14 01:34 · Score: 4, Insightful

If this was an OSS project, your post would have been rated "flamebait".
Re:I'm confused by all this by Anonymous Coward · 2008-08-14 01:40 · Score: 5, Informative

It needs neither input of coordinates or input of a rough 3d layout. It generates its own 3d model by analyzing the photographs programatically, you do not even need to tell the program they were taking in the same area. The photographs are then automatically applied to the generated 3d model and finally it lets you move freely in the generated 3d world selecting the best photo matching your current viewpoint while applying perspective remapping, color correction and lens correction.
Re:I'm confused by all this by Jellybob · 2008-08-14 01:41 · Score: 1

If it's the same project I think it is, this can do it all using image recognition - correlating photos that appear to be of the same location, and then stitching them together.
It takes a crap load of processor time to do it, but it's largely a hands off process.
Re:I'm confused by all this by hkz · 2008-08-14 01:42 · Score: 5, Informative

From what I took away from the original demo, they were doing everything algorithmically. The original demo showed a wireframe of the Notre Dame generated completely from amateur pictures, then overlaid with those same pictures to give it texture. So yes, it is quite impressive. I'd be surprised if Google wasn't doing anything similar for Google Maps though.
Re:I'm confused by all this by dave420 · 2008-08-14 02:14 · Score: 4, Informative

Because you can, say, search Flickr for a landmark, get the images, run them through this, then you can navigate through the space in 3D, looking at high-res imagery of the subject, from all available angles, without having to previously know anything about the subject. Even the system doesn't need to know anything about the subject, it only needs the photos. It is ENTIRELY automatic, using only images. If you look at the old Notre Dame demo, you can see that it even correctly inserted a photo of a poster of Notre Dame into the 3D model, in the exactly-correct position. 100% automatic. That's the breakthrough.
Re:I'm confused by all this by Catil · 2008-08-14 02:16 · Score: 1

In theory, giving the huge amount of photographs available online, probably already covering every public inch of the populated world, a program capable of automatically crawling, indexing, glueing the images together and autocompleting their geotags could render a fully accessible 3D-'streetview' of the world without any human interaction. Although a final implementation might still be a few years away, this project is most certainly a milestone on the way to achieve it.
Re:I'm confused by all this by will_die · 2008-08-14 02:18 · Score: 1

It does not need input other then a massive amount of photos. The program does all the piecing together and building of the 3D layout. If you goto the microsoft site on it they have more details. For the video all they did was get a bunch of photos from different people of the same object and feed them into the program and you saw what they got.
As for the bad part, it takes days and a powerful computer for anything beyond the very basic set of pictures.
Re:I'm confused by all this by jebrew · 2008-08-14 02:28 · Score: 1

Can someone explain this to me and why I should be interested?
If it works as I believe it works (i.e. the stitching is automagic), then tagging two photos with GPS coordinates should be sufficient to tag all of the photos with fairly accurate (two so you have orientation, or just one with direction as well as location).
Depending on what you want to do with this, you may or may not be excited. I've got a lot of photos that simply do not capture the scale of some of the things I've seen. Not that the photos are bad, just that the medium they're expressed through (i.e. a 2-d photograph) simply isn't enough. Climbing a mountain, being in a huge stadium, and so on. This would allow you to have a fly-through of the area.
Not interested in that reason? How about this one. Do you like GTA? Imagine the realism that could be added to the games if someone with a camera could just drive around your city snapping photos and this could automatically build a level for you.
Maybe you'd like to design a garden for your home, wouldn't it be nice to have a full blown 3d representation of it so you could just drop plants in where ever instead of having to get several photos and photoshop each one?
If you take all the photos from a trip (and if you take the photos frequently enough), you could follow yourself around where you went. It'd be great for explaining some stories (as a good visual aid).
These are just a few things of the top of my head, and I still haven't had my coffee...speaking of which...
Re:I'm confused by all this by Floritard · 2008-08-14 05:09 · Score: 1

I work in an architectural firm. We do a lot of home/commercials remodels. This involves going out to the site and getting measurements of the existing building, and taking lots and lots of pictures. Inside and out. While working on the project our draftsmen page through these pictures one at a time looking for this or that specific detail. As such I try to take the pictures in a specific order, but there's always those minor details you go back to and photograph out of sync. I can tell you that we'd much rather be able to feed the whole lot of images to some magic software that would let us walk through the building and get a sense of the spacial characteristics of the place.

This program would add context and ease of navigation to a collection of otherwise random and disjointed images. If they could determine the correct scale and insert measurements on certain features that would be even better.

If you can't see the wow factor of this then really I don't know what to tell you. Go model me a replica of Notre Dame using only pictures gathered from flickr. See how long it takes you.
Re:I'm confused by all this by MobyDisk · 2008-08-14 05:54 · Score: 1

okay, then I am truly amazed. I guess it just seems so far-fetched that I didn't think it was really doing that.
Even a human brain can't do that. We can't look at an object and determine it's 3D spatial details without additional information. For one thing, we use stereo vision. Then we use our knowledge of scale (cars are about that big, and people are about that big) and light (sun is over there, light bulb is over there). But a computer doesn't know those things.
I figured this might be possible if it knew the positions that the images were taken, and the position of the subject. Then it would essentially be using super-stereo: multiple images from multiple angles to generate depth information. But generating depth information without knowing position or angle or distance seems mathematically impossible to me. At least with a human brain, you can fool someone by doing faux finishing and trompe l'oeil. Maybe I need to make a Parthenon and try to fool this software into creating a 3D image out of it. :)
Also, how can it determine your position without knowing the angle and zoom of the lens? (Maybe it really doesn't. Maybe someone with a 3x zoom lens looks 3x closer in their plot, than they actually were - for what they are doing I guess it doesn't matter. The "apparent" position of the photo versus where the person was actually standing is irrelevant)
So lastly... if it can determine a rough 3D layout, could it not use the images to texture that layout, then generate a 3D model? This would turn sculptors into 3D modelers.

Nice view by Pravetz-82 · 2008-08-14 01:29 · Score: 1

"Error establishing a database connection"
Nice view indeed!

Video by c_g_hills · 2008-08-14 01:35 · Score: 4, Informative

Obligatory link to the youtube video (not a rickroll, I promise!)

Thanks, Network Mirror!

Re:Generalized, or just well-known, static objects by Minwee · 2008-08-14 01:40 · Score: 4, Insightful

This is described in their SIGGRAPH paper, which was prominently linked from the article.

It's a bit dense and involves some cross references, but here's a part which may answer some of your questions. For more detail you oculd always read the paper yourself.

We use our previously developed structure from motion system to recover the camera parameters for each photograph along with a sparse point cloud [Snavely et al. 2006]. The system detects SIFT features in each of the input photos [Lowe 2004], matches features between all pairs of photos, and finally uses the matches to recover the camera positions, orientations, and focal lengths, along with a sparse set of 3D points. For efficiency, we run this system on a subset of the photos for each collection, then use pose estimation techniques to register the remainder of the photos. A more prin- cipled approach to reconstructing large image sets is described in [Snavely et al. 2008].

Convoluted by CmdrGravy · 2008-08-14 01:43 · Score: 5, Funny

Erm, isn't that a bit of a long winded complicated way of doing things ? I mean sure, Computer could do that for you but why not just ask instead ?

"Computer, where is " and that would be that. I mean typically she'd be stowed away in the engine room re-configuring the sensor array for some nefarious purpose but that's just kids nowadays I guess.

Re:Convoluted by mweather · 2008-08-14 02:16 · Score: 1

I tried that. The mic on my mouse must be broken.
Re:Convoluted by dave420 · 2008-08-14 04:33 · Score: 1

She'd be in the jeffries tubes somewhere near engineering. Everyone knows you don't need to actually go into engineering to reconfigure the sensor array.
Re:Convoluted by encoderer · 2008-08-14 05:29 · Score: 1

Oh, the keyboard.
How quaint.

Re:Microsoft stealing from Linux again... by c_g_hills · 2008-08-14 01:48 · Score: 1

If you read the paper you will see that it is the same researchers!

is it even real? by xhrit · 2008-08-14 01:49 · Score: 1

Is anyone else a little underwhelmed by their so called 'demo'? It seems like the program does not contain any ov the technologies that are being 'developed' at microsoft, and is instead a hard coded 'interactive movie' showing how this 'new technology' is supposed to look, if they ever get it working.

Most likely this barley interactive photosynth demo will be used to secure patents on '3D Virtual Reconstruction', and the final product will ship with no such features, instead being marketed as an alternative to adobe lightroom.

Am i wrong? Well, can anyone tell me that photosynth is really a working 3d-photo-viewer? and if the technology is there, why cant i open my own photos in it instead ov just viewing the sample photos? Does photosynth really do anything that can't be faked in flash? And if it cant, they who is to say that it is not being faked in photosynth?

--
Dungeon Tactics : Free Open Source SRPG

Re:is it even real? by yuggler · 2008-08-14 02:15 · Score: 1

I'm just guessing/thinking aloud here. There are two parts to the Photosynth-process. First the computer (allegedly, if you're into the Microsoft conspiracies...) analyses all of the photos and gives them a location. This is the processor-intensive part, that takes both time and processing power. The second part is the viewing. When you already know the position of the photographs it's a small(er) feat to present them in a pleasing way. So I believe that PhotoSynth is real. The reason that you can't upload your own photos? This is cool tech, and Microsoft probably wants to use it comercially. Maybe in Windows 7?
Re:is it even real? by zehaeva · 2008-08-14 02:27 · Score: 1

Blaise Aguera y Arcas on Photosynth
the book thing, i don't think can easily be faked by flash, nor the gigapixel resolution image nor the really neat zooming and zooming and zooming into stuff.
really what photosynth, and by extension seadragon which is what photosynth is built on, promises us is a way to semantically link all those photo's on the web together, to build up those real places into virtual places with little or nor human intervention.
this should enrich the web in a way we are only just beginning to see.
Re:is it even real? by dave420 · 2008-08-14 04:38 · Score: 1

Why don't you see for yourself. It works by intelligently looking at photos and constructing a 3D model out of the photos, entirely automatically. You can't load your own photos yet, as it's still in beta. The demo is the output of their previous tests, as it has not yet been updated with the latest version. And no, it can't be easily faked with flash, as this is 100% automatic, and faking in flash would require a massive amount of work by hundreds of people.
Re:is it even real? by xhrit · 2008-08-14 07:52 · Score: 1

100% automatic? ok. so what you are saying is, you have no idea if photosynth works for real or not. You say that it works by intelligently looking at photos and constructing a 3D model out of the photos, but that is not what the demo does!. I have downloaded the demo, and it does not look at photos, nor does it construct anything! The demo is a static walkthrough that at most shows off what the interface to the front end ov some potentially awesome software might someday look like.

In other news, I have a demo ov 100% intelligent AI software - Just log into #xhrit on freenode and you can have a conversation with me^H^H it.

--
Dungeon Tactics : Free Open Source SRPG

Microsoft by smoker2 · 2008-08-14 01:49 · Score: 1

The Photosynth technology preview runs only on Windows XP SP2 and Windows Vista. nuff said.

Re:Microsoft by TheSeer2 · 2008-08-14 01:57 · Score: 1

They've made an effort to make it run on versions of Firefox up to 3.0 (PS was out before FF3.0, so don't think it of it as a "they did it just to say they did it" thing).
I think it's just more of a they haven't got to it yet as opposed to sheer malice. Although they'll probably aim for FF3.0 compatibility before Linux.
Although, I'll admit... there's a reasonable chance there won't be a Linux version until this thing is a lot more mature (i.e. your own collections can be formed... but that's when it'll be it's best).
Re:Microsoft by dave420 · 2008-08-14 04:40 · Score: 1

It's still in beta. Eventually it will be running in Silverlight, which runs on the OS of your choice. And Linux once Moonlight is finished.

Re:Wow by Anonymous Coward · 2008-08-14 01:51 · Score: 2, Interesting

I imagine that's the ultimate goal. But what they have now is still amazingly impressive...

The next step in that goal would be making it automatically determine what's in the structure, and what's 'in the way' (a tourist, a security guard, a pigeon...). It would be annoying if a tourist was thrown in with the 3-d model if they happened to populate the set with a ton of pictures of them and the object you want modeled.

Still, as it stands now, it's still an amazing way to experience a historical landmark that maybe you can't afford to visit. Imagine showing your kids the Parthenon, the Sphinx, the Great Pyramid, The Statue of Liberty, and the Kremlin. Not some static pictures, but a 3-d experience, photorealistic (Because it's populated by photographs, natch). It's the same kind of thing that, if I saw it in a movie 10 years ago, I'd have laughed at it for being stupid, because computers can't do THAT...

An open source photosynth? by replicant108 · 2008-08-14 01:52 · Score: 4, Interesting

There was some discussion recently about the possibility of building an open source photosynth - and creating an 'open voxel space' map of the planet.

Anyone know if there's been any progress on this?

http://lists.burri.to/pipermail/geowanking/2008-June/005373.html

Re:Microsoft stealing from Linux again... by TheSeer2 · 2008-08-14 01:55 · Score: 1

... yes. Because the platform an app runs on determines its copyright status.

Safari runs on Windows and Linux! SAFARI IS OSS.

http://phototour.cs.washington.edu/ - what's that, a Microsoft logo on the official website. Oh, stolen? Perhaps you should try fact checking.

(You sir are the definition of a zealot).

Re:Microsoft stealing from Linux again... by Ilgaz · 2008-08-14 01:55 · Score: 1

Absurd thing is, they buy out the researchers and at least use a fake multiplatform thing like Silverlight to impress/trick people about its possibilities.

So, Linux thing has become Windows only as result of buyout. Complete Microsoft way of doing things and exactly why people like me says "Stay away from Silverlight, .NET, their open source clones and people involved with them."

That is the "open source loving" Microsoft for you which will transform itself to multiplatform company. If you buy it...

correction/sorry by Ilgaz · 2008-08-14 01:58 · Score: 1

First line should be:
"Absurd thing is, they buy out the researchers and "don't use" a fake multiplatform thing like Silverlight to impress/trick people about its possibilities."
(blame coffee)

Re:Microsoft stealing from Linux again... by TheSeer2 · 2008-08-14 01:58 · Score: 1

I meant to say OS X, isn't this embarrassing... well, it runs under Wine. Regardless, my point is the same.

Re:Microsoft stealing from Linux again... by Paaskonijn · 2008-08-14 02:00 · Score: 1

I'm confused. When was Photo Tourism ever OSS?

Moreover, the Photo Tourism web site seems to suggest that it has been supported by Microsoft Research from the start.

Re:One Step Closer to Blade Runner by Tim+C · 2008-08-14 02:01 · Score: 2, Insightful

Yes, and this is nothing like that. That was apparently creating additional information that simply wasn't in the original photo. This is using a whole bunch of photos of the same scene, taken at different times, angles, etc to automatically build up a 3d model. Nothing is being enhanced, you're "merely" being shown the most appropriate, pre-existing photo based on your location and view direction in the generated 3D model.

Damn cool tech, but not the same as that used in Blade Runner (or CSI, or any other "enhance this photo to make that illegible squiggle that's beyond the resolution of the photo readable" plot device)

--
It's official. Most of you are morons.

Re:Security-indeed by way2trivial · 2008-08-14 02:02 · Score: 1

imagine the salivations of the UK security forces.

there is a book, 'lacey and his friends' which contains a few short stories about a society with such abilities...

not pleasant.

--
every day http://en.wikipedia.org/wiki/Special:Random

Re:Generalized, or just well-known, static objects by dave420 · 2008-08-14 02:11 · Score: 4, Interesting

You just give it the photos - it figures out the rest. It works by stitching them together in 3D, so if there is a photo of one part of the subject that is not overlapped by one other, the photo won't be part of the finished "model". If you download the old demo, you can see the Yosemite demo, which shows what happens with movement (hikers climbing a mountain). If it can match up most of a scene in an image, the image can still be used. I'm sure it'll only get better. Another great example is in the old demo, where they simply searched Flickr for "Notre Dame", and then constructed the entire cathedral. It picked up a photo of a poster in someone's house, and seamlessly integrated it into the model. It recognised what it was from, and where on the cathedral it was positioned, and reflected that by putting that image exactly where it should be in the finished "model". Of course this is just stuff I've gleaned from watching the demo videos, using the demo, and reading as much as I can about it, so I might be wrong on some of it, but that was the impression I got. If I'm far off, I'd appreciate being put right, as this technology is nothing short of stunning.

Re:Wow by CheeseTroll · 2008-08-14 02:11 · Score: 1

They could create some kind of agreement with Flickr & other photo-sharing sites where users could check a box to opt-in to photosynth.

(Or if they want to be sneaky about it, require the users to check the box to opt-out, or just change the Flickr privacy policy to opt-in all new photos.)

--
A post a day keeps productivity at bay.

Re:Windows XP SP2 and Vista Only by entmike · 2008-08-14 02:22 · Score: 1

What do your comments have to do with the technology at all except offer the already abundant rhetoric about the company itself? Oh wait, nothing.

Re:Microsoft stealing from Linux again... by dave420 · 2008-08-14 02:23 · Score: 5, Insightful

Microsoft have turned Photo Tourism into something incredibly more powerful. But don't let that get you off your high horse. Some of us don't play the "them" and "us" game.

Re:One Step Closer to Blade Runner by xerxesVII · 2008-08-14 02:42 · Score: 1

Yes. Along with the program's absurd degree of voice control.

--
"We shall grapple with the ineffable, and see if we may not eff it after all." - Douglas Adams

On the other hand... by Maint_Pgmr_3 · 2008-08-14 02:55 · Score: 1

From a company that is really hot-to-trot on patents, I find it interesting that they would make a product where you can really fake photos. "But your Honor, there is a photo of me watching TV at home, taken on the night of the murder."

But can they take it all the way? by Shamenaught · 2008-08-14 02:56 · Score: 1

This technology is great, but I wonder if they'll actually ever reach the pinnacle of this kind of technology.

What do I mean by this? I mean using multiple photos to form 3d models of the subject, then going back over all of the data using a super resolution [wikipedia] system, thus creating a resulting set of images containing more detail than any individually contributing image.

It's what the human brain does all the time! Then-again, human brains can do all sorts of stuff that we're probably not gonna see computers doing any time soon, like thinking, and loving...

Soon, my Commodore 64, soon...

--
mysql> SELECT * FROM `places` WHERE `place` LIKE 'home`; Empty set (0.00 sec)

Re:Windows XP SP2 and Vista Only by Ilgaz · 2008-08-14 02:57 · Score: 1

http://labs.adobe.com/downloads/flashplayer10.html . See how many platforms supported?

That is how companies work in age of 2008 where people uses 2-3 different operating systems in a day.

I am not your average "anti M$" guy to pick at, I am just telling that kind of actions will result in some kind of reaction, it depends on the money company has and it is not infinite.

I can't comment about the technology since I can't view it!

Re:Generalized, or just well-known, static objects by Fittysix · 2008-08-14 03:03 · Score: 1

Or a single person.

That makes me wonder if this software would be capable of identifying people as the age sortof. If it has enough data it could find a progressive path from one age to another, and you could find pics of yourself all over the place.

--
*.sig

Re:Wow by daedae · 2008-08-14 03:12 · Score: 1

Sure, because that sounds easy... some people in my department are working on with stuff like that I think (construction of 3d models from collection of photos), and it requires a large number of photos (at a non-trivial amount of disk space) and a lot of processing power. Also, in a general-purpose application, you'd have to be able to correct for thing like people, birds, trees, etc...

Such a technique would no doubt be cool, and probably could be useful for constructing models of areas without having to go there yourself with a team of photographers. It's overkill though for navigating your/a photo collection, which seems the main purpose here.

Actually, the color-correcting was interesting, but I have to wonder how novel their automatic correction algorithm is. I haven't had a chance to read the SIGGRAPH paper yet, but at first glance it reminds me of something one of my undergrad professors did several years ago (at least 5 or 6 at this point) to false color B&W photos based on a user-supplied color photo of a similar scene. The main difference here is I guess that the B&W false-coloring used luminance values to match against the color photo, whereas here correcting for different lighting conditions maybe means by definition the luminance is different?

Re:Wow by YrWrstNtmr · 2008-08-14 03:22 · Score: 5, Insightful

Huh? Why not get out there, meet people from those countries, eat the food they eat, get drunk with them, and actually experience the world?

Of course! Because every familiy has the time and resources to visit every possible interesting place on the planet.

Re:Generalized, or just well-known, static objects by pz · 2008-08-14 03:25 · Score: 1

That makes me wonder if this software would be capable of identifying people as the age sortof. If it has enough data it could find a progressive path from one age to another, and you could find pics of yourself all over the place.

Totally cool idea!

--

Put my fist through my alarm clock with its ding-dong death inside my ear. - The Blackjacks.

Boom! by 6Yankee · 2008-08-14 03:39 · Score: 1

Crashed Firefox (3).

Twice.

Re:Wow by swillden · 2008-08-14 03:41 · Score: 4, Insightful

Huh? Why not get out there, meet people from those countries, eat the food they eat, get drunk with them, and actually experience the world?

Ummm, because we can't afford it? Taking six people to Greece would consume our family vacation budget for 3-4 years. I'd rather stay closer to home and spend more time with my kids.

--
Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.

Re:Wow by JeremyBanks · 2008-08-14 04:11 · Score: 1

A large portion of Flickr photos are licensed under Creative Commons licenses. They could easily query for any tagged in that location under a suitable license. By selecting a By-Attribution license a photographer has already made their photos available to anyone doing something like this, legally.

Re:Wow by nmg196 · 2008-08-14 04:13 · Score: 1

I've long since learnt that photographs and video are a totally RUBBISH way of experiencing a new place.

If you want to see something cool, physically go there. No amount of photos or video will ever give you the feeling or atmosphere of most cool things in the world. The biggest problem has to be the lack of perception of scale that photos provide (even with ample references within picture).

Get out there and see stuff.

Remember: the best thing about real life is that it's ALL in glorious high definition with surround sound!

Re:Wow by JasterBobaMereel · 2008-08-14 04:44 · Score: 1

To be useful you have to have access to many pictures of the scene, all of these are likely to be copyrighted, and so any commercial application will not be viable...So who exactly are they doing this for?

Ordinary people do not have nearly enough photos to make it work....Companies will not be able to use it due to the vast majority of the material being copyrighted and unavailable for commercial use ..

--
Puteulanus fenestra mortis

Re:Microsoft stealing from Linux again... by recoiledsnake · 2008-08-14 05:00 · Score: 1

So you mean if something runs on Linux, the Linux developers have made it? What's more important, the underlying algorithm or the OS it runs on? Nice karma whoring though, looks like the mods fell for it.

--
This space for rent.

Re:Wow by Abreu · 2008-08-14 05:01 · Score: 1

"But dear, these slashdotters have not the means, nor the inclination to do the grand tour like any young people of means do. We must allow them their delusions!"

--
No sig for the moment.

Other applications of this tech by RJFerret · 2008-08-14 05:15 · Score: 1

As I watched the video, especially describing "orbits" from where most photographs are taken of a subject, it occurred to me that were those points made available to the user, photographers could then opt to find a more unique and artistic viewpoint beyond the norm!

Regarding the "child lost on a ship" thought to find her. I can also see this really intruding on privacy, particularly via paparazzi and the like.

Could you imagine being Tom Cruise or Paris Hilton (depending on your demographic), now the photos taken of you at random points through the day could be stitched together into essentially a video of your complete activities?

Now imagine the government doing the same, especially with all the images available via video surveillance.

4D support? by Roger+W+Moore · 2008-08-14 05:20 · Score: 2, Insightful

That done, I'll be able to navigate my tens of thousands of photos by asking for things like photos taken of the kids while outside at the cottage when they were 3 years old.

That raises an interesting concept. Could they do a 4D orbit? For example identify pictures of your kids at different ages and then you could watch them grow up in front of your eyes. Or watch how a city street changes over a decade? That would be really interesting...shame it will probably only every be available for Windows.

Re:Wow by religious+freak · 2008-08-14 05:20 · Score: 1

I'd imagine the ultimate goal would be to filter the people out of the images entirely.

--
If you can read this... 01110101 01110010 00100000 01100001 00100000 01100111 01100101 01100101 01101011

Nice by Fri13 · 2008-08-14 05:25 · Score: 1

Everytime when I hear about photosynth, I just remember this http://www.sandcodex.com/

But it is nice that Microsoft is understanding the power of this technology and develop it more. But it is not nice that technology actually went to Microsoft so it is patented..

Re:Wow by JWSmythe · 2008-08-14 05:30 · Score: 4, Interesting

I thought one of the previous stories said it would do that.

What I was curious about is, how? A distinguishable photograph could be associated. But, even with one of the examples in the display, the Statue of Liberty, if this is automated, how would it be able to distinguish the real statue of liberty with say a souvenier sitting on my coffee table? Basing it on size and distinguishing shapes, it would match either one. Basing it on those, and the background objects is impossible. It already has to take into account that there are changes in the foreground (people, extra objects like light poles that are not present in very similar views). Background objects like clouds come and go, and leave entirely different images.

For not quite as distinguishable objects, it would be a lot harder. Say you used the Statue of Liberty as your starting point. If you were to travel into Manhattan, there are many very similar shapes for buildings and storefronts. Sure, unique buildings would be obvious, but for every obvious building, there are dozens of almost identical buildings.

Even then, you would have to know the city. Similar architecture can show up in a variety of cities, and be close enough to match. Cameras may record timestamps embedded in the original image (assuming unedited photos are added to the system), but there is nothing useful like geographic coordinates included.

All the photos were shot from the same perspective. It was as if they were shot by one or more photographers of about the same height. There should have been a more significant change to the view from say a 4' tall child to a 6'8" tall man. I don't claim to be a "great" photographer, but I'm pretty good. One of the essentials between being someone who can take snapshots, and someone who can take photographs, is making the composition of your photograph to illustrate the view. That frequently involves changing height and view. Maybe you want to lay on the ground for one, and climb on a ladder for another.

I took some photographs at the World Trade Center on 9/9/2001. Those photographs aren't just of the skyline, although I did take some snapshots at the time. Some are composed lookup up towards the top of the buildings from the ground, and down while leaning on the glass of an observation deck window. Photography isn't documenting a first person view. It's beautifying and romanticist a view, without necessarily changing anything about what's in the composition of the photograph.

There are other features that I don't see how they're getting, such as the zones where photos were shot from. That takes an awful lot of extrapolation. What's the difference between a photographer 10 feet away, and a photographer 200 feet away with a good zoom lens? Almost nothing, except maybe a little focal distortion at the edge of the photo. That varies with the quality of the camera and lens anyways.

I did a little project once years ago. I was sitting in the hills just under the Hollywood sign. We were sitting on top of a hill, so I had a good panorama view. I tried to keep the horizon centered, and I shot frames the whole way around. When I stitched them together in Gimp, I noticed that each frame had variations in it's color. It wasn't because of AWB, it was because the camera (good for the time) had some weird variance, so there was a difference in color from the left to the right side. So, two shots from the same camera at the same settings were significantly different.

I would be willing to suggest that the demo shown isn't a demonstration of a functional piece of software. It is a good example of what can be generated with a computer. I could do the same thing in Gimp or Photoshop. If my job let me play like this for a few weeks, I could have made a better example of vaporware.

--
Serious? Seriousness is well above my pay grade.

Better technology by Fri13 · 2008-08-14 05:32 · Score: 1

What I found more intresting technology than Photosynth http://www.youtube.com/watch?v=556FvXHLtAo

old tech? by recharged95 · 2008-08-14 06:25 · Score: 1

Honestly, this reminds me of DTED and orthorectification techniques used in the satellite imaging world. Just now applied to tagged photos at the consumer level.

When building a virtual Earth using satellite imaging you basically use the same techniques.

Great idea for the 'consumer', but nothing revolutionary, just exploiting the same tech that's been around for 10ys.

so clearly... by Slur · 2008-08-14 06:55 · Score: 1

...you must be one of them.

--
-- thinkyhead software and media

Re:so clearly... by dave420 · 2008-08-18 03:05 · Score: 1

Nope. I'm just calling out folks who bitch and moan about where a technology came from, as opposed to what said technology actually does. It smacks of bullshit.

Re:Wow by loraksus · 2008-08-14 07:58 · Score: 2, Insightful

There are other features that I don't see how they're getting, such as the zones where photos were shot from. That takes an awful lot of extrapolation.

I suspect it isn't as complex as you think - exif tags usually include focus distance and focal length. Also included with that is sensor size or camera model, which will tell you effective focal length.
When you combine that info with the apparent size of the object in the photo (i.e. statue of liberty is x percentage of the frame high), you should be able to get a reasonable estimation on where the picture was shot from.

For relatively isolated objects (like the statue of liberty), I'd assume you'd need a single shot w/ a known location to act as an anchor (it's possible with cameras that support gps) - but I wouldn't be surprised if a mathematician could get around that. Perhaps angle of the sun at time / date (exif again), but I'd assume that would take significant processing and have all sorts of things that would screw it up.

I know DXO can analyze a jpeg or raw to get the model of a lens - presume it's stored in exif somewhere - and lens distortion can be corrected w/ a x% pincushion adjustment to the photo based on known values - DXO has a fairly huge database and I wouldn't be surprised if they were using some of their tech.
Either that or the guys here fudged it by only using pictures from a specific make / model of camera.

I suspect that they would use the distance estimation in exif to eliminate the statues, etc - although I'm guessing a fair number of statues would be eliminated because they aren't scaled properly. Autofocus distance estimations can be off, but not usually hundreds of meters off.

As for cross photo whitebalance / color gimpiness across the frame, that can be relatively easily corrected - autostich (free) and autopano pro (the "pro version" of autostitch) does it and they've licensed their stuff to a bunch of other companies.

Also... keep in mind they really aren't displaying high res imagery - so they can estimate / tweak a bit. I don't know if that will scale, or if it does, what the processing requirements will be, but it's probably not a huge concern.

It's clearly not as simple as they try to make it (they really only used a small amount of image sets), but I don't think it's a photoshop job.
The question is whether it will scale.

--
1q2w3e4r5t6y7u8i9o0pqawsedrftgthyjukilo;p'azsxdcfv gbhnjmk,l.;/

But... by WormSlayer · 2008-08-14 08:08 · Score: 2, Funny

Could it be used to piece together a massively high resolution, totally nude, 3d model of a famous celebrity from the millions of event photo's and nip slips? ;)

Re:Microsoft stealing from Linux again... by naoursla · 2008-08-14 09:19 · Score: 1

Are you complaining because Microsoft hired people to do work? You know, the corporate charter doesn't create anything itself. It is all people doing the actual work.

Microsoft: It's made of people!

Re:Generalized, or just well-known, static objects by Attackinghobo · 2008-08-14 09:46 · Score: 1

Get someones iPhone or iTouch. Browse for goatse, stretch it using multitouch...

Re: Exposure time by DamnStupidElf · 2008-08-14 11:09 · Score: 3, Informative

NTSC is worse than you described; you have two 1/60th second exposures interlaced together. Utterly worthless for still frames.

Once progressive HD video cameras become cheap, then video will suck slightly less for the average family archive.

Keep the Faith by zuperduperman · 2008-08-14 11:37 · Score: 2, Insightful

Ok folks, don't worry!

Just keep chanting the mantra that Microsoft never innovates anything and everything will be ok.

I'm sure there will be a linux port of this soon and then we can all go back to complaining about how Microsoft copies everything from Apple.

Re:Wow by ozphx · 2008-08-14 12:51 · Score: 1

The handy guide why to not get drunk with those people:

Ancient Egyptians: Too dead.
Americans: Beer too crap.
Greeks: Too likely to bugger you in the arse while you are passed out.

--
3laws: No freebies, no backsies, GTFO.

Re:Wow by complete+loony · 2008-08-14 12:59 · Score: 2, Informative

There are other features that I don't see how they're getting, such as the zones where photos were shot from. That takes an awful lot of extrapolation. What's the difference between a photographer 10 feet away, and a photographer 200 feet away with a good zoom lens? Almost nothing, except maybe a little focal distortion at the edge of the photo. That varies with the quality of the camera and lens anyways.

Perspective changes a lot based on where the camera is, a big zoom lense does nothing to change the perspective it just makes the image larger.

Their process finds machine recognisable points in each photo, then looks for matching points between photos. Once you know that 2 photos are of the same subject you can use the separation between these known points to work out the relative viewing position of each camera. It only takes about 4-5 common points on different planes to pinpoint where each camera is relative to other camera's. I can visualise how this process could be completely automated.

At the end of the process they have a 3D model of where all these identifiable points are relative to each other, and they know where to project the plane of each photo within that model.

--
09F91102 no, 455FE104 nope, F190A1E8 uh-uh, 7A5F8A09 that's not it, C87294CE no. Ah! 452F6E403CDF10714E41DFAA257D313F.

Re:Wow by x2A · 2008-08-14 14:48 · Score: 1

I've long since learnt that communicating with people on internet forums has nothing on socialising with a bunch of people in person... yet here we find ourselves.

There's a lot of life in the grey areas dude, in my experience it's those who go with "gotta be all or nothing" attitudes that miss out more. Something about "is what happens between plans"?

--
The revolution will not be televised... but it will have a page on Wikipedia

Re:Wow by x2A · 2008-08-14 14:54 · Score: 1

"Companies will not be able to use it due to the vast majority of the material being copyrighted"

Oh no! Companies will have to learn how to take photos like people can!

"So who exactly are they doing this for?"

It's a research project. As many cools techs out of MS-research projects seem to just disapear, I'd maybe go with... themselves?

--
The revolution will not be televised... but it will have a page on Wikipedia

Firefox? by lpq · 2008-08-14 21:00 · Score: 1

Wasn't too friendly to my browser.
Even trying setting a downloading it & installing; the content wouldn't activate.

Mexico by StCredZero · 2008-08-15 04:02 · Score: 1

You can go cheaply. Just do your homework first.

Re:Mexico by swillden · 2008-08-15 04:54 · Score: 1

You can go cheaply. Just do your homework first.
Ya visitamos a Mexico, varias partes. Es de ayuda que hable yo el espanol, y los precios si son baratos, pero hay mas del mundo que Mexico y Sudamerica. Tambien hemos viajado por Italia, donde tenemos amigos con quienes quedamos para minimizar el gasto.
Greece, however, is neither cheap nor close, and we don't have friends or relatives there. Virtual tours provide nice opportunities.

--
Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.

Re:Wow by JWSmythe · 2008-08-15 05:43 · Score: 1

They've been talking about it for so long, but haven't released a preview, beta, or the full product to market, so that's a sure sign it's vaporware.

I do have full faith that it will be released. Two weeks after Duke Nukem Forever, and simultaniously with the first 100% bug free stable version of Windows.

I can't figure out how to write my own version of Photoshop either, but I can get my hands on it to see that it does exist.

--
Serious? Seriousness is well above my pay grade.

video enhancement by whatever3003 · 2008-08-15 20:26 · Score: 1

I wonder how far this could be taken in association with this type of work: http://grail.cs.washington.edu/projects/videoenhancement/videoEnhancement.htm

--
"Those who do not want to imitate anything, produce nothing." -- Salvador Dali

Slashdot Mirror

Photosynth Team Does It Again

109 of 144 comments (clear)