Rome, Built In a Day

← Back to Stories (view on slashdot.org)

Posted by timothy on Wednesday September 16, 2009 @08:37AM from the what-about-compilation dept.

spmallick writes "Researchers at the University of Washington, in collaboration with Microsoft, have recreated the city of Rome in 3D using images obtained from Flickr. The data set consists of 150,000 images from Flickr.com associated with the tags 'Rome' or 'Roma,' and it took 21 hours on 496 compute cores to create a 3D digital model. Unlike Photosynth / Photo Tourism, the goal was to reconstruct an entire city and not just individual landmarks. Previous versions of the Photo Tourism software matched each photo to every other photo in the set. But as the number of photos increases the number of matches explodes, increasing with the square of the number of photos. A set of 250,000 images would take at least a year for 500 computers to process... A million photos would take more than a decade! The newly developed code works more than a hundred times faster than the previous version. It first establishes likely matches and then concentrates on those parts."

20 of 107 comments (clear)

Min score:

Reason:

Sort:

legality by timpdx · 2009-09-16 08:41 · Score: 3, Insightful

IANAL, but is this legal? I somehow think that Microsoft doesn't have 150K photographer releases in their paws.
As far as I can tell... by KingSkippus · 2009-09-16 08:44 · Score: 4, Interesting

As far as I can tell, after skimming TFA and watching the little demo video, they weren't actually copying the pictures, but using them to build a 3D model.
It would be kind of like aggregating a bunch of books in the library to come up with a letter distribution chart. You're not violating the copyrights of the authors, just compiling information from raw data.
1. Re:As far as I can tell... by Tim4444 · 2009-09-16 09:02 · Score: 4, Informative
  
  No, it's designed to help you find images of a particular location and then it shows you the original photos. The 3d model part is kinda misleading as they're just using it to calculate the relative positions of where the pictures were taken and then browse it like a giant 3d menu. The summary gave me the impression that they built a photo realistic 3d model of the city, but it's just a glorified image browser. You could argue it's like Google image search, but it seems that they did actually copy the pictures instead of just linking to the originals on Flickr. Still, it's some pretty neat photo processing.
2. Re:As far as I can tell... by TooMuchToDo · 2009-09-16 09:16 · Score: 3, Informative
  
  Also, their import app is most likely checking the Creative Commons license on the photos they're pulling from Flickr.
3. Re:As far as I can tell... by harlows_monkeys · 2009-09-16 09:37 · Score: 3, Insightful
  
  How is the image of the Coliseum shown in either of the linked articles not a 3D model of said building?
Re:Cool, but... by westlake · 2009-09-16 08:51 · Score: 4, Insightful

I'm certain the faculty at UW are completely familiar enough with free software that they could have made this work without MS's help.
150,000 photos. 21 Hours. 496 Cores. That makes it a labor intensive, computation intensive project. None of that comes "free as in beer."
Re:Cool, but... by Anonymous Coward · 2009-09-16 08:52 · Score: 5, Funny

Hell yeah! I come to Slashdot for the slightly outdated stories, stay to read the comments of disillusion Linux fanboys.
TED talk with a 2007 version by jhsiao · 2009-09-16 09:00 · Score: 4, Informative

Photosynth was showcased in a mid 2007 TED talk. You can find it here.

It would be nice to have photosynths of monuments, art, or architecture that have been damaged or destroyed (e.g. the Buddhas dynamited in Afghanistan, the churches that collapsed in the 2009 Italy earthquake) from tourist photos that may be floating out in the interwebs.
Why O(n squared)? by clone53421 · 2009-09-16 09:01 · Score: 3, Interesting

Previous versions of the Photo Tourism software matched each photo to every other photo in the set.
If you're building an entire digital model, wouldn't there be some point at which it would be more efficient to match each new photo to the digital model itself (instead of all the other individual photos)? At that point, the 3D model would be nearly complete, and matching new photos would be closer to O(n), as I see it. Additional photos would primarily only increase the detail/resolution of the existing model.

--
Alexander Peter Kristopeit bought his basement from his mommy for one dollar.
UW website by guido1 · 2009-09-16 09:05 · Score: 5, Informative

The teams actual site has more pics and videos, including St. Peter's Basilica, Trevi Fountain, and info on Venice.
http://grail.cs.washington.edu/rome/
Puzzle solving techniques by chord.wav · 2009-09-16 09:05 · Score: 5, Funny

The newly developed code works more than a hundred times faster than the previous version. It first establishes likely matches and then concentrates on those parts.
It would have been even faster if they'd have started with the edges and leaved the sky for the end like in any other puzzle.
Video games by VinylRecords · 2009-09-16 09:07 · Score: 5, Interesting

Imagine if the God of War team could instantly recreate entire cities like this. Or the Fallout 3 team could snap a few thousand photos of Las Vegas and then digitize an entire city within a day and then work out the kinks. Or the Grand Theft Auto developers could recreate New Yo...ahem, Liberty City and then build a perfect 3D model and just slap textures on the buildings.
Sure it's not a perfect system but this has so much potential to help recreate cities or terrain within video games.
1. Re:Video games by shutdown+-p+now · 2009-09-16 11:32 · Score: 3, Interesting
  
  The way you get pics isn't really a big deal, the interesting part is software that takes them and makes a 3D model out of it.
  But yeah, combining Street View with Photosynth is an obvious thing that comes to mind.
Re:Cool, but... by afidel · 2009-09-16 09:11 · Score: 3, Informative

496 cores isn't all that much, with HT enabled a 1U server can hold 16 cores so a 42U rack can hold 672 cores, blade servers are even more dense. The budget for most midsized IT departments probably has room for a few compute clusters of that size.

--
There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
I don't know what else to say... by Monkeedude1212 · 2009-09-16 09:17 · Score: 4, Funny

Aren't humans just awesome?
We build amazing structures that last over a thousand years of constant wear and we invent photography to capture the awe inspiring moments that such marvelous creations cast upon ourselves, then create computers to recreate their 3D Dimensions almost perfectly in a virtual environment using nothing but our pictures that we've taken and our impressive ingenuity.
If you can read this: Pat yourself on the back.
Re:Cool, but... by SuperBigGulp · 2009-09-16 09:20 · Score: 4, Funny

Well, now that Microsoft has done somebody will try to copy them by driving around Rome in a car that takes pictures of everything around it. Oh wait, http://maps.google.com/maps?f=q&hl=en&g=colosseo,+roma&ie=UTF8&layer=c&cbll=41.891293,12.49059&panoid=haogKvGCLWGZlNYPmGLLPA&cbp=11,130.48,,0,-7.13&ll=41.891294,12.490585&spn=0.002588,0.009645&t=h&z=17

--
Someday a Slashdot ID of 177180 will mean something.
Re:Cool, but... by Anonymous Coward · 2009-09-16 09:20 · Score: 3, Informative

There are 2 opensource projects aiming to do similar 3d reconstructions:
http://code.google.com/p/libmv/
http://insight3d.sourceforge.net/
So while getting those 496 cores would still be a task for you, opensource software _is_ nearly there too.
Still just a point cloud? by grumbel · 2009-09-16 09:20 · Score: 5, Insightful

It is nice to see that they have optimized the algorithm, but what about the presentation? It looks like it is still just a point cloud, just as it was two years ago. Why isn't it a fully textured 3d model? It shouldn't be that hard to do that when you already have the points in 3d.
1. Re:Still just a point cloud? by mrchaotica · 2009-09-16 18:56 · Score: 3, Insightful
  
  Why isn't it a fully textured 3d model? It shouldn't be that hard to do that when you already have the points in 3d.
  You might have answered your own question: since developing an algorithm like marching cubes is a solved problem, slapping it on as a post-processing step wouldn't really count as research. These academics are trying to make a cool demo to show off their research, not create a finished product. If they waste too much time polishing it, they risk not getting enough real research done and losing their funding.
  
  --
  "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz
Sure it does by SuperKendall · 2009-09-16 10:32 · Score: 3, Insightful

...None of that comes "free as in beer."...
150,000 photos.
From Flickr. It's not like some poor bastard was paid to be out there photographing for weeks.
21 Hours. 496 Cores.
Don't recall folding@home or seti@home paying me anything.
In short - who wouldn't pony up a few days of computing power to have a fully open 3D model of some of earths greatest landmarks? We only need someone to do the code to distribute, but the basic framework for distributed computation is already in place.

--
"There is more worth loving than we have strength to love." - Brian Jay Stanley