Slashdot Mirror


Google Programming Contest Winner

asqui writes "The First Annual Google Programming Contest, announced about 4 months ago has ended. The winner is Daniel Egnor, a former Microsoft employee. His project converted addresses found in documents to latitude-longitude coordinates and built a two-dimensional index of these coordinates, thereby allowing you to limit your query to a certain radius from a geographical location. Good for difficult questions like "Where is the nearest all-night pizza place that will deliver at this hour?". Unfortunately there is no mention whether this technology is on its way to the google labs yet. There are also details of 5 other excellent project submissions that didn't quite make it."

12 of 214 comments (clear)

  1. more details by Alien54 · · Score: 5, Informative
    Daniel's project adds the ability to search for web pages within a particular geographic locale to traditional keyword searching. To accomplish this, Daniel converted street addresses found within a large corpus of documents to latitude-longitude-based coordinates using the freely available TIGER and FIPS data sources, and built a two-dimensional index of these coordinates. Daniel's system provides an interface that allows the user to augment a keyword search with the ability to restrict matches to within a certain radius of a specified address (useful for queries that are difficult to answer using just keyword searching, such as "find me all bookstores near my house"). We selected Daniel's project because it combined an interesting and useful idea with a clean and robust implementation.

    This is impressive bit of database manipulation. Somehow I didn't think that all of the datatypes, etc would be so easily parsed.

    Although I do recall telephone directories that used to give you results for a specified radius for certain types of businesses

    --
    "It is a greater offense to steal men's labor, than their clothes"
    1. Re:more details by Chester+K · · Score: 4, Informative

      This is impressive bit of database manipulation. Somehow I didn't think that all of the datatypes, etc would be so easily parsed.

      Although I do recall telephone directories that used to give you results for a specified radius for certain types of businesses


      That's just a standard spatial query. It's easy to implement an R-Tree to be able to do (relatively) quick "give me points within x meters of this one" type of searches on a database. There's nothing extremely revolutionary about Daniel's project, anyone with some basic geometry knowledge and the patience to download the 33GB of TIGER data could have done it within the course of a few weeks. (Ironically enough I've been doing the same thing with 1.2 million addresses against TIGER data for the past month.)

      But that's the true genius and beauty of it. Now that it's been said, it's such a mindbogglingly obvious and useful application of web search and spatial search technology that it's hard to believe nobody thought of it before.

      I'd be honestly surprised if Google doesn't run with the ball and fold it into their main search engine. The only thing standing in the way is the storage space and CPU time to do it.

      --

      NO CARRIER
  2. Re:404 Page Not Found ? by rmohr02 · · Score: 3, Informative

    Yea, I've realized that, but then you realize that Google caches most of the web and nearly all of the links produced in search results. So if you get a 404 error you go back and click on the cache link.

  3. Re:About the "Former Microsoft Employee" bit.. by Anonymous Coward · · Score: 1, Informative

    Gates didn't write DOS.

  4. Not earth shattering, but useful by f00zbll · · Score: 2, Informative

    Credit to the guy for thinking of it. It could save a person the hassle of looking up all the address in mapquest. I've never had the need to do such a search on google, since it's easier to just do a yellowpage search. Most yellow page sites like superpages and switchboard already provide that kind of functionality. Google's directory search doesn't have search by distance yet, but I'm guessing it will be added in the future. They kinda have to considering the other directory sites have those features.

  5. Re:if i'd only known by Indras · · Score: 4, Informative
    like free development for google

    Let me quote from the homepage of the annual contest:

    "Grand Prize

    $10,000 in cash

    VIP visit to Google Inc. in Mountain View, California

    Potentially run your prize-winning code on Google's multi-billion document repository (circumstances permitting)"

    --
    The speed of time is one second per second.
  6. NetGeo by *xpenguin* · · Score: 5, Informative

    There's a public database called NetGeo which will convert IP addresses to latitude and longitude locations. I created a script called IP-Atlas to get a visual location of the lat and lon coords.

  7. Re:I see one being implemented soon by Anonymous Coward · · Score: 2, Informative

    Try the Google Glossary to find definitions of words or phrases.

    Markovian Dependece- The condition where observations in a time series are dependent on previous observations in the near term. Markovian dependence dies quickly, while long-memory effects like Hurst dependence, decay over very long time periods.

  8. Markov processes by dukethug · · Score: 3, Informative

    A Markov process is basically a series of random variables where the value of random variable X^(i+1) only depends on X^i. The idea is that if you want to predict the value of X^(i+1), all of the information you could possibly use is in the value of X^i.

    Lots of processes are Markovian- for instance, a random walk. If you're at point x at time t, then you know that there's a fifty-fifty chance you will be at x-1 or x+1 at time t+1. Knowing all of the previous points along the random walk won't help you predict the next point any better than that.

  9. Re:404 Page Not Found ? by General+Wesc · · Score: 3, Informative

    And of course there's the Mozilla Google Toolbar for people who don't use IE.

  10. More Information About the Winner by td · · Score: 5, Informative

    I've met Dan Egnor, and this isn't the only cool thing he's done. He's the author of Iocaine powder, the world champion rock-paper-scissors program. He's also the proprieter of sweetcode a web log devoted to innovative open source projects (i.e. projects that don't just clone or tweak existing software.) But his best hack (not described on line, as far as I know) is a version of Pac Man that runs on a PDA and uses a GPS for a user interface -- if you run around an open field carrying the GPS+PDA, the pacman correspondingly runs around the maze chasing Blinky, Stinky and Dinky (or whatever their names are.)

    --
    -Tom Duff
  11. Re:About the "Former Microsoft Employee" bit.. by asqui · · Score: 2, Informative

    The reason I included Microsoft Corp. as a former employer and not XYZFind Corp. is becasue I wanted to point out that despite what most of you like to think, intelligent people do work at Microsoft.

    Yes really, it's not a large room full of monkeys!