Slashdot Mirror


Google Programming Contest Winner

asqui writes "The First Annual Google Programming Contest, announced about 4 months ago has ended. The winner is Daniel Egnor, a former Microsoft employee. His project converted addresses found in documents to latitude-longitude coordinates and built a two-dimensional index of these coordinates, thereby allowing you to limit your query to a certain radius from a geographical location. Good for difficult questions like "Where is the nearest all-night pizza place that will deliver at this hour?". Unfortunately there is no mention whether this technology is on its way to the google labs yet. There are also details of 5 other excellent project submissions that didn't quite make it."

27 of 214 comments (clear)

  1. I see one being implemented soon by Masem · · Score: 5, Interesting
    From the hon. mentions:
    Laird Breyer, for his project, Markovian Page Ranking Distributions: Some Theory and Simulations. This project examined various properties of the Markovian process behind Google's PageRank algorithm, and suggested some modifications to take into account the "age" of each link to reduce Pagerank's tendency to bias against newly-created pages.

    This may help to defeat the current practice of overloading the PageRank results of a given key word as to point to a given page by having people link to that page with a link containing that keyword, aka "Googlebombing". I do think that the winner is a very interesting and useful project, this latter one will probably be implemented ASAP.

    --
    "Pinky, you've left the lens cap of your mind on again." - P&TB
    "I can see my house from here!" - ST:
  2. I sent something into the contest. by thedanceman · · Score: 4, Funny

    But I guess they thought there was no need for -thedanceman- on the google site.

  3. What a great idea by TedCheshireAcad · · Score: 4, Funny

    If only more pizza restaurants in my area had web sites. Soon enough, I won't even have to pick up the phone to make my food come to me! I wonder if the delivery guy will bring the pizza up to me at my computer. Hmm...

  4. Google Search by iramkumar · · Score: 4, Funny

    Search => Osama Bin Laden
    Latitude/Longitude => 37/180, Pak
    Capture ...

    If this would have come out before we could have saved a country ...

  5. Idea for a Google Query..... by ReelOddeeo · · Score: 5, Funny
    Where is the nearest server in my jurisdiction where I can download....
    • MP3's
    • Warez
    • Pr0n
    • Explosives making instructions
    And worst of all....
    • DeCSS
    We've got to stop all of the terrorists in the categories mentioned above!
    --

    Those who would give up liberty in exchange for security and DRM should switch to Microsoft Palladium!
  6. more details by Alien54 · · Score: 5, Informative
    Daniel's project adds the ability to search for web pages within a particular geographic locale to traditional keyword searching. To accomplish this, Daniel converted street addresses found within a large corpus of documents to latitude-longitude-based coordinates using the freely available TIGER and FIPS data sources, and built a two-dimensional index of these coordinates. Daniel's system provides an interface that allows the user to augment a keyword search with the ability to restrict matches to within a certain radius of a specified address (useful for queries that are difficult to answer using just keyword searching, such as "find me all bookstores near my house"). We selected Daniel's project because it combined an interesting and useful idea with a clean and robust implementation.

    This is impressive bit of database manipulation. Somehow I didn't think that all of the datatypes, etc would be so easily parsed.

    Although I do recall telephone directories that used to give you results for a specified radius for certain types of businesses

    --
    "It is a greater offense to steal men's labor, than their clothes"
    1. Re:more details by Matey-O · · Score: 3, Insightful

      There's code available now that does this for zipcodes. see http://www.zipmath.com/ (And using Mapquest as the black box, street addresses too.) Tieing it into google is a nifty bit o kit tho'.

      --
      "Draco dormiens nunquam titillandus."
    2. Re:more details by Lars+T. · · Score: 5, Insightful

      Sounds like this improvement isn't much use outside the US.

      --

      Lars T.

      To the guy who modded me down from perfect to terrible Karma - Apple haters still suck

    3. Re:more details by Chester+K · · Score: 4, Informative

      This is impressive bit of database manipulation. Somehow I didn't think that all of the datatypes, etc would be so easily parsed.

      Although I do recall telephone directories that used to give you results for a specified radius for certain types of businesses


      That's just a standard spatial query. It's easy to implement an R-Tree to be able to do (relatively) quick "give me points within x meters of this one" type of searches on a database. There's nothing extremely revolutionary about Daniel's project, anyone with some basic geometry knowledge and the patience to download the 33GB of TIGER data could have done it within the course of a few weeks. (Ironically enough I've been doing the same thing with 1.2 million addresses against TIGER data for the past month.)

      But that's the true genius and beauty of it. Now that it's been said, it's such a mindbogglingly obvious and useful application of web search and spatial search technology that it's hard to believe nobody thought of it before.

      I'd be honestly surprised if Google doesn't run with the ball and fold it into their main search engine. The only thing standing in the way is the storage space and CPU time to do it.

      --

      NO CARRIER
  7. 404 Page Not Found ? by bigmouth_strikes · · Score: 5, Interesting

    I'm surprised that there are so many 404 Page Not Found errors in Google's search results, even on the top hits.

    Shouldn't Google automatically check results that a user follows and flag those that cannot be displayed ?

    --
    Oh, I can't help quoting you because everything that you said rings true
    1. Re:404 Page Not Found ? by DrSkwid · · Score: 3, Insightful

      #

      Thomas Phelps and Robert Wilensky, for their project, Robust Hyperlinks. Traditional hyperlinks are very brittle, in that they are useless if the page later moves to a different URL. This project improves upon traditional hyperlinks by creating a signature of the target page, selecting a set of very rare words that uniquely identify the page, and relying on a search engine query for those rare words to find the page in the future. For example, the Google programming contest can be found using this link.

      --
      There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
    2. Re:404 Page Not Found ? by rmohr02 · · Score: 3, Informative

      Yea, I've realized that, but then you realize that Google caches most of the web and nearly all of the links produced in search results. So if you get a 404 error you go back and click on the cache link.

    3. Re:404 Page Not Found ? by PhilHibbs · · Score: 3, Interesting

      Google's links are not redirected via their server, and a lot of people would object to them "gathering data on their users' browsing activities". However, automatically checking the top link after each search (or scheduling it for checking) should be possible.

      What should they do if a page is unavailable, though? What if it's only down for a few seconds?

    4. Re:404 Page Not Found ? by LinuxHam · · Score: 5, Insightful

      Shouldn't Google automatically check results

      I would much prefer to see them improve the ease of browsing their cache. Specifically, if a cached site is 404, then present a cached version of the site where all clicks within the site simply link to the cached version, unlike today where all clicks are native (and therefore lead to more 404's). Granted that wouldn't be of any use for links to dynamic pages, but anything is better than what they have today.

      --
      Intelligent Life on Earth
    5. Re:404 Page Not Found ? by General+Wesc · · Score: 3, Informative

      And of course there's the Mozilla Google Toolbar for people who don't use IE.

  8. Re:Geographical Approximation by Peyna · · Score: 3, Interesting

    Sounds like it wasn't doing IP Addresses or hostnames , but addresses found in text on pages. Using enough rules, and a funky algorithm, you could probably get pretty accurate for a number of pages, enough to produce good results on searches at least.

    --
    What?
  9. Re:About the "Former Microsoft Employee" bit.. by Mononoke · · Score: 3, Funny
    And we certainly dont believe he got better at coding while he worked at MS.
    You've never learned from the mistakes of others?

    Being a former M$ employee tells me he learned quite a bit.

    --
    NetInfo connection failed for server 127.0.0.1/local
  10. Re:if i'd only known by Indras · · Score: 4, Informative
    like free development for google

    Let me quote from the homepage of the annual contest:

    "Grand Prize

    $10,000 in cash

    VIP visit to Google Inc. in Mountain View, California

    Potentially run your prize-winning code on Google's multi-billion document repository (circumstances permitting)"

    --
    The speed of time is one second per second.
  11. Nice by Mr_Silver · · Score: 3, Interesting
    Whilst I'm very impressed with the winner the entry "Robust Hyperlinks" is something that do like a lot.

    What would be cool, would be the option to right click on the hyperlink and have the option "Find alternative location".

    Or even cooler, have IE (or your favourite browser) on putting up the 404 message have a hyperlink which does the same. Hell, easy enough to do with apache.

    --
    Avantslash - View Slashdot cleanly on your mobile phone.
  12. NetGeo by *xpenguin* · · Score: 5, Informative

    There's a public database called NetGeo which will convert IP addresses to latitude and longitude locations. I created a script called IP-Atlas to get a visual location of the lat and lon coords.

  13. Re:About the "Former Microsoft Employee" bit.. by PrimeEnd · · Score: 5, Funny
    If the guy had been employed at XYZ University, I'm sure it would have still shown up.

    Actually he was employed by XYZFind Corp. Literally. And it didn't show up.

  14. Re:Service already exists by MullerMn · · Score: 3, Insightful

    The difference is that the service you're thinking of probably works from a pre-specified list of locations for the businesses it covers.

    The cool thing about the winning google entry is that it actually deduces the location of the search result by finding and parsing any address information that appears on the site!

    I think that's pretty clever. - Does anyone know if it's limited to the US?

    --
    Andy

  15. Re:if i'd only known by Indras · · Score: 3, Funny

    I would ask for royalties! A buck for every time someone viewed a page containing my code :o).

    --
    The speed of time is one second per second.
  16. Markov processes by dukethug · · Score: 3, Informative

    A Markov process is basically a series of random variables where the value of random variable X^(i+1) only depends on X^i. The idea is that if you want to predict the value of X^(i+1), all of the information you could possibly use is in the value of X^i.

    Lots of processes are Markovian- for instance, a random walk. If you're at point x at time t, then you know that there's a fifty-fifty chance you will be at x-1 or x+1 at time t+1. Knowing all of the previous points along the random walk won't help you predict the next point any better than that.

  17. Daniel Egnor's "Iocaine Powder" by po8 · · Score: 4, Interesting

    In a weird coincidence, I just spent a half-hour last night lecturing about Daniel Egnor's Iocaine Powder , winner of the First International RoShamBo Programming Competition. Credit this guy with two award-winning pieces of extreme programming cleverness!

  18. More Information About the Winner by td · · Score: 5, Informative

    I've met Dan Egnor, and this isn't the only cool thing he's done. He's the author of Iocaine powder, the world champion rock-paper-scissors program. He's also the proprieter of sweetcode a web log devoted to innovative open source projects (i.e. projects that don't just clone or tweak existing software.) But his best hack (not described on line, as far as I know) is a version of Pac Man that runs on a PDA and uses a GPS for a user interface -- if you run around an open field carrying the GPS+PDA, the pacman correspondingly runs around the maze chasing Blinky, Stinky and Dinky (or whatever their names are.)

    --
    -Tom Duff
  19. Smooth Move Google by Uttles · · Score: 3, Insightful

    Did you all read the honorable mentions? Google stands to make some good money off of the ideas and implementations these folks have come up with. I'm assuming that all entries now are owned by Google, and man they might have some really cool new features after seeing the projects that were submitted. I only hope that they give at least some royalties to the developers.

    --

    ~ now you know