Slashdot Mirror


Google Programming Contest

AccordionGuy writes: "Google has just announced its first annual programming contest! The objective is to write a program that will do something "interesting" with the about 900,000 Web pages' worth data that's Google provides. In addition to writing the program, contestants also have to convince the judges why their program is interesting (or useful) and why it will scale (that is, handle a constantly increasing load of data that grows as the Web grows). The prize is US$10,000 in cash, a V.I.P. tour of the Google facility in Mountain View, California and possibly a chance to run their program on Google's complete billion-Web-page store."

28 of 629 comments (clear)

  1. The average color of the WWW by I+am+the+blob · · Score: 5, Interesting

    Much like the recent discovery of the average color of the universe, this would be a pointless, but fun, use of the data. Of course, I'm not sure exactly what to average. Do you take into account browser real-estate a particular color occupies? Do you simply average each color= and stylesheet instance?

    Ideas?

    --

    All sweeping generalizations suck.
  2. Some Inspiration by Eloquence · · Score: 5, Insightful
    A lot of implicit rating data can be gathered from the links pointing to a page. Google is already doing this when sorting the search results (frequently linked-to pages rank higher). It would be interesting to see how this could be used to detect very popular new sites. I sent this mail to Google a while ago:

    Hi,

    it occurred to me, since you are evaluating the number of links pointing to a page anyway, that it would be a very nice thing to have a sort of "Top 40 Links of the Day" page, regularly updated to include only new and unique stuff. You could use an algorithm similar to the one used by

    http://blogdex.media.mit.edu/

    or

    http://www.daypop.com/

    Both of these sites have become immensely popular through this feature (in the case of Daypop, I find http://www.daypop.com/top.htm very valuable), and I think it would also be a great addition to Google. I don't think inappropriate content would be much of a problem since it would hardly show up high on the list, and besides, a top 40 list can be looked through by a human.

    What do you think?

    Of course this could be spammed, but as I said, a human could filter the results every day; besides, it would be hard to create a very large number of unique links from different servers pointing to a page. I'm sure Google is already doing some of this to prevent spamming their search-order algorithm anyway.

  3. Re:So basically... by anthony_dipierro · · Score: 5, Informative

    That's assuming that any contest entries automatically become the property of Google.

    With regard to an entry you submit as part of the Contest, you grant Google a worldwide, perpetual, fully paid-up, non-exclusive license to make, sell, or use the technology related thereto, including but not limited to the software, algorithms, techniques, concepts, etc., associated with the entry

    So basically, google doesn't own your code, only the right to use it. GPLing your code would satisfy the worldwide, perptual non-exclusive license grant.

  4. Google Press Release by wizarddc · · Score: 5, Funny

    Google Contest Winner Offers Better Porn Searches

    Winner of the First annual Google Programming Contest creates greatest porn spider ever.

    MOUNTAIN VIEW, Calif. - December 11, 2001 - Google Inc., developer of the award-winning Google search engine, today announced it's first winner of the Annual Google Programming Contest. Winner I. C. Porno has created a program to help catalog and organize google cache of the Internet, also refered to as the World Wide Web of Porn.

    "This announcement is an important step in Google's ongoing effort to provide search services that are fast, easy to use, and that help people find the information they need," said Larry Page, Google's co-founder and president of Products. "To search our collection of 3 billion documents for porn by hand, it would take 5,707 years, searching twenty-four hours per day, at one minute per document. With I. C.'s new program, it takes less than a second."

    World's Largest Collection of Porn
    Google users now have the world's largest and most comprehensive collection of porn right at their fingertips and can immediately primal urges using the following services:

    Google Web Porn Search: The company's newest search service now offers more than 2 billion documents - 25 percent of which are non-English language web pages. Google Web Search also offers users the ability to search for numerous non-HTML files such as PDF, Microsoft Office, and Corel documents. Google's powerful and scalable technology searches this comprehensive set of information and delivers a list of relevant porno in less than half-a-second.

    Google Porn Groups: This 20-year archive of Usenet porn conversations is the largest of its kind and can serve as a powerful reference tool, while offering more porno than the Internet. Google Groups was released from beta today with 700 million postings in more than 35,000 topical porno categories.

    Google Image Search: Comprising more than 330 million nude images, Google Image Search enables users to quickly and easily find porn images relevant to a wide variety of topics, including pictures of celebrities and popular travel destinations. Advanced features include search by image size, format (JPEG and/or GIF), coloration, and the ability to restrict searches to specific genre's of porn.

    About Google Inc.
    With the largest index of websites available on the World Wide Web and the industry's most advanced search technology, Google Inc. delivers the fastest and easiest way to find relevant information on the Internet. Google's technological innovations have earned the company numerous industry awards and citations, including two Webby Awards; two WIRED magazine Readers Raves Awards; Best Internet Innovation and Technical Excellence Award from PC Magazine; Best Search Engine on the Internet from Yahoo! Internet Life; Top Ten Best Cybertech from TIME magazine; and Editor's Pick from CNET. A growing number of companies worldwide, including Yahoo! and its international properties, Sony Corporation and its global affiliates, AOL/Netscape, and Cisco Systems, rely on Google to power search on their websites. A privately held company based in Mountain View, Calif., Google's investors include Kleiner Perkins Caufield & Byers and Sequoia Capital. More information about Google can be found on the Google site at http://www.google.com.

    --
    Th
  5. Re:Notice their contest agreement? (was Re:Well th by benwb · · Score: 5, Insightful

    Notice that they don't say exclusive license. You should be able to release it as GPL yourself.

  6. Restoring meta-tags by Charles+Dodgeson · · Score: 5, Interesting
    I've been kicking around an idea for a scheme to end meta-tag (keyword, description) abuse so that they can actually become useful again. But it would require the cooperation and effort of google (and others) do do this.

    The idea is roughly to refuse to index sites which engage in keyword/description abuse.

    1. index keywords and description data
    2. Allow users to search with keywords on or off
    3. If users search with keywords on, provide a mechanism for users to nominate a site as engaging in keyword abuse.
    4. semi-automatically, and then manusually review nominations.
    5. Refuse to index sites which have engaged in keyword abuse.
    This isn't so much a system that meets the specs of the contest. And there is a scaling issue, but it is on my wish-list for google (and others) to do.
    --
    Prime numbers are exactly what Alan Greenspan says they are -S. Minsky
  7. Re:This is brilliant by saint10 · · Score: 5, Funny

    Better yet, post a story to slashdot about a contest with a prize of 10k, read all the responses moded at 4 and above, spend a weekending coding a few of em up, and cash in!

    Now that's evil!

  8. Don't post them or they'll be Googlewhackwhacked by clary · · Score: 5, Funny

    The Googlewhacking site lists reader-submitted Googlewhacks...which of course causes Google to pick up a second site for the search. And so the Googlewhack is whacked!

    --

    "Rub her feet." -- L.L.

  9. jargon watcher by MbM · · Score: 5, Interesting

    Write an application to track keyword usage over time, when a keyword goes from only 10 hits to several thousand then flag it for jargon. The jargon can then be presented as a webpage of the top whatever with various statistics over popularity and suspected origin urls.

    --
    - MbM
  10. six degrees of google-ation by anthony_dipierro · · Score: 5, Interesting

    Connect any two pages on the web to each other with the minimum number of hyperlinks.

  11. Free Labor - Tom Sawyer Effect by Embedded+Geek · · Score: 5, Insightful
    Many posters have commented on how Google will essentially get free labor out of this (by having thousands of man hours expended for that $10K prize). The only thing that surprises me is that people think this is innovative/new/evil/dastardly or otherwise unique. Fact is, it's old hat.

    I mean, how many contests have you seen on the back of a cereal box to "create a new slogan!" or "write an essay"? Just a cheap way to create some buzz and get your customers to write your advertising copy for you. Heck, the most blatant scams in memory are HBO's Project Greenlight (trolling for scripts - you don't even want to know what the Writers' Guild thought of this) and the Lego Film Contest (trolling for complete commercials).

    Hardly new stuff. Remember Mark Twain's Tom Sawyer? There's a bit where he holds a "contest" to see which kid can whitewash the fence he's supposed to paint fastest. I'm sure that even as Twain wrote that bit, even he thought "I better be sure to give the fence painting thing a unique spin so it works. After all, it's an awfully old idea..."

    --

    "Prepare for the worst - hope for the best."

  12. Riiight... by jonr · · Score: 5, Insightful

    When did you last donate to Google? How many times have you used Google on your job, saving your self and your company money? Where is the friggin' "Do it for the love of coding" thinking now? I would be happy to enter (I just need the right idea ;)) and if Google gets better because of my code, so be it!
    J.

  13. Useful or interesting? by Mr.+Sketch · · Score: 5, Interesting

    It seems like it would be very easy to come up with something interesting, and only a small fraction of those interesting things are actually useful.

    Examples of a few interesting non-useful things I can come up with just off the top of my head:
    Google Poet: Generate rhyming poetry from randomly rhyming sentances on the webpages in the database.
    Googlesaic: Input a picture and scavenge the webpages for pictures from which to create a large mosaic of the input picture.
    Google Map: Create a picture/graph of all the website connections (links) in the webpage list, perhaps add 3d/naviations. Perhaps perform graph opererations and maybe find the longest path one can travel through the links and still stay within the Google search results/database.

    These are just a few, I'm sure plenty of other people can find much more exciting/interesting things to do, but they won't always be useful to the google company.

  14. Search Engine Wars by Van+Halen · · Score: 5, Interesting
    I already made a game last year I called Search Engine Wars. I wonder if it would qualify?

    It's a party game. The basic idea is that a bunch of people are in the game, and it goes around in turns. On your turn, you type in a few words to search for. The game goes and queries google for the first hit on that search, and sends everyone's browser to that page. Then the other players get 100 seconds to guess which words you searched for. The first player to guess correctly gets points for the amount of time remaining.

    It's written using BYOND, which you'll have to download if you want to play.

  15. Re:This is brilliant by epsalon · · Score: 5, Informative

    If you read the rules, you will see that you don't even have to assign copyrights to Google. You only have to give them a license. This means you can GPL your code or even BSD it. Sounds fair to me.

  16. JWZ Has the winner, and the runner up... by thehossman · · Score: 5, Interesting
    JWZ allready wrote the coolest apps I've ever seen that harvest the power of Internet search engines...

    Webcollage -- slowly builds a random collage of images from the net.

    DadaDodo -- generates random sentences based on word probabilities in pages on the net.

    --
    -- The Hoss Man
  17. Well, here's an idea.. by shayne321 · · Score: 5, Interesting
    Here's a free idea to anyone who has the time/initiative to code it (i.e. Not Me): a program that scans a page and rates it with an annoyance rating (x out of 100?) based on annoying things you'll find on the page if you open it: webbugs, cookies sent back to doubleclick, pop-unders, banner ads, java applets, BLINK tags, poorly formed HTML/CSS, broken images, sql/asp/php errors, etc. The higher the number the more annoying the page, and therefore the more likely the user is to click a different search result. Google could also tie it in to their ranking system to rank annoying pages lower in the results. Seems to me like it'd make the web a better place.

    Shayne

    --
    Today I didn't even have to use my AK; I got to say it was a good day -- Icecube
    1. Re:Well, here's an idea.. by shayne321 · · Score: 5, Interesting

      Another idea is to just count the number of HTML errors as the annoyance factor.

      That's not really what I had in mind... HTML errors are nowhere NEAR as annoying as pr0n sites that pop open ads all over the place, resize your browser, bookmark themselves, etc, etc. That's what I mean by annoyance, the kind of site that makes Joe Sixpack (as well as me) get upset when he gets stuck in a loop that for every window he closes two pop open. I'm more worried about discouraging sites from using bad behavior than I am encouraging them to use proper html. Of course, malformed html should ADD to the annoyance factor, but not be the only thing counted. That's my opinion anyway.

      Shayne

      --
      Today I didn't even have to use my AK; I got to say it was a good day -- Icecube
  18. My program by Anonymous Coward · · Score: 5, Funny

    s/www\.microsoft\.com/www\.goatse\.cx/g

  19. Re:Very good by ichimunki · · Score: 5, Insightful

    $10,000. 8 weeks til deadline. 40 hours per week.

    That's 10000/(8*40) = $31.25 per hour.

    Annualized that would be a salary of $65,000.

    Even in IT, that's nothing to sneeze at. But I'd say the benefits of winning a contest like this go beyond the money.

    --
    I do not have a signature
  20. How about... by jjeffries · · Score: 5, Funny

    ...something that looks through that data and finds the interesting bits based on a set of terms that the user provides?

    Or has someone done that already?

  21. Finding Programmers! by rbeattie · · Score: 5, Interesting

    Sure beats hiring programmers.

    No, that's it!

    According to this article Google is getting deluged by resumes, this is just a way for them to weed out the 600+ resumes they get a day.

    The winner of this contest (and maybe a few of the runner ups) will most likely get a job offer as well. Beats having to weed through 4200 greatly exagerated CVs every week...

    -Russ

    --
    Me
  22. Re:Free ideas and free code development for Google by WNight · · Score: 5, Interesting

    The problem is that ideas aren't worth a lot without a way to use them. I've had a lot of neat thoughts about mapping connectivity and so on, but without something like Google to run it on I'd have to spider the whole web myself on my cable.

    They might get a good idea, but if you don't win the contest they don't really have much of a legal leg to take your idea, so you're pretty safe unless you're the winner, in which case you get $10k for hacking together a script that you never could have afforded to run anyways. (It's only concept they want, not the polished results of a 2-month dev process.)

    It honestly sounds like a good deal to me. I hack for a night or two on a project that I find interesting. If I lose, no big deal. If I win I get 10k USD (3 months wages for me, I get paid in Canadian $s) and I'd be famous in exactly the circles who are looking to hire a coder with good ideas...

    People go on about the value of ideas all the time, but really, without proper backing ideas are a dime a dozen. I've said many time "Hey, how about a ..." and seen it advertised a few years later. That doesn't mean I lost out on it, because I didn't have the cash to develop it let alone market it.

    This is why patents on wide ideas are so damaging. Any idiot can have a good idea every now and then, but it takes more work (and funding unfortunately) to make them fly. If you let someone with an undeveloped idea block off a whole field it does a great disservice to the people with the ability to follow through, who likely had the idea independently.

  23. Strange but true.. by dr_labrat · · Score: 5, Funny

    A friend of mine accidentally typed:

    fat misgets fucking

    into google....

    Google knew exactly what he meant....

    --
    The secret of success is honesty and fair dealing. If you can fake those, you've got it made. (Marx)
    1. Re:Strange but true.. by Mignon · · Score: 5, Funny
      A friend of mine

      I've had friends like that too.

  24. Re:This is brilliant by Tom7 · · Score: 5, Interesting

    Unfortunately, all the comments at 4 and above are complaining about how Google intends to rip people's ideas off.

  25. Re:Free ideas and free code development for Google by Ragin'Cajun · · Score: 5, Interesting

    For once, I just might agree with a binary only submission. That way if Google is truly interested they can license the code from the developer or have some sort of other agreement / arrangement.

    It isn't like Google is offering up their source to the rest of the world, so I don't see why it is unreasonable to only offer up a binary to them.

    Well, they *have* been running the best search engine on the web FOR FREE for the past 3 years. They don't clutter their main page with flashing X10 ads, or the the irritating news+sports+weather+financialnews+email combo that everybody seems to think people want. This might not be a bad way to give something back to the company that's saved us so much time and effort finding information.

    And to the guys out there who wouldn't bother with this contest for less than $100K: if your idea is so good, go develop it yourself! Get a lawyer, and work out a deal with Google that suits you better.

    --
    --It's all fun and games, 'till someone loses an eye. Then it's one-eyed fun!--
  26. Sort results by W3C standards conformance by chrysalis · · Score: 5, Interesting

    So that pages that can properly be read by any browser comes first.
    Then, maybe webmasters will stop doing IE-only pages.

    --
    {{.sig}}