Google Experiments
gafferted writes "The boffins at google have been experimenting with new toys, such as Keyboard Shortcuts and glossary, but most fun is Google Sets. Try "green, purple, red" to get a set of 40 different colours. Try a set that contains both Richard Stallman and Bill Gates, see what google associates with Slashdot or ask for a set of rude words."
...but you've Slashdotted google!#*(@Q$#^$
wow.
Janie took my gun...
It looks like the only machine that's melted is the lab1.google.com one. I'm just dreading what the Slashdot effect is going to do to that poor Voice Search phoneline!
"Why did they cancel my favorite Sci-Fi show? I downloaded ALL the episodes!"
... "Google" and "Jumps the Shark"?
- DDT
So long, michael. Don't let the door hit you...
If you're a huge Google fan (and aren't we all) check out Google Weblog. They had this story 2 days ago, plus they keep you up to date on other cool Google happenings.
And no, it's not my site. I just think it's cool.
daed si luap
The one time I actually want to read what the story's about, and the webserver is down...
See here. Who wouldn't want to be a Forward Observation Officer in the navy? :)
I work at a place that is kind of touchy about content served up to those who signed the agreement to be allowed online, and that link to rude words I think needs one of the fark "Not Safe For Work" things after it.
Yeah, the thing doesn't link to boobies, but grepping for incoming text vs. grepping for inbound boobies is a tad easier for log generation.
Besides, I thought rude words just involved being insensitive, not foul.
Wheeeee
I am in awe.
I fed it Hugh, Pugh and Barney Mcgrew - and it gave the right answer.
It can't be far from becoming self-aware.
george
/. and penisbird would have been more appropriate. I'm waiting for poor lil lab1 to come back up so I can try natalie portman and hot grits myself.
What?
No wonder it is still beta, it associates Rob Malda with RMS and Bill Gates!
Richard Stallman
Bill Gates
Linus Torvalds
Larry Wall
Bruce Perens
Eric Raymond
Steve Jobs
Brian Behlendorf
Chris Dibona
Larry Augustin
Rob Malda
Michael Tiemann
Randal Schwartz
Jamie Zawinski
I've had enough abrasive sigs. Kittens are cute and fuzzy.
Google assiciates rude words with Slashdot, but i think the sysadmins were typing them in manually for each query i made while their servers were grinding to a slasdotted halt.
It seems that in addition to labs1
there is also labs.google.com
labs2.google.com
labs3.google.com
labs front page works(right now) the rest don't, and links off of labs, try to go to labs1 and crash and burn...
Dilbert Google Doodle
Web Images Groups Directory
______________________
| | <- Google search
|______________________|
o Advanced Search
o Preferences
o Language Tools
New! Advertise with Google's cost-per-click AdWords Select.
Advertise with Us - Search Solutions - News and Resources - Jobs, Press, Cool Stuff...
©2002 Google - Searching 2,073,418,204 web pages
Damn lameness filter won't let you have a joke so forgive the junk:
# Important Stuff: Please try to keep posts on topic.
# Try to reply to other people comments instead of starting new threads.
# Read other people's messages before posting your own to avoid simply duplicating what has already been said.
# Use a clear subject that describes what your message is about.
# Offtopic, Inflammatory, Inappropriate, Illegal, or Offensive comments might be moderated. (You can read everything, even moderated posts, by adjusting your threshold on the User Preferences Page)
Problems regarding accounts or comment posting should be sent to CowboyNeal.
--
Reverse outsourcing: it's the future
I spent a couple hours playing with google labs last night and one of the most impressiev things (to me) was how I put in 3 bands:
Nirvana
Alice in Chains
Pearl Jam
and received one of the most accurate lists of other grunge and hard rock/heavy bands back that I've seen. To be able to build such a list on something as subjective as music is very impressive, and shows you just how good the quality of google's algorithms are.
$45 per U Colocation Special
Also try the beta for Google's Catalog Search
http://www.kubuntu.org/
5. What happened to that cool thing I was playing around with last week?
The prototypes on Google Labs are meant to be low maintenance experiments. If one disappears it may be because no one was interested enough to use it, it wasn't stable enough for users to try it out, or it was so wildly successful that heavy usage brought the server to its knees. While that particular application may not reappear, there should be something equally interesting to replace it shortly.
So after today's /.ing, are they going to replace the entire lab site?
I discovered the voice search yesterday (and submitted it but was rejected... but that's not the point). It was pretty fun - since it's slashdotted, though, I'll mention that it worked for me: when I said "The Simpsons," it gave results for "The Simpsons" and "The Sims," which is understandable. Somebody else did a search for "ISDN" and got results for "ISDN" as well as "ISBN." The last search was for "Corvette," which gave a lot of results that contained "Court of" in the title, but the sidebar on the right (the paid sponsors) had links to Corvette sites.
So, bookmark that site and someday in the future, when it is not slashdotted, try the voice search! It's not a toll-free number, but the coolness factor is well worth it. I don't know where it would be really useful (you still need a web browser to view the results), except in the case where you know how to pronounce a word but not how to spell it.
I really hate signatures, but go to my website.
If Google were smart they'd use the w, a, s, d keys instead of i, j, k, l. My fingers automatically go to wasd after many, many games of Quake 3 -- and many other games using the same engine. :)
mbbac
Google caches sites that don't cache themselves, but Google doesn't cache itself.
Shouldn't Google cache itself?
I am Russell.
Assembly is the reverse of disassembly.
In the spirit of automated classification and machine learning, I
tried searching for a set containing "boosting", "SVM" and "bagging"
(without "bagging", nothing new is found; another problem).
Results: "SVM", "Bagging", "Boosting", "stacking" and "Other methods".
Clicking on either of the 2 new links ("stacking" or "Other methods")
takes me to the normal Google search on the term. This is of course
not useful -- I need the terms in the machine learning context, which I
cannot get.
Especially in the case of "Other methods", it would be nice to be able
to get to the page Google had in mind...
2 dashes and a space, or just 2 dashes?
For example, try entering "Frank Welker", "Don Messick", and "Maurice LaMarche" (all cartoon voice actors). Under "small set", you get back nothing but a failure page that implies you need to change the terms to get results. The failure page doesn't even have the "expand your search" link that successful small set searches have. But if you use "large set", you get back 3 additional items (well under the 15 item set limit of a small set).
Looks like an easter egg to me. :)
Google's cache of www.google.com
Beware: In C++, your friends can see your privates!
Wow I entered Natalie Portman into google sets and i got 'hot grits' back!
If you're talking about the Dilbert thing, it is by design.
;) (You know, the &hl=sv&ie=UTF8 part)
If you're talking about the language thing, it is your mother tongue.
Now you can connect him with everything in the universe.
I can only hope that if AOL switches to Mozilla as their core code, and Google starts seeing millions of Mozilla hits, they might rethink this decision.
sPh
I was talking about the Dilbert thing.
:)
:)
Yeah, I read all about that Dilbert thing afterwards by clicking on the logo. But it looked funny as I hadn't seen the logo before even if I'm a frequent Google visitor.
Hmm... So you're saying I speak UTF8? Nah, that would be inefficient, requiring twice as long time to... Umm... Sorry, now I confused with Unicode again. *hides*
Beware: In C++, your friends can see your privates!
Oh, it's on the International Google! doh!
:)
:(
Well, it got +4 Funny currently so someone must be amused.
Or perhaps they're just amused by my stupidity.
Beware: In C++, your friends can see your privates!
I'd be pretty damn mad if my project got the slashdotting that this has brought down on labs1.google.com...
Wah!
You have clearly generated bad karma in millions of your past lives.
Come on, give it up, that's
Go to www.google.com. Click "Preferences". Change the drop-down box next to "Number of Results" from its default (10) to anything up to 100.
This feature has been there for years.
We who were living are now dying
With a little patience
The ultimate test: Jenna Jameson, Sylvia Saint, Houston, Chasey Lain, Christy Canyon
...and it passes with flying colors.
A speech...
This makes me wonder what Google will be like 10 years from now. Will they accidentally release a Lawnmower-Man-like entity onto the Internet and then categorically deny everything? Will they have to contribute to a special SuperFund for Internet pollution? Will we see cartoons of Men In Black spray-burning suspicious goo off the Google logo?
Here's a funny list of misspelled Google searches for "Britney Spears". Google was able to automatically spell-correct all of them.
Britney Spears
cpeterso
From the Google Labs page:
Thanks for your interest in Google Labs.
The lab is temporarily closed as we deal with an experiment that got slightly out of hand. Nothing to be concerned about, really. All of our engineers are perfectly safe and there was never any real danger of it escaping into the wild.
Please check back in a few hours. Everything should be back to normal then and science will march on once again. We appreciate your patience.
I'm convinced that Google will become a giant AI. Google Sets seems like a small step towards machine understanding. The problem with older AI was bootstrapping their knowledge base. The Google AI systems will use the entire internet as an encyclopedia of self-correcting, peer-reviewed, continually-updated "facts". Suddenly, the problem of manual data entry for a AI system like Open Cyc is massively parallelized to the entire population of web users! Of course, the web is full of lies and self-promotion, but the web contains multiple voices, multiple "truths", that will create a general consensus using Google's PageRank algorithm.
cpeterso
Speaking of boobies (and I do like to speak of boobies), I was impressed by Google's completion of the set "hooters, knockers, fun bags":
Google on boobies
That's pretty impressive! I think it could pass the Turing test!
That's it. I'm no longer part of Team Sanity.
Each query phrase produces a set of documents, i.e. web pages. The intersection of those sets gives a small set of docs which is pretty much the same thing that a normal google query (or any search engine) will return, if all the queries are ANDed. Then the new feature is to find the intersection of all the terms from all the docs in the doc-intersection set. That is, return all the terms that are common to all the docs.
e.g. in pseudo-code: Assume ...}. ... ...; // so docSets contains the URLs of the docs that have all the query terms // ws will contain the running intersection of the set of words in all the docs
- G is the normal google search engine.
- G.query("search phrase") returns a set of references (URLs) to docs, e.g. {u1, u2, u3,
- u.terms() returns a set of all the words contained in the doc referenced by u, e.g. if u=="http://slashdot.org", then u.terms() == {"news", "for", "nerds", "slashdot", etc.}.
- * is a set intersection operator.
s1 = G.query(q1); s2=G.query(q2); s3=G.query(q3);
docSets = s1 * s2 * s3 *
ws = docSets[0].terms();
forall url in docSets { ws = ws * url.terms(); }
return ws;
So my guess is that ws is the final set of terms returned by the google set. Of course, the words should be sorted by some meaningful metric, e.g. frequency. This is all very easy to implement and can be done very quickly, because finding the document set intersection and the word set intersections can be done very quickly using sparse vectors to represent word or document vectors.