Google Patents Search Algorithm
blastedtokyo writes "Google gets the first web search patent. According to this News.com.com article, Google was able to patent how they crawl and rank web pages. They claim "an improved search engine that refines a document's relevance score based on interconnectivity of the document within a set of relevant documents.""
Let's start screaming about how evil patents are and... oh wait, it's Google (and /. loves Google), so we'll get "Thank God they're this innovative and patented it before someone else stole it."
It's not really their Search algorithm, it's their method of comprehensive PageRanking.
They basically measure Web pages as either 1) portals, or 2) authorities.
Sites like Kuro5hin and *nix have a lot of "Google juice" (i.e. weight in their ranking system) because they have so many links to other sites, while also garnering a slew of links to their main page.
Reply or e-mail; don't vaguely moderate. Ex-O'Reilly/MIT employee, now a full-time Google employee.
They thought of a way to improve upon an existing invention. They were the first to do it. They want to make money from their idea. It's only logical for them to seek a patent. I guess congratulations are in order!
Patents are a tool for creating temporary, artificial monopolies.
With that said, aren't you glad Google might be able to stay on top and profitable, instead of having to resort to banner ad revenue, etc?
Google didn't invent the concept behind PageRank, just its name. See my E2 writeup on citation analysis for more.
I am not quite sure of the purpose of this article since most patent articles are intended to point out the ridiculousness of the patent system, but this seems like a pretty legit patent to me. They developed a technology that is superior to their peers, that they developed completely in house w/out ripping anyone off. This passes my shadiness test. If anything, we should all be happy now that Google will be publishing some of the details for their system.
Google's way of doing thing was certainly not the first way to search, it is not the most obvious way to search, it is not the only way to search, and it might not be the best way to search (something better likely will come along). In other words, I don't think this patent will harass many others at all.
This is nothing near as bad as Amazon patenting message boards attached to sale items, or "one-click shopping" being patented.
Wow.. an internet patent that might actually make sense. It's not "A method to search through an index of web pages for relevant links to a user request for specific information." But the improvement on it. And it's generally accepted that Google DID improve web searching tremendously and have a unique method of doing it. Of course, this means it will be struck down immediately by some small company that gets a broader patent (see above) and sues them.
- In hell, treason is the work of angels.
at least we can still use
At least the patent is not so about a "common sense" technology (at least, not was in '96), and I don't think that google will sue the other search engines that refines a little the PageRank concept (like i.e. <a href="http://teoma.com">Teoma</a>) but to avoid someone else patent this or something very related.
Back when Page and his Stanford pal created Google, they had planned to just simply create a really snazzy and useful research project. From day one and for a couple years, they assured everyone that they would never sell-out and their algorithms and code would remain in the public view.
However, things changed, and they quickly hopped onto the dot-com bandwagon. With this privatization, they closed all their notebooks and journals and stopped teaching others how to implement a great webcrawler and search ranking system.
They made out well, but I feel that the CS community lost a great number of resources. I'm proud of Google and I use it a lot, but I just wish they'd have remained a bit more loyal to the open source community that they started off with.
If it weren't for open BSD code and free database software, Google wouldn't exist today. Don't forget that.
Reply or e-mail; don't vaguely moderate. Ex-O'Reilly/MIT employee, now a full-time Google employee.
Aaaagh! Patents are bad! Patents are bad!
(Psst - hey, Google's getting one.)
Uh, well, (grumble) I guess that's okay then, er...
Bring on the wave of apologists.
--riney, Karmakaze
..google, you will feel their wrath
Live web cams
I'm in two minds about this. Should Google get a patent for this? Google have innovated here, and thus the patent is a valid way to reward the effort they put in to designing the system, in exchange for the idea entering the public domain after the patent expires. While the duration of patents in IT related areas needs to be drastically shortened if they're to serve their original purpose, I'm not inherrently opposed to patents like this. The question then becomes, is it sufficiently obvious to anyone in the field that it shouldn't be patentable? Well, it's a tough call. The fact is that no one had done anything like that before Google. If it was so obvious, why not? My personal view is that it's obvious enough that if Google hadn't done it, someone else would have done within a couple of years. So while I don't think the patent should have been granted, I don't think it's as cut and dry a matter as it may at first appear...
"The invisible and the non-existent look very much alike." -- Delos B. McKown
What's wrong with what Google is doing? They're simply trying to keep an "edge" on the market. The reason why they're the best search engine out there is because they figured out how to make a better way to rank pages. They deserve to reap the benefits of that invention without anyone else cutting in on their business.
As for the "googling" incident, I just think they're attempting to defend their trademark. If you don't do that kind of stuff, you lose your trademark. Kinda like how Kleenex and Xerox lost theirs (everyone says "may I have a kleenex?" or "could you xerox this?" and so it became colloquial and no longer a trademark).
All Google is trying to do is cover their ass. If they decide one day to try to patent the search engine, then there'll be reason to get up in arms.
I find it interesting that because it's google, some /.-ers are saying essentially "good for them!" But at the heart of it, it makes no difference who it is or what their intention is.
Kids, software patents are bad, mm-kay...
...because they're Google. But if it were Microsoft patenting "an improved method for giving help to users", say maybe the help files vs. man pages, people would flame about prior art, talk endlessly out of their anuses about how Bill Gates is trying to wrest control of the tinfoil hat co-op from Mac users, and generally be nuisances.
/.ing while in class, but honestly, people. Google gives a C&D letter, we all golf clap and say "way to defend your IP!" Someone else does it, and we all run to chillingeffects to boycott / whine / gripe / whatever.
I love
Here's a thought... get off your hobbyhorse, and start evaluating things based on FACTS, not the general feeling of techno-elitism you get from pretending you're cool because you get jokes written in PERL.
And mod me -5 Troll, if you want. But it's the damned truth, and you know it.
-theGreater.
Now that they've patented their technology, surely that means that it's open to public scrutiny and therefore abuse as people exploit it's shortcomings.
Like tinyurl, but one letter less! http://qurl.co.uk/
Do you realise that the Google search you link to, shows your comment as the top result? Its a Google loop!
Patents aren't technically evil. It's just the way they're used.
.smell my feet.
There's a reason I only ever use their search engine now. Well two reasons. One is that about half the time I run searches there, what I'm looking for is the first thing on the list. The second is they are very not obnoxious about their advertising. And I've probably clicked through more google ads than any other banner ads on the net. That's right, I'm much more likely to follow information that looks like it pertains to what I'm looking for right now over some obnoxious Javascript ad (Which usually make me turn Javascript off and reload.)
Very similar to Google's method, I've seen CNN and USA Today run ads disguised as news stories in their tech sections. Unlike google, which clearly marks the ads, CNN and USA Today are simply compromising their journalistic integrity. As if those two words have been put together in a single sentence since Cronkite left the industry.
Where was I? Oh yes. In principle, software patents offend me. Well... and most of the rest of the slashdot population apparently. Being able to patent something that doesn't have a physical presence (Be it programs or math or business processes) is counter-productive. Especially since the patent office seems to rubber stamp every application that hits their desk. Hey. If you don't like it, write a civil nastygram to your congresscritter. Do NOT use the word "Fuck." That tends to turn them off. And in extreme case, get you visits from very grumpy people who seem to have something against doors. We're starting to see some technologically clueful folks in office, so the more people who write, the higher the chance that someone in the know might get the message.
I'm trying to teach myself to set people on fire with my mind... Is it hot in here?
Having a patent on it means that Google will be the only viable search engine for the next twenty years if it chooses not to license the patent. Is that what we really want? I could see four or five years, but twenty years is a good percentage of my lifetime. Google is an innovative company, but who's to say somebody couldn't do it better after a few years by building on the idea. The first implementation almost always sucks compared to clones.
Think fuel injectors, for example, which are made by several suppliers, but have a patent holder who gets license revenue.
Stop by my site where I write about ERP systems & more
So, the bright side of this patent is that perhaps it will keep others from focusing on Google's obsession -- the reference popularity contest. But like any patent, it is subject to abuse, not that we know at all how Google intends to enforce it.
I have requested improvements to Google's algorithms for years to make it more possible to search for a specific thing, rather than just a popular thing, but they don't have engineers, apparently, who understand these basic needs.
AltaVista lets you wildcard, search for one word NEAR another word, use common words as part of a phrase, and construct a variety of very useful filters that are impossible with Google's popularity engine.
AltaVista used to be the best out there, but compromised their own usefulness. If AV indexed more pages and had not dropped their usenet coverage, it would still be the most useful engine by far to an advanced searcher -- one looking for very specific things. I still go there often. Just because the masses use Google does not make it quality or best for advanced users. They have stagnated for years now. The masses use a lot of things produced by monopolists who are no longer required to innovate or even improve to the level of the competition.
Kinda like how Kleenex and Xerox lost theirs (everyone says "may I have a kleenex?" or "could you xerox this?" and so it became colloquial and no longer a trademark).
?!?!?
Both are still very much trademarks. You may use the words in conversation, but I can't make a photocopier named "Xerox."
What Google is doing is stupid. Mostly because, "to google" refers only at this point to USING GOOGLE!. (Not search)
So close and yet so far from the world's perfect ID number
That is not the patent for PageRank.
PageRank had already been patented by Stanford University, just before Google was created, when it was a community effort.
This new patent is a patent over an improvement of PageRank, what they call now "LocalRank" and "NewRank". It is designed to stop competitor from developing pagerank-like technologies. Armed with that kind of patent, they can stop open-sorce Aspseek, Teoma and others from developing similar technologies.
What they are tryng to do is extend patents over citation ranking and peer-review, something that has been around since the creation of the first libraries. This is NOT good.
Basically, this means no more money from the suits to any citation-ranking related effor in any start-up, fearing litigation. It could mean also no more installations of open-source Aspseek (Google Appliance's competitor )in corporate environments, because of fear of litigation.
This is sad.
read the patent
A patent is like a baseball bat made out of rubber. When you're competing against other companies - you both get out your set of rubber baseball bats and hand them over to your lawyers who proceed to pummel eachother. After a while - one of the sides will tell their lawyers to stop since they're running out of money (lawyer batsmen are rather expensive). The side who gives up looses - the winning side buys the looser for 5cents (since they're bankrupt).
/m
These are the current rules (in the US, but also to a varying degree in the EU) of the "game" called free enterprise. They are quite senseless and arbitrary - but you have to adapt since lawyers equipped with rubber baseball bats exist whether you want them to or not.
Ultimately, every people has the responsibility for its government (if you cant handle this responsibility - then you become a refugee) - and subsequently also for the laws passed. If the rules of the game are ignorant, they are so for a reason. And since ignorance is usually expensive - in the long run, someone will allways have to foot the bill. Bad policies allways have a monetary cost.
Amazon, Google are naturally doing the right thing since their primary task is to generate profits for their owners - it is not to make policy or specify the rules of the game (the government is supposed to do this on behalf of the voters).
The politicians are obviously doing the right thing since they're basically excercising their mandate of doing what the average joe has given them authority to do.
- so I guess that makes the average joe the bad guy/gal. At least the responsibility lies with the same people who are going to pay the price.
It kinda reminds me of a favourite quote: "if you think education is expensive - you should try ignorance!"
There seems to be a lack of understanding about the original purpose of the patent system. The the distant past, knowledge was transferred from artisan to apprentice and through guilds. Back then, as now, people were very protective of their intellectual property, as it was their livelihood, so it would not be stored anywhere. If the person were to die without passing on the knowledge, it would be lost forever (like Damascus steel).
To try and stop knowledge from being lost, governments introduced a patent system (first patent recorded in 1449) so that the creator of the knowledge would still get a fair financial reward for the item.
IMHO there are 2 problems with the existing patent system implementations.
1) As the technology becomes more complicated, those who verify patents are not skilled enough to accurately judge their validity.
2) The time limit of patents is too inflexible. Many technology patents should have valid lengths of 5-10 years.
First, that was the nicest C&D in the history of them, if you can even call it that. They politely *asked*, not demanded, webspy to change their definition to mention Google's trademark. Had that been M$, they would have sent over Vincent and Jules to go midieval on their asses. Ezekiel 25:17 would have rained its vengence upon them. Nah, Google did that nice. I agree, they might not have had to do it, but it was the kind of grey area that makes lawyers nervous. Overall, they did OK.
Second, Google patented more than an "improved method of helping users." This isn't like Amazon, where they basically patented efficiency (thanks, USPTO). Google didn't patent *all* ways of serving up better results. They patented their fairly specific method, which they were in fact the first to practice. There were a lot of search engine companies at the time - if it were obvious, someone would have been doing it. So I think it passes muster there.
I do agree that people tend to kneejerk on this site, but there were a lot of people during the C&D discussion who kneejerked against google too - so I don't think this blind acceptance of google is really a problem here. Blind hatred of M$ is more likely.
-Looking for a job as a materials chemist or multivariat
What probably makes it special and innovative is that it has the words using a computer somewhere in the definition.
Don't forget _automatically_.
Any human can play Go. But if you come up with an algorhythm to let a computer play Go by itself, then that's a patentable invention.
A) The algorithm is highly useful.
B) It required a significant amount of risk and technical effort to make it worthwhile.
C) The scope of the patent really just covers what it is that they've added, i.e., the ideas that they are supposedly deriving from are not being locked up.
What more do you really need to know? Regardless of what language you wish to put your claims in, that they've just made a "context shift" or what have you, it is a worthwhile effort and it is the kind of effort that requires the potential for substantial profits to secure continued efforts. People don't take risk without at least the potential to profit and the greater the potential reward the greater risks people are willing to take. Are you really going to argue that the idea was obvious or easy? If so, then explain why no one did it before, when billions of dollars and many years were (and are) being spent on such internet technology. There was a considerable lag time between the appreciation of the need for a good search engine (and the resources to develop them) and google's appearance. What's more, keep in mind that:
a) Google's core methodology is no secret now
b) The patent's life is limited.
c) The ideas that they presumedly derived from a still as open as they were prior to this patent
d) This country produces far more than any country despite the fact that we arguably "share our toys" less than most countries, even more than countries with much larger populations (even technically educated ones)....
Now I agree that there are dangers in allowing people to patent any and everything, e.g., well known sorting algorithms and other fundamental building blocks, but this clearly is not happening here.