Google Patents Search Algorithm
blastedtokyo writes "Google gets the first web search patent. According to this News.com.com article, Google was able to patent how they crawl and rank web pages. They claim "an improved search engine that refines a document's relevance score based on interconnectivity of the document within a set of relevant documents.""
It's not really their Search algorithm, it's their method of comprehensive PageRanking.
They basically measure Web pages as either 1) portals, or 2) authorities.
Sites like Kuro5hin and *nix have a lot of "Google juice" (i.e. weight in their ranking system) because they have so many links to other sites, while also garnering a slew of links to their main page.
Reply or e-mail; don't vaguely moderate. Ex-O'Reilly/MIT employee, now a full-time Google employee.
Google didn't invent the concept behind PageRank, just its name. See my E2 writeup on citation analysis for more.
..google, you will feel their wrath
Live web cams
I find it interesting that because it's google, some /.-ers are saying essentially "good for them!" But at the heart of it, it makes no difference who it is or what their intention is.
Kids, software patents are bad, mm-kay...
Think fuel injectors, for example, which are made by several suppliers, but have a patent holder who gets license revenue.
Stop by my site where I write about ERP systems & more
What a coincidence. Today's UF topic covers patent obsession. Check it out. Although amazon.com is the target of the joke, it shows how patent-obsessed software companies can be. I'd say it sure does a good job satirizing it. Who knows? Maybe Google will be targeted in tomorrow's strip.
May be, may be not.
a ci je/F/F3/F3_f.pdf
Some have been talking about similar techniques since before this patent was filed:
http://www.carnet.hr/cuc/cuc2000/radovi/prezent
http://citeseer.nj.nec.com/context/856618/0
I didn't pay a search engine optimization service to make this happen. I didn't use any tricks like "doors" either. It cost me no money, but it did take time and hard work to achieve it.
I explain everything I did in How To Promote Your Business On the Internet.
What's my secret? No secret at all:
- Put stuff on your site that people find interesting and useful
- Ask people for links, and give them reciprocal links in return.
That's it. But read my article for the full discussion, as well as an explanation of why I'm telling everyone my secret.Other pages I have that you may find helpful are:
-
Market Yourself - Tips for High-Tech Consultants
-
Search Engine Submission Form Index
My most popular page is a C++ style guide called Pointers, References and Values.and finally, from my K5 diary, A Webmaster's Strange But True Tale.
Thank you for your attention.
Request your free CD of my piano music.
read the patent
TWW
"Encyclopedia" is to "Wikipedia" what "Library" is to "Some people at a bus stop"
More worrying is that software patents are sometimes granted using such general language that the entity getting the patent *doesn't* really have to disclose anything, enabling them to get both protection while keeping their invention secret, which is exactlty the opposite effect of what patents were intended for -- to get duplicable knowledge into the public domain after a period of protection for the original inventor.
Dude, I think I can see my house from here.
A) The algorithm is highly useful.
B) It required a significant amount of risk and technical effort to make it worthwhile.
C) The scope of the patent really just covers what it is that they've added, i.e., the ideas that they are supposedly deriving from are not being locked up.
What more do you really need to know? Regardless of what language you wish to put your claims in, that they've just made a "context shift" or what have you, it is a worthwhile effort and it is the kind of effort that requires the potential for substantial profits to secure continued efforts. People don't take risk without at least the potential to profit and the greater the potential reward the greater risks people are willing to take. Are you really going to argue that the idea was obvious or easy? If so, then explain why no one did it before, when billions of dollars and many years were (and are) being spent on such internet technology. There was a considerable lag time between the appreciation of the need for a good search engine (and the resources to develop them) and google's appearance. What's more, keep in mind that:
a) Google's core methodology is no secret now
b) The patent's life is limited.
c) The ideas that they presumedly derived from a still as open as they were prior to this patent
d) This country produces far more than any country despite the fact that we arguably "share our toys" less than most countries, even more than countries with much larger populations (even technically educated ones)....
Now I agree that there are dangers in allowing people to patent any and everything, e.g., well known sorting algorithms and other fundamental building blocks, but this clearly is not happening here.
The patent notice contains a U.S. patent number. When entered into the USPTO search engine, a patent number calls forth a complete description of how to implement an invention.
Will I retire or break 10K?
But since USPTO considers "find a common knowlegde algorithm and patend a way to do it with computers" a valid patenting method, they probably would not consider it a prior art.
MSDOS: 20+ years without remote hole in the default install
Well, actually...
See J.M. Kleinberg, "Authoritative Sources in a Hyperlinked Environment", Proceedings of the 9th ACM-SIAM Symposium on Discrete Algorithms, ACM Press, New York and SIAM Press, Philadelphia, 1998, pp.668-677
That discusses the HITS algorithm, which is the core of PageRank (which is a simplified version of HITS). Sergey Brin and Lawrence Page in fact developed [1] Google from HITS [2].
References:
[1] S. Brin and L. Page, "The Anatomy of a Large Scale Hypertextual Web Search Engine", Proceedings of the 7th World Wide Web Converence, Elsevier Science, Amsterdam, 1998. pp. 107-117
[2] Chakrabarti, Dom, et. al., "Mining the Web's Link Structure", Computer, August 1999. pp. 60-66
The Signal/Noise ratio can be improved in two ways. Remaining silent is the OTHER way.