Slashdot Mirror


Google Patents Search Algorithm

blastedtokyo writes "Google gets the first web search patent. According to this News.com.com article, Google was able to patent how they crawl and rank web pages. They claim "an improved search engine that refines a document's relevance score based on interconnectivity of the document within a set of relevant documents.""

29 of 362 comments (clear)

  1. OMG MORE PATENTS!!! by govtcheez · · Score: 4, Insightful

    Let's start screaming about how evil patents are and... oh wait, it's Google (and /. loves Google), so we'll get "Thank God they're this innovative and patented it before someone else stole it."

    1. Re:OMG MORE PATENTS!!! by Bonker · · Score: 5, Insightful

      While I personally think that patents are repugnant, Google has fallen down on the 'just' side of using the patent laws the way they were intending to be used. They're not trying to bilk people out of vast sums of money ala British Telecom's hyperlink patent or Amazon's 1-click buy patent. They have a unique process that they've carefully guarded and have built a business around.

      Now that they've been awarded a patent for page-rank, it's required for them to make it public so that people can license it. You can't patent a trade secret and still have it be secret. People now have the opportunity to build new methods and innovate with Pagerank as a basis for that innovation. (Real innovation, not MS innovation.)

      Again, I think that patents are a misstep. I think they allow too many Amazon and BT events to happen. Despite the fact that the patent system is horribly broken, Google is using patent laws responsibly here. Wait until they announce a patent on 'all search technology that lists search results on a web page' or something like that. *Then* you can start complaining about how broken the patent system is.

      --
      The next Slashdot story will be ready soon, but subscribers can beat the rush and slashdot the links early!
    2. Re:OMG MORE PATENTS!!! by cascadefx · · Score: 4, Insightful
      Software patents are a bad idea.

      [Rant On]
      Really.

      What G**gle is doing is basically quantifying word of mouth. Everyone knows the best restaurant in town is the one that everyone talks about. We "link" to the restaurant of our choice. Someone new to the office, and town, is looking to go to the best restaurant there is. They ask around. 5 people say it is the Puerto Vallarta Mexican restaurant 2 say it is White River Landing, and 8 others say it is Vince's. Vince's it is.

      Simple word-of-mouth ranking right there. If the new person throws in variables like, "I don't eat Mexican." That skews the results. Nothing fancy about this.

      In fact, I bet a few hours of research into Sociology, Psychology, and Linuquistics papers will turn up generic proofs and observations of the very same things that page rank takes care of in a different context. A context shift shouldn't be patentable. Much software (but not all) involves making these logical leaps. Many times they are leaps from pure science that is copyrighted (on the one hand) but (increasingly less so) open on the other. This is human knowledge we are dealing with. The Scientific Method... all that crap. It doesn't work unless everyone shares their toys. Start locking them up and you stifle innnovation (at the least) or become dictatorial master of (increasingly more of) everyone's lives.
      [/Rant Off]

    3. Re:OMG MORE PATENTS!!! by ergo98 · · Score: 5, Insightful

      Now that they've been awarded a patent for page-rank, it's required for them to make it public so that people can license it

      I had made this mistake before, confusing trade groups with patents. AFAIK patents do not force you to license it whatsoever. Instead they can be used to hunt down anyone who intrudes into your patent and sue them out of existence.

      In any case this isn't about PageRank, but is about a revised search technique: In a nutshell it is PageRank by resultset -> i.e. Say you searched for "Scooby Doo" : It gets the result set of Scooby Doo hits, and THEN it derives a pagerank within that set of Scooby Doo hits (versus the basic PageRank which derives the ranking for the whole net). It's funny because I had actually investigated the initial steps of a patent several years ago for something which I called a "combined corpus" (which in a similar light groups items by topic of discussion-i.e. a page on Crickets would get a good score for cricket searches by being referenced by lots of Cricket pages : It wouldn't benefit them to put a nude picture of Britney Spears to get a lot of links and boost their generic pagerank) because of the general ridiculousness of something like the basic PageRank, but I knew that against a giant machine like Google I wouldn't stand a chance so I just forgot about it (which is the problem with patents: How many people think of a great idea but then let it rot because of the almost certain patent overlaps). I've had that same thought process with a lot of, in my opinion, great ideas.

      People now have the opportunity to build new methods and innovate with Pagerank as a basis for that innovation. (Real innovation, not MS innovation.)

      Ooooh, nice use of the obligatory MS slam for mod points (ignore the fact that MS has been a fantastic patent citizen and has never, to my knowledge, enforced dubious patents). In any case how is it "innovation" for others to now use something existing (if Google allowed it)? Sounds like counter-innovation because everyone who might possibly overlap with this patent will now just dump the project lest they cross paths with Google.

    4. Re:OMG MORE PATENTS!!! by Com2Kid · · Score: 5, Insightful
      • What G**gle is doing is basically quantifying word of mouth.


      Which is a pretty impressive proccess. Making a set of mathmatical formulas out of an otherwise very much fluid and etheral concept. Not half bad.

      • Everyone knows the best restaurant in town is the one that everyone talks about.


      Oh? I think it is the one that everybody with a good sense of taste talks about? What is a good sense of taste? Welllll, now we are getting down to the nitty gritty. What defines a "trustable" website?

      • We "link" to the restaurant of our choice. Someone new to the office, and town, is looking to go to the best restaurant there is. They ask around. 5 people say it is the Puerto Vallarta Mexican restaurant 2 say it is White River Landing [muncienews.com], and 8 others say it is Vince's [muncienews.com]. Vince's it is.


      You must define the weight, if person a and b say Vinces but they both say that person C has "better taste" in Mexican Food, and person C says Puerto Vallarta, and enough of that goes on, than the decision base upon the results can be changes signifigently.

      • Simple word-of-mouth ranking right there.


      Yes, sounds like a good alpha-level project. :)

      • If the new person throws in variables like, "I don't eat Mexican." That skews the results. Nothing fancy about this.


      True, but what if resturant X has a style of Mexican that is mixed with, say, Soul Food, and the person REALLY loves Soul Food. Then what? Life gets confusing. :)


      • In fact, I bet a few hours of research into Sociology, Psychology, and Linuquistics papers will turn up generic proofs and observations of the very same things that page rank takes care of in a different context. A context shift shouldn't be patentable.


      Oh? If it is so obvious, why did search engines for so long, well, heh, suck. I remember using insanly complicated regexps with those "other" search engines to do what are now trivial searches on Google.

    5. Re:OMG MORE PATENTS!!! by Thavius · · Score: 5, Insightful

      There's two ways patents can be used: as a sword, and as a shield.

      IBM holds many interesting patents. One that caused a former employer of mine to take notice is one that covered anything that used templates to generate HTML files. This patent basically covers almost all WYSIWYG HTML creation tools (we were in the middle of creating one when it was issued). I haven't seen any breaking stories on how IBM is beating down small companies with it, and our company didn't get served a C&D order because of it.

      It appears that IBM is using the patent as a shield, to protect themselves against another company saying, "I invented that, give me money." It will protect them from being the target of an infringement suit.

      Other companies, such as BT, and Amazon, and others, are using their patents as a sword to exthort money out of companies. This is what I disagree with, because most often they target small companies first. They never seem to go after companies with resources, because they know their sword is not as sharp or strong as it could be.

      I'm not patents as an idea, but patents of some tech innovations have been abused. The side-swinging patent, that guy will never try to enforce his patent, because it was for fun. But just like anything else, patents can be abused to the detriment of everyone.

      Google's patent can be used in two ways. Let's see how they use it.

  2. Patents creating artificial monopolies by taumeson · · Score: 5, Insightful

    Patents are a tool for creating temporary, artificial monopolies.

    With that said, aren't you glad Google might be able to stay on top and profitable, instead of having to resort to banner ad revenue, etc?

    1. Re:Patents creating artificial monopolies by WolfWithoutAClause · · Score: 3, Insightful
      Patents are a tool for creating temporary, artificial monopolies.

      Yeah, I agree. Still not all monopolies are bad; I guess it depends on the character of the company that has the monopoly; or what they do with it, or don't do with it. In the US, monopolies aren't illegal, although some of the freedoms that you have if you aren't a monopoly are removed. This seemed to upset Microsoft no end: "we've done nothing wrong, we aren't really a monopoly. Yeah right. Or actually, no, wrong.

      --

      -WolfWithoutAClause

      "Gravity is only a theory, not a fact!"
  3. hmmm.... by Kevin+Stevens · · Score: 4, Insightful

    I am not quite sure of the purpose of this article since most patent articles are intended to point out the ridiculousness of the patent system, but this seems like a pretty legit patent to me. They developed a technology that is superior to their peers, that they developed completely in house w/out ripping anyone off. This passes my shadiness test. If anything, we should all be happy now that Google will be publishing some of the details for their system.

  4. Not that outrageous by Nikk+Name · · Score: 5, Insightful
    Compared to other "patent the fork" icon'ed items, this one is not that outrageous.

    Google's way of doing thing was certainly not the first way to search, it is not the most obvious way to search, it is not the only way to search, and it might not be the best way to search (something better likely will come along). In other words, I don't think this patent will harass many others at all.

    This is nothing near as bad as Amazon patenting message boards attached to sale items, or "one-click shopping" being patented.

  5. Hmm.. by Astin · · Score: 5, Insightful

    Wow.. an internet patent that might actually make sense. It's not "A method to search through an index of web pages for relevant links to a user request for specific information." But the improvement on it. And it's generally accepted that Google DID improve web searching tremendously and have a unique method of doing it. Of course, this means it will be struck down immediately by some small company that gets a broader patent (see above) and sues them.

    --
    - In hell, treason is the work of angels.
  6. Sad day for computer scientists by Amsterdam+Vallon · · Score: 3, Insightful

    Back when Page and his Stanford pal created Google, they had planned to just simply create a really snazzy and useful research project. From day one and for a couple years, they assured everyone that they would never sell-out and their algorithms and code would remain in the public view.

    However, things changed, and they quickly hopped onto the dot-com bandwagon. With this privatization, they closed all their notebooks and journals and stopped teaching others how to implement a great webcrawler and search ranking system.

    They made out well, but I feel that the CS community lost a great number of resources. I'm proud of Google and I use it a lot, but I just wish they'd have remained a bit more loyal to the open source community that they started off with.

    If it weren't for open BSD code and free database software, Google wouldn't exist today. Don't forget that.

    --

    Reply or e-mail; don't vaguely moderate. Ex-O'Reilly/MIT employee, now a full-time Google employee.
    1. Re:Sad day for computer scientists by s20451 · · Score: 4, Insightful

      With this privatization, they closed all their notebooks and journals and stopped teaching others how to implement a great webcrawler and search ranking system.

      Troll.

      The upside to patenting (at least in theory) is that Google no longer has to keep its IP secret, in fear that someone else will copy them. If you're so curious, why don't you request a copy of their patent yourself, and review it.

      --
      Toronto-area transit rider? Rate your ride.
  7. Slashdot hypocrites... by jwriney · · Score: 4, Insightful

    Aaaagh! Patents are bad! Patents are bad!

    (Psst - hey, Google's getting one.)

    Uh, well, (grumble) I guess that's okay then, er...

    Bring on the wave of apologists.

    --riney, Karmakaze

  8. Re:Good for them... by Ed+Avis · · Score: 5, Insightful

    It's not a question of whether Google is 'good' or 'evil'. It's a question of whether the patent office was right to grant this patent, and whether a patent system that includes software is of greater economic benefit to society than one that does not.

    You can ask: if patents on computer programs were not available, would Google have developed their idea anyway?

    --
    -- Ed Avis ed@membled.com
  9. Not entirely unexpected, but... by Tet · · Score: 4, Insightful

    I'm in two minds about this. Should Google get a patent for this? Google have innovated here, and thus the patent is a valid way to reward the effort they put in to designing the system, in exchange for the idea entering the public domain after the patent expires. While the duration of patents in IT related areas needs to be drastically shortened if they're to serve their original purpose, I'm not inherrently opposed to patents like this. The question then becomes, is it sufficiently obvious to anyone in the field that it shouldn't be patentable? Well, it's a tough call. The fact is that no one had done anything like that before Google. If it was so obvious, why not? My personal view is that it's obvious enough that if Google hadn't done it, someone else would have done within a couple of years. So while I don't think the patent should have been granted, I don't think it's as cut and dry a matter as it may at first appear...

    --
    "The invisible and the non-existent look very much alike." -- Delos B. McKown
  10. Re:Good for them... by goldspider · · Score: 3, Insightful
    "They thought of a way to improve upon an existing invention. They were the first to do it. They want to make money from their idea."

    (I've got some kharma to burn, so why not...)

    Couldn't it be argued, then, that Amazon improved upon online purchasing with their one-click process?

    --
    "Ask not what your country can do for you." --John F. Kennedy
  11. Re:watch out by DCowern · · Score: 5, Insightful

    What's wrong with what Google is doing? They're simply trying to keep an "edge" on the market. The reason why they're the best search engine out there is because they figured out how to make a better way to rank pages. They deserve to reap the benefits of that invention without anyone else cutting in on their business.

    As for the "googling" incident, I just think they're attempting to defend their trademark. If you don't do that kind of stuff, you lose your trademark. Kinda like how Kleenex and Xerox lost theirs (everyone says "may I have a kleenex?" or "could you xerox this?" and so it became colloquial and no longer a trademark).

    All Google is trying to do is cover their ass. If they decide one day to try to patent the search engine, then there'll be reason to get up in arms.

  12. Good for them... by theGreater · · Score: 5, Insightful

    ...because they're Google. But if it were Microsoft patenting "an improved method for giving help to users", say maybe the help files vs. man pages, people would flame about prior art, talk endlessly out of their anuses about how Bill Gates is trying to wrest control of the tinfoil hat co-op from Mac users, and generally be nuisances.

    I love /.ing while in class, but honestly, people. Google gives a C&D letter, we all golf clap and say "way to defend your IP!" Someone else does it, and we all run to chillingeffects to boycott / whine / gripe / whatever.

    Here's a thought... get off your hobbyhorse, and start evaluating things based on FACTS, not the general feeling of techno-elitism you get from pretending you're cool because you get jokes written in PERL.

    And mod me -5 Troll, if you want. But it's the damned truth, and you know it.

    -theGreater.

  13. Re:Good for them... by DCowern · · Score: 4, Insightful

    I'm sorry I came off trollish but I just don't see why every patent is seen as evil on Slashdot. I agree wholeheartedly that the patent system has gotten out of control. I just don't agree that every patent is evil. In a lot of cases, businesses need patents to exist. For example, what would happen if Microsoft figured out how to implement Google's page rank system and implemented it on MSN? Google would have no recourse and Microsoft has approximately 80 bajillion times the resources of Google and could easily out market them.

    And by the way... the difference between patents and RFCs is that with RFCs, there's no expectations of profit. They're made in cases where, as a previous poster pointed out, the greater societal benefit outweighs potential profits. Many RFCs and IEEE standards are based on corporate IP anyway, especially ones dealing with network protocols. Token Ring, FDDI, and Ethernet were all proprietary standards back in the day...

  14. Re:The nature of the beast by tetro · · Score: 4, Insightful

    Patents aren't technically evil. It's just the way they're used.

    --
    .smell my feet.
  15. Its a non-obvious invention by DeadSea · · Score: 3, Insightful
    This is one patent that should be granted. It's no one-click shopping patent, it's no "put an e- in front of it" patent. If Google weren't doing it, then I wouldn't have thought to do it myself.

    Having a patent on it means that Google will be the only viable search engine for the next twenty years if it chooses not to license the patent. Is that what we really want? I could see four or five years, but twenty years is a good percentage of my lifetime. Google is an innovative company, but who's to say somebody couldn't do it better after a few years by building on the idea. The first implementation almost always sucks compared to clones.

  16. Re:Good for them... by Lumpy · · Score: 4, Insightful

    ...because they're Google. But if it were Microsoft patenting "an improved method for giving help to users", say maybe the help files vs. man pages, people would flame about prior art, talk endlessly out of their anuses about how Bill Gates is trying to wrest control of the tinfoil hat co-op from Mac users, and generally be nuisances.


    so you got any prior art on google? They patentented something that IS ACTUALLY INNOVATIVE and took time to patent.. other than patenting an icon that says "start" or patents on clicking on an object to cause an action.

    most patents are completely absurd and need to have an angry mob impale the patent-er and the clerk at the patent office that approved it on a large spike somewhere very public as a lesson to others.

    google's patent is actually something that is worthy of a patent (like the wankle engine, or using an exploding device to inflate a teflon/cloth bag to lessen the impact on the people in a vehicle during a crash... you know something innovative.. It's hard to recognize today in the sea of non-innovatives.

    --
    Do not look at laser with remaining good eye.
  17. Re:watch out by aengblom · · Score: 3, Insightful

    Kinda like how Kleenex and Xerox lost theirs (everyone says "may I have a kleenex?" or "could you xerox this?" and so it became colloquial and no longer a trademark).

    ?!?!?

    Both are still very much trademarks. You may use the words in conversation, but I can't make a photocopier named "Xerox."

    What Google is doing is stupid. Mostly because, "to google" refers only at this point to USING GOOGLE!. (Not search)

    --


    So close and yet so far from the world's perfect ID number
  18. This patent is for NewRank/LocalRank, no PageRank by registro · · Score: 5, Insightful

    That is not the patent for PageRank.
    PageRank had already been patented by Stanford University, just before Google was created, when it was a community effort.
    This new patent is a patent over an improvement of PageRank, what they call now "LocalRank" and "NewRank". It is designed to stop competitor from developing pagerank-like technologies. Armed with that kind of patent, they can stop open-sorce Aspseek, Teoma and others from developing similar technologies.
    What they are tryng to do is extend patents over citation ranking and peer-review, something that has been around since the creation of the first libraries. This is NOT good.

    Basically, this means no more money from the suits to any citation-ranking related effor in any start-up, fearing litigation. It could mean also no more installations of open-source Aspseek (Google Appliance's competitor )in corporate environments, because of fear of litigation.

    This is sad.

  19. ..regarding patents. by THEbwana · · Score: 3, Insightful

    A patent is like a baseball bat made out of rubber. When you're competing against other companies - you both get out your set of rubber baseball bats and hand them over to your lawyers who proceed to pummel eachother. After a while - one of the sides will tell their lawyers to stop since they're running out of money (lawyer batsmen are rather expensive). The side who gives up looses - the winning side buys the looser for 5cents (since they're bankrupt).

    These are the current rules (in the US, but also to a varying degree in the EU) of the "game" called free enterprise. They are quite senseless and arbitrary - but you have to adapt since lawyers equipped with rubber baseball bats exist whether you want them to or not.

    Ultimately, every people has the responsibility for its government (if you cant handle this responsibility - then you become a refugee) - and subsequently also for the laws passed. If the rules of the game are ignorant, they are so for a reason. And since ignorance is usually expensive - in the long run, someone will allways have to foot the bill. Bad policies allways have a monetary cost.

    Amazon, Google are naturally doing the right thing since their primary task is to generate profits for their owners - it is not to make policy or specify the rules of the game (the government is supposed to do this on behalf of the voters).
    The politicians are obviously doing the right thing since they're basically excercising their mandate of doing what the average joe has given them authority to do.
    - so I guess that makes the average joe the bad guy/gal. At least the responsibility lies with the same people who are going to pay the price.

    It kinda reminds me of a favourite quote: "if you think education is expensive - you should try ignorance!" /m

  20. You READ that C&D? by siskbc · · Score: 3, Insightful
    I love /.ing while in class, but honestly, people. Google gives a C&D letter, we all golf clap and say "way to defend your IP!" Someone else does it, and we all run to chillingeffects to boycott / whine / gripe / whatever.

    First, that was the nicest C&D in the history of them, if you can even call it that. They politely *asked*, not demanded, webspy to change their definition to mention Google's trademark. Had that been M$, they would have sent over Vincent and Jules to go midieval on their asses. Ezekiel 25:17 would have rained its vengence upon them. Nah, Google did that nice. I agree, they might not have had to do it, but it was the kind of grey area that makes lawyers nervous. Overall, they did OK.

    Second, Google patented more than an "improved method of helping users." This isn't like Amazon, where they basically patented efficiency (thanks, USPTO). Google didn't patent *all* ways of serving up better results. They patented their fairly specific method, which they were in fact the first to practice. There were a lot of search engine companies at the time - if it were obvious, someone would have been doing it. So I think it passes muster there.

    I do agree that people tend to kneejerk on this site, but there were a lot of people during the C&D discussion who kneejerked against google too - so I don't think this blind acceptance of google is really a problem here. Blind hatred of M$ is more likely.

    --

    -Looking for a job as a materials chemist or multivariat

  21. Re:Oh, but... by Planesdragon · · Score: 3, Insightful

    What probably makes it special and innovative is that it has the words using a computer somewhere in the definition.

    Don't forget _automatically_.

    Any human can play Go. But if you come up with an algorhythm to let a computer play Go by itself, then that's a patentable invention.