Slashdot Mirror


Google TrustRank

Philipp Lenssen writes "Google registered a trademark for the word "TrustRank", as Search Engine Watch reveals. Is this a sign we can expect a follow-up to Google's PageRank? An earlier, possibly related paper on TrustRank is available; it proposes techniques to semi-automatically separate good pages from spam by the use of a small selection of reputable seed pages."

20 of 176 comments (clear)

  1. Potential abuse? by mferrier · · Score: 4, Interesting

    This is a step in the right direction conceptually, but giving a smaller number of "seed sites" more rank influence increases the potential fallout from any rank cheats that may be found in the future (see Google Bomb and Google 302 exploit.

    Google may be better off as they are currently leaving all sites initally equal in influence before the Pagerank calculation.

    Then again, Google has a great track record for testing their ideas before committing them to general service...

    1. Re:Potential abuse? by ciroknight · · Score: 5, Interesting

      With Google's "portal system" they're developing, the trust comes from within; the company trusts its users because they are clicking into an agreement of terms. That being said, hacks that would make this new TrustRank unreliable would probably just lead to the termination of services of the account.

      This to me keys that Google's trying to become a more involved company; instead of just sitting back, caching and searching the internet, they are now trying to serve you best and give you the results you are looking for. I would imagine with TrustRank, you will see a little star or something near a link on Google's home page, and the star would indicate if it is something in your field that you would be looking for. For example, if you were a Biologist and searched for a certain kind of fish, say "Blue Tuna", it would put stars next to sites with the fish's breeding habits, etc., but if you were a general consumer, it would provide links to the local fishery.

      The internet is an extremely powerful tool, and search engines have simply evolved to the point that they are now "dumb technology". Without more user invervention (and not simply by throwing in more keywords and praying), they will continue to be as they are now. Once the company better knows what we'll be looking for, they can better serve us. And that's all I see this new tech as being.

      --
      "Victory means exit strategy, and it's important for the President to explain to us what the exit strategy is." G.W.Bush
  2. Re:more censorship, unimpressed by penguinoid · · Score: 5, Insightful

    Its not censorship. Google couldn't censor even if they wanted to. Rather than explaining to you what censorship means, let me just tell you that what Google is doing is siply doing their job better. I don't want to find spam when searching for anything, and neither does anyone else. Ergo, eliminating spam from the search results makes everyone (except spammers) happier.

    --
    Don't waste your vote! Vote for whoever you want, unless you live in a swing state it won't matter anyways
  3. Re:more censorship, unimpressed by Molly+Lipton · · Score: 5, Informative

    Yes, this is always a problem. How can you possibly know whether or not a site is spam just by looking at who's linked to it? A lot of great sites have very few external links to them and often they're from blogs and other sites that will likely be identified as spam prone.

    This is a basic problem of filtering web-content. How do you avoid throwing out the baby with the bath water? I'm running into that problem in designing a custom filter to keep my son from inadvertently seeing pornography as he looks for his "r0mz," but that's peanuts compared to Google's dilemma.

    The fact is, spam filtering is inherently censorship. This kind of interference will always have a negative impact on the marketplace of ideas that is the modern internet. On the other hand, as a side effect, removing blogs from search results (as this trust metric very likely will) may increase the usability of Google overall. I suspect there will be some people who are not as happy about that as I am.

    --


    -- Molly Lipton, Born Again Technologist.
  4. Re:Cheeseh... by penguinoid · · Score: 4, Funny

    That's why us open source programmers always throw out and completely rewrite our programs from version 2.6 to version 2.8

    --
    Don't waste your vote! Vote for whoever you want, unless you live in a swing state it won't matter anyways
  5. Re:more censorship, unimpressed by ciroknight · · Score: 4, Insightful

    Two points: 1) Any new system Google implements will run along the side of PageRank; they've invested too much to completely switch all of Google running to TrustRank. The system might even augment current PageRank by running an algorithm over the data that PageRank returns. We can only speculate as of now. But I can assure you that one will not replace the other, and there will probably be a way to use both systems in the future if you like. Hell, using your Gmail account, you may even be able to specifically tune PageRank, making certain pages more relevant to you appear higher in search results.

    2) You have the option of not using Google. Yahoo is a completely independent search engine now.

    --
    "Victory means exit strategy, and it's important for the President to explain to us what the exit strategy is." G.W.Bush
  6. Re:Cheeseh... by TheViciousOverWind · · Score: 4, Insightful

    You know, new solutions are most often patched old ones.

    Should Google just throw away their many years of research, and start from scratch?

    I find this trust-based approach interesting, but I wonder how it's gonna work for smaller sites (Which the few trusted seeds will not ever link to), but I guess the smaller sites don't really have a problem as it is, because only specific search-terms are targeted.

    There's also the problem of allowing new websites into the game, but I guess that's for the Google developers to figure out. :)

    --
    My <1000 UID is with a hot chick
  7. Re:more censorship, unimpressed by telecsan · · Score: 5, Insightful

    You fail to understand that google is incapable of actually censoring anything. Them not displaying a webpage in their results does not, indeed, remove it from the web.

    Google's primary responsibility now is to it's shareholders, which means increasing the chance that you and I find exactly what we are trying to look for, and not to unabashedly display every peddler that serves up content over http.

  8. Re:Cheeseh... by ciroknight · · Score: 4, Informative

    The fact is, we really don't have enough information as of yet to conclude whether this is a patch to PageRank, or a secondary system, running along side PageRank. One can assume it to be the former, but the latter could work just as well with Google's new corporate concept.

    Imagine going into your Gmail account settings, adding a string of a few websites you deem to be "superior" or of better quality, and then let TrustRank grab the collection of all of these, note where the highest votes go, and use these as more "Trustworthy" search results. Or, using PageRank, it simply adds an option "Vote these sites higher because they are linked to the user defined site settings."

    Both schema make Search Engine spamming more controllable by Google (Simply by terminating accounts linked to spammers), and could have an interesting effect. Can't wait to see what happens with TrustRank.

    --
    "Victory means exit strategy, and it's important for the President to explain to us what the exit strategy is." G.W.Bush
  9. Similar to Advogato's? by OblongPlatypus · · Score: 4, Interesting

    This sounds very similar to Advogato's trust metric, which uses a "seed" of trusted accounts to filter out trolls/spammers. The difference might be that it should be even easier to implement in the case of web pages, because they already have links to each other, avoiding the reliance on users to manually "certify" other user accounts in order to build the graph.

    --
    -- If no truths are spoken then no lies can hide --
  10. TrustRank link broken, session expired by mferrier · · Score: 5, Informative

    To see Google's TrustRank Trademark info on the USPTO site, click here , click "New User Form Search (Basic)", and search for "TrustRank".

  11. A good sign by treff89 · · Score: 4, Insightful

    Google, as we all know, is a reputable service provider; they get the job done efficiently and innovatively. Now they are continuing their attack on the ails of the internet which was started by Gmail spam filtering. By developing this tool, Google is helping to clean the Internet up and enable it to become the massive source of pure information it has such potential to be. The "negative" sites on the Internet, such as keyword sites with no real content which invade search results, and the like are a bane to the community and by helping get rid of them, Google is yet again doing us all a favour. Google, I salute you.

    1. Re:A good sign by pjrc · · Score: 4, Insightful
      Yes, this is a good thing. It might result in wiping out search engine spam, maybe. If the "search engine optimizers" don't find creative ways to cheat.

      Let's not get overly optimistic about what this is going to do for the web... such as:

      By developing this tool, Google is helping to clean the Internet up and enable it to become the massive source of pure information it has such potential to be.

      What exactly is "pure information" anyway?

      Consider my little website. Lots of pages about how to design electronic stuff. But we sell components that support those activities, so it's not 100% "pure", is it? You could consider all those pages as a giant ad for the stuff on the store section of the site. But most people would consider my pages on the more informational side (and the vast majority really are).

      About once every 2 or 3 weeks, I get a call from one of these search engine optimiztion companies. Not sure if it's the same couple companies... I usually just say "no" and ask to be on their do-not-call list. They're mostly a bunch of slimey people and probably don't honour such requests.

      But sometimes, the idea is tempting. I resist because I believe it's unethical, and ultimately a bad long-term investment. Still, to anyone selling via the web, even a tiny little 2-person company like me, the sales pitch is quite compelling. Pay some fee, traffic goes up, more sales, increase in revenue offsets the cost for the SEO's work. Maybe it's not so bad if they don't stupe to cheating.

      Still, I resist because I know it's not a black and white distinction. It's a fuzzy line between the obviously good techniques (improving site structure, rewording page titles, etc) and the obviously bad (cloaked pages). I also just don't trust them.

      But even the distinction between "pure information" and "spam" is fuzzy. I'd like to think I'm leaning towards the "pure information" side, but we do indeed sell products. It wasn't always that way... in the mid-90's, the site was smaller and hosted at a university and no products were sold. I had several people begging me to sell them a few of the parts needed for a project. Eventually, a friend started selling some stuff (prices were high, service poor), and so I took it over. Satisfaction with the site has improved dramatically since then!

      Still, it's a fuzzy area between pure information and purely commercial, or advertising or spam.

      I can tell you it's a lot more work crafting really good web pages than just writing a check to a seedy SEO company. But if these ranking algorithms really do improve to perfection, the response is probably going to be more and more pages appearing in that gray region. Increasing sales can pay for a lot of man hours to author more material that's compelling for visitors and truely does help them to solve their products (especially if they buy the described products).

      So, in a best case scenario, these algoriths reaching perfection (seems unlikely) is probably going to lead to a lot more very good content, but content that revolves around pitching products (eg, infomercials), and not "pure information".

  12. Is the paper even the same thing? by protoshoggoth · · Score: 5, Interesting

    Given that one of the authors of the referenced paper is an employee of Yahoo, I have to wonder if whatever Google has in mind has anything whatsoever to do with the trustrank scheme we're talking about here. I mean, all we know is they trademarked the word, nothing more.

  13. Re:more censorship, unimpressed by generic-man · · Score: 4, Interesting

    Considering how much market share Google has, them not displaying a web page in their results (or dropping it a few hundred places) effectively removes it from the web.

    Google's primary responsibility now is to its shareholders. Google makes money from advertising. If Google can encourage you to patronize its advertisers instead of trusting its index for everything (which right now is pretty easily gamed), then Google makes more ad revenue and shareholders are happy.

    --
    For more information, click here.
  14. This info is not intended to be read by a human. by mathmatt · · Score: 5, Interesting

    This is wierd. It is the 19th hit (on the second page) of a google search for "trustrank" It requires a login from google's results page, but a google's cache reveals a directory including the paper linked to by /.

    I guess we weren't supposed to read this. And you shouldn't have read *this*!

  15. Personalised trust metrics by tdvaughan · · Score: 5, Interesting

    It would be amazing if Google gave us the ability to assign trust values to sites that we ourselves trust. This way, for example, I might give Wikipedia or the BBC a 10/10 trust rating for all their off-site links (and set it so that links off the linked sites are at 50% of their parent trust rating etc.). If we could also subscribe to someone else's trust ratings then technically illiterate people could hand over the responsibility of managing their trust database to someone else. From first thoughts, this looks like it could solve the problem of malicious SEO.

  16. Re:more censorship, unimpressed by telecsan · · Score: 5, Insightful

    "Freedom of Speech"

    First of all, the only protection that is guaranteed you here is that the gov't will make no law abridging the freedom of speech.

    Google, as influential as they might be, are not the government (insert 'Do No Evil' joke here). Therefore, they are not bound to this "Freedom of Speech" argument.

    Secondly, "Freedom of Speech" is not this universal, higher-being ordained preserve at all cost idea that we have transformed it into.

    Freedom of speech does not give you the right to spray-paint your slogan all over my front door, nor, in this case, does it give you a 'right' to be listed on Google. Nor do you have a 'right' to have your name printed on the front page of your local paper in 36pt font.

    Not being listed in Google does not amount to censorship in any definition of the word. The net existed before google, and people still managed to find web-sites. Google gives (through PageRank or whatever mechanism they choose) free advertisement to 'good' sites. They have every right to only display sites that pay money, if they so desired. You have absolutely zero (0) 'rights' to be listed for free on Google.

    Trotting out the Freedom of SPEECH argument is nothing more than whining about Big Brother coming to get you because what you have to say isn't worth hearing. Guess what? If you want to be heard, say something that's worth listening to. All that glitters is not gold, and much that is said (or printed) is worthless drivel. Much like this post.

  17. Re:Questions by pjrc · · Score: 4, Informative
    I just finished reading the paper. All these questios are pretty well answered by the text. To save you and others the trouble of reading it, I'm gonna take a stab at these. Feel free to actually read the paper and tell me if I misunderstood.

    How is this different from applying a weighting to PageRank?

    It attempts to detect clusters of pages which have few inbound links, which also propagating "trust" scores to all other sites by using their linking structure. For sites that have many inbound links (high scroring in pagerank), the authors claim this modification tends to classify spam and reputable sites differently.

    Will the owners of the pages / sites deemed to fall within the set of trusted seed sites get any money for all their hard work (i.e. hand-maintaining pages of links)?

    No.

    However, they will get better search engine visibility, which is quite valuable.

    What if such an owner decides to link to a page of commercial or spam links - will they get any money from the owner of the linked site?

    The paper suggests using only highly reputable organizations with long-term stability for the seed pages. Government organizations, universities, very well known companies.

    The analysis in the paper is based on a per-site graph, not per-page, by the way. They lacked the resources to try these computations on such a large data set.

    Is this a possible method of abuse?

    Presumably, the small set of seed pages/sites will need to be monitored by staff employed by the search engine company. If one of the trusted seed sites "went bad", they would need to be removed from the list.

    Will that cool poster of links between websites now become 3D to give trusted links more prominence?

    Probably not.

  18. Re:Conjuction? by camusflage · · Score: 4, Informative

    It should be noted that the Slashdot user No More Free Stuff catalogs such links, and by adding this user as a friend and assigning a negative bonus to foes of friends, you can lower the moderated value of any such posts.

    --
    The truth about Scientology, Xenu, and you: Operation Clambake