The Un-Google - The Search Competition
WinEveryGame writes "The Economist is running an article on the state of the competition for Internet Search. While Google clearly dominates, and continues to have positive momentum, its leadership is still vulnerable. The search-engine battle is not over yet." From the article: "In terms of momentum — mass times velocity — Google's lead indeed looks daunting. It has by far the most mass, with an American market share of 43% as of April, which reaches 50% counting AOL, an internet property that uses Google's search technology. This compares with 28% for Yahoo!; 13% for MSN, which belongs to Microsoft; and 6% for Ask, which is owned by IAC/Interactive Corp, a conglomerate of about 60 online media brands. Google also has velocity: its market share grew by 17% in the four quarters to this spring, whereas Yahoo! and MSN both lost share. Only Ask has more velocity — its share grew by 35% — but then again it has little mass."
There are some customers (government/military included) that are aware of the two concepts of precision and recall. Before you groan and skip this post because you recall those words from all classifying algorithms, you should take note that there are two stages we have yet to meet in this respect.
One is simply improving precision without sacrificing recall. When I search for 'horn' in Google, how many of those searches are relevant? I was thinking about a French horn (instrument) and the first link brings me to a society about them. The next three links, however, do not. You might say, "Well, gee, you should have put 'French' in your search" but is this really necessary? So there is some money to be made in "learning" search engines that tailor themselves to the user or perhaps the results could be displayed intuitively in domains of knowledge (a la Clusty). So that I can select a node that applies to the correct searching term and see all results returned below that. Have you ever wished to view your search results in a format other than a linear display of ranked results? The documents are related in more than one dimension, you know. As computing power increases, I suspect there will be room to display them in two dimensions (heat/area mapping, nodes & vertices on a plane) and three dimensions (spatial 3D engines with nodes & vertices in space).
The second stage is giving the user the power to adjust precision versus recall. Even a graphical interface that shows the F-measure relationship between precision and recall would be helpful to consider in the search engine wars. Say you give the user some control through a slider AJAX interface of a threshold ß. But the threshold isn't simply the "Google score cut off" or even a term frequency cutoff. Instead, it's applied to be a "relevance" threshold. You would score relevance by fingerprinting frequency, specificity, clustering and other useful tools by using a domain ontology or taxonomy.
Another big thing that is missing is identifying what kind of data you are searching. Social data? Scientific data? Historical data? etc. Perhaps I'm only interested in who's who to Stephen Hawking. I'd search for him and flip through nodes of separation from him to other people.
The current search sites also only tend to favor key-word regular expressions. What about searching with raw text or entire paragraphs? If you want to see an interesting demo of this, visit Collexis' Demo Site which alludes to a whole new kind of searching.
The key to entering the market as a competitor with Google is to pick up Google's slack and to try to pose yourself as a complimentary service to Google. Google is terrible at closed domain searches but amazingly efficient at open domain searches. You don't want to compete with them so fill a different part of the market. Google benefits from simple design, so go to an advanced flashy complex design. Most people aren't looking for that but the people that are have nowhere to go.
The Economist is alluding to potential leadership problems inside Google. Who cares? That's not going to be Google's downfall. Google's downfall will be an new intuitive way to search and the only thing that will prevent their downfall is if they buyout the company or bone up on the technology.
The search-engine battle hasn't even hit its stride.
My work here is dung.
It should really be m*v^2. (m*v is a vector equation, m*v^2 is a scalar)
It seems more and more when I try to find something on google all I get are a bunch of link farms. This morning I was trying to find a bike jersey for a friend of mine and on the first page of results, it took getting to the second page to find any actual results. I did much better using Yahoo and found what I was looking for on the first page of results.
This is just one example, but it happens constantly...
The reason I mainly uses Google is Google groups; whenever I have a computer related problem, the Usenet archive is often helpful. I do not think that there is any other archive of Usenet like this out there (available for free). If there are, please share the links.
And how many slashdotters find Google Groups useful?
Remember the year 2000? They promised us flying cars. They delivered the PT Cruiser...
Actually, I have as small personal page for people who are looking for me to find me. It is here. I am probably the only person in the world with a page that lists my name and my elementary school name. If you do a search on these terms "JOSEPH COTTON SEABREEZE" in google, you will not find my page. If you do a search in yahoo, then there it is at the top. So Google is not king, by any marker other than market share.
Well, actually, almost all current search engines suck. There is waaaaaay too much noise in the results they return. Let's say I'm doing a search for "product X", search for it in Google and what do you get? Several links to ebay (which may or may not be current), tons of links to various "rate it" sites such as epinions/nextag/msn/etc, and maybe a few smatterings of other sites mixed in. Typically the manufacturers own site won't even appear in the first couple of dozen results!
So basically, I agree with the general position of the article, that there is still a TON (actually several tons) of work to be done and room for someone else to move in with a truely superiour solution. While it's great that Google is tinkering with lots of other technologies, I wish they'd actually make some real advances in their core business (and actually, I'm slowly starting to come to grips with the fact that their core competency may not be searching, but really it's in creating low latency widely distributed computing infrastructures). For all the years and the massive sums of money, my search experience is not significantly better than it was 5 years ago.
I was on the phone with some engineers at MS the other day and even they admitted that they use Google. It's just better... for now.
http://religiousfreaks.com/I have found some interesting information about google... I will share it with you now before
Friends don't let Friends use Internet Explorer.
Does anyone know how they calculate these market share values? AFAIK they don't all publish traffic statistics.
Developers: We can use your help.
So Ask--which used to be called Ask Jeeves but dropped Jeeves, a knowing butler, from its logo in February--is taking a different tack. It has come up with ExpertRank, an algorithm that also ranks web pages by incoming links, but is different from Google's PageRank in that it first groups, or "clusters", pages and links by theme. So instead of using a web page's overall popularity to calculate its ranking, it finds the pages that are most popular among experts on a particular subject, a method that often returns better results than Google's. Ask also uses these thematic clusters to suggest the best ways to narrow or expand a search, a feature called "zoom" that is very popular.
Which is the trouble I have with Google; their search results are like a shotgun blast too many times, getting far too wide a spread of sites having anything at all to do with the subject I type in, instead of being more narrowly focused. The problem I see with Ask's method is just how do you define who the experts are and what field they are experts in? Web sites can contain all sorts of content and people will reproduce links at a whim, just because they like what they see. Would they use a system similar to Amazon, where people are ranked by how many people use their recommendation?
GetOuttaMySpace - The Anti-Social Network
Which world do these numbers come out of? This month on my private site so far I got 1400 incoming links vom Google and 30 from MSN (the next runner up), 27 from Yahoo. Maybe it's just that Google loves my site for some strange reason, but I can't imagine my own little sample of web hits is statistically so "off" from their numbers. Other sites I admin for have similar numbers.
The numbers of pagehits by spiders from those search companies are much more on an equal basis. Sometimes one of them is on top, sometimes the other, but they all spider like crazy.
Much more interesting are little search engines like gigabot, which never ever gave me one incoming link but still spider like it's going out of style. Somehow makes me think they must live either off warm air or spam. What reason to be do they have?
I don't know how they measured the dominance of these search engines, but I know how its supposed to be done: by the number of hits it gets on Google. In that respect, Google: 9,630,000,000 Yahoo: 5,240,000,000 MSN: 4,220,000,000 Ask: 2,140,000,000 Clearly, though, the most dominant search is the word "search" itself. It gets 16,670,000,000 hits.
The real question is what these figures mean if anything, other than that the cartoon on the Economist page should not be interpreted as being at all representative of reality! I suppose it does mean that, for the field of search if nothing else, nobody's going to be knocking Google off their post for a while; they're utterly kings of the search hill.
"Little does he know, but there is no 'I' in 'Idiot'!"
"This compares with 28% for Yahoo!; 13% for MSN, which belongs to Microsoft; and 6% for Ask, which is owned by IAC/Interactive Corp, a conglomerate of about 60 online media brands"
This isn't over, simply due to lack of certainty in net neutrality. If media companies get leverage to control bandwidth to the big search companies (Google), it goes without saying that that these figures will change significantly. For Google, it could be death by a thousand cuts...
Why not just 'dot' it with a unit vector in the positive direction? Your suggestion gives twice the kinetic energy -- something completely different.
It's not a war, it's a market. Markets change. That's a good thing. Right now most people are happy with google. If something better comes along, I am sure nobody would stop embracing it just because they love Google so much. For shareholders and Google's owners it's probably different, as they want to keep on top and gain maximum market share to increase profits (or whatever). But as a consumer I couldn't care less if the best search engine is called google yahoo or msn. If a search engine searches well, I'll use it. If it doesn't - well, game over.
The article totally fails to take Lorentz-FitzGerald Contraction into account.
Intron: the portion of DNA which expresses nothing useful.
This is were Google truely dominates, in cult of personality.
It seems more and more when I try to find something on google all I get are a bunch of link farms. This morning I was trying to find a bike jersey for a friend of mine and on the first page of results, it took getting to the second page to find any actual results. I did much better using Yahoo and found what I was looking for on the first page of results.
I speak of the same experience in another thread. Simple fact is, Google appears to be doing nothing to make searching truely better. I give them some benefit of the doubt by saying "appears" since they may have "the next great thing" in the wings and we just haven't seen it yet. But lets face it, searching today is a time wasting experience. What's worse is that several years ago, it was time wasting due to the large number of seemingly random links. Well, those links aren't so random anymore, AAMOF, those links are geared to simply make Google money. I've said it before and I'll say it again, anyone who buys into their "do no evil" marketing fluff can give me a ring, because I have a nice bridge to sell them. How are they any different that M$ in this regard? They have tons of money, some of the brightest minds in the valley, and yet a simply search for a product gives me pages of utter crap. Hell, if a manufacturer makes the mistake of naming their company after their flagship product, you won't even be able to find their main website in a search until the third page!?! But Google still collects their bucks. How is this helping? How is this not evil?
Anyway, off my soap box. Here's to hoping someone can come along and actually do some good here (and yes, it may even be Google themselves).
Search can always be better (as other people have pointed out). This is a good thing. Competition is good and we benefit.
(This is also why Microsoft is so threatened by the Internet... once you move everything to the Internet, the desktop OS becomes irrelevant and all you need is a browser... this can be provided quite nicely by "free" software running on minimal hardware.)
I don't read your sig. Why are you reading mine?
Since when is delta M equal to v in kinematics?
Were that I say, pancakes?
Gp wasn't suggesting computing energy, just pointing out that a bare mv results in a vector, not a scalar that can be easily compared. What we want is m|v|, which can be computed as m * sqrt(v^2). And anway, for "energy" computation purposes mv^2 will get you double the "true" figure (assuming you were interpreting it as kinetic energy), since K.E. = 0.5 * m * v^2.
((lambda x ((x))) (lambda x ((x))))
Obwan: that search engine is our last hope
Yoda: no, there is another
(later)
Yoda: google... page rank is strong with you... pass on what you have indexed... there... is... another... search engi(ugh)
This way you can see how much energy a company has, and thus it's importance (m*v*v/2), growth can end *very* fast.
The energy a company gains is m*a*v, that shows that google is the best growing company...
etc.
The Google Killer will not arise from the current stable of search engine competitors. Like Google's arise from seemingly nowhere in the late 90's amidst then formidable competitors like NorthernLights, Altavista, Goto, Hotbot, Yahoo!, etc., to its position of dominance today, the "next big thing" in search will arise either from a currently unknown startup or from academic research.
The reason I mainly uses Google is Google groups;
When google acquired the archive from dejanews (can't bring myself to call them deja) is when I started using google. before then I was a metacrawler guy, but it's just easier to search groups first, and then simply click the web tab. I think it's safe to say that if google didn't have groups, I wouldn't be using it as my primary search engine today.
... you can't even install it.
I know I'm a little off-topic, but (from my own slashgeo website):
The Ogle Earth blog indicates that if you use Google Earth at work (the free version), you're in illegality. From the site: "1. USE OF SOFTWARE The Software is made available to you for your personal, non-commercial use only. You may not use the Software or the geographical information made available for display using the Software, or any prints or screen outputs generated with the Software in any commercial or business environment or for any commercial or business purposes for yourself or any third parties."
The EULA changed with the new version launched this week. You simply have to buy one of the other 3 (paying) available versions of GE.
Animoog.org
Google isn't just a search engine anymore, they have moved into so many other areas it's going to be almost impossible for any one search engine which is soley a search engine to compete in any way, and if they branch into the same areas then the argument 'Google did that first' will come out. If anyone wants to be any threat to google at this point, they will have to do something completly original, and get a large market because of it.
About a month back I did a comparison of some searches, based on ask's claims of understanding concepts. They were able to outperform Google in this functionality: http://www.nirajsanghvi.com/stories.php?sid=318&ti d=55
Look at technorati for example and look at the effect (or lack of) on technorati traffic when google launched their own blogsearch.. nothing at all.. it failed to make an impression despite technorati having growing pains of its own that probably annoy many users and send them elsewhere. Google tries to apply the same methods used on their main search system to the blog search and its not really working partly because its more about people than data.
Blog search is fairly saturated already and most of the big search players have a blog search of some kind and 'independants' are still popping up daily but what about board/forum search? None of the majors do it at all!
Boards are not seen as quite so trendy as blogs and so have been mostly overlooked despite them being hugely popular and showing no signs of stagnating.. the board world is still growing fast and the only real board search engine to date is boardtracker which has many of the features that google and the rest of GYMA lack including..
Persistant search.. they all offer alerts but what use are these when they are alerting on content they just found which was created years ago? Only the specialist search engines like boardtracker offer real persistant search at the moment.
Categorized searching (helps with the problem someone mentioned above when searching for 'horn' since you can restrict search to the 'music' category or whatever you want) - again boardtracker has an effective implementation of this and a few others may also but where is Google?
Searchable rss feeds.. rss is good and google does have it on their blogsearch but what about the main search? Its very widely used these days and not having it is like having a three wheeled car.. it will still go, but corners are tricky and you'll feel a prat driving around in one. ;)
Tagging systems with tag clouds etc and other 'social search' features. Yahoo leads the way with these through various aquisitions.. boardtracker offers tagging for boards, technorati for blogs.. but where is Google? They seem to prefer complex automated systems rather than letting the wisdom and power of the masses help out with organizing the worlds data.. its their loss.. yahoo knows it, the verticals know it. You can't really replace one with the other but they work well together, its smart to integrate both, its smart to enhance one with the other.
Google is certainly still the best and fastest general search engine around but they still have some learning to do and either they should get out there and do some smart aquisitions to fill the chinks in their armour or they should start building what the people want because the times they are a changing.
and that's a good thing.
The barrier to entry in search is moderately high (you have to be able to afford the hardware to do your indexing) but there will always be people willing to invest in search. It's easily monetized (love that word, eh?), there's no cost for users to switch to new competitive product, and there's no magic bullet that gives you both accurate results and the ability to weed out aggregators and shady SEOs. As long as developers can come up with new search algorithms that give better results, there's the chance that the "next Google" could be launched.
I like and use Google, but that's because the results are usually valuable to me and the ads are minimally intrusive. Currently, the one issue I have is Google's inability to prevent aggregators from showing up in search results. I've never found anything useful through aggregator pages, and I'd like Google to filter them out.
Anyway, the ability of new companies to explore search is something that's good for SE users. New search startups can be launched and attempt to improve search. Google is forced to innovate. Where's the downside?
Interested in a Flash-based MAME front end? Visit mame.danzbb.com
Dude, you should invent some sort of GoogleFight system, where you can see which search term gets more hits on Google. That would be clever and effective.
At some point within the last few months, Google removed the "Remove this site from search results" button. I made heavy use of it, perma-banning resultspammer sites like ExpertSexchange.com and its ilk.
Sadly, this button is now gone from Personalized Search, and the resultspammer sites are steadily reducing Google's usefulness to me. Where I could once search for specific tech terms and get a good batch of reference resources, now I'm getting junk portal pages for the top five results.
Sure, I could report a link as spam, but that's a lot more time-consuming than the button, and it doesn't appear to have any immediate results for my searches.
This makes me sad. I've loved Google since I first met her, but I can't be with her if she's going to continue mainlining spam.
http://unxmaal.com
The reason everyone still uses Google isn't because they have the best ranking of results anymore, which is usually encrusted with spam sites designed to beat Google, but because they have the most COMPLETE results. When you search for something rare, Google most likely will return results no other basic engine has. So people have gotten accustomed to checking Google first out of habit more than anything else.
To me, I think the future of search isn't necessarily a better Google, but something different. The problem with Google is the same as its strength - its simplicity. There is very little control on Google for more complicated searches, such as searching only company websites, or searching only encyclopedia content. It's just a big kludge for them to add stuff like travel info or weather or movie info without knowing the intent of the searcher beforehand. Searchers have to get savvier, not just the algorithms. I think search aggregating sites like Seaurch.com which has 200 engines but still uses a simple interface, is a great idea. Sites like Clusty.com also take an interesting approach towards understanding the searcher's intentions.
While it's true that their search algorithm is important, the other very important thing which contributed to the success of Google is that they have always offered honest results. They do have sponsored links but they are clearly marked as such. You know that the main search results are not influenced by advertizer money. Other search engines before Google typically did not make it clear when a site had paid for higher rankings.
Google's motto of "Do no evil" is really just a modernized version of "Honesty is the best policy." Any search new search engine needs not only to have a good algorithm but they also need to build trust with users.
...alternatives would be nice.
Any geek, or normal person who is for freedom of expression, free society, freedom from oppression should be actively boycotting Google until they eschew any relationship with RED China.
FREE TIBET
Re-establish recognition of Taiwan.
Punish Google and Yahoo! (especially Yahoo!)
Politics is the art of looking for trouble, finding it everywhere, diagnosing it incorrectly and applying the wrong fix.
Of course classically F=ma and W=E=int(F dot ds). If the article simply used acceleration instead of velocity (I can't believe they associated something that doesn't change with growth. *sigh*.) then instead of momentum they'd have force. You could then of course measure "energy" by quantizing their growth in a specific direction(s).
If you're going to make an analogy of something with classical physics you could at LEAST get the "units"/equations right.
We need a Neo Google to replace the monstrosity Google has become. Google is an ad driven monster on the road to become another ABC, CBS, or NBC.
Bring back the honest, untainted search organization. There needs to be a search engine with proactive privacy (no analysis, no data retention). One that's fast and bullshit free. No cookies, no javascript, no "sponsored" links.
Just like ebay has become addicted to shills and gray market goods, Google will become addicted to the ad revenue from shady rank fraudsters and porn companies.
Appropriately, the verification word for this post is "liberate."
I think it will be hard (or at least take a loooooong time) to over throw Google as a search engine for the masses even if a better engine comes along. There is really something to be said for people recognizing the phrose "Google it". This is similar to the reason that msn.com is one of the top 10 (or 20?) visited sites, simply because it's the default homepage in Windows IE. There is a huge number of people who don't care enough about a "better product" and will just stick with what is standard. The phrase "Google it" is one of their biggest assests as a company. For a lot of people Google = search and thats invaluable.
Steal my band's record! Seriously,
Thought this was going to be a story about lastgoogle.com.
Article totally ignores emerging fields of LRS (Last Result Search) and SEdO (Search Engine de-Optimization).
But then what do you expect of The Economist? Typical center-right-wing media bias.
tidokoro
what turns a man's karma neutral? lust for gold? power? or just a heart born full of neutrality?
What has made Google from a nifty tool to a worldwide phenomenon is the breadth of their searches--they are no longer about searching the web only. You can search also Usenet, maps, scholarly research, online vendors, books, email, and even your own files and images on your PC. You can also use Google to search the web for images, videos, stocks, catalogs, blogs, and news articles. Simply beating Google in web search won't dethrone them, you'd need to beat them everywhere (or at least make their other search services irrelevant).
As we all know, searching is becoming a main focus not only on the Web, but also within businesses in general. It seems that there are several large companies (namely Google, Yahoo, etc, etc) that are going head to head to take over the market. However, as this article notes these powers are also becoming distracted not only with the fight amongst themselves to become the number one search engine in the world, but also with venturing into other areas of interest (i.e. Google going after the Microsoft software industry). This only leaves the doors open for up and coming smaller firms to offer something that the world has yet to see. While these search superpowers are struggling with each other, small operations are rising up everywhere with new and innovative ideas - the type of ideas that got Google where they are today. It is only a matter of time before the giant falls...before the next big thing comes to the search community. It shouldn't be long now if these power don't come up with something new and innovative.
"Ironically, by posting this on slashdot, I will get a higher rating on google."
Slashdot puts nofollow tag in links, so Google ignores those links.
interesting notes, but until most people can't find what they are looking for, there is no ROI in doing massive relevance engine upgrades, in fact such upgrades may be ROI negative if it breaks expected behavior. this is why i don't expect search to move much in the next five years. there will be tweaks and improvements, but i don't sense a sea-change until there is a large pile of unclaimed money available.
I find it hard to take anyone who quotes Wikipedia seriously. Just because it's popular, doesn't mean it's accurate or in depth. Wikipedia is dominated by lobbyists, government officials, marketing firms, and anyone with a political or social agenda.
It just makes you look amateurish when Wikipedia is your "go to" source.
I'd gladly give up Google for a decent search engine that:
1. Doesn't keep logs of my searches forever,
2. Doesn't turn over my records without a warrant,
3. Doesn't attempt to cross-reference my searches with my activity on non-related services,
4. Isn't interested in data-mining me, and
1. Provides an ssl tunnel so my ISP doesn't have logs of my searches.
The fact is that Yahoo has never had a viable search product. They started out as a shared bookmark file, then became a directory web site. It was never the best way to search the web — the web always grew too fast for a directory to be up to date. But because Yahoo managed to get a lot of visibility early on, they created a mindshare they've never lost. It's the ultimate triumph of branding over substance.
The next one will:
1) Not have the unsightly "Ivory Tower" philosophy that Google has inherited from Stanford - sorry, but Orkut "Circlejerk Bukakke" needs to lay off the Animal House.
2) Not go to countries such as China or India, but taking the greater challenge of developing in our own backyards. There's a bunch of people here that can do the work at the original rate and easily come up ahead in total cost savings versus the "rubber stamp CMM" countries. Slapping them in the face is only going to determine if your trial will be a mock one or a real one.
3) Not considering the Midwest as just "flyover country" due to lack of exclusive colleges and exclusive places. We're all not just country hicks. The ground's been good enough for farmland, manufacturing, missile silos, why not put your datacenter and other things in here as well? Or is it that you're just too busy with writing offshoring loophole law?
Twitter supports and protects racists - by smearing their critics with the "Hate Speech" label.
I wish Google purchased Clusty... it'd be great because then people would see that query results are 'ordered information'... which, after all, is what Google is all about (first sentence). That way, Google would show that it "knows".
The parent AC is correct.
.edu sites who are there principally to educate rather than sell you something. If you have to do general search, add
Big business turned the Internet (the WWW part principaly) into little more than 'online TV'. Because of all the $$$ at stake, we have probably the best search engine around, Google, drowning in ad driven/cash driven search engine spam ('spamdexing').
If you are searching for something in Google, add site:.edu to your search. By doing that. that should lead you to
-shipping -visa -mc -amex and the like to your search terms to block the 'ad pages'
Good Luck!
P.S. The best search engine would just index the single homepage page served up by the webserver at all 4,294,967,296 possible IPv4 addresses (minus the reserved/private/unused ones). The rationale is that if you are paying for webspace and a unique IP address, chances are good you have worthwhile content there. This would eliminate 'spamdexing' in all its forms in one bold stroke. The drawback is that 'online communities' would only be listed by the main IP webserver address and not by the URL which is why Google is in the 'mess' they are in.
If you are doing worthwhile research and want to avoid as much e-commerce as possible in the process, here again are some helpful tips....
add site:.edu in Google searches (mentioned earlier - restrict search to non-profit educational sites)
add -shipping -visa -mc -amex and the like to Google searches (mentioned earlier - helps eliminate product 'ad pages' and leave behind product review pages
search well-known 'info sites' like http://citeseer.ist.psu.edu/ or http://www.wikipedia.org/ If you can't find the info you want at those two places for free, the content you seek is either not on the Web or (more likely) you have to pay for it to get access. We all know how Net-savvy people hate to pay for anything they find on the Web....
As a last resort, search Google's Usenet interface at http://groups.google.com/ Helpful info can be found there but be prepared to really dig for it! =/
I don't see Google dropping off, in fact, I see them gaining more momentum in search. They have other interesting services too that will get more and more popular as the casual Google user starts playing around with the other things have to offer. They truly are a innovative company that can even the score with MS and make it less of a MS world which is good for the consumer.
That's definitely not what we want. Unless you want to ignore the difference between growing and shrinking marketshare, of course...
Yeah, anything hosted on a virtual host is worthless. Where is the rolly-eye emoticon when you need it?
The more you know, the less you understand.
Except we do want to compare (one-dimensional) vectors. Unless you don't care whether a company is gaining or losing marketshare?
It appears to me that Google (or a 3rd party organization) needs to start a new "trust" index.
Kind of like what Yahoo used to look like, where sites that are known to be useful and are actively updated are maintained in some sort of internal database that is updated by Google staff or a non-profit organization.
This site or internal database should be used to inflate the relevance of documents PR/relevance on the trusted domains. The editorial policies of the site might influence these decision.
There could be two kinds of trusted skew. End-skew and transverse-skew.
Sites like Wikipedia might only get an end-skew because it's too easy to subtly modify a page to point to a site of your own (and thus get a PR boost by proxy).
Tightly controlled sites like MathWorld, on the other hand, might be granted transverse skew.
It could also be used to blacklist sites (like Digital Point) and skew the PR down when it traverses links from there. I would also give a small negative skew, to say, Digg.
So maybe what I'm getting at is content "trustability" metadata. The lower it is, the less likely Google might consider a document from that domain to be authoritative or relevant. If it is very low, it would assign negative skew to any linked sites as well. The higher it is, the more highly it would score it, and at a certain point it pass this bias on to linked sites.
And it would be nice if Google could determine user content from article content on syndicated sites that allow comments. Links and terms contained therein should be weighted differently.
Voluntarily added classes that googlebot can parse for divs?
THIS THING CAN TURN ON A DIME, MACROSSZERO STYLE ALSO FUCK BETA, ~NYORON
Google isn't going to place a datacenter anywhere unless it can easily get at least an OC-48 from two different backbone providers.
That rules out much of the midwest except for the areas around Chicago and Denver.
THIS THING CAN TURN ON A DIME, MACROSSZERO STYLE ALSO FUCK BETA, ~NYORON
I just think it's cool that there are marketing geeks who know that mass x velocity = momentum.
Currently hooked on AMP
Yeah, the metasites & link farms are sure gaining in annoyance factor - in direct proportion to their page rank. Which reminds me... has the "remove this result" feature disappeared for any other users of Google Personalized?
Anyway... Rollyo + Furl + Foxmarks = Crazy Delicious.
Any hypothetical Google Killer would be well served by implementing a union of the features of the above utils. Sync my bookmarks (the ones in my menu bar, not just on a page you own), archive the pages for me, make keywords out of the bookmarks' directory structure, make keyword:domain associations out of those relationships in my bookmarks, and let me search only the relevant domains by using a simple "#_categoryname_#:" operator in the search. Then show me domains with rising representation in peoples' bookmarks and which have strong associations with my category of interest. How hard is that? Nobody's rolled it all together in a do-it-all-for-ya fashion, but the features are all out there piecemeal.
Pi Ran Out
You should be able to hit a button that says such and such a site is relevant for the given search, rerank all other sites accordingly. Sites for musical horns aren't going to highly crosslink with sites using horn to mean "a high pommel of a Western saddle (usually metal covered with leather)". It's also annoying that there isn't a way to sort by pages with major recent content changes.
I don't think the parent is really arguing this per se, I suspect that they simply feel it's impossible in the long run to separate garbage content from real content on virtual hosts. He has a vaguely interesting idea but it's obviously not practical.
bad math alert. 'nuff said.