Computer Scientist Calls For Web Search Shake-Up
alphadogg writes "Given the seemingly non-stop battle between Google, Microsoft and others in Web search, you might think this is a pretty fertile area for new ideas. But a University of Washington computer science professor thinks otherwise and is calling on academia and industry to get way more creative. Timed to coincide with this year's 20th anniversary of Tim Berners-Lee springing the World Wide Web upon us, Oren Etzioni Thursday will have a commentary titled 'Search needs a shake-up' published in the journal Nature. The main obstacle to progress 'seems to be a curious lack of ambition and imagination,' Etzioni writes in the piece, which he acknowledges 'is meant to be provocative.'"
I think this idea is DOA. A startup vs Bing vs Google. Enough said.
C'mon.
This dude's name is just a scrabble draw of the 6 most common letters in the English language...and z.
And it's over web search?
That's odd, I thought it was just Google.
I'm sorry, but I Bing doesn't really count as a serious search engine, at least for me.
Search engines aren't about finding stuff, they are about shoving stuff into you in a way that maximizes ad-revenue. And as far as I can tell, they couldn't do that any better than they currently do.
But I need access to a data center with thousands of servers, petabytes of storage, and gigabits/s of bandwidth to demonstrate it.
make imaginary.friends COUNT=100 VISIBLE=false
Maybe Search needs a Problem shake up. Innovation is great, but when I search now, more than ever, I quickly find what I am looking for. Spam results remain an issue, but for the most part, I have what I need in seconds.
So, what's the problem? Why does search need a shake up? Do we need to manufacture new problems? The drinking straw has been around for a little over 100 years. Get creative beverage engineers, make a better straw. One that doesn't suck. ... Oh.
I understand that the piece is admittedly supposed to be provocative and start a dialog. So, what search problems do you have today? Is the fact that most search is text based something to overcome? Is there a frontier of a search engine that helps us ask better questions? How about analytical engines like wolfram alpha? Where can these technologies go? What else IS there that we need?
Shenanigans!
http://www.prnewswire.com/news-releases/decidecom-launches---helps-consumers-purchase-electronics-with-no-regrets-124179959.html
"No other team has the technology, talent and experience in predictive systems to solve this problem," says Oren Etzioni, Decide co-founder and computer science professor at the University of Washington. "We've built the only broad-scale model lineage, text and data mining systems that predict future price and model releases to address this complex consumer problem."
The dude is just plugging his shopping-search engine, and astroturfing a computing conference as part of his marketing campaign.
What a cock.
Innovate. That's an order.
Perhaps the lack of creativity and ambition is from the threat of lawsuits....patents, trademarks, and copyright.
All the search engines now generally available use tag-based methods. Among librarians -- who are the real professionals at information searches -- that's a method for quick superficial results.
Promoting deeper research and understanding is best done with subject-based methods. E.g. the way libraries are organized, Library of Congress cataloguing system, etc.
Problem is, AI is nowhere near good enough to do that yet. So you need to hire humans, lots of humans, to actually think about the information. Which is way too expensive.
Another tag-based search engine, even if it's Etzioni's own, is just one more look-alike in the crowd.
Web search has stagnated badly. Or rather it has regressed actually. The databases are getting larger, but the search possibilities are significantly worse than they were in the past. For example, AltaVista had a "near" keyword that could be used to specify that two search terms need to be within 40 words of each other. It also had full boolean logic. What Google offers is pathetic in comparison and is a massive step backwards. Especially with the far larger number of web-pages today, I find myself regularly wading through pages and pages of irrelevant results. (Yes, I am using all that Google offers in semantics, thank you.)
I know this is due to Googles technical architecture. But that itself is outdated. They could do AltaVista (classic) like search. Instead they basically have the same design as 10 years ago and PageRank now fails regularly to performm not that it was that good before. Maybe this is also because Google has very low incentives to do better, as they are a) King of the hill and b) make their money with advertising, i.e. as long as they are noticeably better than the competition, they are fine. And other seem to have trouble reaching even the (relatively low) quality of Google search results.
Truly a sad state of affairs.
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
Google is doing an outstanding job getting information collected. But at the end of the day it's just a "dumb" search that matches keywords with content trying to hide results that are just junk. Google is all about quantity of information with some basic garbage filters. There is no real relational information being gathered between content sources.
The next step in search should be to start analyzing information based on themes and returning results that may not have the same keywords, but are related in themes. Also, ranking results by reading level, length, etc would probably go a long way to getting rid of junk and help in research. Real text analysis. That would help meaningful content get to the top of results.
Work Safe Porn
I've been steadily using Blekko more and more as an alternative to Google. Its back to simple, with the addition of slashtags to refine searching. The complete banning of content farms caught my attention. its still not ideal but it is growing.
... who's hoping to monetize his public-funded research and make even more money for himself.
Seriously, have any decent and successful academic search engines NOT been sold off to the highest bidder? I remember Metacrawler came out of UW as well - it became popular and was "commercialized". As was WebCrawler, for that matter. And we all know about Google, of course - those Stanford guys did pretty well for themselves...
#DeleteChrome
There are ads on Google's pages? Can't say I noticed since I use ad blockers.
How does it pay for itself? Business model? Ads are a proven revenue stream for search. Do you have an alternate?
Nobody will license the tech, Lucerne, etc already give you local site search.
You could do a paywall. Could work but wouldn't compete with Google directly, "free" would still dominate the public space. Otoh you could be sustaining if storage and hardware are cheap, scale somewhat with traffic and of course bandwidth and power use would scale just fine (as long as results don't need to be too current).
A fool throws a stone into a well and a thousand sages can not remove it.
Viva Gopher Space!
"Promoting deeper research and understanding is best done with subject-based methods. E.g. the way libraries are organized, Library of Congress cataloguing system, etc.
Problem is, AI is nowhere near good enough to do that yet. So you need to hire humans, lots of humans, to actually think about the information. "
This is not true. Automatic document-topic induction techniques (look up Latent Dirichlet Allocation, for example) are fairly sophisticated these days. It's not strong AI, but it does human-useful soft-clustering from text alone in a way quite useful for search. Roughly it attempts to figure out the intrinsic subject(s) of your search.
It means that you can search for documents using keywords A,B,C and it can return documents which have none of those words and yet are likely to be related because they share many words with the set of documents which tend to have A,B,C frequently. A search engine can also remember recent searches in a session whose desired topics are likely to be correlated.
All the search companies use close to state of the art versions of these methods.
(Google is finding that good weak AI combined with Big Data can do remarkable things --- statistical language translation is much better than I expected, and that's substantially harder than document clustering).
First of all, there's an opportunity for other search systems even within Google - as TFA says, Classic Google Search isn't really designed for the constraints of a cellphone screen (much less for voice-based searches, where the keyword model might not even be the right engine to put underneath the user interface, unlike mobile-phone search where it probably is.) A good mobile-phone search UI would be a real improvement, whether or not you end up selling your startup to Google, or marketing your search tool as an Android App..
Second, any time you're talking about Design as a strategic business tool in an even vaguely computer-related area, you've got to think about Apple. They may not want to eat your lunch today, but if they ever do, they'll come out with a product that's insanely great, paradigm-shifting, and shiny, and you'll have to deal with them. Maybe it'll look a better product in your current space, like iPods taking over the MP3 market, but then you find out that iTunes is at least as important as the hardware itself, or maybe it'll be something even more subversive, like an App Store.
And then there are companies like Tellme or Genesys, which have been in the IVR / dumb-phone voice search space for decades, and probably have cool things to do in the smartphone space as well, or Nuance (who own Dragon Dictate and a few other technologies.) (And Tellme got bought by Microsoft a few years back, so there are some hooks into other large product sets.)
Bill Stewart
New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
Yahoo may have been in more things, but Altavista was really the search engine to beat, and Google beat them. And of course there were other search/portal companies (like Excite, which @Home unfortunately decided to buy/merge for $Nbillion.)
Bill Stewart
New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
Science fair gold medalist, 17, invents better way to search Internet
You can find the actual paper online if you'd like. Being as it is through Nature, and he is not writing it as part of an NIH-sponsored project, the paper is behind a paywall.
Fortunately it is in the main Nature journal, which is possibly the most subscribed-to journal in science; hence if you don't work for a place that subscribes, you can probably get it at your local library.
And no that is not an endorsement of putting academic research papers behind paywalls.
Damn_registrars has no butt-hole. Damn_registrars has no use for a butt-hole.
There is no shortage of software products many of us use online that are rooted in university CS projects. Instead of a generic call to arms, perhaps this CS professor could do something about it. He has the means and the resources at his disposal, after all.
It means to take a bribe to ruin a good product by integrating an inferior search function. "Verizon binged my Droid".
Help stamp out iliturcy.
Yeah, but if I started coming up with content that Google couldn't read, it'd be bad for SEO. My rankings would drop and nobody would come and click on my AdSense ads!
Not since Multics has academia showed much ambition and imagination in computer science.
Peer review is a convergent process viewed over decades and centuries. It's not a particularly good local filter. For my taste, it gets far too much credit for being an effective gatekeeper (most of the time) against errant nonsense. Our pathetic current generation search technology supplements the stem "brog" with a potent example, almost as if it could read my mind.
We don't think that peer review in software amounts to accreditation of bug-free perfection. In a good setting, it might be a fairly effective defense against architectural howlers. Is scientific peer review somehow magically better? As Einstein once said, "the secret to creativity is knowing how to hide your sources". His corollary is less often quoted: "the secret to authority is to hide your scowlers".
In the rare case when we peek behind the scientific curtain, it's an unseemly mess of data collusion, warranting special commissions to assure us that nothing was actually as bad as it looked on virgin grep: political cabals to render suspect data in the best light are business as usual. (We did learn that a paper making sophisticated statistical claims doesn't necessarily number a statistician among the secretive panel; as per established custom, the rubber stamp is equally valid either way.) The commission is right in their verdict as viewed against actual scientific norms if you shed the idea that peer review is worth a pot to piss in divorced from it's established track record for convergence to scientific sanity after a decade or two or ten.
This new-age search guy missed the entire AI memo. The field of AI had all the creativity in the known universe (and then some) back in the 1960s when pronouncing imminent breakthroughs.
And besides, Wolfram Alpha has already staked a huge position in search as-we-don't-yet-know-it, with a whole lot more credibility behind it than anything I expect to see delivered from the hands of this puff-piece pendulum.