Slashdot Mirror


Seven Search Engine Evolutions for '07

eldavojohn writes "I found a short but interesting list of predicted evolutions of search engines that will most likely be implemented in 2007. While some are vague and obvious like a better human interactive experience, there are others that are worth looking into like alternative means of indexing and using semantics — not keywords — for matching documents. The author of this list is Dr. Riza Berkan, also the author of 'Fuzzy Systems Design Principles.'"

72 comments

  1. Comment removed by account_deleted · · Score: 3, Interesting

    Comment removed based on user account deletion

  2. cant be wrong by bedonnant · · Score: 2, Funny

    an article beginning with a bill gates quote cant be wrong.

    --
    ~~~ Paf. Le chien.
  3. Slashvertisement by TodMinuit · · Score: 3, Insightful

    Most of the things listed seem to make results more ambigious, not narrow them down.

    Even if you could radically change the way a search engine works, you then face an even bigger task: Forcing users to radically change their searching habits to fit your search engine.

    And what the hell is "QDEXing"? Google reveals nothing, therefore we can conclude it does not, in fact, exist.

    --
    I wonder if I use bold in my signature, people will notice my posts.
    1. Re:Slashvertisement by Mostly+a+lurker · · Score: 3, Insightful

      Yes, if you search for "QDEX search", you find a few links, but is is basically a plug for a specific search engine: Hakia Search Engine

    2. Re:Slashvertisement by Evangelion · · Score: 2, Interesting

      Google reveals nothing, therefore we can conclude it does not, in fact, exist.

      God, that is so beautiful.

    3. Re:Slashvertisement by Anonymous Coward · · Score: 1, Informative
      Just tried searching on Hakia - this thing is truly a piece of crap. Altavista circa 1995 would have returned better results than this. I tried searching for "Java 1.5 API Documentation," and a link to Sun's documentation does not even appear on the front page. The top results are not even related to Java. Indeed, the top results seems to be gibberish:


      The glimpse will appear him whether it was at him; and they to men who are seemed a goodly mild reserve in the the impression continents except java api documentation my own breast, to could be at first a quiet, then a sthenic, vnknowen, or the Constables and be ...


      Let's all hope 2007 is not the year we see Google implement QDEXing.
  4. In 2007? by cp.tar · · Score: 3, Informative

    Ah, let me just tag this article 'semanticweb'... there, much better now...

    As early as 2007? Now I don't really believe that.

    It may get partially implemented, and probably only in English.
    Maybe Chinese as well.

    Most of the other languages will have to wait for quite a while beforehand...

    Not to say semantic search is a bad idea or anything... I, for one, would like to see some image-, audio- and video-search based on some kind of semantics, not tags and names... but that'll just have to wait.

    --
    Ignore this signature. By order.
    1. Re:In 2007? by Rakshasa+Taisab · · Score: 1

      Ah, let me just tag this article 'semanticweb'... there, much better now...

      Perhaps in 2007 we'll get semantically correct posts too.

      --
      - These characters were randomly selected.
    2. Re:In 2007? by cp.tar · · Score: 1
      Perhaps in 2007 we'll get semantically correct posts too.

      Evolution of computers has, alas, proven to be much speedier than evolution of human mind.

      Semantically correct posts will have to wait until computers start writing them.

      --
      Ignore this signature. By order.
    3. Re:In 2007? by patro · · Score: 1

      I'd be satisfied with partial (prefix) search for a start. Searching for "eco*" should return economy, economies, economist, etc.

      This alone would help a lot.

  5. Re:Still waiting for the following paradigm shift. by bedonnant · · Score: 1

    unfortunately, the explosion in number of blogs is likely to make that goal impossible.

    --
    ~~~ Paf. Le chien.
  6. So important, MS put it into Vista by ZahnRosen · · Score: 1

    Putting this kind of emphasis on search is wise. MS knows this and put a vastly improved searching into Vista. Some say its better than other desktop searches. This is similiar to having a good memory in a human being, quite useful in practice.

  7. Let me be the first to say... by Overzeetop · · Score: 3, Funny

    ...that I was unimpressed when "fussy logic" was a buzzword a decade ago. I do not look forward to it's resurgence in the marketing lexicon.

    --
    Is it just my observation, or are there way too many stupid people in the world?
    1. Re:Let me be the first to say... by GeffDE · · Score: 0

      That might be my favorite typo ever. Mod it funny, coz it is!

      --
      It has been a nervous year, with people beginning to feel like Christian Scientists with appendicitis.
  8. Seven Spam Evolutions for '07.. by Anonymous Coward · · Score: 1, Informative

    Hmmm.... and these predictions are in a press release from someone with a new search engine which (surprise!) opens shop in '07.

    Can someone tag the article "spam"?

  9. Change number 6: done (almost) by littleark · · Score: 1

    Yoople! has already introduced a more engaging human-like search experience and let the people collaborate in order to create a better indexing.

    Ok, someone could say it's the perfect way to permit abuses and lot of work has still do be done, but it's a smart proposal to start from. Don't you think?

    http://www.yoople.net/

    1. Re:Change number 6: done (almost) by Anonymous Coward · · Score: 0

      Don't you think?

      Actually, I think PreFound has you beat.

    2. Re:Change number 6: done (almost) by littleark · · Score: 1

      Very nice indeed! Never seen it before!

  10. Re:Still waiting for the following paradigm shift. by RealSurreal · · Score: 1

    Fortunately that's going to peak in 2007 as well. At least according to Gartner for what that's worth http://news.bbc.co.uk/1/hi/technology/6178611.stm

  11. Re:Still waiting for the following paradigm shift. by cp.tar · · Score: 2, Funny
    The most relevant search results are the ones I've seen before, else it's called "exploring" not searching.

    But then you have a browser called Internet Explorer... Confused yet?

    --
    Ignore this signature. By order.
  12. Get your own house in order by RealSurreal · · Score: 4, Interesting

    Let's hope it's better than the author's search engine, hakia.com. Just used it to search for "nike stores in the uk". First result is an etailer in the US, all the others are spam sites. Looks like we've got a long way to go before search engines actually understand what I'm looking for.

  13. i.e., all search engines will look like hakia by Anonymous Coward · · Score: 0

    This article is a promo for hakia by its CEO. For example, his #1 point is:

    1. The first time a search engine will have an alternative to indexing; new technology like QDEXing will be developed.

    Unsurprisingly, QDEXing is a term invented by hakia; see http://www.hakia.com/technology.html

    3. The first time that search results will include highlighted best sentences as a result of semantic analysis rather than bolded keywords as a result of finding incidences.

    Unsurprisingly, this is what hakia does. And so on down the list.

    The part that is surprising to me is how BADLY hakia does these things. For example, I searched for a bird I saw yesterday, the "eared grebe". The results are weak relative to MSN, Ask, Google, and Yahoo, but-- aside from that-- here's an example where they highlight the "best sentence" in a snippet:

    Page 3 of 3 | Page 1 | Page 2 | Eared Grebe Podiceps nigricollis Albino. Antelope Island Causeway. Davis County, Utah. October 2003 : by Nicky Davis

    The fact that they're not even finding sentence BOUNDARIES properly pretty much destroys the claim that they're extracting meanings from text. This isn't to say that someone won't do it someday, but... not these guys, not today.

  14. Semantic Searches? by eebra82 · · Score: 2, Funny

    Can somebody please explain what a semantic search would look like? I'm not sure if I understand the meaning.

    1. Re:Semantic Searches? by bronzey214 · · Score: 1

      It means that if you wanted to know the score of the a certain sporting event.

      We'll go with Steelers vs. Bengals for now.

      You could type in score Steelers and Bengals and it would return something like...

      20 - 17 Steelers
      11:23 3rd Quarter
      Bengals Ball on Steelers 47

    2. Re:Semantic Searches? by alexhard · · Score: 1

      Since I don't really understand semantics either...would that happen automatically or would it have to be a pre-programmed function like google's currency exchange, or calculator? I'm guessing not, but wouldn't that be amazingly hard to implement?

      --
      Infinite time means everything that can happen, will. You being you is absolutely incidental. You do not exist.
    3. Re:Semantic Searches? by a.d.trick · · Score: 1

      The term "semantic" is very poorly used. Almost, every search engine employs some level of semantics. For example, text that is inside page headers (like <h1> elements) has more importance than other text. Things like meta tags are seached for information. The people who talk about semantic seach engines are really asking for more semantics than currently exist. It's something of a pipe dream though, because it takes a long time to change the way that websites talk to each other.

  15. Google is here to stay by bronzey214 · · Score: 1

    Honestly, I can't see myself NOT using Google in the years ahead. It's become too ingrained in my lifestyle. If I don't know what something means, I google it. In fact, in the rare times that Google is down, I find myself lost and constantly clicking "home" (www.google.com) only to find it doesn't work.

    1. Re:Google is here to stay by robogun · · Score: 2, Insightful

      Google is a part of your lifestyle? You can't be for real unless you're some marketing droid.
      But the fact is there are people out there who say "I won't change, it is teh easy," but they need to realize Google records every search they make.
      http://www.dummysoftware.com/gzapper.html

    2. Re:Google is here to stay by bronzey214 · · Score: 1

      No, not a marketing droid. I look up basically everything I do on Google.

  16. The way forward by rumplet · · Score: 1

    Clearly the way forward for search is to make an algorithmic search engine, and have it scrape information from a dead human edited directory.

    Google directory. Bringing you the future today.

    1. Re:The way forward by Kensai7 · · Score: 1

      What about a "learning" search engine (and directory)? I prefer to spend 15-minutes-per-week to tune this engine by feeding it with words, sentences, documents, etc. and then leave heuristics do the rest. Most of the things I search are similar to my work, hobbies, and interests. Ranking the results is critical for not wasting time. Of course, a great and intelligent algorithm is imperative. My faith is in Google. Again...

      --
      "Sum Ergo Cogito"
  17. Evolution, you say? by __aaclcg7560 · · Score: 1

    Why not some "intelligent design" when it comes to search engines? "Spore" looks like a nice game but I wouldn't base a search engine on it since the result set would be too Darwinian. :P

  18. the rare times google is down by way2trivial · · Score: 1

    I find it to be a DNS error local to me.

    next time, try the IP
    (currently, 64.233.187.99)

    http://64.233.187.99/

    --
    every day http://en.wikipedia.org/wiki/Special:Random
  19. At least one is already being done by jorghis · · Score: 1

    "The first time a search engine will let users evaluate answers on the spot by displaying uninterrupted and coherent text snippets, often letting searchers forgo having to click through to links and saving time."

    Doesnt ask.com give you this functionality already?

    1. Re:At least one is already being done by Anonymous Coward · · Score: 0
      But what good is the front end of ASK.COM (or any search engine) if it strikes out, 9 times out of 10? Style over substance for results doesn't cut it.

      I sure hope LIVE.COM is going through some sort of rapid learning curve because it, too, has gone way backwards in the last few months.

      Or did I miss the PR release that both have just given up like Excite did years ago -- and gone the route of blatant consumer shopping and ads?

  20. This is Great by MrCrassic · · Score: 1

    That was a pretty good article, even though most of the stuff on there was pretty obvious (for most of us /.'ers) to begin with.

    I think it was only inevitable that internet searching focuses more on the "type as you speak" initiative rather than the older term-by-term searching of the past. This would be great for us, but I really see that the benefits would cater more to the average man/woman who already has a difficult time searching because they are using "the wrong terms."

    I really think that Google will be the first search engine to implement most of these changes, since their user-base and R&D is already above the roof. I think that Microsoft will also implement this soon with Live, since a sizable portion of their research teams are testing searching based on semantics as well.

  21. A lot of this is available now by saddino · · Score: 3, Informative

    4. The first time that a single query will bring a gallery of
                results equivalent to running multiple queries about the
                meaningful variations of the same topic.

      5. The first time a search engine will let users evaluate answers
                on the spot by displaying uninterrupted and coherent text
                snippets, often letting searchers forgo having to click through
                to links and saving time.


    Both of these have been available for a couple of years: e.g. searching on the single query "semantic web" using CQ web, reveals clusters such as these:


    fuzzy sets
    fuzzy systems
    neural networks
    set theory
    soft computing
    aritifical intelligence
    control systems
    expert systems


    And each one of which is linked to a specific page of results using sentences instead of snippets, e.g. for artificial intelligence:


    1. This paper will present the foundations of fuzzy systems...noteworthy objections to its use with examples drawn from current research in the field of artificial intelligence.
    Fuzzy Systems - A Tutorial
    2. The most obvious implementation for the fuzzy logic is the field of artificial intelligence.
    Fuzzy Logic
    3. Ultimately it will be demonstrated...fuzzy systems makes a viable addition to the field of artificial intelligence and perhaps more generally to formal mathematics.
    Fuzzy Systems - A Tutorial
    4. The paper gives examples of the fuzzy logic applications with emphasis on the field of artificial intelligence.
    Fuzzy Logic
    5. A collection of articles and other technical resources for artificial intelligence.
    PC AI - Fuzzy Logic


    1. Re:A lot of this is available now by cain · · Score: 1
      The first time that a single query will bring a gallery of
      results equivalent to running multiple queries about the
      meaningful variations of the same topic

      That's the worst haiku I've ever read.

  22. advances in image search by Anonymous Coward · · Score: 0

    most imagesearches give you a filtered and nonfiltered option but where is the porn only search from a big company? you can trick alexa into giving you porn only i wonder if they know but i bet they got the rankings wrong! rofl come on google, give us what we want!

  23. Re:Still waiting for the following paradigm shift. by Anonymous Coward · · Score: 0

    You want to do a search that only covers pages you've seen before. Doesn't Google Desktop do this?

  24. 7 Things Hakia Will Promise but Fail to Deliver by billstr78 · · Score: 1

    Some of these things (1,6) sound pretty specific to technology that the author's company: Hakia is promising to produce this year. Some of these items (2,4,5) are already being performed by major search engines, but are done behind the scenes and are not immediatly obvious to the user. #2 Will continue to be perfected over thext 20 years, not the next 12 months. #3 Sounds like a reasonable extension to the traditional practice of bolding keywords. I'd like to see this implemented, though I think it will only come with good progress in the area of #2. #7 Is actually a pretty good insight into the way top engines will use thier computing power after they've already crawled and indexed in the standard fashion, most of the 15B good pages on the current web.

  25. Re:Still waiting for the following paradigm shift. by dodongo · · Score: 1

    Yeah, there is some Google thing you can do that tracks your search history and promotes links you frequent.

    Personally, that freaks the shit out of me, so I don't have it enabled -- but you're right, the basics of that technology are at least partially implemented by at least one major search engine.

  26. Wildcard search , will it ever be available??? by Anonymous Coward · · Score: 0

    :-((((. I know it requires enormous resources, but I need it badly. It would be like heaven if I could wildcard search, because you could show exactly what you are searching for.

    1. Re:Wildcard search , will it ever be available??? by FormOfActionBanana · · Score: 1

      like "cat*" to get cat AND cats!

      --
      Take off every 'sig' !!
    2. Re:Wildcard search , will it ever be available??? by amodm · · Score: 1

      At the risk of sounding redundant/irrelevant, thats called stemming. It brings any word into its "root" form. See http://www.comp.lancs.ac.uk/computing/research/ste mming/general/

      Also, regex queries would be very difficult to implement on search engines (if not impossible), coz the nature of search is different. In every regex query, the pattern is the one which is pre-processed, while in search engines, the text is pre-processed. Because of the different nature of pre-processing involved, regex on search engines might not be possible.

    3. Re:Wildcard search , will it ever be available??? by Anonymous Coward · · Score: 0

      You already get "cat" AND "cats" when you search for "cat".

    4. Re:Wildcard search , will it ever be available??? by SpectralDesign · · Score: 1

      My understanding is that google already does something to search queries so that a search for either cat or cats will actually draw results from both sets... I think they call it "smart search" or something like that, and they've been doing it for quite a while.

      Although I agree, it'd be nice to be able to do wildcard searching... talk about handy when crosswording!

      non sequitor: you ought to check out kartoo http://www.kartoo.com for an interesting twist on searching!

      --
      Be who you are and say what you feel, because those who mind don't matter and those who matter don't mind. - Dr. Seuss
    5. Re:Wildcard search , will it ever be available??? by bobwyman · · Score: 1

      The smallest unit of data indexed by most search engines is the "word" not the "character." Thus, it isn't possible to create regular expressions that work at the character level, although something approximating "regex" on strings of words has often been implemented and there is no reason why full regex couldn't be supported fairly easily -- as long as you accept that "words" are the base symbols in the regex, not characters. For instance, "phrase search" and proximity search operators like "near," "within," "before," "after," "same sentence," have been implemented in many search engines and provide some of the capabilities of regex -- with words as the base symbol.

      Many search engines implement "stemming" as a means of covering the most common requirement for sub-word query predicates. Stemming allows both "cat" and "cats" to be treated as the same word. A system that provides stemming will often use "stems" as the smallest unit of data indexed rather than words. Thus, in the index, there would only be one entry for "cat" which represents both the words "cat" and "cats."

      Of course, character-based search exists today for small corpora -- grep is the most well know system, I think. But character-based search on a large corpus is a very difficult problem that, using today's algorithms, would require massive computing capacity to provide to even a small number of users. However, in some specialized applications, this can be justified. One technique for reducing the resource cost of character-based regex on large collections is to index bigrams, trigrams, quads, etc. (i.e. use multi-character sequences as the base "symbol" or smallest unit indexed. ) However, such ngram based systems still don't deliver the serving capacity that one would expect from something like a Google or Yahoo!.

      One "mixed" strategy that has sometimes been employed is to support a more complex query that has a "first part" which is word-based (as in traditional search engines). This first-part is used to select a subset of all the documents in a collection. Once the candidate-result-set is produced, one would run a character-based regex (similar to grep) over the results. Of course, you can probably see that you can easily hack up such a two-phase search yourself by sending a word-based query to a search engine and then doing the character-based regex on the result pages.

      I fear we are stuck with word-based search for the foreseeable future. While folk often ask for character based regex, the reality is that it is simply too hard to implement today. Also, the size of the market that has a real need, as opposed to a desire, for this capability is much smaller than one might think....

      bob wyman

    6. Re:Wildcard search , will it ever be available??? by FormOfActionBanana · · Score: 1

      Stemming or lemmatisation are only irrelevant to the fact that I was making a JOKE. Cat* will match all sorts of words that have nothing to do with cats.

      I actually had a customer requesting wildcard search, and used "cat* for cat(s) as an example.

      --
      Take off every 'sig' !!
  27. Re:Still waiting for the following paradigm shift. by foniksonik · · Score: 1

    Use Delicious and search your bookmarks... easy-peasy and you can do it now.

    But you're right. Search Engines should be keeping a list of sites you visit and associating them with your user account and IP address... so you can get a list of previously visited sites that meet your keywords at the top of the page or in a sidebar... OR SHOULD THEY?

    --
    A fool throws a stone into a well and a thousand sages can not remove it.
  28. So maybe they'll implement decades-old tech? by whitroth · · Score: 4, Insightful

    I know from an ex-wife, a librarian, that librarians have been doing searches for fifteen or twenty years using such constuctions as NEAR . None of the popular search engines, from google on down, do this. It would *certainly* make my life easier, and result in relevant hits, rather than 100k hits because some asshole advertisers have thrown a laundry list into their META tag.

                    mark

    1. Re:So maybe they'll implement decades-old tech? by Anonymous Coward · · Score: 0

      Yahoo actually does support NEAR...

  29. Weighted sorting is all I want by foniksonik · · Score: 5, Interesting

    I just want to tell the engine that keyword 1 is 5 times as important as keyword 2

    Give me a slider control that instantly filters the results... ie: have the first 100 results waiting for me with 20 showing, then let me adjust the weight of my keywords until I get the list I am looking for with individual items falling off or being added to the list as I adjust the controls.

    --
    A fool throws a stone into a well and a thousand sages can not remove it.
    1. Re:Weighted sorting is all I want by Anonymous Coward · · Score: 0

      That is a really good idea - I hope someone from Google is reading this :)

  30. It Is About Time by BoRegardless · · Score: 3, Insightful

    I am sick of getting 100,000 irrelevnt hits & then doing dozens of narrowing searches, only to find that the word & phrase hits are all in different paragraphs or even unrelated articles on the same page.

    Bring it on NOW.

    1. Re:It Is About Time by Anonymous Coward · · Score: 0

      It would be nice to have check marks on search result for "no more of these" to filter off the junk links. Also would be nice if these are kept as user's preferences as well eventually fed back to the search engine.
      As it is right now, there are too much junk (compared to a few years ago) in the search results.

    2. Re:It Is About Time by robogun · · Score: 1

      Also, they need to dump the useless "supplemental results" or at least let us turn that crap off.

  31. Press Release by BobGregg · · Score: 1

    Good grief... it's not a scientific paper, it's not a journalist's article, it isn't any meaningful content at all - it's a press release. Right off of BusinessWire. What's next, Ron Popeil's predictions for 2007?

  32. maybe a search octopus instead of a puppy by victorvodka · · Score: 1

    When I go to do a search on my computer, I'm comforted by that little doggy. I wish Google would follow Microsoft's example and replace the little box to type in a search query with an animated animal, something with more limbs for going out on those internets and finding stuff.

    --

    The flag just makes more sense than the constitution. - Judas Gutenberg

  33. About freakin' time. by (Robo_Bro) · · Score: 1
    People have been futzing with the concept through data-mining for years...it's about time it went into a search engine. And what an engine! I tried it out, and compared it to Google (I had to re-enter the search query in a different syntax to get relevant results) - and I found it more useful! Who knew? Two bones to pick: 1) "7 search evolutions" seems like one idea rehashed into seven bullet points. A little redundant. 2) FTFA:

    "However, heavy users of search already understand that the average search takes 11 minutes and 50 percent of searches are abandoned." Um... no, I didn't know that. Where did you collect the data? How did you calculate that percentage and average?
    --
    "It's never the things that happen to us that upset us, it's our view of them." -Epictetus
  34. Re:Still waiting for the following paradigm shift. by LoonyMike · · Score: 1

    The search history wouldn't be enough. There's lots of pages you visit through links in other pages (or typed-in urls, for that matter), not just pages you searched for and followed in the search results.

  35. Re:Still waiting for the following paradigm shift. by dodongo · · Score: 1

    That's even scarier.

  36. Research topic for decades by Beryllium+Sphere(tm) · · Score: 1

    It's been extraordinarily difficult to get the kind of results this guy is talking about, and that was in a research environment that was free of SEO spammers deliberately attacking the algorithms.

  37. Hakia results are less relevant than Google by nektra · · Score: 1

    Well, I have tried some keywords and Hakia has objectively less relevant sites when Google not.
    Don't make Adversing with Slashdot guys help when you have nothing new to offer.

  38. "Answer Engines" are doomed by porneL · · Score: 1
    There's a problem with über-smart semanting search engines - if they provide answer right away (by understanding semantics and/or choosing very relevant snippet), there will be little incentive for users to visit web sites that provided the information. This means that search engines will steal ad revenue from content providers and content providers will revolt agains such engines.

    This is already a problem to some extent - Nielsen wrote about this in 2k4.

  39. Re:Still waiting for the following paradigm shift. by Mex · · Score: 1

    As am I. That's why I switched my homepage from Google to the Wikipedia. It's a faster gateway into real information.

  40. Re:Still waiting for the following paradigm shift. by Anonymous Coward · · Score: 0

    You do realize that it's not possible to put pages you've seen before first without knowing which pages you've seen before right?

  41. Seven predictions in Web search '07 commented by j.leidner · · Score: 1

    Predictions are always difficult. Here are some comment from somebody
    working in the field:

    > 1. The first time a search engine will have an alternative to
    > indexing; new technology like QDEXing will be developed.

    Indexing is a pre-requisite for fast access of retrieval results.
    Even distributed peer-to-peer indices that are a very attractive
    idea suffer from bad performance due to the absence of a monolithic
    index owned by an organization with huge bandwith.

    > 2. The first time ontological semantics will be used that will
    > enable a search engine to perceive concepts beyond words and
    > retrieve results with meaningful equivalents.

    The problem with applying ontology based search is the disambiguation,
    i.e. the mapping from natural language words (terms) to the unambiguous
    nodes of the ontology (concepts). Automatic disambiguation needs to be
    pretty good in order to help in search, but this is unfortunately still
    an open research problem.

    > 3. The first time that search results will include highlighted best
    > sentences as a result of semantic analysis rather than bolded
    > keywords as a result of finding incidences.

    This prediction seems to mix presentational issues (bold layout) with
    processing issues. The problem with the former is that flagging a whole
    sentence bold perhaps isn't well liked, as it could already have been used
    with current technology. The problem with the latter is what exactly is
    meant by "semantic analysis" - "deep" automatic natural language processing is
    still a very costly operation and may not be an option as early as 2007 to
    be applied to a whole Web index.

    > 4. The first time that a single query will bring a gallery of
    > results equivalent to running multiple queries about the
    > meaningful variations of the same topic.

    We would not notice this, since it would be carried out internally.
    However, this processing intensive step could be (preferredly) replaced
    by result-equivalent change in the ranking algorithm.

    > 5. The first time a search engine will let users evaluate answers
    > on the spot by displaying uninterrupted and coherent text
    > snippets, often letting searchers forgo having to click through
    > to links and saving time.

    Giving answers is certainly an emerging trend, cf.
    http://www.infonortics.com/searchengines/sh05/slid es/leidner.pdf
    But it may last longer than one year to become pervasive.

    The repeated mention of snippets seems to suggest that the author of
    this set of predictions has found fault with snippets and considers
    this a priority, whereas most people - at least on desktop PCs - seem
    to be okay with the way results are summarized today.

    > 6. The first time a search engine will have a dialogue utility that
    > will help point out best answers or suggest a Gallery for a more
    > engaging human-like search experience.

    Further work in interactive search is certainly ongoing, in some sense
    a dialog feature is already operative, as real search engine logs show
    that users keep re-phrasing and refining their searches all the time
    to converge to the results they desire.

    > 7. The first time a search engine's data will grow by detection of
    > new knowledge rather than by detection of new pages. Search
    > engine growth by knowledge will be the new direction for the
    > industry for 2007.

    This depends on a universally accepted notion of knowledge, and how to
    measure/acquire it automatically. Perhaps one of the strengths of modern
    search engines is that NO commitment to any kind of theory of knowledge
    has to be made, it works - for better or worse - because all it needs
    are strings.

  42. Kozoru did semantic search and no one cared. by jdp816 · · Score: 0

    I'm good friends with a guy who worked for Kozoru just before they folded up. I've got a couple of their cluster machines running right here next to me, heck I've got one for sale on Craigslist right now... Anyway, they had natural language search. It worked. It wasn't vaporware. Want to know what the #1 problem with their system was? The users. Most people are so mentally tied into keyword searching that they wouldn't believe that they could actually *ask the system a question* and get a meaningful result. They even built a chat-based system, called BYOMS, where you cold define what particular sites you wanted to search for various topics. The tech was great, it worked, and no one with money cared. http://www.google.com/search?q=kozoru http://www.google.com/search?q=byoms

  43. Dragging Results by Siker · · Score: 1

    I think human input will definitely come into play in the future of search. Ultimately you can make machines very good at recognizing spam content, but how can you possibly identify what people really want to see without asking them?

    The way forward is to allow people to reorder their results and to delete spam results. This way we'll have a search engine that actually learns what people want and acts appropriately. Sites like Digg and Reddit are on to something in this sense. They use 'swarm' technologies to determine what is most relevant in a certain narrow category at a certain time.

    Just like another commenter mentioned there is already something like this: Yoople. A couple of months back I wrote that Google's Searshmash secretly was playing around with something like that too.

  44. Re:Still waiting for the following paradigm shift. by dodongo · · Score: 1

    Right, but there's a difference between the pages I've searched for from Google and clicked out on vs. all pages I've seen before, period.

    My first comment in this thread was about being uneasy with Google tracking all out-clicks to your user name. I was then reminded that for ultimate efficacy, this system would have to record all pages you've visited, to which I responded that it was scarier still.

    But I guess expected an AC to go back through parent comments is far too much, so I'll just pat you on the head and send you on your way.

  45. In Googlis non est, ergo non est. by MikeFM · · Score: 1

    Don't you hate when marketing BS is written about as if it were something significant. If it ain't on Google then it isn't even on a website that knows how to get itself properly indexed.

    --
    At what price learning? At what cost wisdom? The price is a man's peace of mind, and the cost is his life.