Slashdot Mirror


New Search Engines

An anonymous reader wrote in to say "It seems that there's a new company out there called Fast Search and Transfer which is competing with Inktomi. They have a demo online at www.alltheweb.com and their engine seems to be ultra-fast. News about this is available here. Try out the demo, it is awesome what these guys have done." It is fast, but so far I've not had as good luck searching as with other engines. And the speed is probably largely due to the sparse HTML. But its not bad.

27 of 54 comments (clear)

  1. Re:Wow by Madoc · · Score: 1
    ``Size isn't everything; but if you can have both size and relevancy, then you are able to please all types of users''
    It's fast, but I found that it's relevance (and coverage) leaves a lot to be desired. It really pales in comparison to something like www.google.com or www.altavista.com. The similarities in setup to the latter are interesting as well (is it going to be a theme that hardware companies showcase their talents using search engines?).

    As soon as they get much closer to their "all the web" idea, I'll take another look. But for now my default engines will stay.

    --
    Anonymous Cowards: Proving daily that human beings are innately jerks.
  2. Holy crap! by gavinhall · · Score: 1

    Posted by FascDot Killed My Previous Use:

    I clicked the button and I had results before the mouse had fully un-clicked.

    That's damn fast!

  3. How could it get so fast? by gavinhall · · Score: 1
    Posted by sirtwist:

    There's gotta be a reason apart from a fast machine. I have this feeling that the database ain't so big, making the searches incredibly speedy.

    Still, it's a great speedy thing to see.

    1. Re:How could it get so fast? by gavinhall · · Score: 2
      Posted by sirtwist:

      Dammit.

      I just read the article. At 80 million documents, I don't think my assertion is quite as valid.

      Wow!

    2. Re:How could it get so fast? by dblock · · Score: 1

      I guess it's purely cache methods. No database queries any more, pure hash hits in RAM.

      cheers

      dB.

  4. Prone to spamdexing by Kurt+Gray · · Score: 1

    Try searching for "shareware" and check out the
    results -- seems they don't discourage keyword
    spammers!

  5. Re:ftpsearch.lycos.com? by knuth · · Score: 1

    It looks to be by the same guys who created Fast FTP Search... which went downhill when it was bought by Lycos.

  6. Not so FAST... by martian · · Score: 4


    As someone else commented, these people wrote the search engine behind ftpsearch.lycos.com. Which is fast. There are a few more reasons, apart from sparse HTML.

    This engine asks for no cookies

    The output is not in a table, so you see the results as they arrive in your browser, without having to wait for the whole table (in lynx it makes no difference :-)

    The load is not very high yet, probably.

    But having seen FTPsearch in action for the last 4-5 years, and having seen it always return results quickly, it wouldn't surprise me if alltheweb stayed fast.

    --
    "There are two major products that come out of Berkeley: LSD and UNIX. We don't believe this to be a coincidence."
  7. Fast = FTPSearch, MP3Search et al by hta · · Score: 2

    This technology isn't very new.
    Parts of it at least were developed at the Norwegian Institute of Technology, and used in the search engines FTPSEARCH and MP3Search.
    But it's really nice to see them get the publicity they deserve.

    Harald

  8. Google is still better. by stevenj · · Score: 3
    I don't care how fast the search engine is, if it still returns 10,000 useless links. When are the other search engines going to realize that intelligent relevancy ranking is the single most important feature of a search engine?

    The only search engine I know that does a good job at this is Google. It is so good at finding relevant sites, I don't care if the response time is occasionally a little slow.

    (Google uses a nice algorithm where they gauge the "importance" of a page by how many other sites link to it.)

    --
    If a thing is not diminished by being shared, it is not rightly owned if it is only owned & not shared. S. Augustine
    1. Re:Google is still better. by LucaL · · Score: 1

      The only gripe I have with google is that it's not always up to date. It seems a couple of month behind current events.

      On the other hand, it's so good it's uncanny. If you search for "Linux", the first links it returns are:

      Linux.org, Redhat, Debian.org, Linuxhq.com, ...

      Try it on Altavista :)

  9. Beautiful! by jonbrewer · · Score: 1

    Damn this engine is fast! And I love how the searching syntax is spelled out on the first page. This is really phenomenal. Certainly my new default search engine.

  10. Wow by pluteus_larva · · Score: 1

    that alltheweb.com is *fast*. good coverage too.

  11. Duplicates? by Bob+McKenzie · · Score: 1

    Egosurfing, I found it quite refreshing to NOT see my old web pages that died 4 months ago, yet found a large number of hits to people I'm apparantly related to. (gasp) Still no sign of my current web page tho. Curious. More time perhaps.

    But did anyone else find there's number of identical links? (I'm used to metacrawler, so maybe this is normal for search engines?)

    --
    Friends help you move. Real friends help you move bodies.
  12. Re:FreeBSD by Jae · · Score: 1

    I wonder if these means that Dell is going to start supporting FreeBSD on their machines as well.

    --
    -Jae
  13. FreeBSD by Agnomen · · Score: 1

    Interesting to notice, their HTTP headers proclaim the site is running FreeBSD. It is nice to see FreeBSD and Linux being used in large commercial applications.

    1. Re:FreeBSD by DonkPunch · · Score: 1

      "...have constructed an advanced search capability using a high-performance, low-cost software/hardware combination...."

      That little quote made me wonder what they were running. Why didn't they just say? They didn't seem at all shy about mentioning the Dell 4300s. :)

      If the web servers are running FreeBSD, I wonder if it's also powering the database.

      Maybe I'm reading too much into their vagueness here, but I tend to think:

      #ifdef PRESS_RELEASE
      #define HIGH_PERFORMANCE_LOW_COST Free_OS
      #endif

      (Sorry about the double-post. This is the one I meant to send.)

      --

      Save the whales. Feed the hungry. Free the mallocs.
  14. CNET Spamdexing??? by breser · · Score: 1

    I find this really hard to believe but CNET is doing spamdexing.

  15. Re:ftpsearch.lycos.com? by cce · · Score: 1

    yeah, off the bat i recognized the logo from the old ftpsearch.ntnu.no, now ftpsearch.lycos.com. are they a part of lycos, is lycos partnered with them, or what?

  16. Hardware-Based Searching by scottsevertson · · Score: 5

    The specs on the search engine are available at http://web.fast.no/product/search/d et.asp?id=34.

    The press release doesn't exactly scream it out, but the search engine is actually just a little bit of software stuck on top of some pretty neat custom hardware. They call their chip the FAST PMC (Pattern Matching Chip), and their server is just your average (well, sort of average) high end server, with a buttload of those chips stuck on PCI cards.

    The specs on the PMCs are available at http://web.fast.no/product/PMC/det.asp ?id=52.

    FAST claims 100 MB/sec throughput on each chip, and each card has its own RAM (from 8 MB to 2 GB). The chips actually run at 100 mHz each, and even have support for RegEx matching (slightly limited).

    From the specs:
    A typical configuration will contain 4 to 8 plug-in cards per search node, and 16 or 32 chips on each card.


    Overall, I'm pretty impressed - putting search capabilities into hardware is a pretty good idea, especially since so much of a modern processor is geared toward things like Floating Point calculations, which doesn't help text searching at all.


    Scott Severtson
    Software Developer
    Auragen Communications
    scotty@auragen.com

    --


    Scott Severtson
    Senior Architect, Digital Measures
  17. Re:Lightning by oyvindmo · · Score: 1

    If a search takes exactly 0.0050 seconds, it's because it's cached. The engine caches all searches for some time, and if someone does the same search again, the results are simply fetched from the cache.

  18. Database Servers? by DonkPunch · · Score: 1

    "...have constructed an advanced search capability using a high-performance, low-cost software/hardware combination...."

    That little quote made me wonder what they were running. Why didn't they just say? They didn't seem at all shy about mentioning the Dell 4300s. :)

    If the web servers are running FreeBSD, I wonder if it's also powering the database servers.

    Maybe I'm reading too much into their vagueness here. It seems to me that companies which have close ties to MS (like Dell) are a little reluctant to trumpet the virtues of other operating systems too loudly. As a result, I tend to think:

    #ifdef PRESS_RELEASE

    --

    Save the whales. Feed the hungry. Free the mallocs.
  19. cool by 13th+seer · · Score: 1

    I found unique info to plagarize for the paper I'm writing.

  20. FAST search and transfer is what? by Hagbart+Celine · · Score: 1

    FAST is a norvegian company that specialize in fast algorithms for search and also transfer of video and images. Since FAST is a research company they are interested in selling their technology, not productifying it themselves. All the web is a marketing site for FAST technology and is probably going to be closed down after an eventual sale (according to norvegian newspapers).

    The company has a couple of strange fellows working for them, there is former archeologist, medical doctors and statisticans. Guess a combination of their speciality fields spurred a really strange, but fast, algorithm.

    Hope that will answer som questions.

    By the way, the URL to fast is www.fast.no.

  21. Narcissist's dream engine by Tofuhead · · Score: 1

    I just did a search for my real name and came up with a ton of accurate links, more than any other engine in the past. Hot damn, ain't that fancy? =)

    What can I say? I've bookmarked this engine.

    --
    It is still the dark of night.
  22. Nice, but I just want a search engine! by dblock · · Score: 1

    Nice, fast powerful, whatever... yet another Altavista with maybe one day more relevant results.

    Still, for us, common mortal beings and small company sites (50'000 to half a million pages), maybe a decent accessible search engine like Alkaline is more than enough! It's free!

    cheers

    dB.

  23. Re:database in memory? by dblock · · Score: 1

    That's how Alkaline works. But for now an index of 179734 distinct word forms and 21309 pages eats ~40 MB together with the engine and serveral fast search structures. It grows slowly, so for 40'000 pages it will not get over 50-60 MB.

    Anyhow, that means your next purchase of the first terabyte of RAM will make the shop happy :)

    cheers

    dB.