Domain: alltheweb.com
Stories and comments across the archive that link to alltheweb.com.
Comments · 236
-
Re:Penalities for Violations?
-
NYC plane crash -- cnn.com down
Just saw a press release about the News search feature on Alltheweb. Very handy, now that cnn.com is down (at least it has been for me during the better part of this day...)
Check out this news search for 'queens plane crash'. Hit #1 was updated 27 minutes ago. A tragic day with another tragic event, but a nice discovery that I wanted to share...
Sandman -
NYC plane crash -- cnn.com down
Just saw a press release about the News search feature on Alltheweb. Very handy, now that cnn.com is down (at least it has been for me during the better part of this day...)
Check out this news search for 'queens plane crash'. Hit #1 was updated 27 minutes ago. A tragic day with another tragic event, but a nice discovery that I wanted to share...
Sandman -
Some google alternatives
alltheweb.com Let's you search the web, ftp files, images, MP3s and Videos. The results are quite good though not as good as in google. It has support for more languages than google and I use it exclusively to search pages written in my native language(which isn't supported in google). Has the best ftp search.
ResearchIndex Nice scientific literature search engine. Lets you search not only documents but also citations. Keeps cached copies of the documents in multiple formats. Can show related documents or other documents viewed by users that viewed the current document.
vivisimo.com Groups the found documents by topics and subtopics. Nice interface and the sudgested topics are quite reasonable.
www.wisenut.com Similar to vivisimo, but vivisimo(IMO) is beter.
www.searchshots.com Lets you see screenshot of the found pages. Too bad has a content filter and the results are not very good.
www.teoma.com Simple interface. Can group the results by their topic.
ditto.com An image search engine.
webshots.com Not exactly an image search engine. But I've had much better luck finding images there than in any image search engine. Requires you to download a program (windows only) that puts the images as a wallpaper. -
All The Web
All the web does a good job of picking up Google's loose ends. (Everyone's gotten those completely random sites google sometimes throws your way) When I use it in conjunction with Google, I can almost find anything. (It's ad free and really fast too...)
-
AllTheWeb
Yesterday, I posted a reply telling about a Norwegian company (Fast), which have developed the search engine called AllTheWeb. AllTheWeb seems superior to any other search engine on the net when it comes to the hardware being used, as they are using specially developed hardware for searching through huge amounts of text (and other media).
digitoday.no (Norwegian only) today reported about further enhancements of the AllTheWeb search engine. I have tried to do my best in translating some of the article into English;
- Fast will soon change to a new and improved crawler which will find three times as many web pages. That way, Fast will soon cover the whole Internet. ...
Fast estimates that the web today consists of billions of page, but by removing duplicates and "garbage" the number will decrease dramatically. They estimate that their search engine will cover 1.8 billion web pages before christmas. ...
One of the biggest improvements is the ability to index dynamic pages. Dynamic pages are web pages you can only access by pushing a button, choosing something from a menu, or filling out information in a form. ...
The whole article (in Norwegian) can be read here. I'm not a translator, and my English is pretty bad, so you are warned. :-) -
Better hardware than GoogleI really like Google. Their search engine is fast, and it covers a lot;
- Cache: Means that we are able to visit a site after it's been slashdotted.
- Relevance: Google's "relevance technology" is great. Find related sites, and find only pages related to your query.
:-) - Not only web pages: Google doesn't only search for web pages, but also PDF files and images. More search engines should have had features like that.
Why? Well. They have developed special hardware to do their search. And it's damn fast (that's where they got the name, I guess). However, the software running on their hardware isn't as good as Google, and I really wonder why...
My conclusion: The software Google is using should have run on AllTheWeb's hardware. That would have been one hell of a search engine.
No I don't like it, either... -
All The Web?
Anyone try http://www.alltheweb.com??
I do not know how it stacks up to google but I know that it is pretty darn fast. -
www.alltheweb.com Works for MeI tend to use the Advance Search feature at www.alltheweb.com, as it brings up more hits than Google.
Of course, Google is now the only player in town for Usenet Searches since they bought Deja (and if they're reading this, I want them to bring back Deja's hierarchical nesting features...)
-
www.alltheweb.com Works for MeI tend to use the Advance Search feature at www.alltheweb.com, as it brings up more hits than Google.
Of course, Google is now the only player in town for Usenet Searches since they bought Deja (and if they're reading this, I want them to bring back Deja's hierarchical nesting features...)
-
fast.no has had one for a long time
There's been one available for a long time at multimedia.alltheweb.com, brought to you by the fine folks at Fast Search and Transfer ASA.
-
alltheweb.com has better image searchAllTheWeb.com has an image search engine feature that is just as good and in fact the interface is much cleaner.
Asim
-
Google Isn't That Good
Google isn't that good. Try alltheweb, which is slightly better in searching capabilities and about a gazillion times faster.
-
Re:doesn't everyone use Google these days?
Alltheweb. That's a new one to me - never heard of it. I guess I'll try it for a while.
-
Take the US spin off...Again, a US centric
/. article - if Katz would take his USian specs off and done some digging, it would have been nice to see where the rest of the world is on this. And Katz neatly ignores the issue if a company has incorrect data on you.
Here in the UK we _do_ have rights of data access. For once our Government has done something quite well. The Data Protection Act Was originally drafted in 1984 (the irony!) and has been amended a few times since. In action, it works - I have requested credit profiles for myself for the nominal charge of UK £1, and received the information quickly and without quibble.
Pity about the mess that is RIP, though... Stand has a good writeup about it...
Strong data typing is for those with weak minds.
-
On a related note...I noticed last night on my dialup account that clicking on search results from alltheweb.com first sends an http:// request back to that server, before connecting to the server that has the hit you want to see.
Also, alltheweb doesn't seem to have any privacy statement on the site at all....
Now, I am not naive enough to have an expectation of privacy online, and i don't think that anyone else should, either. And, I know that - legally, at any rate - there isn't much that a company has to tell us about what they do. So, where does that leave us? If you have concerns about your privacy then you should either get and stay offline or collaborate with others to identify and combat the worst offenders.
Rant off.
-
other search engines comparedTerm : Phoebe Cats Nude
Search Engines : Quite a few
Synopsis : Nobody's perfect, don't always go off #1-3 and remember, anomolies exist in all things. I'm fairly certain such an article could be written about any search engine and the fact that this is so noted that google is not perfect, it must be pretty close. If we saw such an occurrence with other search engines we wouldn't think twice - it is expected. Thank you google for raising the standards!google
http://www.google.com/searc h?q =phoebe%20cates%20nude
altavista
http://www.altavista.com/cgi-bin/query?q=phoebe%20 cates%20nude&kl=XX&pg=q&a mp;a mp;a mp;a mp;a mp;a mp;Translate=on
yahoo
http://search.yahoo.c om/ bin/search?p=phoebe%20cates%20nude
raging
http://r agi ngsearch.altavista.com/cgi-bin/query?q=phoebe%20ca tes%20nude
lycos
http://www. lycos.c om/srch/?loc=searchbox&query=phoebe+cates+nude
hotbot
h ttp://hotbot.lycos.com/?MT=phoebe+cates+nude&SM=MC &DV=0&LG=any&DC=10&D E=2&AM1=MC
go
http://www.go .com/Split? pat=go&col=WW&qt=phoebe+cates+nude
excite
http://search.excite .co m/search.gw?search=phoebe+cates+nude
askjeeves
h ttp://w ww.ask.com/main/askjeeves.asp?ask=phoebe+cates+nud e&metasearch=yes
alltheweb
http://www.alltheweb.com/cgi-bin/search?exec=FAST+ Search&type=all&query= phoebe+cate s+nude
goto
http://ww w.goto. com/d/search/?type=home&Keywords=phoebe+cates+nude
-- .sig -- -
A better way?The last sentence says it all...
"Otherwise we won't be able to find anything on the web"
I know the gags about all web searchs returning at least one pr0n link, and they are exaggerated to a certain extent. There is some truth in it, though.
But the lack of honesty on the internet appalls me. As an old schooler (I remember gopher, before this new fangled www thingy), I'm used to being able to find things quickly, and only find things that are relevant.
With the proliferation of personal home pages, unscrupulous company webpages exporting lying meta tags, I'm disappointed in the web as it has turned out.
Pr0n sites are they best and worst thing on the internet. Best, not for the reasons you'd think, but because the pr0n sites drove the need for high quality image and moving picture compression, and provided the need for higher bandwidth availability. But now, they are ruining our internet, along with the spammers. That said, AllTheWeb does an excellent job at stopping "adult" links being returned from a search, and it's bastard quick, too.
Why not _force_ all pr0n sites to have a .xxx TLD? Easily spotted by search engines, web filters and all. So all of us who can handle the internet in it's glorious tangle can find what we want, and avoid what we don't. And segregate other sites into more stringent TLD's - like .tec for tech sites, .sel for ecommerce, .hmr for funny sites etc etc etc. Everyone benefits that has a legitimate site, effectively killing these ad-laden dross sites that popup 10,000 more windows when you try and close them. The only problem is policing this effectively - maybe have an optional membership to a regulatory agency, which allows you to publish a "certified" tag, which can be a criterion for web search engines. The only problem again, is how this agency supports itself financially.
What do you guys think?
Strong data typing is for those with weak minds.
-
AllTheWeb
After your advice, I just visited AllTheWeb (funny joke
:-) and launched a query on Google (891000 docs found, 0.13 seconds).
This therefore lacks a functionality (that Google also lacks BTW):
av.com usually gave a last checked/changed date for each URLs, I just loved this.
BTW, I am now back from av.com and it seems they also got rid of it...
What a pity.
Is this an implicit way to explain it was too much data to handle? :-(
Could somebody tell me how to display dates in Google ?
-- -
Re:New Google addictIf you like Google because it has a lite image load and a few adds you should try All The Web. Working as a programer for Lexis-Nexis as a programer I have become VERY familiar with search engines.
Some benifits and cool things of All the Web are:The low number of images(two small ones)
FAST search time ( my average search time is
.12 sec(it tells you)All the web tells you how many documents/web pages it finds
I have found that All the web is superior to google. -
It's A Very Common English Idiom
Check th is out, if you doubt it.
-
Point of reference..
Google being my primary search (and the beauty that it is)... here's another for those looking for a backup:
FAST Search
.......should the need arise.
-
Computer noise, it's a matter of cost and tech
The power supply and cpu cooling systems are usually the main problem with respect to noise. The fans themselves, and the necessary openings for airflow are what makes the computer noisy.
You could make a much quieter computer if you sealed the case and instead used peltier plates, heat pumps and heat sinks. This method is just more expensive than using cooling fans, which I expect is why off-the-shelf computers aren't made like this already.
-
Re:slashdot guys
-
Re:Got your chocolate chip cookie thingJoking aside, I think that it is probably a typo issue, for example the following search on alltheweb.com brings up plenty of porn inside the first 50 with the search terms:
choc chip cooky
It also brings up plenty of recipes as well, but a relatively understandable misspelling lets the porn noise floor through as it does with all searches as you get past the default matches. -
Re:Got your chocolate chip cookie thingJoking aside, I think that it is probably a typo issue, for example the following search on alltheweb.com brings up plenty of porn inside the first 50 with the search terms:
choc chip cooky
It also brings up plenty of recipes as well, but a relatively understandable misspelling lets the porn noise floor through as it does with all searches as you get past the default matches. -
Re:c. all of the above
Apparently people are a little on edge around here lately.... Legality issues aside, there are a growing number of search sites that are "endorsing" pirated music by way of having huge indexes of MP3s, many of which, undoubtedly are pirated. One that sticks out in my mind is the MP3 search on Lycos, powered by Fast Search and Transfer ASA, the same people that brought us AllTheWeb. Then there is Napster, an MP3 lovers dream (sorry to keep referencing this ad nauseam.) There was much publicity, at least among the
/. Community about the suit filed against our buddies over at Napster, no doubt it'll get thrown out (it may have already, I'm not sure) because there is the concept that Napster is simply providing a vehicle for search, location and subsequent download of MP3s, and never takes physical, or even legal possession of the data. Basically, they, and other such sites, services and software, are conduits and, in acting as such, are not violating any law. I think there is a distinct possibility that we might begin to see the tides turn on these types of issues. After all, how many times in the past year or two have we been hearing about class action suits against other types of "conduits" for other activities. Gun manufacturers, tobacco companies, automobile manufacturers. Although there is a substantial difference between manufacturing a weapon used in a deadly crime and providing an avenue to procure a music track that you do not own and possibly do not intend to pay for, where is the line drawn. I am not predicting that there will be huge class action suits by large numbers of music artists or record concerns, but noting a potential for some legal actions to at least by subject to closer scrutiny by the judicial system. Until then, there is going to be a lot of back and forth on issues such as encryption technology, legal liability for sites linking to and referencing certain MP3s and other media files, and there's going to be a lot of money to be made as an intellectual property and copyright attorney. The industry is too new for self regulation, whether this means that the government will step up its battle or technological advances on protection technology will speed up to stop the gap remains to be seen. All of this of course, IMHO -
Re: Technology thieves.
> (noone else has done it yet, and the big search
> engines have been out long enough to come up
> with it on their own).
That is not true. Inktomi has been doing something similar for many months, and now Fast is doing it too. Click on "web pages" at yahoo's search results for terms like "mp3" or "games" and you will see for yourself. -
New method?
I was beginning to think this as well - Yahoo, Infoseek, Hotbot and the like just don't seem to find the good stuff anymore. If there's content held in a database and a page is generated on demand by an active server page or CGI script for example then the page doesn't come into existence until the user requests the information.
Perhaps it's time for search engines to search by topic and direct to a site related to the enquiry. The individual sites could then have their own search utilities to trawl through their databases? Not sure if this is feasible or not though.
In terms of good search engines though - Google and AllTheWeb.com seem to find good content whenever I use them. The problem I guess is that you don't know what you're missing until you find it by some other means, and neither do the search engines. -
Re:Why, I much prefer Google, except for babelfish
I use a combination of northernlight, google, and alltheweb. They work great for finding all that I need.
-
Re:Lycos is a Bullshit Scam
But Norwegian scientists and Open Source heroes will fight the good fight! They won't put us back in the tin!
-
interestingAccording to my favourite search engine, doing a search for Pez:
53579 documents found - 0.0040 seconds search time
... which is obviously a non-starter for these guys. My guess is that they're covering their back as such, for a future time when either some online candy merchant tries to latch onto pez traffic or if they find porn sites that use pez to misdirect children. I'm no lawyer, but that seems like a good thing to me. -
The Elvis IndexI did a whole mess o' these a few years back, using 'elvis' as the calibration standard: the Elvis Index.
I just recently started doublechecking those old numbers using FAST's 200M-page index... so far the rankings are still pretty comparable.
-
Re:all and then some?>Ok so repeating an effort to find the various purported etymology of the word "strawberry" I
>searched with +etymology +strawberry +origin on both yahoo (my standard) and alltheweb.
>Yahoo found 60 while alltheweb found 117, but a number of allthewebs' finds were xxx sites!?
>How many xxx sites actually use the word etymology and if this is more do we really want more?I tried "CalTrans Bridge Design Manual" in Google!, Inference Find, and All The Web. Google gave me many links to CalTrans sites and some associated ones. Inference Find found the CalTrans sites and a bunch of tangentially related sites. All the Web found a bunch of CalTrans sites and related sites, but numbers 19 and 21 were porn sites, and putting CalTrans at the end of the string got me more porn sites.
Not a terribly useful site, IMO.
-
Re:all and then some?
>Ok so repeating an effort to find the various purported etymology of the word "strawberry" I
>searched with +etymology +strawberry +origin on both yahoo (my standard) and alltheweb.
>Yahoo found 60 while alltheweb found 117, but a number of allthewebs' finds were xxx sites!?
>How many xxx sites actually use the word etymology and if this is more do we really want more?
I tried "CalTrans Bridge Design Manual" in Google!, Inference Find, and All The Web. Google gave me many links to CalTrans sites and some associated ones. Inference Find found the CalTrans sites and a bunch of tangentially related sites. All the Web found a bunch of CalTrans sites and related sites, but numbers 19 and 21 were porn sites, and putting CalTrans at the end of the string got me more porn sites.
Not a terribly useful site, IMO.
-
5031765 [non-unique] documents found
I searched, on a whim, for debian cd images (try it!) and the 2nd, 3rd, and 4th hits were all the same! Doesn't instill the greatest confidence in their searching algorithms, does it? No thank you, I'll go back to www.google.com, hit "I'm feeling lucky", and get sent directly to what I was looking for...