Slashdot Mirror


A Search Engine For The Slower Net

Makarand writes "According to this BBC News article researchers at MIT are developing a search engine for people using the web on slower net connections. The software will e-mail queries to a central server and receive the most relevant webpages from the search results by e-mail in a compressed form. Since the program is too big to download over a poor net connection it will be mailed on CDs to libraries for people to borrow and install. They are also considering trying to persuade computer sellers in developing countries to install the program on machines."

309 comments

  1. Them Modem Linkers by inertia187 · · Score: 4, Funny

    About them Modem Linkers,
    ain't they kinda odd?
    Goin' on the net,
    with they little baud.
    Look at all those Modem Linkers,
    what a thing to see.
    Web sites come up really slow,
    get's lousy Voice/IP.
    Internet at low bit rates,
    what a dawgon mess.
    Load a web site, take a break,
    while 'pache mods compress.
    How to be a Modem Linker,
    don't need a ticket.
    Get a local ISP,
    dial up and link it.

    --
    A programmer is a machine for converting coffee into code.
    1. Re:Them Modem Linkers by Anonymous Coward · · Score: 0

      You've gotta love Google.

    2. Re:Them Modem Linkers by Anonymous Coward · · Score: 0

      Yeah they've been doing something similar to this for a while now. The technology was actually a precursor to that google labs thing where you can call up on the phone and search via voice recognition. Interesting stuff.

    3. Re:Them Modem Linkers by ncc74656 · · Score: 1
      That's funny. However, there are some valid reasons for doing this, and it's not just MIT who is looking at it.

      FOAD, troll. Take your goatse.cx link (verified with Lynx) and shove it where the sun don't shine.

      --
      20 January 2017: the End of an Error.
    4. Re:Them Modem Linkers by inertia187 · · Score: 2, Informative

      This is in reference to "Them Poems" by Mason Williams, circa 1960's. In and of themselves, they have little if anything to do with geekness.

      Personally, I'm tired of "In Soviet Russia ..." form of humor. You might know I've even tried it. Always a let down in the karma department.

      Yes, it is highly excessive. There's probably something wrong with me. However, I'll get bored of this very soon, and I'll move on to other methods of Karma gathering.

      --
      A programmer is a machine for converting coffee into code.
  2. Bah to that. by grub · · Score: 1, Funny


    Use lynx or w3m and search on google like the rest of us, ya Nancy-boys..

    --
    Trolling is a art,
    1. Re:Bah to that. by tommertron · · Score: 0
      Funny, but true.

      Google is (mostly) text-only. And they actually have a mandate to make sure that their pages will load quickly, even on slow modem connections. Is it really necessary to send CDs out for people to install when the Google flagship page is less than 30K?

      --
      Random rants about technology: http://technorants.blogspot.com
  3. Slow people have rights too by Anonymous Coward · · Score: 2, Funny

    I know the Internet is complicated - but there's no need to pick on slow people.

    1. Re:Slow people have rights too by Anonymous Coward · · Score: 0

      Maybe we should have a user interface that these slow people could lick to move around. Instead of "GUI" it would be called the "Licky". And when they want to go to a specific website they could laugh like a horse. Actually, I think I'll chair a committee to develop this API under the Java Community Process. A lot of Sun employees could benefit from this technology.

    2. Re:Slow people have rights too by rmadmin · · Score: 1

      I was on 56K untill 2 months ago when I picked up 128K DSL for the same price.. Google is fast enough for me on that. I do sit on a T1 at work, and use google alot there. And I have no problem going home and using Google on my "slower connection" People just need to learn fscking wait.

      Anyways.. We should pick on the people with slow sites that need to move their webserver off of their momma's 14.4 and get it on something a bit faster. :-)

    3. Re:Slow people have rights too by shellbeach · · Score: 1
      Yeah? Well, until less that two years ago I was using google from a 14.4. And it worked just fine! I'm currently using a 56kbps modem, but I only got that a few months ago, upgrading from a 33.6 :) I assure you, the faster speed is appreciated, but the net's perfectly usable with slower modems (unless all you care about is flash animations and other such hideous abominations ... :)

      My big question is - are there really people in poorer countries who are using less than a 14.4??

    4. Re:Slow people have rights too by evalhalla · · Score: 1
      are there really people in poorer countries who are using less than a 14.4

      Connection through an handphone is still around 9 kbps or somethink like that, at least in some cases.

      Anyway I don't think that the big problem in third world connections is the fact that it is really slow (i.e. under 14.4), probably it is more a matter of unreliable connection, condivided by quite some people.

      What I don't understand is why the program sends compressed pages via e-mail, when in this way their size grows (comparing with compressed pages via http 1.1), and you can't resume if the download fails.

      Wouldn't it be better to make use of google, maybe the google cache, and a proxy-like thing that simply receives the request, in the night searches google, downloads the required pages - compressed - and allows people to see them on the next morning?

  4. A better idea by Anonymous Coward · · Score: 5, Funny

    Maybe we could have all webpages categorized by a number, something like 800 for science or whatever, and then we could have a filing cabinet with index cards in it. Then, people could open the filing cabinet, see a number for the page they want and then go directly to the page.

    1. Re:A better idea by Anonymous Coward · · Score: 0

      I'm not sure I want to flip through the 3-billion or so odd pages that google has indexed in the "google card catalog"

    2. Re:A better idea by Anonymous Coward · · Score: 0

      You might as well because you're too slow to get a simple sarcastic joke.

    3. Re:A better idea by nsideops · · Score: 1, Interesting

      It's modded funny, but there is a good idea deep in there. I've basically been thinking about this idea for years. Not quite like a card catalog system as he described, but more like an ever changing directory. Of course it would require quite a bit of man hours to oranize/update/arrange, not to mention linking to the many many different categories and sections one page could belong to. Every day I run into something that I need to search for, but don't really know what exactlly to search for or where to go. I get lots of similar links but it usualy takes about an hour to narow them down to something usefull. I admit, this is part of the greatness of the internet, bucause quite often in this hour search I find valuable information that I need that I wasn't even looking for or stuble upon something that I didn't even know was out there. All of the sites that attempt to do this now are very limited and usually laid out quite poorly. It would be a massive project, but I think it would be quite usefull.

      --
      Teach someone to use the net and they won't bother you for weeks; show them Slashdot and you may never see them again.
    4. Re:A better idea by dago · · Score: 0

      It has already been imaginated and done partially
      about 100 years ago ! (at least the basic idea)

      If you go to the mundaneum, in Mons(.be), you'll have access to the card catalog system.

      --
      #include "coucou.h"
    5. Re:A better idea by Anonymous Coward · · Score: 0

      Then of course the obvious optimization, the file cabinet could contain a copy of the web page that could be e-mailed to the user directly. We could call it Apaper web caching.

    6. Re:A better idea by muffel · · Score: 2, Funny

      Funny how there always seems to be a relevant Dilbert.

      --

      bla
    7. Re:A better idea by Uber+Banker · · Score: 1

      The problem I have with that is where something affects both chemistry and biochemistry, or example. Or another thing affects economics, accountancy IT and finance.

      In the end... the http:... is the unique field and the search engine is the relational database these fit in. It is a good solution.

    8. Re:A better idea by An+Onerous+Coward · · Score: 1

      The cartoon is taking way too long to load up. Could you please e-mail it to me?

      --

      You want the truthiness? You can't handle the truthiness!

    9. Re:A better idea by Anonymous Coward · · Score: 0

      No, but I'll put it on a CD so that you can access it at your local library.

  5. Because Google is too slow? by Anonymous Coward · · Score: 0

    Works fine on my dial-up at home.

    1. Re:Because Google is too slow? by akiaki007 · · Score: 1

      Yes, but can you view all the results offline after the search is done?

      I didn't think so.

      --
      "Time is long and life is short, so begin to live while you still can." -EV
  6. How slow are their connections? by Santos+L.+Halper · · Score: 0, Interesting

    I mean really. I use dial up occasionally, and I can get my search results in 20 seconds instead of 2. What point does it serve emailing your search query off and waiting much longer for the results?

    --

    "Ask not for whom the bone bones. It bones for thee." --Bender
    1. Re:How slow are their connections? by jason0000042 · · Score: 1

      Well, if you read the article, you would know that people in developing countries don't have even constant access to dial up.

      And it can cost way more than you would expect for a few minutes on line in developing countries. My business has dealings with some people in Africa occasionally. Sometimes it's even too expensive to send an email.

      Plus, the lag between, say, Nairobi, and servers in the US, over dial up and steatite, can be way more than 20 seconds. I spent nine months on a team that duplicated functionality of some old crappy system that everybody in my office used, because some people in china were trying to use it over telnet, and it was too slow. Something like 6 seconds a key press.

      --
      i don't like my old sig.
    2. Re:How slow are their connections? by garcia · · Score: 1

      Not only that, but unless the information returned via CD includes the research FOUND that related to that search query how would this be beneficial? You would still have to search through the results CD and then download the research.

      I'm sorry but google.com usually gives me what I want (to the T) on the first page of results.

      This doesn't sound like something that would come out of the brains at MIT.

    3. Re:How slow are their connections? by Anonymous Coward · · Score: 0

      Yep, from brains that actually READ THE FSCKING ARTICLE, it comes.

  7. oh come on... by Anonymous Coward · · Score: 0

    I've been on a 28.8 modem and it wasn't that bad. This is just a way to get publicity.

    But if you're on something like a .5kbps connection then it might be worth it.

  8. And what timing! by mtrupe · · Score: 1

    Gotta love how software catches up about 10 years too late...

  9. What program? by drdale · · Score: 1, Interesting

    Could anyone else figure out why this requires a program on the user's end that is too large to be downloaded? Seems like all you need is an e-mail client, and instructions on how to format the information request.

    --
    This post is dedicated to all of those /.ers who do not dedicate their posts to themselves.
    1. Re:What program? by Anonymous Coward · · Score: 0

      Because it's going to be part of Mozilla 1.5

    2. Re:What program? by Thoguth · · Score: 1

      I was thinking the same thing! The only thing I can figure is, maybe it is part of some fancy all-in-one mail gateway and web caching system that gets installed on the server.

      --
      The requested URL /iframe/sig.html was not found on this server.
    3. Re:What program? by Anonymous Coward · · Score: 0

      Because the included version of gator has to have all the ads available locally.

    4. Re:What program? by kalidasa · · Score: 1

      Because they're users. The average user doesn't understand a listserv command, and just sends a message "Hey, could somebody unsubscribe me from this list?" to the posting (and not the request) address.

    5. Re:What program? by Anonymous Coward · · Score: 0

      You actually see messages that articulate? In my experience, they are more like "stop spamming me!!!1 if u dont stop i will report u!!!!!!!11 k thx bye"

    6. Re:What program? by BillThies · · Score: 4, Informative
      Hi, I'm a graduate student working on the TEK project.

      There are several benefits of having a TEK Client program instead of just using email. But first off, the client isn't that big -- the JAR file with the TEK classes is 125 KB. When we package it up with third-party libraries and an installer, it comes to 2 MB, and with Java included, it's 10 MB. It would be interesting to try to prune down this distribution to the minimal size -- for the prototype version, we have focussed primarily on the software's functionality.

      The TEK Client program is useful because it provides a seamless interface to browsing the downloaded pages. It operates as a web proxy: users adjust their browser to talk to TEK instead of the web, and then they can view pages just as if they were connected. The URL's appear as usual in the browser's "location" toolbar, and links on the page are functional. If a URL has been downloaded before, then it is loaded out of the local cache; if it has not yet been downloaded, then the user is queried to submit a request for that URL.

      The TEK Client includes a local search utility for searching the cache of downloaded pages. In this way, the user can build up a local library of information that is relevant to their community; for example, in a school setting, many searches could be satisfied using only the local cache due to overlapping interests of students.

      Also, the TEK Client is useful for tracking searches. In settings where connectivity is intermittent, searches can be enqueued during the day and sent at night (or when a connection is available.) The client also provides basic user management so that multiple people can share a public installation (perhaps using a single email address, which they might not own themselves) and still keep track of their own queries.

      In the future, we think there are a lot of features that could be added to the client. For instance, we could seed the client with other open-source resources, such as an atlas or encyclopedia, that could be used in conjunction with web searches. There could also be an "intelligent query builder" that helps construct Internet searches (for example, by checking spelling) before going through the time and expense of connecting and sending them off.

      Many more details about TEK are available from the TEK Homepage. We are currently moving our CVS source tree to SourceForge, so if you're interested in helping to improve the software, it'd be great to hear from you!

  10. Huh? by Anonymous Coward · · Score: 0

    what a really really dumb idea. really. i mean, WTF? what's the point?

  11. Perfect for those people... by fpp · · Score: 3, Funny

    ...still surfing the internet with their Commodore 64s and 300 baud modems!

    1. Re:Perfect for those people... by Anonymous Coward · · Score: 0

      It's perfect for me too! I'm using my Commodore 64 and a 300 baud modem!

      Oh wait... you post finally finished downloading. Nevermind...

    2. Re:Perfect for those people... by spacefrog · · Score: 1
  12. Hmmm. by teamhasnoi · · Score: 5, Funny
    Seems like a long time to wait for porn.

    http://www.mshiltonj.com/sr/

    1. Re:Hmmm. by Mysticalfruit · · Score: 2, Insightful

      Seriously... something tells me that the serious push for broadband wasn't so grandma could do her geneology research faster... Someone once said that every advancement in media technology only came of age because it could be used to transmit porn.

      Now, take a good hard look at your cubemate. You know what they do when they get home... and it's not BF1942... probably gives you good insight into their test bench naming conventions...

      --
      Yes Francis, the world has gone crazy.
    2. Re:Hmmm. by Anonymous Coward · · Score: 0

      That's OK, I've got a big bottle of benzocaine!

    3. Re:Hmmm. by Xerithane · · Score: 1

      Now, take a good hard look at your cubemate. You know what they do when they get home... and it's not BF1942... probably gives you good insight into their test bench naming conventions...

      You have to share a cube with someone? And it's most likely they spend their nights downloading porn? I really don't want to ever end up working wherever you work.

      On a more seriously joking matter, where the fuck are these people? I've worked in quite a few places and I've only met a few people with the neon signs hovering above their heads that say "Pornmonger." Yet, they seem like they are all over the place and I've just had the fortune to avoid them.

      At my job, we all work in a fairly large room. We have individual cubicles but it's "our group." Out of the 7 people in this room, not one could be thought of in that manner. Are these comments just a joke or do you people actually have to put up with those folks?

      --
      Dacels Jewelers can't be trusted.
    4. Re:Hmmm. by JDevers · · Score: 1

      I would think that most "pornmonger" types DON'T have the neon signs...more likely they don't tell anyone in the real world and possibly not even anyone in the virtual world...

    5. Re:Hmmm. by Anonymous Coward · · Score: 0

      Everyone is a pornmonger. You are a pornmonger. All men are pornmonger, and if you deny it then...you're not a man.

    6. Re:Hmmm. by jez9999 · · Score: 1

      So? I've got a big bottle of benzodiazepines!

    7. Re:Hmmm. by humming · · Score: 2, Funny

      I remember how long it could take to get home porn through ftp-mail back in my university days... But it was worth it, dammit.

      Ah, the time when we got USENET access... All those hours sitting and waiting for the 'You have new mail.' message was over and porn could be reached almost instantly at alt.pictures.celebs[*].

      --H
      [*] Name may not be accurate.

      --
      I'm too stupid to preview.
    8. Re:Hmmm. by Mysticalfruit · · Score: 1

      Actually, I have a real office with a door that closes with a little window and everything... My company has a policy against cubicles.

      --
      Yes Francis, the world has gone crazy.
    9. Re:Hmmm. by p3d0 · · Score: 1
      Seriously... something tells me that the serious push for broadband wasn't so grandma could do her geneology research faster...
      Funny you'd choose that example, because according to MacLeans magazine, "Genealogy has become second to pornography as the most popular use of the World Wide Web, with two million sites and counting."
      --
      Patrick Doyle
      I mod down every jackass who puts his moderation policy in his sig. Oh, wait a sec....
    10. Re:Hmmm. by chrylis · · Score: 1

      more likely they don't tell anyone in the real world and possibly not even anyone in the virtual world...

      Not sure about the "real world", but I heard from about eight of them in the virtual world this morning...

    11. Re:Hmmm. by crisco · · Score: 1

      using the scientifically proven method of typing a word into google and seeing how many pages it reports.

      Genealogy - 9,000,000
      porn - 44,000,000
      linux - 66,000,000
      windows - 66,000,000

      so OS are more popular than porn?

      sex - 144,000,000

      maybe not

      YMMV

      --

      Bleh!

  13. Re:Because... by slimak · · Score: 2, Insightful

    I believe it is the pages themselves, not Google, that this is an attempt to deliver.

  14. Cached searches by T5 · · Score: 4, Interesting

    Might be a nice way to preserve searches for later perusal. Unlike bookmarking, the returned search results are stored in an email.

    This would be a good way to preserve stuff that may be the subject of removal due to court order, like xenu.net and other similar de-Googlings.

  15. Hey! by Quasar1999 · · Score: 2, Funny

    MIT guys! Why don't you put your brain into better compression technology? So we can deliver higher bandwidth to those still on crappy 56K lines?

    And don't say it isn't doable... If I had the time, I could do it, and I'm a mere highschool graduate...

    --

    ---
    Programming is like sex... Make one mistake and support it the rest of your life.
    1. Re:Hey! by jayhawk88 · · Score: 1

      If I had the time, I could do it...

      Yeah, but those Tivo'd American Idols aren't watching themselves, are they?

    2. Re:Hey! by Indomitus · · Score: 1

      We do have higher bandwidth on phone lines. It's called DSL.

    3. Re:Hey! by Quasar1999 · · Score: 1

      We do have higher bandwidth on phone lines. It's called DSL.

      SHHH!!! Don't tell... I was planning on making millions! You damn near let the cat outta the bag... now how much do I have to pay you to keep you quiet? ;)

      --

      ---
      Programming is like sex... Make one mistake and support it the rest of your life.
    4. Re:Hey! by ncc74656 · · Score: 1
      MIT guys! Why don't you put your brain into better compression technology?

      And don't say it isn't doable... If I had the time, I could do it, and I'm a mere highschool graduate...

      Spoken like somebody who doesn't have a clue what he's talking about. I'm sure there'd be a fair chunk of money involved if somebody manages to deliver substantially better than 2:1 lossless compression on text, photos, and/or sound...hell, it'd probably be good for a Turing Award, if it's verifiable. It's something that the best minds in computer science (people a good bit brighter than most of us) have been grinding away at for decades to yield little more than incremental improvements. It's highly unlikely that some random schmuck is going to figure it out between Quake deathmatches.

      --
      20 January 2017: the End of an Error.
    5. Re:Hey! by VCAGuy · · Score: 1
      MIT guys! Why don't you put your brain into better compression technology? So we can deliver higher bandwidth to those still on crappy 56K lines?

      Well, herein lies the problem. Compression on modems (now the V.90 and V.92 standards) must satisfy three things: a) low-latency; b) low-processor demand; c) linear. A modem can only look at a very little snippet of data (usually what's in the FIFO buffer) and compress that. Even when it compresses it, it has to do so quickly and serially. The reason compression schemes like RAR work way better is because they aren't real-time. The challenge in communications is always how much compression you can get away with and still satisfy linearity of data flow and time-constraints.

      --
      Q: "Why do sound techs say 'check 1, 2'?"
      A: "Cause if they could count any higher they'd be lighting techs."
    6. Re:Hey! by Dolly_Llama · · Score: 1

      You owe me a new keyboard you bastard...*mops up coffee

      --

      Somewhere, something incredible is waiting to be known. -- Carl Sagan

    7. Re:Hey! by Quasar1999 · · Score: 1

      Well, herein lies the problem. Compression on modems (now the V.90 and V.92 standards) must satisfy three things: a) low-latency; b) low-processor demand; c) linear. A modem can only look at a very little snippet of data (usually what's in the FIFO buffer) and compress that. Even when it compresses it, it has to do so quickly and serially. The reason compression schemes like RAR work way better is because they aren't real-time. The challenge in communications is always how much compression you can get away with and still satisfy linearity of data flow and time-constraints.

      You make a very valid point, but why limit yourself to only using the modem? If we have an ISP at one end, and an end user with a PC (using a modem) at the other, could you not simply grab the data from the web at the ISP side, compress the entire file (be it html, jpeg, gif, zip) and then send the compressed version over the phone lines to the end user where the PC does the decompression (not the modem)??? Sure some things compress better than others, and streaming media such as audio and video could not use this technique, but the speed at which modem users could browse the web could be vastly improved.

      Why has this not been done? What key peice of the puzzle am I missing?

      --

      ---
      Programming is like sex... Make one mistake and support it the rest of your life.
    8. Re:Hey! by MojoRilla · · Score: 1

      You make a very valid point, but why limit yourself to only using the modem? If we have an ISP at one end, and an end user with a PC (using a modem) at the other, could you not simply grab the data from the web at the ISP side, compress the entire file (be it html, jpeg, gif, zip) and then send the compressed version over the phone lines to the end user where the PC does the decompression (not the modem)??? Sure some things compress better than others, and streaming media such as audio and video could not use this technique, but the speed at which modem users could browse the web could be vastly improved.

      This has been done. By many people. See here for one example.

    9. Re:Hey! by Natalie's+Hot+Grits · · Score: 2, Informative

      Ever heard of PNG,GIF,Animated PNG/GIF? these are formats for both images and video, that are 100% lossless, and better than a 2:1 compression ratio on most video and images (High resolution images with many colors are not great compression ratio but in the average case 2:1 is very very very very doable)

      As for video, animated PNG is a PNG compression of the Diff's of the second frame to the first, third to the second, etc etc.. In the case of video, compression ratios are on the order of 100:1 and audio is usually around 2:1 for lossless (FLAC,shn)

      Huffyuv is a codec that video editors use that is lossless and better than 2:1 in the average case

      Text, for sure on the average case, is WAY WAY WAY over a 2:1 compression ratio (Ever heard of gzip,zip,bz2?). Especially considering they are zipping up code, and not written english, it is very very compressable because of so much redundancy.

      I think someone forgot to do their compression homework before posting and insulting his parent (which he should respect, for he has insight)...

      --
      Two infinite things: your stupidity and mine. But I'm not sure about the latter. If my sig offends you, I'm sorry.
    10. Re:Hey! by ncc74656 · · Score: 1
      Ever heard of PNG,GIF,Animated PNG/GIF? these are formats for both images and video, that are 100% lossless, and better than a 2:1 compression ratio on most video and images

      I said photos, not images. The picture of your dog that you scanned in and saved to PNG won't compress nearly as much as the pie chart image on your website. Images with large areas of solid color will indeed compress down by a large amount. (A picture I scanned in of my truck compressed with PNG got about 1.63:1. Lena got about 1.65:1. Both sources were BMPs where the source data had never been subjected to lossy compression...they were compressed with bmptoppm foo.bmp | pnmtopng -comp 9 >foo.png.)

      In the case of video, compression ratios are on the order of 100:1

      Lossy compression can do that, but there's no lossless video compression that routinely gets anywhere near that. Reread my original assertion...it pertains to lossless compression of text, video, and audio and the likelihood of getting something substantially better than 2:1 compression.

      Huffyuv is a codec that video editors use that is lossless and better than 2:1 in the average case

      Not in my experience...720x480 captures from an All-In-Wonder Radeon usually get somewhere around 2:1, give or take a bit. (It's also worth noting that Huffyuv is lossless only for YUV sources; RGB video will experience a slight loss from colorspace conversion. For analog video capture, this isn't an issue...composite/S-video/component video are all variations of YCbCr, which is close enough to YUV for government work. CG video will most likely start out as RGB, so there will be some slight loss introduced by the RGB-to-YUV conversion...but it'll also likely see greater-than-typical compression for the same reason that a PowerPoint slide will compress more than a scanned photo.)

      Text, for sure on the average case, is WAY WAY WAY over a 2:1 compression ratio (Ever heard of gzip,zip,bz2?). Especially considering they are zipping up code, and not written english, it is very very compressable because of so much redundancy.

      Code is a special case. Compressing natural-language text is a more likely scenario. Taking Project Gutenberg's Hamlet, stripping out excess whitespace, and compressing it with gzip -9 yielded 2.32:1 compression. Doing the same with the text on /.'s main page (just the text, not the raw HTML) as I type this message yielded 2.10:1 compression. (Use strings foo.txt | wc; strings foo.txt | gzip -9 | wc to get before-and-after sizes.) Substituting bzip2 -9 got 2.68:1 and 2.22:1, respectively. While those numbers are better than 2:1, they're not "WAY WAY WAY over a 2:1 compression ratio." Substantial improvement, I'd think, would mean at least 4:1 compression on any of these examples. If you can do that, you'll make a killing in the market.

      --
      20 January 2017: the End of an Error.
    11. Re:Hey! by Quelain · · Score: 1

      http://www.compression.ca/act-calgary.html

      The leading compressor there gets 1.8226 bits per byte on a large archive consisting of several different data types (text, images, code).

      If you're just talking plain english text, then a dedicated text compressor can do a bit better through the use of a large dictionary file. Not much better though, general compression algorithms are amazingly good.

      --
      Cthulhu loves you.
  16. Re:Because... by jtw123 · · Score: 1

    I think the point isn't that Google's returned index is too slow but actually clicking through and loading the pages themselves are prohibitively slow.

    I think this would be more appropriate in cases where access was unreliable and intermittent. If you could get on long enough to submit a query, you'd get results back (albeit slowly) even if your access was cut off.

  17. Why? by connsmythe96 · · Score: 1
    How is this a good idea? What's faster:
    • Emailing a search, waiting for a response, then downloading a zip of the webpages that might be relevant

    • or...

    • Searching in a webform, seeing a LIST of results with context and clicking only the links you want (i.e., downloading what you need only)

    Why download a zip of multiple results when in many cases just the first result is needed?
    --
    if(!cool) exit(-1);
    1. Re:Why? by anttik · · Score: 0

      Read the article. It isn't for slow connections in our standards, but for slow connections in developing countries. It really matters if you have several students wanting to get an answer and you only have one computer and maybe the phone line can't be busy all the time.

    2. Re:Why? by cloudmaster · · Score: 1

      I'm not sure what the problem with just turning off image display and using google is, anyway...

  18. Re:Because... by TopShelf · · Score: 4, Informative

    I had the same initial reaction, but after RTFA (I know, shame on me), it seems that the limitation isn't so much time, but continuous time hogging the phone line accessing Google, checking out pages, etc.

    Instead, this service would package together selected results of the search, for overnight download into the PC's cache. The user can then browse through the material at their leisure without needing to use the internet connection (which is the scarce resource).

    --
    Stop by my site where I write about ERP systems & more
  19. Repackaging of simple tools under a pretty name by TylerL82 · · Score: 1

    Isn't mod_gzip already used with popular search engines?
    It's nice that MIT has the processing power and bandwidth to receive data from search engines, uncompress them, REcompress them, and send back to the "queryer".
    Sounds like more overhead and more trouble that what's already in place that does the same damn thing! ...Am I missing something?

    1. Re:Repackaging of simple tools under a pretty name by localghost · · Score: 1

      The idea is that in places where people can not get a persistant connection, you queue up queries while offline, send them when you get connected, and recieve the results the next day. As usual, /.'s summary is misleading. The article explains the logic, and it does make some sense, but I don't see this as something that will take off.

    2. Re:Repackaging of simple tools under a pretty name by Jad+LaFields · · Score: 1

      Well, it's never going to "take off" in a traditional sense, but if it helps a couple hundred people in disadvantaged parts of the world, possibly turning a few kids onto computing and a pontentially lucrative career, then that's good. This is not an earth-shattering development, but it doesn't merit the amount of uninformed vitrol I see all around here. This has got to be the most annoying display of the /.'s aversion to RTFAing that I've seen in a while.

      Nice to see that at least someone didn't knee-jerk reply to the /. article. Good post.

      --
      [SIG] It's like putting a moose in the blender -- a recipe for disaster!
  20. Finally! I can use Google... by Anonymous Coward · · Score: 1, Funny

    with my 300 baud modem.

  21. Re:Because... by rocket97 · · Score: 0, Interesting

    I agree 100%, I just moved to a new place where I am forced to use dial-up because I am in an area that does not have access to DSL, cable, or anything else that is still decently priced... anyway when I run a search on google it takes at most 5 seconds to get the listing back of the results. If this program needs to send an email to the user letting them know the search results this will take at least 2 maybe 3 times longer (in the fastest instance) for the user to get the results. I think that MIT needs to re-think their ideas and come up with something more useful rather than coming up with something that will just cause more headaches to the users forced to use the slow connections like myself.

    --
    "The two most abundant elements in the universe are hydrogen and stupidity." -Harlan Ellison
  22. The big problem by Lord_Dweomer · · Score: 1
    Is that someone else has judgement over what is to be considered most relevant. Sure you can pick from the sources they give you, but who's to say that the information they give you isn't filtered in some way?

    --
    Buy Steampunk Clothing Online!
    1. Re:The big problem by Anonymous Coward · · Score: 1, Insightful

      Please explain how that is different than any other search engine.

    2. Re:The big problem by Lord_Dweomer · · Score: 1
      With other search engines, its much more feasible to cross reference your results based on the results other people are getting. I don't know the address offhand, but there is a website that was made to track these 'filterings'. Obviously that becomes next to impossible to do for a snailmail search engine.

      --
      Buy Steampunk Clothing Online!
  23. Laos, maybe? by Daengbo · · Score: 1

    I read this story a couple of days ago, and thought it was rather a strange idea. I don't know where access is that slow. At the time, I thought maybe it might be used with the ham radio internet they're going to get in Laos as part of a "empower the farmers" program.

    1. Re:Laos, maybe? by winmutt · · Score: 1

      http://www.arrl.org/hsmm/

      packet radio is not slow.

      --
      :winmutt:
    2. Re:Laos, maybe? by Ramadog · · Score: 1
      packet radio is not slow.

      That depends on where you live. Where I am in Australia part of the backbone is still 1200 baud.

      Using anything irc like is ok for example the talk mode on the local X1J4 node or a convers server on a NOS system is useable. Reading your mail or bulletins on the other hand is not something that you would to download and try reading in real time. It is more like download them and read later when the forward session has finally finished. Browsing web pages at this speed is not fun.

      At these kind of speeds something like what is mentioned in the article could be usefull. But by the time you get up to 56k or so my preference would be just use google.

  24. Great!! I so need that... by SlimFastForYou · · Score: 2, Interesting

    for my cable Internet connection at home.

    Yes, I am dead serious... Lets just say Charter's cable Internet in my area lately really stinks. I would almost rather be on a 14.4k modem - no joke. I am not the only user... I get lag spikes of over 3000ms when not doing anything, and almost dropped connections. Good thing DSL recently became available in my area =D. One less Charter Pipeline subscriber.

  25. In other news... by sw155kn1f3 · · Score: 2, Funny

    They are developing the program which will replace web forums - you post a message to predetermined mail account and everybody subscribed will receive it very soon (patent pending).
    File transfers and weather forecasts are planned in 2006.
    This will make a difference.

    --
    - Arwen, I'm your father, Agent Smith.
    - Well, you're just Smith, but my father is Aerosmith!
  26. Re:Because... by jasoncart · · Score: 1

    So its a Google API... and a program that zips the pages? Wow, heavy development.

  27. Slow Net by Kris+Thalamus · · Score: 0

    Who needs the internet when we have a perfectly functional postal service

  28. It's been done before. by Anonymous Coward · · Score: 0

    Sounds exactly like the agora email services from way back in the day.

    http://www.bellanet.org/email.htm

    http://scout.wisc.edu/addserv/NH/95-11/95-11-21/ 00 26.html

  29. I got this idea for a great product. Want in? by packethead · · Score: 1

    Rocks baby! Pet rocks! Get it? Get it?

    Or how 'bout this? Seven minute abs? Get it? Get it?

    Or, or, or, .....

    --
    .sig
    1. Re:I got this idea for a great product. Want in? by Anonymous Coward · · Score: 0

      Hey, I still have my pet rock

  30. RTFA. by Kickasso · · Score: 1

    It explains. Honestly.

    1. Re:RTFA. by CmdrTostado · · Score: 1

      Unable too RTFA, already using product, article not included with querry. Please e-mail me the article and I will respond tomorrow, if I can get a phone line tonight to download. Gotta go this will probably cost me $400 US.

  31. RTFA by DrewMIT · · Score: 5, Informative

    For those of you wondering why someone would do this, how about reading the damn article?

    The program doesn't e-mail back with a mere mirror of a google / yahoo results page. It actually filters through the individual results compressing the entire page. e.g. my search turns up a CNN page and a blurb on MSNBC and I get, e-mailed to me, compressed versions of those actual sites, not just links to them.

    As far the "my 28.8 modem is just fast enough" crowd -- read the article! Some of these locations the software is being developed for don't even have access to a phone line on a regular basis. And the lines they do have access to are more likely than not to be noisy as hell and not able to support a 28.8 connection.

    1. Re:RTFA by bongoras · · Score: 2, Insightful

      "Let us assume you are in Boston, and you have tons of research money in a fund and no clue what to do with it. If you want to find a new way to piss away the money, you've got to be really creative, especially if you want to do it AND claim to be socially conscious at the same time. So what you do is invent a problem. And then invent a solution. Doesn't matter if it works or not."

    2. Re:RTFA by sunking2 · · Score: 1

      Oh, how I wish I had some mod points. Is there an Insightfully Funny mod?

    3. Re:RTFA by Anonymous Coward · · Score: 1, Insightful

      I agree.

      Slashdot would actually be a fairly intellectual place (with some humor) if people would read the article AND all the previous people's two cents before contributing some rehashed crap.

      As of now, I swear 2/3 of the comments are "28.8 is fast enough" Makes me want to drill my eyes out for making the mistake of reading the comments.

    4. Re:RTFA by dfj225 · · Score: 1

      And the lines they do have access to are more likely than not to be noisy as hell and not able to support a 28.8 connection. I live in New Jersey and my line was so noisy that it couldn't support a 28.8 connection...

      --
      SIGFAULT
    5. Re:RTFA by joepeg · · Score: 1

      Computer: "Enter Search Terms:"

      Teacher: "mischevious students"

      Computer: "we are going to send a query through e-mail, it is OK?"

      Teacher: "OK"

      *1 day later...*

      Computer: "Did you mean: mischievous students?"

      Teacher: "crap"

      --

      ZEN is a prime number in base-36

  32. A Whole Country? by jetkust · · Score: 1

    They are also considering trying to persuade computer sellers in developing countries to install the program on machines.

    They are going to develop countries to install the program on people's machines?

  33. Still in use today in the US. by _Sambo · · Score: 4, Insightful

    TODAY
    I am reminded of the Prepaid Legal system of doing business. You call up and ask a question, and the next day, an attorney familiar with the area you are asking about calls you back to answer your questions and advise you. So maybe this isn't all that outdated of an idea after

    IN REGARD TO THE SYSTEM IN THE ARTICLE:
    To have this capability back in 1973 would have been unbelievable. In 1983, to have this available to every library in the US would have been an unbelievable achievement. To have it now is so slow that I start to go google eyed even thinking about it.
    BUT
    This is great for countries that are 20-30 years behind in technology. It will revolutionize the search for information for areas that are not as connected as the US.

    1. Re:Still in use today in the US. by jez9999 · · Score: 1

      So maybe this isn't all that outdated of an idea after

      OK, I'm not usually a grammar Nazi, but do you realise you made FOUR errors in that single fucking sentance??

      1) You DO NOT start a sentance with 'So...'.
      2) You DO NOT say 'that outdated of an idea', you say 'that outdated an idea'.
      3) You it's 'after all', not 'after'.
      4) You put a full stop at the end of a sentance.

      Learn English, or fuck off.

      Sorry, I'm in a bad mood. Male period. Or something.

    2. Re:Still in use today in the US. by Anonymous Coward · · Score: 0

      Not to be a spelling Nazi, but "sentance" is spelled with an "e" --- "sentence".

    3. Re:Still in use today in the US. by Anonymous Coward · · Score: 0

      I guess he's a hypocrite, or something.

    4. Re:Still in use today in the US. by Anonymous Coward · · Score: 0

      Learn English, or fuck off.

      I'd much rather have you send me a picture of yourself so that I can jerk off to it and eat my cum while pretending it's yours.

    5. Re:Still in use today in the US. by instarx · · Score: 1

      To be correct, your rant should have been written as "One DOES NOT say...", and "One DOES NOT start...", as opposed to your "You DO NOT..."

      I see your point, however. His grammer does suck.

      But let's face it - you ain't gonna win no spellin' or grammer prizes neither, Bubba.

  34. What it's really about by Anonymous Coward · · Score: 3, Insightful

    is not slow connections, but connections that are unreliable

    Using the phone in a country like Malawi can be a real adventure. It's not like the US at all.

    1. Re:What it's really about by tarquin_fim_bim · · Score: 1

      It's not like the US at all

      I have have studied this situation closely and think that the reason why they called it Malawi and not US was to give people a hint that this may be the case, but then I could be wrong.

    2. Re:What it's really about by BillThies · · Score: 2, Informative
      Hi, I'm a graduate student working on the TEK project.

      Thank you for your post, it's an important point -- TEK is targeting users that might have no direct Internet connectivity. In some places, it can be cheaper to have an email-only account instead of full-fledged web access; for these users, TEK provides web content using only email.

      In addition, there are cases where no connectivity is available, but emails can be sent in a store-and-forward fashion. For instance, we are working with First Mile Solutions, who provides store-and-forward services to rural communities using a mobile access point (such as a bus) that visits each kiosk during the day. Moreover, if the connections are unreliable by any measure, then email is a better medium than HTTP, as no end-to-end connection is needed at any time.

      More information about the TEK project, including some statistics on Internet rates in the regions we are targeting, is available on the TEK Homepage

  35. Re:Because... by Anonymous Coward · · Score: 0

    (Score: -1, Didn't Read the Article)

    It's designed for computer's that don't have fulltime internet connection. The program dials up at night and sends off the queries, so then the next day after the dial up/fetch/retrieve, the results are in.

  36. WHAT A GREAT IDEA!!! by Anonymous Coward · · Score: 0

    in 1988

  37. Searching over email?? by slackr · · Score: 2

    So why is it that the answer to all of my searches is either "wet teens," "Generic Viagra," or "I am a banker from Nigeria?"

    --

    * Please do not read my signature.
  38. Also good to circumvent censorship by pbox · · Score: 5, Insightful

    Coincidentally (?) it is also very usefult to circumvent the Great Firewall. Way to go, but it would also be nice to optionally have the cached content (ala google) e-mailed as well. That would send the last standing wall crumbling.

    --
    Code poet, espresso fiend, starter upper.
    1. Re:Also good to circumvent censorship by Anonymous Coward · · Score: 0
      but it would also be nice to optionally have the cached content (ala google) e-mailed as well.
      This could just be me, but isn't that what the entire thing is about????
    2. Re:Also good to circumvent censorship by JimmytheGeek · · Score: 1

      Peacefire has a more or less turn-key proxy you can set up for your friends stuck behind the Wall. If this caught on, the bad guys could just watch the traffic to the mit site.

  39. RTFA by lylum · · Score: 1

    "Let us assume you are in Malawi," explained Prof Amarasinghe, "and the computer lab does not have access to the telephone line all the time." "If you want to find some new information about malaria, you are prompted with a message that says 'we are going to send a query through e-mail, it is OK?'. "At night, when the phone line is available, the teacher can dial out and send the queries." The request is sent to computers at MIT in Boston, which then search the internet and gather webpages.

  40. Re:Because... by fliplap · · Score: 5, Insightful

    RTFA.

    Why don't you just scream "HI I'M FROM 'WESTERN' CIVILIZATION AND HAVE NO IDEA HOW THESE THINGS WORK IN LESS PRIVLEGED PLACES"

    Google is too slow when your school has one phone line that is used for _everything_, including net access. Not to mention the cost of using the phone anyway. This allows all the students to submit thier searches to a teacher one day, the teacher then submits the all searches with only a couple minutes of dialing up. He can retrieve the compressed results a few days later with only a minutes of dialing up. Now go read the article. Someone needs to mod that post down, hopefully the poster can redeem themselves later in the thread with something insightful.

  41. Maybe these poor 14.4-ers should finally get the memo - they're human rights are being violated! Everbody, including those in Estonians, know that it is inhumane for them to be stuck piping information through their small RJ11. Starvin' Marvin is weeping for their poor souls.

  42. Be warned, parent contains goatse link - nt by Anonymous Coward · · Score: 0

    nt

    1. Re:Be warned, parent contains goatse link - nt by Anonymous Coward · · Score: 0

      Moderators, don't listen to this guy. He's a fscking troll.

  43. This sounds like a Solution... by Jhon · · Score: 1

    Looking for a problem.

    I recall 'back in the day(tm)' several email services which you could send an email with a URL in the subject and it would return the web page to you in a reply (a -la a lynx like format). That was -- what? 10 years ago?

    I've used 19k, 14.4k and even 4800 baud modems to connect to the net and browsed comfortably. Something as simple as unchecking the box [DOWNLOAD GRAPHICS] (or whatever setting your browser uses) will accomplish this.

    What kind of 'slowness' are we taking here? 110 baud?

    -jhon

    1. Re:This sounds like a Solution... by generic-man · · Score: 1

      The queries will be submitted to a central repository for transmission via e-mail. The results come back the next day.

      Assuming that the results total about 20 KB in size, that represents a baud rate of about (160,000 bits / 86,400 seconds) = less than 2 bits per second.

      --
      For more information, click here.
  44. Re:Because... by Anonymous Coward · · Score: 0

    Under capitalism man exploits man. Under communism it's the other way around.

    Woman loves woman? Isn't that called Lesbianism?

  45. and for those with no internet.... by vmxeo · · Score: 3, Funny

    And for those people with no internet connection, you can mail your search requests to MIT (Please include self-addressed stamped envelope). MIT will then process your search request within 5 business days, and mail you back the results. You can then peruse the results and marvel at the wealth at information you'd be able to find... if only you had internet access.

    1. Re:and for those with no internet.... by Jad+LaFields · · Score: 1

      That would actually really be a good idea, as information slowly becomes more digitized and web-based, rather than library-based. Sure, as of now you can still much of what you need to know from libraries, but if you're in enough of the boonies, the local library is probably just a bunch of Danielle Steel books, if anything at all.

      Of course, in a number of third world countries, even snail mail can be extremely unreliable...

      --
      [SIG] It's like putting a moose in the blender -- a recipe for disaster!
    2. Re:and for those with no internet.... by Anonymous Coward · · Score: 0

      They tried this. Only they used cheap Harvard students to do the actual searching, so it never worked. The Harvard students kept looking for the 'any' key.

  46. Re:Because... by panaceaa · · Score: 5, Funny

    I tried to RTFA but MIT hasn't emailed it to me yet :(.

  47. Re:Because... by Anonymous Coward · · Score: 0

    I think _you_ need to RTFA.

  48. Well... by frankjr · · Score: 1

    I saved the html for google and it was 3405 bytes. If my math is correct that would be about 92 seconds on a 300bps connection. It will probably only be useful if you have a really slow connection.

    1. Re:Well... by dildatron · · Score: 1

      It's actually closer to 90.5 seconds. But, if you turn off images, it would be tolerable.

      But if one only has a 300 baud modem, what the hell are they doing on the internet anyways? That's like brining a knife to a gun fight, or some other poor analogy.

      --


      If you had nuts on your chin, would they be chin nuts?
  49. classic waste of money by bongoras · · Score: 1

    I'm gonna get modded down, but fuck it, I have karma to burn and a soapbox to stand on.

    These people from mit are getting research money to spend on this rediculous piece of shit project that NO ONE will ever use. Can you imagine, a farmer in Ghana or something and he gets malaria. His first thought is "Hmm, I wonder if there's any information on the internet about this?"

    I mean come on.

    I mean fucking really.

    But ok, I'll even give them that... let's pretend that farmers in countries where the internet connection sucks so bad that google, a site that is optimized to work on pretty much anything, is too slow... even know about the internet and have any use for it... are they going to go to the library, borrow software, take it home and install it on... oh yeah... install it on ... their computer... oops... WHAT FUCKING COMPUTER?

    Asshole mit people with nothing better to spend money on.

    1. Re:classic waste of money by Anonymous Coward · · Score: 0

      Yeah! You are SO right! What a waste of money. Those stupid fuckheads.

  50. Re:Because... by Anonymous Coward · · Score: 0

    I think_you_need to GAFL

  51. Wouldn't be needed if... by DeltaSigma · · Score: 4, Insightful

    ...only webdesigners had not collaborated to turn the web into the graphics orgy it is today. I mean, have these kids coming out of graphics school even browsed the relevant w3c specifications?

    News Flash !

    • The internet is for information, not television!
    • Web sites can look pretty and function better without being a giant photoshop mural!
    • Creating a clean design that's easily searchable, retrievable, and most of all, usable is a work of art!
    1. Re:Wouldn't be needed if... by Dog+and+Pony · · Score: 1

      Try telling that to a manager. Just try it. Or, if you wish, I can tell you what the response will be - in the absolute best case scenario, perfect world utopia, you will get a blank stare. From there it is all downhill.

      Notice that a *lot* of sites out there that are made by professionals for themselves or for open projects follow your guidelines. It is the commercial sites that don't (and the kiddie sites, of course, where you can't see the site for all the flash animations and counters).

      Sad thing is that there aren't any commercial sites, with a few exceptions like google, that have a simple but great site, so it is hard to vote with the wallet on this one. I also am afraid that what that the general customer wants is lots of flashing things and shiny buttons, so I do think the managers are doing the right thing froma business perspective.

    2. Re:Wouldn't be needed if... by kalidasa · · Score: 1

      Try telling that to a manager. Just try it. Or, if you wish, I can tell you what the response will be - in the absolute best case scenario, perfect world utopia, you will get a blank stare. From there it is all downhill.

      Exactly. When I was doing web design, for one customer I designed a beautiful, XHTML Strict site with incredibly low bandwidth requirements. The customer didn't think it was "snazzy" enough, and eventually had a graphic designer do a page design that was so agonizingly slow that I can't imagine anyone ever actually waited for an entire page to load.

    3. Re:Wouldn't be needed if... by orasio · · Score: 1

      The internet is for information, not television!

      Television itself was intended for information.

    4. Re:Wouldn't be needed if... by Anonymous Coward · · Score: 0

      Television itself was intended for information.

      the parent is talking about usable bandwidth

    5. Re:Wouldn't be needed if... by NDPTAL85 · · Score: 1

      Says who?

      And WTF is the point of all this computing power if its just to display boring text? For the love of GAWD even BOOKS have pictures!

      BOOKS!

      --
      Mac OS X and Windows XP working side by side to fight back the night.
    6. Re:Wouldn't be needed if... by poot_rootbeer · · Score: 2, Insightful

      The internet is for information, not television!

      Why shouldn't the internet (by which, I assume, you mean specifically the World Wide Web) be for both information AND television?

      Just because the markup language we call "HTML" was originally developed and is best suited for information-rich text documents such as academic papers a decade ago doesn't mean that we must not, or even should not, look beyond that type of content and find new uses for the system.

    7. Re:Wouldn't be needed if... by Anonymous Coward · · Score: 0
      perfect world utopia, you will get a blank stare

      No, that's the realistic best case scenario. In my utopia, my manager would be female, beg my forgiveness, offer me a blow job (I refuse, married), and give me a huge bonus instead. Then she will convince the congress to outlaw flash websites and the president will ask his people to make it there highest priority. Marcomedia's execs and marketing are put away and the programmers there are reformed.

    8. Re:Wouldn't be needed if... by Metropolitan · · Score: 1

      I know this will be modded low, but here goes:

      Amen!!!

      Usability has been sacrificed to designers run wild, marketing people with no idea how to present a clear message, and generally people with nothing to say. It's easy to hide lack of meaning behind bright colors & whatnot.

      One great big Home Shopping Network is not what the WWW is destined to become, contrary to the efforts of a great many small minds. Tell your story well and clearly, regardless of commercial or other intent, and those you need to reach will find you. Amazon does so pretty well, as do Google, the Library of Congress, and a great many others. Clarity of message, a compelling story.

      Whether this provides a path around totalitarian governments remains to be seen. Hopefully it will erode them from the inside, as a good flow of information should. The best sword against despots is truth, told to lots and lots of people.

    9. Re:Wouldn't be needed if... by Anonymous Coward · · Score: 0

      The internet is for information, not television!

      More and more I'm not bothering to use my VCR anymore, and getting my TV from the internet.

    10. Re:Wouldn't be needed if... by sootman · · Score: 1

      I don't have time to read all the responses so I'll just say this: the web is whatever we make it.

      --
      Dear Slashdot: next time you want to mess with the site, add a rich-text editor for comments.
    11. Re:Wouldn't be needed if... by xtrucial · · Score: 1

      While I agree in principle, I think we can live with some variety. I mean, what if someone is an artist and really does need Flash or whatever to show off their work? And certainly other sites that can expect people with fast connections and the latest browsers--e.g. computer gaming sites--can do the same thing. We shouldn't flatten the web's possible styles just because there are a few (okay, a lot) of bad apple designers out there.

    12. Re:Wouldn't be needed if... by RealAlaskan · · Score: 1
      Why shouldn't the internet ... be for both information AND television?

      If you could do the TV without screwing up the rest, it would be peachy. I'm not sure that's possible, but I'd surely love to be proven wrong.

      ... the markup language we call "HTML" ... is best suited for information-rich text documents such as academic papers ...

      I guess the logical question here would be something along the line of: ``If a page ISN'T information-rich, why would I go to the trouble of searching for it?''

      I know that a picture is worth a thousand words, and sometimes the information I'm looking for is a graphic. But what the grandparent post was complaining about is that most websites are information-poor, and try to use graphics to hide the fact. What he was getting at, and I'm seconding, is that most uses of pictures on the web are counter-productive. At best, they impede my access to the information I'm looking for.

      When I want entertainment, I get a book, or go outside and play. When I want to learn something, I google for recent papers. TV just isn't in the picture.

    13. Re:Wouldn't be needed if... by DeltaSigma · · Score: 1

      The other respondant was correct in his assesment of my post. But I'd like to further clarify my position here.

      I urge people to read W3C's specs, because it's a model for moving beyond the broadcast age. That's why I resist the notion of the internet turning into television, that's why I dislike the standard web site; the "look at me, I'm pretty" website. In the beginning, outside of e-mail, the web more closely resembled television, or a book, than it does now. We have many options for making our content useful for our users. The one problem is that "web-designers" usually got the job by being good graphic designers, or good marketers. They're used to pushing their content out, and assuming users view it. That's all that's important to them.

      I believe the internet can play a large role in business. Shopping online could be a new experience. By this I mean, the intrinsic nature of the internet makes for a new, exciting, and very profitable point of sale. It can provide the ultimate comfort zone for a customer. However, we must resist the urge to think in terms of broadcast. It's this thought process that produced everything that we complain about on the net today. Spam, banner-ads, pop-up windows. The story is the same from every user, tech-savvy or not. "The internet sure is cool when I finally close everything I don't want so I can see what I do."

      Imagine online shopping, if the W3C had been calling the shots at every company. Products would be listed, according to category, in a corresponding XML standard format. Some web-front businesses could sell products based on the subject. Say, they specialize in PDA's running the Palm OS. These sites would offer convenience for the customer. They could organize products dynamically by features relevant to them, or by price. Businesses would attempt to list every bit of information about the product, making it possible for the customer to choose specifically what it is they want. After a customer finds out what features they do want, and what features they don't, they filter their search to include all desired features, then sort by price.

      It would be a completely new twist on the business/consumer relationship. The problem with this system is that it is the essence of capitalism. It streamlines the purchasing process by a factor that most marketers and CEOs can't fathom. In a matter of minutes a customer has educated themselves about the features possible in the product they're interested in. They've decided what features they need, and what they don't, and after leveling the playing field they've selected the unit that's most affordably priced. That's the whole problem with running internet-businesses the way the internet wants them to be run. It's capitalism in its purist form. If you're not feature rich, and affordably priced, you're dead, and no amount of advertising can change that.

      Indeed, it's easier for a company to go by the current system, it's safer. With all the confusion they've caused, a customer will end up buying the product they've seen in advertising the most.

      This is advantageous to the big companies, as it makes the whole system quite simple. Put more money into marketing, make more money.

      I look forward to a system slightly more complex for the company, but infinitely more simple for the customer. One where the world is a database just waiting to be searched.

      Besides, in such a system, when a user searches for an operating system, and comes to learn that linux does everything they want it to, all for the low low price of $0.00. Which "add-to-cart" buttond do you think they're going to push?

      Hmm, educated consumers, a monopoly's greatest fear...

  52. Compression, NO!!! by morcheeba · · Score: 1

    CSLIP already compresses it, most modems made since 1994 compress data, compressing it again at the application level won't help. Nevermind that the mail program will uuencode the data anyway & severely bloat it.

    1. Re:Compression, NO!!! by mikeee · · Score: 1

      compressing it again at the application level won't help

      Yes, it will, especially if it's application-specific compression. CSLIP compression isn't particulary good (for good reason, it's supposed to be simple and efficient).

      Good point on uuencode, though. A compression program designed to produce 7-bit rather than 8-bit output to avoid this might be a win.

    2. Re:Compression, NO!!! by BillThies · · Score: 2, Informative
      Hi, I'm a graduate student working on the TEK project.

      We agree that you won't have too much to gain from zipping the content before sending it. The larger gains are from higher-level compression; for instance, the TEK Server keeps track of each page that it sends a given user, and it is careful not to send duplicate pages in replies to future search queries (unless the user specifically requests an updated version of a given page.) This can be especially useful in shared environments (such as a school) where there is a lot of overlap between queries.

      Also, there are some marginal gains to be made by zipping more content at once. The server sends ~20 pages at a time (or all the URL's requested in a given batch), which will compress better than if they were done separately.

      Your point about the bloat from the mail program is a great one, thanks. We should look into fixing this.

      By the way, we see the primary benefit of TEK as being the email-based access rather than the compression. You can find many more details about the project on the TEK Homepage.

    3. Re:Compression, NO!!! by morcheeba · · Score: 1

      Thanks for the reply - wish I could mod you up. It's a great idea & I think it'll help a lot of people. I lived in Peru at the dawn of BBS's, and access was a nightmare - we dialed in to a special satellite link to get out of the country (land lines were too expensive and noisy), and we still paid through the nose for it.

      I was a bit wrong on the compression - if you're running the program overnight, you might as well try an compress it as much as possible - I didn't consider that CSLIP is more speed-optimized than size-optimized. It also depends on how the internet access is billed - per byte or by minute. The modem-level compression will only help the 'by-minute' billing.

      Sorry about the email protocols & antiquaited 7-bit and non-ascii systems; it would be cool if email was naitively binary-compatible. If you've got custom software, then can you use a socket protocol? Email might have delays and if the response is delayed, would require multiple dial-ups. It's also robust, and schools probably do a morning fetch-mail anyway.

  53. what about gzip compression and paging? by jtotheh · · Score: 1
    First of all, browsers and web sites already can use compression on the fly so I don't see the big advantage there. And google or whatever engine you prefer delivers the first 10 or so at a time in case your connection is slow.

    Waiting a day to get an email from a search engine - that is like waiting for batch printouts in 1982 but worse.

    Basically I don't see the point in this as it is being described. I suspect though that there is more to it as MIT is not full of dumb people.

    1. Re:what about gzip compression and paging? by Jad+LaFields · · Score: 1

      Waiting a day to get an email from a search engine - that is like waiting for batch printouts in 1982 but worse.

      There are places in the world which are even farther behind than 1982.

      Actually, it's even crazier than that, that are like current-day first world in some respects, and third-years behind in others... an African library gets a only several-year-old used computer from some nice charity, but doesn't have the right infrastructure to utilize its 56k modem.

      You might feel that you couldn't stand waiting a day for a search to complete, but for others its better than nothing (or getting something after wasting a day trying to connect... let MIT do it).

      --
      [SIG] It's like putting a moose in the blender -- a recipe for disaster!
  54. Here's a name for it by TerryAtWork · · Score: 1

    Stopgle.

    --
    It's Christmas everyday with BitTorrent.
  55. Re:Because... by moonbender · · Score: 1

    You neither have a particularly slow connection - the majority of the net is on dial-up - nor are you (apparently) in a situation where hogging a phone line is a problem. You've probably got unmetered short-distance calls, so whether you're on for 3 minutes downloading a 400k email or whether you're online 30 minutes reviewing those 400 kilobytes of information doesn't matter to you.

    In short, RTFA. Sorry.

    Not that I'd wager that this is some kind of brilliant, revolutionary idea, but really, the article doesn't even imply that anyway.

    --
    Switch back to Slashdot's D1 system.
  56. uhhh...archie? by SCHecklerX · · Score: 1

    That's what it sounds like to me.

    1. Re:uhhh...archie? by EChris · · Score: 2, Insightful

      Yeah I thought the same thing, though this goes a step further and sends compressed copies of the resulting pages back you, not just an index of the sites.

      What I wonder is why the *client* needs any software? Why not just make an email addy that people send queries to (like you did with "archie") and get the results back in whatever mailer you've got already?

      Chris

  57. Re:Because... by donutello · · Score: 1

    I did RTFA.

    So now, you're going to have all the pages downloaded to your PC for you, when it's quite likely that the very first link was the one you wanted anyway?

    What about the bandwidth costs of doing that? And exactly how slow are these connections anyway? Google's search page is a few KBs - I can't imagine how downloading every possible hit (say top 20 hits) is feasible where downloading a single page of a few KBs is not.

    --
    Mmmm.. Donuts
  58. Free emulator by gspr · · Score: 2, Funny

    For those of you who want to try it out at home, just use one of your several hundred AOL CDs, and voila, you'll have a line slow enough to try it out.

  59. Google Voice Search by GillBates0 · · Score: 4, Interesting
    Check out the Google Voice Search page. This has been up for quite a while now, and though not directly applicable as is, to people with a slow internet connection, it's just another alternative to emails. They should design it to respond by phone though. But again, people with dialup connections would, more often than not, not have a second phone line to request searches by phone.

    To try out this demo, please follow these simple steps:

    1. Pick up the phone and call the automated voice search system at (650) 318-0165.
    2. After the prompt Say your Search Keywords, say your query to the system.
    3. Click this link and a new window will open with your voice search results.
    4. Say another query, and the new window with the search results will be updated with the new results.

    --
    An Indian-American Hindu committed to non-violent thought/speech/action alarmed by the global explosion of radical Islam
    1. Re:Google Voice Search by WEFUNK · · Score: 1

      I just tried calling and it just rings, and rings, and rings... perhaps we've slashdotted it!

      --
      My next sig will be ready soon, but friends can beat the rush!
    2. Re:Google Voice Search by Anonymous Coward · · Score: 0

      That's my number and I'm sick of it typing Slashdotter's queries into Google, so I turned of the ringer.

  60. Net connection by non-poster · · Score: 1

    Why not just use Content-Encoding:gzip to reduce the amount of time (text) pages take to transfer over the connection? Or have your ISP's proxy server compress everything on it's way over the pipe? Surely someone has thought of this before...

  61. Pop-ups by Anvil+the+Ninja · · Score: 1

    This is the first article I've read (in memory) that spawned five pop-ups upon following the primary link. A pox on greedy posters!

  62. Re:Because... by donutello · · Score: 1

    He can retrieve the compressed results a few days later with only a minutes of dialing up.

    Huhh?? How? If you download just the results page, then that is pretty useless since you then have to click on the links to see if it is the relevant link or not.

    If on the other hand you download the actual contents of the top 20 pages then given how slow your connection is supposed to be, I don't see how you could do that in "only a few minutes".

    --
    Mmmm.. Donuts
  63. Re:Because... by hondo77 · · Score: 1

    Consider areas that only have access to the Internet via radio...at certain, limited times of the day.

    Pop! (sound of a lightbulb turning on)

    --
    I live ze unknown. I love ze unknown. I am ze unknown.
  64. Re:Because... by Anonymous Coward · · Score: 0

    > I think that MIT needs to re-think their ideas

    To their credit, at least this isn't some lame wearable shit, or a stupid musical instrument which spins around or whatever. Jesus, they come up with some fucking crap!

  65. Re:Because... by EinarH · · Score: 1, Informative
    Honestly, folks, someone please explain how this could substantially save someone time surfing the web?
    In non developed countries the lack of bandwith is a serious problem.

    A year ago I was in Moscow. After 6 days without internet I really wanted to check my e-mail(webmail).
    That day we spent some time some kilometers outside Moscow, but still managed to find a internetcafe.
    After waiting for 15 minutes (the place was crowded) I started "surfing".
    Man that was slow.
    25 computers, *sharing* a 64kb uplink. And all the locals (they had an arangement; pay x numbers of rubels and "surf" as long as you want) where downloading with IRC, Kazaa, DC and ftp which resulted in *heavy* packet loss.
    I spent 8 minutes getting the Yahoo.com frontpage. And it took me almost 20 minutes before I could read the first mail.

    --

    Melius mori in libertate quam vivere in servitute.

  66. Re:Because... by Anonymous Coward · · Score: 0

    This sounds as dreadfull as my first experience with programming at school. Punch a set of cards, mail them to the data center, have them mail back the printout, or more likely, the error message.

    (Then, I would go home and play with my Altair)

    I suppose it's better than nothing, but not by much. I would just use the library.

  67. Other needs by ferreth · · Score: 2, Insightful

    First, for those of you saying 'Google is fast enough even on a 14.4K' - think school with one phone line, perhaps not even available during the day. Or how about connections via satelite phone at $$/min? Suddenly you want super efficient, when you only earn 5 bucks a day.

    As to what else this needs, the search engine needs to strip out all the crap before emailing a web page to you (Java, Flash, etc) - should focus on mostly text, small pictures only. Particulary since 486's would be a common platform for people using this, so the search engine better work well on one. You also should be able to strip out all pictures as an option to maximise text info download - remember turning off pictures in Netscape 2.x to speed up your browsing? If you need something it striped out, you should be able to query just for the bits you need later.

    Also the ability to share your cache between computers would be huge if they can't have a server to do that for them. At any rate, means of transferring those precious pages you downloaded to another computer - on a floppy, unless you have local email.

    --

    W9x:Thanks for the make-work project Bill.

    1. Re:Other needs by donutello · · Score: 1

      As someone whose school only had a 1.5kbps link to the internet, I consider myself qualified to comment on this subject.

      The ability to not download text only already exists. We used lynx to look at pages and downloaded images only when we thought they were relevant - and killed the download as soon as we saw enough to either understand the picture or realize it was not what we wanted to see.

      Also the ability to share your cache between computers would be huge if they can't have a server to do that for them. At any rate, means of transferring those precious pages you downloaded to another computer - on a floppy, unless you have local email.

      As far as I can see, that seems to be the only cool feature of this. Everything else already exists.

      --
      Mmmm.. Donuts
    2. Re:Other needs by kwerle · · Score: 3, Insightful

      First, for those of you saying 'Google is fast enough even on a 14.4K' - think school with one phone line, perhaps not even available during the day. Or how about connections via satelite phone at $$/min? Suddenly you want super efficient, when you only earn 5 bucks a day.

      No, google is fast enough at 300Baud. Damn, but folks are young around here.

      As to what else this needs, the search engine needs to strip out all the crap before emailing a web page to you (Java, Flash, etc) - should focus on mostly text, small pictures only.

      Either configure your browser or proxy to do that. Easy.

      Particulary since 486's would be a common platform for people using this, so the search engine better work well on one.

      Give me a break. 486's are plenty powerful enough for web browsing. Even with pictures.

      You also should be able to strip out all pictures as an option to maximise text info download - remember turning off pictures in Netscape 2.x to speed up your browsing? If you need something it striped out, you should be able to query just for the bits you need later.

      [sarcasm on] Really? [/sarcasm]

      Also the ability to share your cache between computers would be huge if they can't have a server to do that for them. At any rate, means of transferring those precious pages you downloaded to another computer - on a floppy, unless you have local email

      Give me a frickin' break. PPP over null modem serial.

      This has got to be one of the worst ideas I've ever heard of. Hell, I knew of WWW via UUCP (that's email, kids) in the 90's - and that didn't require ANY "special search software."

    3. Re:Other needs by Anonymous Coward · · Score: 0

      UUCP = Unix to Unix Copy. I love it when some /.ers think they know everything and other /.ers believe them.

    4. Re:Other needs by zcg · · Score: 1

      google is already compressed - for browser which can handle gzip compression such mozilla, MS IE 5.0 >

  68. What the fuck? by Anonymous Coward · · Score: 0

    How are these people connecting to the internet? By mail? Google is fucking text how fucking long does it take to load, even if you're using IE? Fuck!

  69. Re:Because... by Anonymous Coward · · Score: 0

    > I think_you_need to GAFL

    Go around flicking lemons? You think that'd help, huh?

  70. borrow and install? by Anonymous Coward · · Score: 0

    Since the program is too big to download over a poor net connection it will be mailed on CDs to libraries for people to borrow and install.

    When will our library system adopt this policy? There are quite a few programs out there that I would like to "borrow and install"...

  71. I don't understand by El · · Score: 1

    Google responses are so slow that you have to email them -- but then clicking on the links in the email ISN'T to slow? Pretty half-fast solution, isn't it?

    --

    "Freedom means freedom for everybody" -- Dick Cheney

    1. Re:I don't understand by Anonymous Coward · · Score: 0

      NO, Their solution packages everything (filtered relevent webpages), compresses it and emails it back. So just one bulk download and the results can be viewed offline.

  72. already been done(kind of) by crazyprogrammer · · Score: 1

    Back in 1998 I used to use Juno for email and had no web access. There was a free service that you could use to get web pages via email that I often used to "surf the web". the way it worked was you would send an email to webmail@curia.ucc.ie (I dont think it works now) and put "GO http://websiteaddress.com" in the body and it would email you back the html of that page.

    You could search yahoo by requesting a url like http://search.yahoo.com/bin/search?p=search+terms

    --
    "the fax machine is nothing but a waffle iron with a phone attached to it." - Grandpa Simpson
    1. Re:already been done(kind of) by dildatron · · Score: 0, Flamebait

      Good God, that's pathetic. I pity you.

      --


      If you had nuts on your chin, would they be chin nuts?
    2. Re:already been done(kind of) by BillThies · · Score: 4, Informative

      You're right that retrieving web pages over email has already been done. A present-day service that works as you describe is www4mail, and I know people that use it regularly from low-connectivity regions.

      However, the TEK system (which I'm involved in) offers several benefits over a purely email-based solution. By having a web proxy on the client side, users can use their favorite browser to view downloaded pages, complete with color and formatting, which is often absent in text-only systems. Moreover, the client keeps a local, searchable cache of all downloaded pages, and the server keeps track of which pages have been sent to avoid wasting bandwidth on duplicate content. Finally, with a web-like user interface, many users can share a single e-mail account in a public kiosk or school.

      Many more details about the TEK system are available from the TEK Homepage

  73. Uhh, google? by libertynews · · Score: 1, Insightful

    Google is obviously very light. If you really can't support the bandwidth for graphics, use lynx or turn off images totally.

    I realize that developing countries don't have fast connections, but google works fine across 24k connections (and really ought to work fine on anything).

    email requests and getting responses later seems like a bit more of a kludge than it needs to be.

    bcl

    --
    Remember Lexington Green!
    1. Re:Uhh, google? by battjt · · Score: 1

      For those of you who don't read the articles, this service wont be useful. For those of you who read the articles, this service will be useful in that it returns the results of the search, not just links to articles, but you probably already know that since you read the articles. I'll disreguard your silly comment about lynx. I just tried to read http://msnbc.com and it was very difficult. Now, imagine doing that with standard computer knowlege and weak English skills. Joe

      --
      Joe Batt Solid Design
  74. Re:Because... by fliplap · · Score: 1

    I can imagine your brainwaves are somewhere along the lines of:

    [_______________________________]

    but I'll bite anyway. I still don't think you read the article or you would have noticed the part about compression.

    "Someone using the software would e-mail a query to a central server in Boston. The program would search the net, choose the most suitable webpages, compress them and e-mail the results a day later."

    Its very likely, that since the target is to use this for information, that the pages would be _highly_ compressed, either reducing image quality or removing many images altogether.

    On top of all that, if you had read the article, you would have noticed the part about schools not having net access round-the-clock. This is because the entire joint only has one phone line. Given this limitation, it is far more feasable to have 20 students submit thier search to the teacher and have him submit it that night. Rather than dial up for 30 minutes and have 20 people sharing a dialup line try to search all at the same time, which would cause many of the queries to time out.

    " And exactly how slow are these connections anyway? " Again, something you would have found out if you had read the article. Maybe you should go read it again.

  75. Oh, now I understand! by Anonymous Coward · · Score: 0

    So they can use it to download pr0n during off-peak hours!

  76. The Slow Evolutionary Crawl of Archie by Baby+Duck · · Score: 2, Insightful

    I liked this technology when it came the first time around. Archie via email? Anyone? Yay! MIT reinvented Archie! Only with a thick client instead of a small one! Way to go!

    So what if it scans webpages instead of FTP sites. It's not that big of a leap.

    --

    "Love heals scars love left." -- Henry Rollins

    1. Re:The Slow Evolutionary Crawl of Archie by mmol_6453 · · Score: 1

      It sounds to me like it'd be more useful for truly long-distance connections, like from here to Pluto.

      --
      What's this Submit thingy do?
  77. Gopher by iacovou · · Score: 1


    I don't understand. E-mail gateways to Gopher and the WWW have been around for a very long time now. What is so special about this? Perhaps the article is incomplete.

    In anycase. There are still many applications for Gopher. The factors that made Gopher so appealing 10 years ago are still real-world factors today amongst third world countries - and cell phone users.

    In the early days of the WWW a lot of effort was spent making sure Gopherspace was available to WWW users. It sounds like the opposite needs to happen.

    E-mail is a tempting way to solve this problem - but what are you going to do? Send HTML pages via email? What does that save? Not to mention the 24 hour wait.

    --
    //iacovou
  78. Many problems with this idea by hyrdra · · Score: 0, Redundant

    First, If you don't have enough bandwidth for even the most minimialistic google page with stripped graphics etc., you don't have enough bandwidth to view the sites attached to the results of your query. What good is a list of results by e-mail when you can't view the links to establish how accurate they are?

    Also, the search process for the Internet is not only related to the quality of the search engine you use. It's an interactive process of finding what works best, refining your query based on results until you eventually find what you're looking for -- using this system would eliminate this aspect, at minimum making it painfully slow.

    Can anyone give a useful application that many people could use and benefit from?

    --


    "I'll just chip in a bit for RedHat: I actually have that installed on my university machine." - Linus, '95
    1. Re:Many problems with this idea by Anonymous Coward · · Score: 0

      As other's have pointed out, the server compresses the pages itself and sends these back in a much smaller form than "viewing the links it sends back." Works even if you only had some decrepit UUCP email link.

      Obviously you (and apparently the moderators (+3 Insightful - WTF?) can't be bothered to read before posting. Moron.

  79. Why not go all the way? by Pac · · Score: 3, Funny

    They should develop a program that strips images, animation, java aplets, ActiveX components and all HTML from Web pages, leaving only the text and the links. Then send it to the users. It could be called Gopher. Or Archie.

    1. Re:Why not go all the way? by Anonymous Coward · · Score: 0

      Hmm, so a program to strip images and other embeded content. Lynx, Opera, MS IE with images, ActiveX and Java turned of?

  80. Google by vasqzr · · Score: 1


    Google is mostly text. Pretty low bandwith if you ask me. Plus, it works great with Lynx.

  81. Software downloads... by greenskyx · · Score: 1

    Being that I am now stuck on a 56K Modem... this really wouldn't be bad IF I could pick software to have mailed to me. If I was able to get the new Open Office, Mozilla and Redhat updates for a low fee it might be helpful.

    Having webpages mailed to me seems stupid because I have high-speed internet at work and if there is a bandwith intensive site I just load it at work the next day...

  82. Re:Because... by advocate_one · · Score: 3, Informative

    cor... this sounds oh so familiar... anyone remember ftp by email??? History repeating itself...

    Query the ftp server by email and get the directory list emailed back to you. Then you could send the command via another email which would result in the file being emailed back to you overnight ready for you to retrieve it.

    And then there was "trickle" where files could be sent/refreshed to your uni's mainframe's ftp server overnight and would be there for you to play with the next morning and you would always have the most recent version of the file as they'd have been synched via trickle

    --
    Donald 'Duck' Dunn: We had a band powerful enough to turn goat piss into gasoline.
  83. Difficult way to achieve easy result. by Anonymous Coward · · Score: 0

    Why make a client-side program so big that it has to mailed? Why make a client-side program at all?

    Wouldn't it make more sense (and be much simpler) for MIT to write a program to run on their end that would receive and read the email containing the sender's query and simply reply with the results?

    It seems kind of Rube Goldbergian to go about it they way they planned.

  84. Re:Great!! I so need that... by Anonymous Coward · · Score: 0

    It's kind of funny to see your post at this time, i just spent half an hour waiting for charter to go back up, spent about 15 minutes before that trying to fix it myself before realizing the problem wasn't at my end.

  85. This just in... by McD · · Score: 3, Funny


    "MIT Reinvents Archie service from the early 90's."

    --
    "Given the pace of technology, I propose we leave math to the machines and go play outside." -- Calvin
  86. I'm On 28.8--This Makes No Sense by istartedi · · Score: 1, Informative

    This doesn't make any sense to me. I'm on 28.8, and 20 results from Google still come up instantly. Bandwidth might be an issue for the linked pages, but certainly not the search results. Even when I was on 14.4, back when Yahoo! was the hot search engine, it was no problem.

    So, what if these guys are on 300 baud and they get compressed search results via... e-mail??? The delay waiting for results to navigate e-mail systems probably negates the savings from the compression. Why not send compressed results over HTTP using a web-browser like application? Of course then you are still faced with bandwidth issues on the links you follow.

    It just doesn't make sense to me, unless they write a server-side proxy that intelligently filters Flash, popups, Java, superfluous graphics, audio, and other useless stuff that "web designers" like to use. The proxy could present pages in such a way as to offer users the option of downloading blocked files when the AI fails. That just cries out for a Mozilla mod or some other kind of custom browser; certainly not an e-mail client.

    --
    For all intensive purposes, "whom" is no longer a word. That begs the question, "who cares"?
  87. Did you know? by Anonymous Coward · · Score: 0

    Gastrocolic reflex: This is the reflex system that tells the colon to empty when food hits the stomach, or even in anticipation of a meal. This is why baby poops every time he nurses. It is also why kids with constipation complain of their belly aches right around mealtime.

  88. Re:Because... by EinarH · · Score: 1
    I know this might be a strange concept to you, but there's something called "surfing with graphics disabled". You might want to try it sometimes.
    I *did* surf with graphics disabled.
    Do you think I spent 8 minutes downloading the images on the yahoo frontpage?

    Stupid AC...

    --

    Melius mori in libertate quam vivere in servitute.

  89. Re:Because... by Nick+of+NSTime · · Score: 1

    Sheesh, I masturbated to pr0n 8 times before Yahoo.com loaded for you.

  90. thinking of something incredibly witty to say by pauly_thumbs · · Score: 1

    Oh yes:

    Imagine a beowulf cluster of these!!!

  91. No it wouldn't. by Anonymous Coward · · Score: 0

    All you get back is the search results (links), not the actual material. You still have to download that.

  92. Good idea but... by Realistic_Dragon · · Score: 3, Interesting

    It's a shame that with the way the net is going all they will get as search results will be flash heavy sites that take 20 minutes to download on broadband, let along dial up.

    Where did all the sites go that you could use wget -r to grab overnight? How about the odd few that used to offer a .tar.gz for download and offline reading.

    Content over presentation is a concept that needs to be reintroduced to the net, preferably with a stick.

    --
    Beep beep.
  93. Bah to that. A very big Bah! by frovingslosh · · Score: 1
    Is it really necessary to send CDs out for people to install when the Google flagship page is less than 30K?

    Disable graphics and google loads in no time flat. Realistically, if you can't use google with your existing tools then you can't use any links a search engine would get you.

    They are also considering trying to persuade computer sellers in developing countries to install the program on machines."

    Hell with that, if any software should be pre-installed, it should be stuff the bulk of the customers are asking for, not something the developer persuaded the sellers to install. We all know the problems that kind of thing causes, and such bloat would be worse here, since the program is clearly called large and systems for developing countries are likely to cut corners and have small disks.

    If they really want to do something useful, they should build an e-mail based portal. That way someone with e-mail could get anything they want, not just a search, but then submit a link from the search and have the result e-mailed back. Or submit the link for the program (this or any program) and have it e-mailed to you, that way it wouldn't need to be pre-installed and could be easily updated. There are actually still people who have more access to e-mail than to the Web, such an e-mail portal would be of good use, and would better address what this project claims to be trying to address.

    --
    I'm an American. I love this country and the freedoms that we used to have.
    1. Re:Bah to that. A very big Bah! by Lord+Dimwit+Flathead · · Score: 1

      Or submit the link for the program (this or any program) and have it e-mailed to you, that way it wouldn't need to be pre-installed and could be easily updated.

      Ummmmm ... isn't the point of pre-installing the application to remove the necessity of downloading it over a slow connection?

    2. Re:Bah to that. A very big Bah! by Jad+LaFields · · Score: 1
      Disable graphics and google loads in no time flat. Realistically, if you can't use google with your existing tools then you can't use any links a search engine would get you.

      From the article:

      These are then sent back to the computer in Malawi so that they can be stored in the machine's internet cache.

      "Next morning the teacher can connect, download that e-mail and when the students arrive, they can browse through those pages the way they would if they had full internet connectivity," said Prof Amarasinghe.

      They also send the linked pages from the search, compressed and undoubtably with the graphics removed. As the connection is probably unreliable, it can be done overnight when its less likely to be under use and the download can be retried several times if it fails.
      --
      [SIG] It's like putting a moose in the blender -- a recipe for disaster!
    3. Re:Bah to that. A very big Bah! by S.Lemmon · · Score: 1

      Why... why... that's copyrighted material!!! My GOD man, have you no respect for IP laws!

      Seriously, how long before the lawsuit start coming out of the coatrack eh?

  94. Re:Because... by stretch0611 · · Score: 1

    It sounds like this software could help sites from being slashdotted.....

    --
    Looking for a job?
    Want your resume written professionally?
    DON'T USE TUNAREZ!!!
  95. jeez by Anonymous Coward · · Score: 0

    MIT never ceases to amaze me. Their lack of innovation is almost as staggering as the Patent Office. How's the AI coming guys? No seriously, who would honestly pay for this shit? They have way too much money and time on their hands. Do something that's never been done - go study at Berkeley or something. Bunch of over-paid, over-hyped, over-inundated, over-inflated ivory tower loving idiots.

    Why is it that everytime someone at MIT beats-off it makes the front page?
    I remember when they prototyped hardware so you could surf the net wearing a head-rig with LCD glasses. Who the fuck cares? For the last time, we're not impressed with MIT anymore!

    1. Re:jeez by Anonymous Coward · · Score: 0

      yeah, it's like their IQs rolled up too high and went back to 0.

  96. Prior Art ;-) by sICE · · Score: 2, Informative

    I wonder in what it is different from AGORA, Web-To-Email, Gopher, and such services services? If you dont know bout them, you might want to check the Accessing The Internet By E-mail -- Guide to Offline Internet Access and Fravia's "How to search the web" lesson 10.

    Have fun.

  97. Cheaper and Easier by Anonymous Coward · · Score: 0

    Wouldn't it be much cheaper and easier to get those 5 people still using 56k DSL or am I missing something?

    M.D. Inc.

  98. If you stopped wanking you'd have some time. by Anonymous Coward · · Score: 0

    WTF are you doing now that takes up so much of your time? Is it more important than possibly creating the greatest compression scheme yet and reaping the benefits?

  99. Re:Because... by donutello · · Score: 2, Funny

    I can imagine your brainwaves are somewhere along the lines of:

    [_______________________________]


    Ok, you are obviously someone incapable of making a point without resorting to childish insults. In my experience this usually correlates well with inferior mental capacity so I would encourage you to read the below slowly:

    Downloading the contents of 20 pages when one page is the one you're looking for is vastly inefficient.

    Its very likely, that since the target is to use this for information, that the pages would be _highly_ compressed, either reducing image quality or removing many images altogether.


    Ooh! High compression achieved by not downloading fancy images or code! That's absolute genius. We can only download the text and then the user can choose if they want a particular image to be downloaded. Someone should really inform the makers of lynx so they can put this feature in the next version. Maybe the makers of IE and Netscape can have an option where they don't download images by default.

    And since you seem to be particularly dense, I will have to point out that the above paragraph is intended to be sarcasm since that functionality already exists.

    --
    Mmmm.. Donuts
  100. did anything think ALL the way through this? by perimorph · · Score: 1

    If your connection is too slow or unstable to handle Google, how are you going to load the web page your search returns??

  101. We don't need no stinkin' slow net by frovingslosh · · Score: 1

    One hardly needs a search engine to find a slower net. My first ISP, Concentric, certainly had a slower net. They are either gone now or hiding behind a different name, but you can still get a slower net from some providers. AOL users seem amazed when they see other systems using the same modems.

    --
    I'm an American. I love this country and the freedoms that we used to have.
  102. The entire internet for slow connections by Anonymous Coward · · Score: 0

    we'll e-mail you a CD daily with your selections that you make via phone

  103. paper mail? by mikeee · · Score: 1

    Maybe it should have an option to charge a few bucks and airmail you a CD-ROM's worth of the most relevent results....?

  104. [sighs] by ed.han · · Score: 1

    sadly, i had an idea to categorize sites by the dewey system back in the day. of course, once i realized what a "portal" was, that idea died a quick death, as books only have one dewey decimal number.

    the other thing that killed it was that the dewey system only treats of non-fiction and as we all know, most sites are definitely fiction. :>

    ed

  105. Information divide my arse by Anonymous Coward · · Score: 0
    "More and more we are creating an information divide in the world and this can help narrow that divide and have a huge benefit in that sense, " said Professor Saman Amarasinghe of MIT's Laboratory for Computer Science in Boston.


    What utter, socialist crap. The internet is closing the information divide.

    I tend to think too, that until they would better spend their effort trying to get reasonable speed access.

  106. email-based browsing suitable for pagers by rog · · Score: 1
    trancell.com has had email-based browsing and google-searching for years. try it The only problem is that the system has gotten too popular for it's own good and what was originally a 30-second turnaround for email-based google searches is now unlikely to return a response at all. Not that posting this will remedy the problem, but hey - I don't use my blackberry anymore anyway ;-).

    Nice idea, free, useful, wish I still had my blackberry.

    --
    Saving random seed...
  107. Fine tuning by Pettifogger · · Score: 1
    Though this idea has some merit, I don't know how useful it'd really be.

    For me, I usually go through several revisions of my Google (or other) searches before I either hit on what I'm looking for or realize that it's not out there.

    If I had to wait a day for each search query to come back, it'd take me a few weeks to do what I can accomplish in 10 minutes. Yeah, I know it's better than nothing, but "fine tuning" your query is a big part of what makes a search useful.

    --

    IAAL

  108. There's already something like this. by dacarr · · Score: 1
    --
    This sig no verb.
    1. Re:There's already something like this. by Captain+Splendid · · Score: 1

      Damn straight. Google got my vote back when I was on dialup because it didn't take several minutes to return loads of graphics, banner ads, and then, finally, my search results.

      --
      Linux, you magnificent bastard, I read the fucking manual!
  109. Because where else would you put the... by ProfessionalCookie · · Score: 1

    Where else would you hide the Spyware.

    That's cool how by sending the results via email you don't actually have to download the results (what?). Oh wait why would you ever do anything this way if you have Google?!

    It's not like you'll be able to find your searches among all the "Sponsored email" you'll be getting. But hey these things take money.

  110. Why is it so large? by MrScience · · Score: 1

    I mean... it seems like you could whip this out in a 10-20k program (tops).

    --

    You quitting proves that the karma kap worked. The most annoying of the whores shut up. --CmdrTaco

  111. Already been done. by janda · · Score: 1

    The Internet Oracle always provides the best answer(s) to your questions.

    --
    Karma: Food Fight (Mostly affected by Date Plate).
  112. Re:we're not impressed with MIT anymore! by Anonymous Coward · · Score: 0

    Speak for yourself! I was never impressed with MIT.

  113. wap google by MoogMan · · Score: 1

    Well, google.com is pretty fast even for slow connections.

    For *even slower* connections (say wap over a 9600 baud connection), people can always use http://wap.google.com/wml which is an even lighter version of the already-light google!

    Its primarily meant for wap-enabled phones etc, but you can search the entire web (and this is the default option) so there is shouldnt be a big difference to standard google.

  114. when will they learn by Anonymous Coward · · Score: 0

    ...NOT to reply to jokes seriously (this guy actually spent time to get a link!)

  115. GZIP ? by Anonymous Coward · · Score: 0


    So as just about every browser supports it, whats wrong with it ?

    seems like we have a solution looking for a problem again

  116. Please help by Anonymous Coward · · Score: 0

    I have a 1.5 Mbps up / 768 Kbps up redundant connection with fiberoptic guy coming to hook me up to the Public Utility District network. Should I request such a CD from MIT, and would it help me?

  117. UUCP. by Anonymous Coward · · Score: 0

    IBM toaster, very expensive only 5 or so exist in the world. Users submit bread in batch for over-night toasting.

    Anyone here remember how ListServ used to do something like this? Complete with a report on how much CPU was consumed in the search.

    Actually sounds like a "neat thing" for people with laptops and such, when you're going to be disconnected for a time.

    Email is perhaps one of the worst transfer system for binary data, why not use HTTP? post search (select email, jabber ID, ???) -> query ID is emailed/IM'd/etc.. to them, using supplied ID, they download the result.

    If binary data is actually emailed... hmm.. think of the possibilities for sending "gifts" to your err "friends" email boxes.

    You could submit searches from your wireless device and download them later, when you have bandwidth.

    You could share the search result with other people, instead of saying "google for XXX" just point them at a result you've already searched on, saving the CPU of the search engine.

  118. Compression by yintercept · · Score: 1

    I think the only reason for such a program would because the compression used by the modem isn't good enough, but it is an incredibly inefficent means for a small improvement in compression. I suspect that by the time you finished adding the overhead of email, that you wouldn't really get that much more speed. I mean, really, how much more room are you going to be able to squeeze out of MP3s and JPGs.



    I guess you could optimize your time a little better if you had a program that downloaded all of the pages from a particular search request. You could then view all of the downloaded pages on your local machine quickly. However such programs would create a ton of white noise, and would wreak havoc with all dynamic sites.

    This sounds more like a technology without a cause.

  119. Re:Because... by nicky_d · · Score: 1

    As the article states, the plan is to recieve the search query, bring up the most suitable pages, compress them, and mail them back. I'd assume that in general a graphicaly heavy or plugin-dependent page wouldn't be deemed 'suitable', so they'd just be receiving a zip of text pages - hardly a great burden on the line and, at the end of the day, a real improvement for, say, schools. This is technological development at the opposite end of the scale from pop-under web ads; great work, and good work.

  120. Re:Because... by SeanAhern · · Score: 1

    If on the other hand you download the actual contents of the top 20 pages then given how slow your connection is supposed to be, I don't see how you could do that in "only a few minutes".

    You download a compressed version of the contents of the results of the search. HTML pages compress very, very well, so I'd hazard a guess that it's pretty efficient.

    Go read the article. It explains a lot.

  121. efficiency of email by Anonymous Coward · · Score: 0

    Presumably the slow user would have to dial up and download their compressed pages at some point, whether or not it's an attachment. Shouldn't the user be able to download the compressed files using a protocol designed for file transfer?

  122. Grow Your Own by Anonymous Coward · · Score: 0

    Build you own using Google Web APIs, such done by CapeScience :

    "Just email google@capeclear.com and put the text of your query in the "Subject" line. You'll receive your search results via email."

  123. strange new idea. by hatrisc · · Score: 1

    Search the web by mail! have you're results in 4-6 weeks!

    --
    I write code.
  124. spam by Anonymous Coward · · Score: 0

    >> e-mail queries to a central server

    "Three days after its launch, the central server became overloaded with queries for weight loss, porn, and refinancing. Sadly it became less than responsive to even those with 14.4 dialup."

  125. Internet by mail by Smartcowboy · · Score: 3, Informative

    This FAQ explains how to access most of the internet using only a standard email client.

    The above document explain how to access:

    FTP
    ARCHIE (deprecated)
    FTPSEARCH (deprecated)
    GOPHER (deprecated)
    VERONICA (deprecated)
    JUGHEAD (deprecated)
    USENET
    WWW
    WWW SEARCH (using standard search engine like altavista, yahoo or google)
    FINGER
    WHOIS
    [...]

    All these protocols can be accessed via email, according to the FAQ. The FAQ has been around for a long time. This explains why many (most) involved protocols are now deprecated. I used this faq in the early '90 and I don't know how it works now. At the time, it was great. The last update is 2002/04/16.

  126. Copyright Problems by yintercept · · Score: 1
    Instead they are thinking of sending CDs to libraries so that people can borrow and install the software on their machines.

    I guess I should first mention the obvious that putting a bunch of other people's copyrighted work on a CD Rom, is the type of thing that gets people hauled before courts.

    As for compression, if you are using compression on the modem connection, then you don't really save any time by trying to compress the data again. You might save some space, but my experience is that a zip file full of jpgs isn't that much smaller than a directory of compressed images.

    The one area where the program has benefits is in time management. A program like Lotus Notes that replicates data sets over slow connections lets you use idle time on a network for data transfer.

    You don't need to develop new technology here. Just find all the technology that was used to replicate data when phone connections ruled corporate America.

    I could see some benefit to in having a better client side database for web pages. I can't see how the email layer in the application is making any contribution to the effort.

    Unfortunately, to be frank, this particular project sounds more like useless posturing from self righteous grant writers who are wanting to get taxpayers money by simply claiming to be doing something for the poor.

    The World Bank (et. al) have been very good about funding projects that make claims that they will help people, that simply enrich the grant writers.

  127. R...T...F...A... by Anonymous Coward · · Score: 0

    It sends you the actual compressed pages, you don't have to click the links.

  128. Why a program? by dabadab · · Score: 1

    Why do they need a program for this? Of course, it automatically unpacks stuff from the email, etc, but that's not something that you absolutely need.

    I mean, there are already http-over-email services, that do not need any special program, you just send a mail to the service's address with the links to the stuff you want and it sends it back and you could use it perfectly this way: just send a mail with a line like 'hedgehog asia' to mail@search.com and it would send you back a reply with all the pages relating to asian hedgehogs.
    The hassle with sending the CDs seems somewhat unnecessary (and it would require a CD drive and most probably MS Windows).

    --
    Real life is overrated.
  129. Where's the money? by PapaZit · · Score: 2, Interesting

    As admirable as the idea behind this project is, I don't think it'll succeed. In a word: money. The programming and research aren't the problem -- someone's getting a thesis out of this, so MIT'll foot the bill. The problem comes with finding money for maintaining and improving the servers, handling abuse, support, etc.

    It's a service that's only useful for poor third-world schools. Those organizations are probably running on a donated 486. They sure don't have money to pay, or even the money to pay to download ads. Charity-wise, "fund a search engine for poor third-worlders" is somewhat less compelling than "feed a starving child".

    I see this idea living on research and enthusiasm for a year or two then dying a quiet, broke death.

    --
    Forward, retransmit, or republish anything I say here. Just don't misquote me.
  130. Google = Media Control. by Fantastic+Lad · · Score: 1
    No. Seriously.

    There are a Lot Of Sites which the lords of Google have cut from their search engine. And I'm not talking porn. (The proliferation of sex-obsession is actually encouraged due to its weakening effect upon individuals.) I'm talking about anything which holds any real weight and thereby pisses off the wrong people. --Or anybody who needs to be punished, are chopped from what has become the Internet's de-facto public eye. (Google.)

    Easily done, too. Make a remarkable product. (Google was amazing when it began). Then corporatize it and hand over the strings to the bad men.

    This isn't a joke. There are plenty of examples. My own websites are among them; won't list them here, because I can't afford the backlash of losing anonymity, but on the 'lesser' search engines they show up fine and dandy at the tops of the lists. The same was true on Google, until I crossed a couple of lines back when Iraq was taken by Bush and his gang of psychopaths. Google has since chopped ALL my sites despite the fact that they have nothing to do with my personal political views and various beliefs about how reality works.

    There IS a war be waged to suppress human awareness. Those of us who have the balls to try to lift the curtain for others to see DO take damage.

    Anyway, people should use All The Web if they want a search engine with teeth.


    -FL

  131. You're not getting it by TrekkieGod · · Score: 1
    The grandparent had a point, that you just didn't get. Say you're on that one computer and phone line. You need to search for information about turing machines...So you can either go to google and choose the ONE page that has information you want from the list. Or you can do this mail thing and get their selected "10 most relevant pages", and download all of them. And then, you have no guarantee that's even what you want, since somebody else selected what's relevant.

    Even if what you want is among those 10, you had to tie up that phone line for at least 5x as long.

    --

    Warning: Opinions known to be heavily biased.

    1. Re:You're not getting it by anttik · · Score: 0

      I think it's actually you who didn't get my point.

      There could be many students, say 20, who wants to find and browse the results. They will give their query to their teacher and he'll send the queries all at once, thus saving a lot of time. Next day everything can be downloaded at once and then you can go offline and then there's no problem with the limited online time.

      Also remember that it's most likely that these people are not as fast browsing the Internet as we are. It's not like they're browsing pr0n all day long.

  132. Sun's Java Web Services Development Pack... by PHAEDRU5 · · Score: 1

    ...has an example of an SMTP-to-HTTP bridge.

    Google and Amazon have published web APIs.

    I see instant low-speed connectivity to two of the web's two big consumer apps.

    --
    668: Neighbour of the Beast
  133. Oops about CDs in libraries by yintercept · · Score: 1

    Oops, sorry I was thrown by the last header "CDs in libraries." I thought the article said that they were putting the result sets in the library...not just the program. Mechanized ways of publishing result sets gets people in trouble.

    The article still seems fishy because a great deal of the TCP/IP infrastructure has been optimized to handle the problems of slow connections, precompressing data isn't more efficient than compression in transit. There are as many IP collisions when sending e-mail as with http requests.

    Having a program that crawls the web and sends a lot of pages for a search engine request will increase, not decrease download times, but I misread the bottom section and apologize.

    1. Re:Oops about CDs in libraries by GigsVT · · Score: 1

      TCP/IP doesn't handle high latency hardly at all though.

      If it did, my satellite router wouldn't need to run windows.

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
    2. Re:Oops about CDs in libraries by Anonymous Coward · · Score: 0

      I don't see how the article could confuse you. Maybe you only read the Slashdot story and never read the article. That would be very funny, since the comment you replied to has the subject, "RTFA" and started "For those of you wondering why someone would do this, how about reading the damn article?"

  134. Re:Because... by indros13 · · Score: 2, Insightful
    I'm from 'western' civilization and have no idea how these things work in less privileged places. describes 95% of the American population. I'm not sure why you should be surprised that I didn't understand the need for low bandwidth services.

    To "redeem myself," I'd like to make two points:
    1. I was aiming for amusing with the Google thing. I decided to tack on the "real question" because I'm honest about my ignorance of the topic.
    2. In what way will this search function highlight the control of relevance algorithms over the kind of knowledge folks using this search process will get? In a higher bandwidth society, I have the freedom to check out numerous searches and continually refine my search strings to find the best information. Folks using this service, however, will not be able to do so as readily.
    3. I lied, third point. Ultimately, this just continues to reinforce the hierarchy of post-industrial nations over developing ones by giving them a quick fix for a dearth of wealth in the Information Age. For anyone to compete globally in knowledge or business, they have to have substantial information in a timely fashion. This provides neither, and while an admirable stopgap measure, fails to address the root problem.

    --
    Under capitalism man exploits man. Under communism it's the other way around.
  135. Ain't modern technology great? by sohp · · Score: 1

    What's next? TCP/IP over carrier pigeons?

    +++ATH0

    NO CARRIER

    1. Re:Ain't modern technology great? by thebigmacd · · Score: 1

      But since we are using AMSSB and FM like the rest of the civilized world, we can just supply our own synthetic carrier and no data is lost ;)

  136. MIT does have too much time on it's hands.. by zanthas · · Score: 0

    First a search engine for slow networks, now robotic snails http://www.theregister.co.uk/content/28/31783.html :-)

  137. What's the hard part? by Moeses · · Score: 1

    I'm curious about what the hard part of writing this program is. I'm a competent developer and I could put together a rough demo in a few days (times 2 for standard estimate buffering).

    I'm sure there are good reasons, such as they are writing their own search engine tailored to this particular need, but using existing tools this could be cobbled together in short time.

    We have a server. One program writes all emails sent to a certain email address to a certain directory. This program is probably already written and open source. We have another program running on the server, this one monitoring the directory. Each email contains a query which can be run against google using their provided API making SOAP calls. The results are returned and used to create a list of URLs. Each URL is then crawled to a set depth (other more complex factors could be substituted later), written to it's own directory (or stuffed in a db), including all image files, etc. We now have a set of directories correlated to a query.

    Now we just have to get the results back to the user. A third program runs through each set of directories, it will parse the HTML to update links (including images, etc.) to how they will be seen locally by the end user, strip out unneeded stuff, perhaps replace the existing images with more compressed versions (there are plenty of existing libraries to help here), and then the whole set of directories can be compressed using a standard compression scheme such as gz or zip.

    There's your demo. So what's the challenging part of improving this? Perhaps making sure the user gets the search results they want? I'm used to rapid fire searching, repeatedly zeroing in on what I'm looking for. That would take days for these users, so perhaps there is an intelligent agent in the client program that prompts the user in ways that would hopefully improve their results.

    Perhaps the search engine component may rank pages differently than google based on information density or the correlation of information contained in the pages crawled from the original search result (so that one is more likely to get results where many pages talk in depth about the same subject).

    Perhaps they are developing their own compression technique specific to this task. Since crawled pages will probably contain the same headers, nav, etc, they may put in a custom stage where special hooks and codes to represent that before going into a general compression stage. The client program would know how to assemble this and would usually achieve better compression than using a general approach alone.

    I'd like to see how what MIT is working differs from the rather rough demo I've sketched out here.

    1. Re:What's the hard part? by BillThies · · Score: 1
      As a graduate student working on the TEK project, I can try to answer some of your questions.

      In terms of the current version of the software, you're right -- it's mostly a lot of engineering. To get things working, we put together known techniques and tools into a single package.

      However, there are two big pieces that are missing in your description. One is the client, which is a web proxy that simulates a web connection and interacts with the user. It's about 10,000 lines of Java code, plus 5,000 lines of HTML. The other aspect is that the server actually keeps track of all the pages that have been sent to a given client, to avoid sending duplicate content in the future. For this, we use a database, which complicates parts of the server code; in all, the server comes out at 16,000 lines of Java right now.

      So, I'd be impressed if you could write this in two days. However, if you're interested, we could use your help improving the software! We're currently moving our CVS source tree to SourceForge (under project "TEK").

      In reality, the hardest problems have been dealing with OS-level issues. How do you send and receive mail in a general way? We're currently using SMTP to send, and attachments to receive. But this has some limitations... we experimented at length with the MAPI interface and other approaches, but always had some roadblocks.

      Also, there are a number of "hard" research questions that we think you can consider once the basic infrastructure is in place. For example, what is the best procedure to select pages with a high "information density"? How does the selection procedure interact with the geographical and demographic characteristics of a given client? How can you gather as much information as possible at the client side to make the search successful? These are the kinds of things we'd like to consider eventually.

      Many more details about the project are available on the TEK Homepage.

  138. Collab and Shameless plug by RedHat+Rocky · · Score: 1

    Perhaps these folks would also like to get involved with this project.

    We've (http://www.vh.org/) been approached by the widernet group to provide some content, issues such as updating the information and copyright abuse crop up right away.

    Long term, I think it's a better idea to focus on getting better connectivity then band aiding the problem with solutions such as these. Notice no one addresses the langauge barrier problem, I guess all interested parties are assumed to speak English?

    Tough problem, no easy solution.

    --
    Anything is possible given time and money.
  139. from MIT so must be good department... by Anonymous Coward · · Score: 0

    Really this idea sucks.. using a 1kb dialup modem
    I can get results from google, etc..

    And the 'too big to download' issue.. hahaha.. bwhahah

    Fools

  140. Re:Because... by Anonymous Coward · · Score: 0
    HEY FUCKHEAD!
    R T F A !
  141. Back in the old days... by mikeage · · Score: 1

    you might remember agora and getweb servers. Web pages by email, formatted via lynx. _Very_ cool for those who may have only had a mail connection (either a BBS->net, or, like one place I was at for a while, UUCP for mail downloading).

    --
    -- Is "Sig" copyrighted by www.sig.com?
  142. This is barely better than GoogleMail by Zastrossi · · Score: 1

    Big deal, this is barely a step up from this: http://www.capescience.com/google/

  143. Supported by IBM? by G4from128k · · Score: 1

    Sounds like the project must be a bunch of IBMers who are nostagic for ye good olde days of batch. Perhaps they will also create special perforated cards for submitting these searches.

    --
    Two wrongs don't make a right, but three lefts do.
  144. There's this really cool site called "Yahoo!" by ScumSucker · · Score: 1
    It's modded funny, but there is a good idea deep in there. I've basically been thinking about this idea for years. Not quite like a card catalog system as he described, but more like an ever changing directory.

    Um, this is what Yahoo! is. Perhaphs you've heard of it? Also, what is Google Directory if not "an ever changing directory"?

    1. Re:There's this really cool site called "Yahoo!" by nsideops · · Score: 1

      "All of the sites that attempt to do this now are very limited and usually laid out quite poorly"

      Maybe I didn't give quite as good of an explination of what I'm looking for, but these systems you mentioned, although better than most, still lack that feel of having the entire net at your fingertips in an easliy laid out manner. It's better than just guessing at a search enging entry sometimes, but not quite as functionaly as what I would hope for. Maybe I should get to work on laying out just exactlly what I'm wanting, but then again, I spend all day at work on slashdot, so what time do I have? :)

      --
      Teach someone to use the net and they won't bother you for weeks; show them Slashdot and you may never see them again.
    2. Re:There's this really cool site called "Yahoo!" by stoborrobots · · Score: 1

      Or you could describe your ideal solution on a webpage, get it slashdotted, then cross your fingers and hope someone takes the bait, and builds your dream...

      No, seriously...

      What I mean is, what is your idea? Can anyone here help?

      This is a site where a vast amount of open-friendly development effort is available, if they see a project they consider worthwile...

      Tell us about it...

    3. Re:There's this really cool site called "Yahoo!" by notsoanonymouscoward · · Score: 1

      Yahoo isn't ever changing. Its pretty static. What you want is netnose. Its sort of a hybrid search engine / directory. Data is processed like yahoo or dmoz, where content is rated/ranked by human editors, but theres no paid editors, and no signup/registration... so anyone can do it. Yet the "catagories" aka keywords, are more dynamic and without a heirarchy. The most important thing though, is that we rate the relative significance of the keywords to the content, not just rating the content itself. We aren't as concerned with how good content is... we just want people to be able to find it.

      --
      I ate my sig.
  145. latency by yintercept · · Score: 1

    You are correct, email handles long latency and disconnected lines better the HTTP. What happened in the US is that companies made a tremendous effort to invest in technologies to handle the latency problem, but faster internet came around before there was a postive return on the investment.

    The third world might benefit from using database replication technologies.

    Ultimately, however, the real gains will come from building the true infrastructure in the third world. For example, an area with bad telephone service would do better to start with higher speed wireless networks.

    1. Re:latency by GigsVT · · Score: 1

      You have a valid, and probably more relevant, point, but I was referring to TCP/IP itself. Things start to crumble when latencies approach 1 second. I don't know the exact technical reason, but I think the retries start to avalanche.

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
  146. I must have missed something there.. by Scooter · · Score: 1

    As I used a modem for years and had no problem with regular search engines.. What do they mean by "slow" connections? 9600bps? Carrier pigeon?

  147. What about those that don't have computers: by PPGMD · · Score: 1
    I guess we could use snail mail:

    Dear Google,

    Can you please me the Google Image Search for kittens please?

    Sincerely,
    Computer Less

    1. Re:What about those that don't have computers: by Anonymous Coward · · Score: 0

      Dear Computer Less,
      All kittens have been killed.
      Sincerely,
      The allmighty GOD (of photoshop)
      Fark account number: 2688

  148. Yes it would by BillThies · · Score: 2, Interesting

    Hi, I'm a Ph.D. student working on the TEK Project. TEK does send the content of pages, not just links (although it also allows you to retrieve individual links, if desired). This allows you to get information back in a single query. TEK stores all returned results in a local cache on the client machine, so that users can search through the pages and refer to them at a later date. The software provides a local search utility that allows you to peruse previous results with a standard web browser; you do not need to keep the emails that are returned from the TEK Server. We hope that this is useful not just for taking a snapshot of a given page, but also for averting future searches if some content has already been downloaded before. More details are available on the TEK website: http://tek.sourceforge.net/

  149. For more information on TEK by BillThies · · Score: 1
    The BBC article was just an introduction to the TEK project. For more information, including our publications, please see the TEK page:

    http://tek.sourceforge.net

    We are also in the process of migrating our CVS source tree to SourceForge.

  150. Content is sent, too by BillThies · · Score: 1
    Hi, I'm a Ph.D. student working on the TEK project.

    In fact, TEK does send the content of the pages it finds. In this sense, "search engine" is a bit of a misnomer -- it's more of an "information delivery tool".

    Many more details about TEK are available from the TEK Homepage

  151. Simpler solution by Guspaz · · Score: 1

    Write a PHP script to do a query to Google, strip out the images, gzip the content, and feed it to a user.

    Sure, gzip may not be as good as these compressed emails, but hey, gzip gets a google search page (for "linux") down from 23,859 bytes to only 5,724 bytes.

    Now, even a 14.4k modem would download that in (theoretically) 3.2 seconds. They don't get much slower than 14.4k modems. But even if you insisted on a 300 baud modem, that's still only two and a half minutes to download!

    Now, no matter how slow the connection, I would bet dollars to donuts it's faster than 300 baud even in a third world country. In other words, standard gzipped webpages are good enough, no need for silly programs distributed on CDs. (This is besides the fact that I can't figure out why a simple 4KB program to send/receive an email would require an entire CD)

    1. Re:Simpler solution by Anonymous Coward · · Score: 0

      Or simply use mod_gzip for on the fly compression

    2. Re:Simpler solution by Guspaz · · Score: 1

      That's what gave me the idea, but we're talking about compressing Google, which not only doesn't use mod_gzip, it doesn't use Apache entirely.

      Sure, it would be smart of Google to enable gzip compression in their custom web server, but on the other hand, with that many hits the CPU hit would be enormous.

  152. RTFA: Microsoft Bob can help... by iendedi · · Score: 1
    This doesn't make any sense to me. I'm on 28.8, and 20 results from Google still come up instantly.

    It would make alot more sense to you if your read the article.

    What these guys are doing is actually pretty cool and makes alot of sense. Here, I'll walk you through it:
    Bob, a poor Internet user in a third-world country wants to research new, cheaper ways to clone copyrighted software for sale on his corner. We'll call this guy Microsoft Bob for now, since that't the kind of stuff that Bob sells.

    Since bandwidth is very expensive, Microsoft Bob uses this new search tool from MIT and enters a variety of topics that he is interested in (e.g. "Microsoft Office key cracks", "Microsoft Authenticity Symbol Forgery", ...). He can do this offline and only needs to pay for a quick burst from the local Internet cafe to the server in Boston.

    An hour later, Microsoft Bob checks his email and downloads a compressed folder that was prepared for him in Boston. It takes a couple minutes to download, but that's okay. Bob just saved 3 hours of online search time, and that equates to like 5 copies of MS Office that he doesn't have to sell on the corner. So already, this program is proving more useful than Google at preventing software piracy!

    Microsoft Bob unpacks the folder, clicks on the index and can now surf around in the local folder through all of the pages that the search engine found that may be of interests. And since MIT researchers love AI so much, it's probably a pretty good selection of on-topic material.
    Go get 'em Bob!

    Dude! Get a faster modem. A 56k modem is like, what, $15?
    --

    It is your personal duty to fight for what is right on a daily basis. Ignoring injustice is identical to approving
  153. Clarification about funding for TEK by BillThies · · Score: 2, Insightful
    ... sounds more like useless posturing from self righteous grant writers who are wanting to get taxpayers money by simply claiming to be doing something for the poor.

    I am a graduate student working on the TEK project, and we have never received funding directed for TEK. So far, the project has been carried out using general research funds (for example, a faculty startup package) available to the PI. We have been operating with a very low budget, mainly with undergraduate students. One of the researchers, Libby Levison, worked on TEK for an entire year without receiving any pay. Most of us also work on unrelated projects that are funded separately.

    As a policy, we have never applied for funding from any organization where we will be in competition with developing nations for the same dollar. For the record, we submitted a proposal to the NSF ITR program that covers TEK, but the proposal was rejected.

  154. Good low bandwidth solution. by Anonymous Coward · · Score: 0


    But it's good to look at other things including using lynx or another text only browser.

  155. Re:RTFA: Microsoft Bob can help... by Vulpine · · Score: 1

    Dude! Get a faster modem. A 56k modem is like, what, $15? There are a lot of places with shitty phone lines where you'd be lucky to get 28.8 no matter how fast your traditional modem.

    --
    -- 'As it all washes away you know -- as it all is one, no one is alone.' -Cosmic Disorder
  156. Frequently Asked Questions by BillThies · · Score: 1
    I am a graduate student working on the TEK project, and I would like to clarify some of the issues raised in this discussion. Some brief answers to frequently asked questions are below; for full details, including publications and the software itself, please see the TEK Homepage

    Q: Who are the intended users, and how slow is their connection?
    A: The primary targets are communities where Internet access is expensive, unreliable, or completely unavailable. In developing nations, an email account is often significantly cheaper than full-fledged web access; for a few examples, see our last paper. Moreover, there are many cases where connectivity is intermittent, and it is cheaper and more reliable to send files in a batch mode during off-peak hours. Regardless of the modem speed, users in developing regions are often plagued by long latencies and low bandwidths due to congested infrastructure and inter-continental links. Many such users have expressed a lot of excitement about the TEK system.

    Q: But your server takes 24 hours to reply. How will that speed things up?
    A: Actually, our server replies immediately to each query, and processing takes less than a minute. The one-day wait in the article is just an example scenario that accounts for possible delays in the local network, as well as the night-time usage model.

    Q: Still, how does this make web access more affordable?
    A: The TEK system shortens the expensive connection time because it makes browsing an offline process. A set of pages can be downloaded from a local ISP during the cheapest and most reliable hours; users never have to pay for online time spent reading pages or waiting out inter-continental communication latencies. Moreover, the client-side cache of downloaded pages and the intelligent server processing could eliminate some searches altogether (see below).

    Q: Google is fast, low-bandwidth, and even has an email interface. What's new here?
    A: The TEK system is not really a "search engine"; rather, it is an end-to-end information retrieval tool with both a client and a server. In fact, the TEK Server queries Google for its candidate pages. The value added by the server is that it keeps track of the pages sent to each user, and avoids sending duplicate pages in future search results (unless, of course, a user requests an updated version of a page.) This ensures that the client's bandwidth will be used only to download material that is new and interesting. Note that the server also sends the actual content of pages rather than just a list of links; it does some basic filtering and compression of the content to reduce the bandwidth requirements.

    Q: Why do you need a program on the client side?
    A: The TEK Client is a very important component of the system. It provides a web proxy that simulates an Internet connection so that users can view downloaded pages in their favorite browser. In addition, the proxy stores all pages in a local cache so that they can be searched and viewed at a later time. It also provides basic user management and query tracking so that many people can share a common machine and email account, perhaps on a public kiosk or school computer.

    Q: Why is the client program so big as to require a CD?
    A: The program itself is relatively small; the JAR file is 125 KB. When we add in third-party libraries and the installer package, the size is up to 2 MB. Including Java in the installer bumps the size to 10 MB. We implemented the first version of the TEK Client in Java for portability and ease of development, though we agree that a more compact distribution is possible, and we could be interested in exploring this in the future.

    Q: Do you intend this as a permanent solution for low-connectivity areas?
    A: No. In the long term, there needs to be

  157. This is not new by Anonymous Coward · · Score: 0

    It's actually possible to access almost everything through email...

    http://www.faqs.org/faqs/internet-services/acces s- via-email/

  158. Don't forget the updated RFC by Anonymous Coward · · Score: 0

    That was a 1990 RFC. It was updated in 1999. The more current "IP over Avian Carriers with Quality of Service" RFC can be found here:

    ftp://ftp.rfc-editor.org/in-notes/rfc2549.txt

  159. Re:Because... by Fungii · · Score: 1

    Pot Kettle Black.

  160. This was how I first got "internet access"... by vidarh · · Score: 1
    I dialled into a local BBS that exchanged e-mail and news via UUCP four times a day. Thanks to various web, ftp and gopher to e-mail services, I was "surfing" with a 6-12 hour delay between request and reply... That was back in '93. A year later I finally got "proper" dialup access to an ISP running Linux 1.0.something...

    The good old days :)

  161. Frequently Asked Questions (cont'd) by BillThies · · Score: 1
    On my screen, the end of the first posting is cut off without a link. Here's the rest of it again, just in case:

    Q: Do you intend this as a permanent solution for low-connectivity areas?
    A: No. In the long term, there needs to be better communications infrastructure in developing regions. This system provides an interim solution for delivering much-needed information. It also serves as a stepping-stone to full connectivity, as it simulates a web connection from the client machine. Once the infrastructure is available, many rural users will have developed familiarity with browsers, web pages, and search engines.

    Q: How is the project funded?
    A: We have never received funding directed for TEK. The project has been carried out on a low budget, using mostly undergraduates and funded by general research funds (e.g., a faculty startup package). One of the researchers, Libby Levison, worked on the project for an entire year without receiving any pay. As a policy, we have never applied for funding from any organization where we will be in competition with developing nations for the same dollar.

    Q: Do you need any help with the project?
    A: Yes! We are currently moving our CVS source tree to SourceForge, where we welcome open-source developers. We can also use help deploying the software in low-connectivity communities -- please find our contact information on our website. Thank you.

  162. If I understand... by Anonymous Coward · · Score: 0

    If I understand they invented this :

    ssh remoteserver 'cd /tmp/; mkdir dl; cd dl; wget http://www.google.com; cd ..; tar cvzf file.tar.gz dl/*; cat file.tar.gz | uuencode file.tar.gz | mail -s 'Your search' my@mail.domain'

    Tune wget URL, recursivity and you are ok ;-)

    ***Think*** I will patent this beautiful thing !

  163. Nothing new by coldcity · · Score: 1

    Is this not just another HTTP over UUCP?

    It used to be quite common to surf in this way, or use a similar technique to request a file from an FTP server which would then mail it to you, the rationale being that you were likely to be able to get a faster connection onto your mailserver than onto the hosting server.

    --
    coldcity
    code, life, art
  164. Thirdworld by khalido · · Score: 1
    In Pakistan, and I suspect most of the third world, ISP's have way more users than bandwidth. The most common connection is a 33.6K, as the obsolete phone lines around here don't support 56K. Now with a good ISP, this is fast enough to browse the web. However, with most ISP's you get far less than 33K. To get a idea of how slow it is, people often type in their password for hotmail [for some reason its very popular here], go do something, come back click on a email, go make coffee, dust the house, etc. I am not exaggerating here.

    Email is the one thing which is fast as when downloading an email you are accesing a local server which operates at the full speed of your connection. So email searches [and or content via email] is a brilliant idea. Pakistan's entire bandwidth is less than a typical college lab's in one of the better uni's in the US, and till that gets better things aren't going to get better soon. The internet divide is very much here, and the more u can afford the faster u can browse, with the majority not being able to afford any access.