New Breed Of Web Accelerators Actually Work
axlrosen writes "Web accelerators first came around years ago, and they didn't live up to the hype. Now TV commercials are advertising accelerators that speed up your dial-up connection by up to 5 times, they say. AOL and EarthLink throw them in for free; some ISPs charge a monthly fee. Tests by
PC World, PC Magazine and CNET show that they do speed up your surfing quite a bit. They work by using improved compression and caching. The downside is they don't help streaming video or audio." And they require non-Free software on the client's end, too.
Unfortunately, these caches store only the most accessed pages, so anything of any value to the Slashdot audience will be as slow as ever. But you can be sure your porn will be delivered at 5x the speed of your normal dial-up! (yawn)
Show your love for the Hacker community
HackerLogo.com
"Web accelerators"...You mean highly-advanced technology like mod-gzip?
There are a huge number of yeast infections in this county. Probably because we're downriver from the bread factory.
They require non-Free software?
Well, why don't you go ahead and write some Free software to accomplish the same thing?
My GameCube requires non-Free software too.
Wahhhh
I don't need no instructions to know how to rock!!!!
get broadband. This will definitely help places that still don't have broadband. But, if broadband is available, it's a no-brainer. I'd rather spend a few bucks more and get broadband, rather than be stuck with some kind of software that may or may not speed up the access depending on what it is.
Why so many content providers aren't using gzip compression? The cpu time required is MUCH cheaper than the bandwidth, AND it makes users happiers because they get it faster. Oh, and it's free (for Apache anyway) and easy to set up. It even works with 99% of browsers these days.
And they require non-Free software on the client's end, too.
And I'll just bet that none of that software includes any popups, spyware or intrusive monitoring!
This seems like a really niche market nowadays. Not _too_ many people that need fast internet, that could use this, don't have broadband availible. The one key thing is price, which is even starting to get iffy.
/. or pr0n that much fast, it works, tell me if I am wrong, but I am seeing a small market for this much hype
Something like $10-20 monthly for "speedy" earhtlink dial-up, or an extra $10-20 slapped on my monthly cable bill for broadband? (Charter Communications, they suck anyways)
I guess if you need to read
Error 407 - No creative sig found
They actually work?
Next thing they find out is the new generation of penis enlargement devices actually work, too...
I don't need a signature.
OMG, not that! I know this won't get much play here, but I don't care if it's free or not as long as it works. I use the Free software that I do because it is better than Fee software, not because it is free. Shame on me for not being an ideologue.
It's just the old tradeoff between CPU power (decompression time) and bandwidth usage (download time). Much easier (and more smartly) implemented on the server side with something like mod_gzip, like HungWeiLo said.
And graphic compression's been done before too, since around AOL 3.0 or so. Most people turn it off because it makes pages look like crap.
Snake oil that works? What do you even call something like that?
My former company was checking out NetAccelerator recently to resell to our clients.
These things are a joke. The primary performance increase comes from recompressing images into really nasty JPEGs. AOL was doing this years ago (and getting blasted for it). If you turn that off, the performance improvement is not even measurable.
Furthermore, you tend to get a lot of stale caches on your machine. Most browsers don't even get this right, so they add yet another layer of potentially buggy cache abstraction.
No, these things are junk. They act as proxy servers and their source is closed. How can you trust them to handle your data? Even with all their compression features turned on, the performance improvement is seriously overrated. Don't bother. You simply cannot get something for nothing in cases like these.
Now, what would improve the download speed of the web is if web designers would start building standards compliant markup. Many web sites have as much as 700kb overhead in markup from tools that create loads of font tags and their ilk. Pure XHTML + CSS layout would do a hell of a lot more to speed up the web than these scams. Of course, don't take my word for it--read Zeldman.
Join Tor today!
They compress the packets of data. Where will this help? In compressible places that aren't already compressed. Such as the HTML markup for webpages. This wont help already compressed JPGs, or already compressed MP3s or already compressed ZIP/GZIP files or already compressed videos (MPG/AVI/ASF). So is this really going to help much? Sure, there is always going to be a small percent of space (and therefore time) saved even transferring these formats. Is it going to make a 5X difference? No. Is it going to make a noticeable difference? It's unlikely but possible. The only way this "new technology" is going to help is if you are a dialup user without broadband options.
rproxy is a really interesting project, and back when I tried it over a 56K dial-up connection, it did actually work to speed things up. You sit an rproxy web cache at each end of the dial-up connection (so you need somewhere to deply your custom proxy to make it work, but bear with me...) and then request web pages as usual. Each end caches the pages that pass through it, but the clever part is that when you re-request a page, the proxy at the far end (on the fast connection) can fetch the page and compare with the last copy in the cache. Then it transmits only the differences using the rsync algorithm. Unforunately it's not being actively developed any more given the increasing availability of high-bandwidth connections, and the decreasing fraction of web traffic that is suitable for delta-compression. Shame, since it did seem to be a real "web accelerator" without any of the illusory techniques used by the garish banner-ad accelerators.
Matthew @ Bytemark Hosting
on
dialup.... ...
you....
insensitive
clod!
And my connection is wheezing just trying to post this!
mod_gzip is manna from heaven
I turned mine off by accident once and got a phone call from the co-lo wanting to know why I was suddenly maxing out.
gotta love that 70% saving.
There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
ISPs could simply put some squid caches between the net and their dial-up banks. Turn mod_gzip on and you'll accomplish a lot of the same thing.
.diffs came across. :-)
Instead of having to traverse the Internet, with all the associated latency, pages are pulled locally - 1 hop away. Pages are also compressed.
A better way would be to figure out how to transfer pages via CVS, so only
Learning HOW to think is more important than learning WHAT to think.
GIFs, JPEGs, MPEGs, and MP3s are already compressed, so compression doesn't make them any smaller. That really leaves only HTTP, HTML and CSS to benefit from compression. And caching only helps if you're in the habit of looking at the same pages multiple times... so where's the benefit for the average porn-downloading, RIAA-infringing geek? Does it speculatively preread links before I click on them?
"Freedom means freedom for everybody" -- Dick Cheney
You can get the same thing you looked at yesterday 5x faster!!
Caching and compression will only get you so far before lossiness (sp?) kicks in and you start getting garbage, or caching works so well you get the same page every time you load it.
Get on the bandwagon and chip the money for Broadband if you're looking to boost your speeds. If you can't get a/v any faster, really, what's the point?
Low bandwidth main pages becoming less and less prevalent so it's not going to do you much anyway, plus, you're still paying for it...
Okay, so GIFs, JPGs, streaming video, ZIPs, and compressed .EXE installers are all already compressed near to their thoretical limits.
[Puts cynics hat on]
The vendors mentioned in the PCWorld article seem to be treading dangerously close to copyright infringement by compressing other people's content on their servers to be pulled through their browser proxy.
NetZero and Earthlink apparently force you to use their proprietary internet-access layer, so how are we sure their extra-cost "Super" speed isn't just normal internet speed, and their "Base" speed isn't just slowed down by the interface layer?
[Takes cynics hat off]
The only thing here that seems like it would be genuinely useful is HTML compression... surely there is/will-be an Open Source solution for this. Maybe a new MIME type, e.g. text/html.compressed? Then it could be implemented on both the browser and server side, and this would have far greater impact. This could be implemented either in the browser itself or in a lightweight proxy like Proxomitron. Anyone? Anyone?
~ Whence do you come, slayer of men, or where are you going, conqueror of space?
OK -- now you'vegot my attention. Like, um... just how nasty?
Roving Web-Teleoperated Robot
Caching webpages in a proxy is something all ISPs do. The downside is that whenever I've used an ISPs squid proxy, it slowed things down! Turning proxies off almost invariably helps speeds, in stead of hurting them. Plus, if the proxy goes down, you can still use the web. I have no idea why ISP's proxies are so craptastic (YMMV), but in my experience, they are. (BTW, it would help if windowsupdate was cacheable..)
;-) Not such a good approach for webbrowsing btw.
.ART format when you used their client? Do they still?
Compression.. Now there's something! I have in the past used an ssh tunnel (with compression switched on) to my university's web proxy, and that sped up things quite a bit! Why isn't this switched on by default on my PPPoA connection? Doesn't apache handle gzip'ing these days? Doesn't seem to be used much, though.. This speed up might be less pronounced on dial-up links though, because POTS modems usually switch on compression anyway (again YMMV).
Some download accelerators simply download different chunks of the same file in multiple sessions from either one server (shouldn't matter - unless with roundrobin DNS) or even from mirrors (better!). That's quite effective as well, but we know this, and that's why we use bittorrent for big files, don't we?
But it has to be said.. Most download accelerators are just bloaty spyware and don't do *zilch* to help your download speed.. Feh!
Didn't AOL use to convert GIF graphics to their own, lossy,
SCO employee? Check out the bounty
You admit that a $200,000 setup fee isn't "a few bucks more." Thank you; most people miss this.
But what about people who are so mobile that they need to be able to jack in and access the Internet from any of several locations, and they can't afford the price of a broadband subscription for each location? I was in just that situation for four years. Dial-up has the advantage of a last mile in almost every home in the States, brought to you by the Universal Service Tax, meaning that no matter whose house I was visiting, I could always plug my laptop into the wall and dial my Verizon Online account.
Will I retire or break 10K?
WOW, a webcache and real-time compression!
.v42Bis compression have only been able to do that for, what, nearly a decade?
My browser and my modem with
Man is the animal that laughs.
And occasionally whores for Karma.
I worked at a local ISP who managed to get a demo for a cache server a while back. (I don't anymore.) The machine arrived. We plugged it in, and started to take tech calls.
Basically, it proxied all requests through that ISP on port 80. If it found a request to an IP or sitename it had visited before, it tried to serve it out of cache. If it didn't, it proxied the result through and returned the results from the requested IP or sitename.
The problems:
The server had a difficult time with virtual hosting of any kind. About 4 out of 5 requests to a virtual host would go through. About 20% of the time, there was some critical piece of information that the cache server would mangle so that the vhost mechanism would be unable to serve the right data. This was a couple years ago, so bugfixes might have happened. Maybe.
The server definitely had a hard time with dynamic content that wasn't built with a GET url (thus triggering the pass-thru proxy). If the request was posted, encrypted, hashed, or referenced a server side directive of some kind (server-side redirects were a nasty) the cache would fail. A server side link equating something like "http://www.server.net/redirect/" to a generated URL or dynamic content of some kind was the most frequent case we rean into with this. The server simply couldn't parse each and every http request or every variety and try to decide if it should pass-thru or not. I can't think of a logical way around this that wouldn't break any given implimentation. Can you?
We used dynamically assigned IPs at the time, so proxy requests made from one PC were often returned erroneously to another assuming the IP changed between usage. Say a modem hangup, etc. This was a rare event, but I listened to at least one person complaining that he was getting someone else's Hotmail. The fix to this is either to blacklist sites from being cached-- infeasible for every site that could possibly be requested-- or assign static IPs. DHCP broadband users may have similar problems, especially for those who have new IPs every so often.
Finally, if something got corrpted on the cache server due to disk error, stalled transfer, or some other reason, the sever had little or no way to throw out the bad data. It would throw out data that it *knew* was corrupt due to unfinished downloads, etc... , but often times this check failed or data was assumed to be correct even when it wasn't. Everyone who requested the same piece of corrupt data got it. I had to answer this statement a few times. "I downloaded it on one computer connected to your ISP and got a bad download. I downloaded it on my other computer from the same ISP and got the same bad download. Then I connected to another ISP from the first computer and got a complete download. What's up wit' dat, yo?"
Cache servers are a bad idea. The very idea is to try to be an end-all be-all to everyone who uses them. There are bug-fixes to some of the problem, but no way to solve the essential problem of the fact that MOST data on the web is dynamic now. Using cache servers with dynamic data is inviting difficulty and problem.
The next Slashdot story will be ready soon, but subscribers can beat the rush and slashdot the links early!
There are two free software projects building web accelerator proxies. One is RabbIT . The other is ziproxy. They are both web proxies which do not require any special software on the client side. They both compress HTML by gzip, and compress images into lower quality JPEG's. RabbIT is written in JAVA whereas ziproxy is written in C. RabbIT has more features than ziproxy, such as caching and removing ads. Give them a try if you're using a slow line! Disclaimer: I'm a ziproxy user and developer.
If they're so good, why isn't this first post?!
Netzero offered this a while ago (maybe they still do). Basically it does speed up loading of pages greatly, however there is a drawback, and a big one at that, the pictures look like crap. The GIFs/JPEGs/etc are compressed. Compressed so much to the extent that they look like, for lack of better terms, crap. A rule of thumb applies here, if it sounds too good to be true, it probably is. You really cant expect broadband speed for the cost of a dial up, and if you do I have some lovely penis pills to sell you for the low low price of 69.95.
The web could be faster if web server admins used faster web servers. Zeus Web Server instead of Apache, for example. The Holy Grail of web serving seems to be "good enough is good enough, and performance is someone else's problem".
That's about all the article had to say:
Tests by PC World, PC Magazine and CNET show
These are the same magazines with full color, multi-page reviews of the new 0.025% faster hardware. They are the same magazines that review each micro$oft product and say that the TCO is lower than ever before. Take one look at any of their websites, and you will see:
These magazines are Advertisements
Taking anything from them seriously is like taking a presidential speech to be a serious economic discussion, or taking a realtor's web-site as gospel in the market.
Funny - just went to CNET.com to research my post, and guess what? Over 50% of the page is advertising. The rest is 'reviews' of which 100% have links to affiliate programs to purchase said hard/software and give a kickback to CNET.
They will try hard to sell anything, and get their commission. It's like they are the used car salesman of the internet - only everything is new and they don't look you in the eyes when lying to you.
Personally I've found putting my modem on a box with a large amount of disk space and running squid to be extremely useful. In between the agressive caching and blocking banner ads, most of my web browsing doesn't seem very slow at all.
Of course when it's a new site chock full of graphics, or I'm doing binary downloads, I'm painfully aware of my modem's limitations. But for general surfing, sometimes it seems almost as good as the friends' broadband.
RFC2119
Ah, you mean like the RabbIT proxy. Personally I run this on my box on a t1 and use it whenever I am stuck with all but dialup.
Speeds things up so much, it's not even funny. Although it does require that you have a machine on a decent, faster than dialup connection to make it work well.
Can I get an eye poke?
Dog House Forum
(I should check this out by timing various downloads, but I'm too lazy. Somebody else can prove me wrong!)
So why do JPEG files with "more" compression download faster? Because JPEG is a lossy format: when you increase the "compession" you're not encoding data more efficiently, you're throwing data away. Depending on the image, you can do this and still end up with something that looks the same. But push it far enough and you end up with crap.
Where I live, DSL is $30 a month and cable is $40 a month. How can you argue with that?
The CPU time isn't always cheaper than the bandwidth. Some sites get cheap bandwidth and serve a ludicrous number of database-driven pages that are slightly different from each other - they're designed so that the database servers can handle the load, but gzipping every single page absolutely slaughters the webservers, which are barely doing anything more than relaying requests back to the database servers and are *still* running at high load.
Breaking Into the Industry - A development log about starting a game studio.
Yes apache and all 4.0+ browser support the gzip/mod-gzip combination. But the fact is very few websites implement this. Lots of reasons for that (mod-gzip conflicts with many other modules for one) although many more should than do.
A service like this that acts as a compression proxy can dramatically knock down the size of content. We implemented this at my last client and saw 78-93% compression of everything other than images. That includes css, javascript, dynamic web content, etc. I don't know about images, but this alone is very significant for today's clumsily table-laden pages.
Likewise I often find myslef in some crappy hotel where the connection is so noiesy I can barely squeeze a 14.4K connection out of it. I just want to check my web based e-mail not download the encyclopedia britannica
so anything that can make a dialup work painlessly on common web pages is a good thing.
Some drink at the fountain of knowledge. Others just gargle.
You......
went.....
to the.....
William.....
Shatner....
school of.....
acting.....
didn't you?
I hate to conflict with the general Slashdot opinion that this technology is somehow bad, but the truth is this stuff rocks
I've seen Internet Connection's acheive compression rates and acceleration rates breaking 130Kbs (equivalant).
We've talked some about compression rates, but there is more to it than that. This stuff does:
The first two require a proxy server setup, the second two are just plain good ideas that haven't made it into the wild. For example, the smart cacheing could be implemented by Moz, or even by a Firebird extension. The last could be handled by a certain standards body implementing much more aggresive changes in terms of compression instead of features like the ones introduced in the v.92 standard.
I work for a wholesaler in the western united states..
...and to the claim that it requires a client? that is not true. (blatent plug http://www.pacwest.com/dialbroadband/) this wholesale solution for ISPs is clientless.
In test trials 2-3x is the average seen. 5x does happen, and so does 9x from time to time.
If you have a recent version of PHP, you don't even need mod_gzip. Just put the following lines in your .htaccess file:
php_flag zlib.output_compression onDoes everything on the fly. I once had a shell script that would wget a url with the accept encoding gzip header, and then wget it again without and show the percent savings. Was fairly interesting to see what sites were using compression, and what sites that weren't could have saved in bandwidth by using compression.
Wow. I've never encountered anyone who has spoken so confidently on a topic without knowing a damn thing about it it. You are so absolutely wrong that the example you gave is the exact oposite of what you think it is. You've demonstrated precisely what not to do when writing good markup.
I don't even know where to begin to correct your cluelessness. Here it goes...
When I talk about writing good markup, I'm speaking a purely structural sense. In otherwords, markup text based on what it is, not what it looks like. Separation of content from presentation is the principle to follow.
For example, this is bad HTML:
That's bad for many reasons.
Now, here's good HTML:
And then a stylesheet:
Now, the user can easily define what stuff should look like based on what it is. A user could supply a stylesheet that increases the size of paragraph text by a percentage. A user could specify their own margins for list items. A user could eliminate colors or adjust contrast to their liking.
On top of that, a machine can read that markup and know precisely what it is. This is a header, that's a paragraph, this is a list, etc. A search engines and screenreaders would know precisely what to do with the text.
Furthermore, the styling data is in one location, so if you have 1 or a 1000 pages, the work is the same to make them look different.
Of course, this is only the tip of the iceberg. If you use XHTML and CSS properly, you get more visual flexibility than otherwise. (This also implies discarding tables for use in layout in favor of layers. Tables are for organizing tabular data, not positioning things visually.) XHTML also gives you forwards c
Join Tor today!
I have numerous relatives, older people, that had broadband but went back to dialup over the cost. In my apartment complex, DSL is not feasible due to the age of the phone lines, so we must use the Comcast monopoly- $50/month. Juno is around $10/month- even AOL is $23/month, half the cost of broadband.
Obviously, using broadband makes sense at a certain point of usage. But if you're not using the Internet more than, say, 2 hours a week, the economics just doesn't make sense. So there is value in dialup web accelerators, especially software that's easy for those who are technically challenged.
For the most part, gzip IS, in fact, the alpha and omega. gzip is king on byte-oriented data (it doesn't matter what size the words are, it's a nth-order entropy encoder, so they all turn into pseudo-symbols).
.jar files (pkzip)
The types of data that can be specialized are much fewer than you propose.
For example, bzip: better suited for text as text has a lot of localized second order trends. However it is computationally intensive and may not do well on a server appliance over multiple connections.
PNG is better on (many) raster images since it exploits 2-dimensional relationships recursively. But it requires the source image to be uncompressed first. JPEGs might already be compressed, and that would make them larger. But recompressing JPEGs (which is the big step these proxies take) is someone of a hack since we didn't really ask for it, and it may look like shit in the end.
And forget video or audio. There's nothing you can do about it (in realtime anyway...).
If and when SVG and other XML-based content formats become prevelant components of websites, then gzip/bzip on the fly will become very useful in making sites small, fast but content-rich.
PDF and flash are already compressed heavily, so they don't need it. Java programs come in
And mozilla has browse-ahead built in.
So I don't really buy any of this. If I still had dialup, I'd rather them just be upfront, let me control MY upstream cache settings/content, and forget all the fancy software, because most of it is redundant. That part about the cache is the key thing, because that's what you're really paying for, and they should let you control how it's used.
Fuck Beta. Fuck Dice
One step most of these proxies is doing is compressing HTML files. HTML is highly redundant, so compression can save alot of space. However, it's silly for the proxy to do the compressing. Instead web site owners can do the compressing! Transferring pages gzip compressed is part of the standard. No special software is needed by end users. A 3:1 reduction in bytes transferred for your web pages (the HTML itself) is a reasonable minimum. The result is that you use less bandwidth and end users get a faster web site! Every mainstream browser supports this, and those browsers that don't support it will automatically get the uncompressed version. If you're using Apache, you'll want mod_gzip to automatically compress transfers. (You can fake the effect with MultiViews, but it's a hassle to maintain two copies of every HTML file.)
(Yes, I know I don't practice what I preach. I'm working on it.)
Search 2010 Gen Con events
People keeps saying that this technology is pretty much moot as more and more people are getting broadband connections. Why should compression and caching technology only be applied to slower connections? Why waste any amount of bandwidth even when you have "tons" of it?
I use a modem for web browsing. I've found that wwwoffle is a good proxy server, because it can operate in both online and offline modes - when offline, it serves the most recent version of each page, and if you try to view a non-cached page, it's marked to be downloaded next time you connect. If you want to speed up your browsing some more at the expense of having to hit 'reload' occasionally, you can configure wwwoffle to always use an available cached version even when online.
If you have a shell account on another machine, and that machine has access to a proxy server, then you can tunnel port 3128 or 8080 (common http proxy ports) through ssh. This makes browsing a lot quicker because there is only a single TCP/IP connection going over the modem link - you don't have to connect separately for each page downloaded. Unfortunately I found that while this gave very fast browsing for half an hour or so, eventually it would freeze up and the ssh connection would have to be killed and restarted. Perhaps this has been fixed with newer OpenSSH releases.
RabbIT is a proxy server you can run on the upstream host which compresses text and images (lossily).
The author of rsync mentioned something about an rsync-based web proxy where only differences in pages would be sent, but I don't know if this program was ever released.
-- Ed Avis ed@membled.com
If you and your ISP both use a Windows system, turning on "software compression" (and don't forget to turn off the modem's hardware compression) basically gives you mod_gzip on ALL your incoming AND outgoing data.
A drawback used to be that the server at the provider side was often overloaded, so I set up several "accounts" to switch between hard- and software compression with and without proxy. Now that my ISP is no longer "free", I haven't seen the server become overloaded any longer, so I use software compression and their proxy all the time. HTML and text download at 20kB/s over a 48kbps connection. Off course, there's no gain in already-compressed content like images and audio.
Musicians don't die. They just decompose.
Harvest all the links on a webpage, cache them on disk while you view the page content, if you still don't go anywhere, harvest links on the cached pages, and so on. If you type some URL and go elsewhere, discard everything.
Results for you: You click on a link and you have it immediately - from harddisk cache.
Result for others: A major part of the bandwidth is wasted, everyone's connection gets slower.
Effects: Everyone installs accelerators to have the net working faster. Bandwidth usage jumps 10 or so times, prices rise, connection speed drops far below what was before the accelerators. Nobody gives up the evil accelerators because without them it goes even slower.
It's called "social trap".
45 5F E1 04 22 CA 29 C4 93 3F 95 05 2B 79 2A B2
From one of the articles:
The idea is to store Web pages on your hard drive upon your first visit to the pages, and then to limit the information you download on subsequent visits to those pages to only the data that changes, making for a faster download.
Makes me think about: http://rproxy.sourceforge.net/.
New things are always on the horizon
Here's a fucking hint
http://sourceforge.net/projects/mod-gzip/
There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter