Average Web Page Approaches 1MB
MrSeb writes "According to new research from HTTP Archive, which regularly scans the internet's most popular destinations, the average size of a single web page is now 965 kilobytes, up more than 30% from last year's average of 702KB. This rapid growth is fairly normal for the internet — the average web page was 14KB in 1995, 93KB by 2003, and 300KB in 2008 — but by burrowing a little deeper into HTTP Archive's recent data, we can discern some interesting trends. Between 2010 and 2011, the average amount of Flash content downloaded stayed exactly the same — 90KB — but JavaScript experienced massive growth from 113KB to 172KB. The amount of HTML, CSS, and images on websites also showed a significant increase year over year. There is absolutely no doubt that these trends are attributable to the death throes of Flash and emergence of HTML5 and its open web cohorts." If you have a personal home page, how big is it?
That's rather personal.
It's a good thing phone carriers don't limit your data consumption....
oh wait..
With the growth of Javascript libraries like JQuery for more UI features, more images, I can see it reaching that high.
Meanwhile, web developers don't care because more and more people are getting faster and faster broadband speeds. So as long as the page-load metric works OK on their rig or perhaps what the envision most of their viewers have... they think it's all OK.
Average information content - does a page view give me more insight as a user now than it did 10 years ago?
And how much of it is ads?
and the 3G users, and the satellite users, and everyone else that has a low-bandwidth and/or high cost per byte connection.
My parents can't get DSL or cable. They're stuck with 22k dial-up, and use AdBlock Plus, NoFlash, and Propel accelerator with compression set to the point where you can barely recognize photos, and it still takes 2 minutes for a reasonably normal page (CNN, MSNBC) to load, much less anything with a ton of Javascript or Flash.
Can't websites automatically detect connection speed the first time a client visits, and store a cookie so that us slow people get a nice, simple website?
Oh, and Propel, please move to JPEG2000 and XZ compression. Some people need every byte they can get.
It's not the size of the home page, it's the motion of the .GIF
And.... when running AdBlock Plus, this figure goes down to 100kB. I run AdBlock mostly for the massive speed increase that comes with it.
My fully featured CMS that used jQuery, jQuery UI, and a lot of heavy library takes 140kb. Learn to optimize people!!
I have a homepage, and it's only 4.92Kb. Granted it is the "It Works!" page for CentOS which has all of the other text and icons and such but who needs more than that? Do people really have personalized home pages now that Facebook came about (other than some hobbyists or professionals who run a side business)?
I wonder what the average "Facebook" homepage size is... since that is what most people will be seeing regularly.
- Toast
I think eBay lead the curve on this one. I complained bitterly to them about how long it took their bloated pages to load when I was still on dialup. Nobody cares.
I suppose the telecoms do. This increases the liklihood of blowing through your monthly bandwidth cap without even watching videos.
A feeling of having made the same mistake before: Deja Foobar
If the bulk of the increase is from javascript wouldn't turning on compression on the web server solve the problem? They're text files, they compress down massively.
And Less Content. .
I remeber the days when a site would include an 10 paragraph article on one page - Not 10 pages with a paragraph on each.
. .
This only matters if people go to the first page, and never go to any additional ones.
For most websites these days, you'll take the initial hit from javascript and the 'branding' images when you first get to the site ... but the changing content per page is much lower.
If websites are using standard javascript libraries being served by Google's CDN, then it's possible that someone visiting your page already has jquery, mootools or similar cached and doesn't need to load yet another copy.
I also didn't see if they had any comparison between transferred size vs. used size. (eg, javascript that's sent compressed) ... and as this is from an new archive ... does anyone know if Archive.org could analyze their holdings to see what the longer term trends are?
Build it, and they will come^Hplain.
My home page remains where it has been since 1993 at the Calgary Unix Users Group: http://www.cuug.ab.ca/branderr ...clocks in at 9.2K, plus a 15K GIF and a 9.1K JPG (if you "turn on images" in your browser - remember when it was a realistic option not to?)
I have held the line, while Viewing With Alarm (VWA) the growth of web pages for the entire 18 years since. I wrote Bob Metcalfe when he had a column at InfoWorld 15 years back, and he was Viewing With Alarm the exponential growth in Internet traffic and predicting the "collapse of the Internet" (had to eat those words - literally) because of it. My letter pointed out that his column constituted 2K of text - that was all the generated content that was bringing in the readers, (unless you count the 10K gif of Bob Metcalfe, and I don't), and the page had an additional 100K of framing and advertising-related image GIFs. His reply was somewhat defensive.
This last year, I had occasion to travel on the Queen Mary 2, where all internet is via satellite at a minimum of 34 cents per minute with their bulk plan. How quickly I grew to resent the giant Flash blobs that would be automatically downloaded with every page of a newspaper so I wouldn't miss the animated ads for the latest in car buys. At QM2 speeds, I'd have to wait about two minutes before I even had an "X" mark to click on to dismiss the ad. I was rather quickly cured of almost any interest in the Internet content at ALL, I did my E-mail, checked the google news headlines (fewest high-byte ads), and logged off.
My point: 90% of mail is spam. So are 90% of web page bytes. We just don't call it spam. We call it "the whole outside frame around the news page that we try not to see, but keeps jumping around into our field of view".
There is absolutely no doubt that these trends are attributable to the death throes of Flash and emergence of HTML5 and its open web cohorts.
No, it's not about HTML 5. A lot of it is about bloated content management systems and templates.
I was looking at a Wall Street Journal page recently, and I brought it into an HTML editor so I could eliminate all non-story content. The story required an HTML page with only 72 lines. The original page was over 4000 lines. It contained a vast amount of hidden content, including the entire registration system for buying a subscription. All that junk appears on every page.. Inline, not in an included file.
On top of that, there are content management systems which create a custom CSS page for each content page. So there's no useful caching in the browser.
Remember those people who said CSS was going to make web pages shorter? They were wrong. Look at Slashdot - bloated, slow pages that don't do much, yet consume CPU time when idle.
That's still too much flash for me
Sent from my iPhone
Ironically posted on a website that is itself a bloated pig.
10 years ago online video was virtually nonexistent, and where it did exist it was never larger than 320x240.
And now it is ubiquitous, HD and largely devoted to pointless things that would be skimmed over and disregarded in a fraction of the load time if left to text and still images.
my site's a pyjamas application. it is therefore 1,000 lines of python.... or, when compiled (and therefore including the pyjs runtime which does stuff like dict, list, exceptions etc. all emulated in javascript, as well as including the library of widgets that are used on the page) it's 1.3mb of really obtuse but functionally correct javascript.
That's what she said!
"For every expert, there is an equal and opposite expert"
It remains the size of one of those animated "under construction" gifs.
Word game?
I've been able to run both CPU and GPU based CFD and 3D visualisation on my laptop without any problems, yet some flash games which are just doing 2D animation will roast a 2.7 GHz CPU to the point that the kernel decides to call it a day and shut down the whole system.
Unbelievably, these flash games aren't doing anything more complex than playing a retro 2D platform game. I'm guessing that this is due to the way in which all the separate texturemaps/pixelmaps are treated as generic webpage images rather than as a single DOOM style WAD file.
Vintage computer adverts: http://www.vintageadbrowser.com/computers-and-software-ads
http://www.the5k.org/
It seemed so long ago. Didn't /. have an entry as well?
This sig contains repetition and redundancy.
Some sites use Javascript to display what is semi-static data that should be assembled on the server side before transmitting to the user. For example, a news site where the stories are loaded by Javascript.
Some sites even have pages that are entirely blank if Javascript is turned off. It seems that some of these "web programmers" don't even know how to dynamically build a page with server-side scripting instead of Javascript.
---------
There is inferior bacteria on the interior of your posterior.
CSS is for prima donnas and Flash is for artistes. PHP is for chatterboxes and Perl is for psychics. Javascript is for the clinically insane, and Ruby is for hipsters. Drupal is for geeks and Ajax is for nerds.
I'll stick to plain-jane HTML, thanks.
"...and plain-jane HTML is for bigger hipsters than Ruby users". Understood. Thanks.
Damned FrontPage
"UNIX is very simple, it just needs a genius to understand its simplicity." -Dennis Ritchie
From a usability perspective:
In short: You fail web page design, so who the fuck cares if your page is 10K?
HAND.
It's not the size of your homepage, it's how you use it.
I am Bennett Haselton! I am Bennett Haselton!
Well, stuff like jQuery/Dojo/etc libraries shouldn't be loading every time you view a page.
The first view, your browser will need to load all the associated CSS, HTML, etc.
After that, included files should hopefully be cached, and only page content need be loaded.
Also, with JS libraries and AJAX, one should be able to build pages that load the overall template once, but don't require pulling large HTML files for updates (rather just pull content with AJAX).
What's worse is that the "payload" of text is less and less interesting. Bandwidth isn't the problem. I have more than enough bandwidth for these pages. When they hit the browser, they take forever just to render. There are a handful of web sites I still use, Slashdot among them. Most new sites I just back right up. If your site does that on day 1, it's not worth the bother. I'm not buying a new machine just to look at your crap web site that's probably just a rehash of every Internet meme.
We're well into the "nobody comes here anymore it's too crowded" and/or "57 million web sites and nothing on" stage.
Well, that explains why my 400MHz WINXP laptop with 128M memory has a heart attack trying to scroll down a page of comments on Slashdot.
Sorry, but gray text on gray background is making my eyes bleed.
40-60% of Yahoo!’s users have an empty cache experience and ~20% of all page views are done with an empty cache.
http://yuiblog.com/blog/2007/01/04/performance-research-part-2/
Don't feel too smug. Your page isn't even compressed.
If you have line of sight, you might consider reading 'Diary of a Not-spot' posted on The Register.
Hopefully some of what he's tried and gone through could be of help to you.
Diary of a Not-spot: One man's heroic struggle for broadband
Diary of a Not-spot: The readers speak
Diary of a Not-spot – the final chapter
Diary of a not-spot: Breaking the BT barrier
Median is the measure you want.
If you use the mean, 90% of web pages could stay the exact same size, but if the other 10% go nuts and increase their size 20x, the mean will grow nearly 3x.
So a web page today is about 10x bigger than it was in 2003. I can accept that.
But in 2003, I had a baseline 2-megabit-per-second Internet connection and could have had a 3- or 5-megabit connection for a bit more cash.
Today, 8 years later, the "normal" connection speed for my ISP is 6-megabit.
So according to my observations and their statistics, folks are expected to download 10 times the amount of stuff using just 3 times as much available bandwidth.
In other words, the web is currently more than three times slower than it was in 2003.
Hooray!
Kid-proof tablet..
Unless you drive in England.
About half my regular blog readers are based in emerging markets / less developed countries. I began to notice that hittership was dropping in Africa and India. Reviewing about a thousand posts, I noticed that the more photos and "blogger apps" I put on the web page, the lower the readership in countries with low bandwidth. I've been more conscientious now about which photo resolution I post and tend to avoid videos. And a lot of the cool little blogger widgets don't seem as important when measured in seconds to open the page. http://retroworks.blogspot.com/2010/12/blog-has-widget-fever.html Of course my content also sometimes sucks, and it also helps if I lay off the haiku.
Gently reply
So now, as carriers start charging based on data usage, web pages continue to grow, and ads they contain as well. It was fun while it lasted.
I can mend the break of day, heal a broken heart, and provide temporary relief to nymphomaniacs.
Bucking the trend, apparently.
My home page is 202 bytes.
The most used link points to a page that is 45 kilobytes. All data, no images, no css. There are 12 people that use it.
I believe my site design style could be considered Minimalist.