WCArchive sets new Record
dcs
writes "The hardware upgrade for wcarchive came
not a single second too soon. In it's first full day of
operation with the new hardware, a new record was set...
969 gigabytes of traffic was generated, thanks
mainly to the recent release of RedHat 6.0. I'm
looking forward to the first terabyte in day mark, but it
seems an upgrade on network capacity is due before that
can happen. "
granted, this doesn't prove anything about the FTP servers, but:
www.download.com is running Netscape-Communications/1.12 on Solaris
Plus, they have multiple ftp machines... so Apples + Oranges comparison.
According to NetCraft:
www.download.com is running Netscape-Communications/1.12 on Solaris
ftp.uu.net still runs SunOS 4.1.x,
nic.funet.fi (the old kernel distribution point)
runs Digital Unix V4.0D. So there are commerical
oses that can handle that kind of load. Its just that most people do not run sites like wcarchive.
Mmmmm... These specs would make for the perfect MP3 server! (All legitimately owned on CD, of course.) ;-)
ftp.cdrom.com isn't Linux, its FreeBSD.
Those sites aren't handling anywhere near the load that WCA does even on a light day.
FreeBSD: The Power to Serve
I noticed in its new configuration it does have a gigabit ehternet card.... Hmmmm....
The commercial Unixes (and NT, for that matter), could handle this kind of load. Do the math. To handle that load, they need to be able to do 12 megabytes/second disk I/O, and keep a couple of T3s full. Hell...the disk I/O probably isn't even a factor, since the Red Hat distribution will fit in the cache on that machine.
If someone pushes 900 gigabytes out of a web server that is serving a zillion small HTML pages, then it is time to be impressed, but serving up 900 gigs as 4000 or so large FTPs is not impressive. Finding someone willing to pay the bandwidth costs for 900 gigs a day is very impressive, though (a couple thousand dollars, I believe).
Sites like Slashdot, Yahoo, DejaNews, and Hotmail are much better examples for demonstrating the ability of a given OS to handle a load.
Or better yet redhatted!!!
In fact, probably it was that slashdot article that made people go there (like, this is fast and has a huge limit, so let's grab from it).
This make me think if the slashdot effect is only from slashdot... I think it's from the whole open source/free software community, which is great group of close-minded fellows.
Since cdrom.com is providing 100% service to the others, maybe the ISP hosting it as a significant interest in having it on its network, and thus lowers the price of the net connection ? I mean, it is nice to be able to say that you have a 100 Mbps connection to 1/2 TeraByte of freely downloadable files (and today, publicity on slashdot).
> I can't recall what the Microsoft record was.
Probably people downloading some "minor bugfixes"
for Win95.
The OS: FreeBSD
Ftp.download.com reports that it's running WU-FTPD, probably also on Solaris.
Well:
Hotmail: FreeBSD
Yahoo: FreeBSD
Need I say more...
Au contraire, my silly friend -
The Smart Reseller tests were quite equitable, as they compared "out of the box" OSes, with no tampering.
Your statements are simply nonsensical!
Can you explain to me why that is?
What kind of site is this (I can understand if I serves up some dynamic content, reading from databases or creating web pages on the fly). I mean, you can send static html pages over the ftp server if you wanted, there might be a little lag for the browser to do an anonymous login, but serving static html and ftp seem to be pretty much the same to me.
Oh, yeah
And what's the web site that these machines serve?
http address please!
Actually, ftp.uu.net does -not- run SunOS anymore.
And it should be a whole lot faster than it was a week ago.
This does NOT prove that a finnish sauna Unix clone can handle big loads.
Enough...
-T
The U.S. Postal Service cracked down on newspapers and magazines that made inflated circulation figures which gave a false impression to advertizers. It is now a crime to publish false circulation figures for material processed by the Postal Service.
We can certainly hope that similar fraudulent benchmark claims will someday be a thing past.
David Greenman's orignial message can be found
at
http://www.bafug.org/NewRecord.html
> It is not inherently harder to server static HTML pages than FTP files
It is inherently harder to serve dynamic HTML, though, and that is what any serious web site has to do these days.
http is a very simple protocol: In most cases the client opens a tcp connection, sends a header with "GET" and the server sends the data back through the same connection.
:-) )
ftp is *much* more complicated. There is more than one connection (one control from the client and a data connection, direction depends on wether it's "passive-" ore "active-mode" ftp.)
If people download with a browser a lot of things
happen with *every* URL:
The client opens the control connection server sends greeting asks for user, client sends user (anonymous in most cases), server asks password, client sends password (eg. mozilla@), server sends greetings, client sends cwd (it has to, see RFC-I-don't-know), server sends ok, client sends "binary mode" (well, some do), server sends ok, client sends "get", server sends data.
The handshake for opening passive/active data connections is not included... (I'm not sure how this works, ATM
This is not absolutly correct (from memory), but it should give you an impression. For more info, please read RFC 959, 1123 or/and sniff your own ftp connections.
Hey, just looking to WCArchive (ftp.cdrom.com) and what i see ?
...
Powered by FreeBSD !
A correction seems Needed
I really dislikes the spin of the article ; it let suggest that this record is "thanks to the latest release of Redhat 6.0" It makes me remember some M$ practices.
A correction seems most needed in tribute to all those who have worked hard to release FreeBSD ; This record is a great reward for them.
Instead, you won't see me anymore here.
Jean-Marten Marchi jmarten@ibm.net
Really don't like the spin of the article that let suggest it is thanks to Red Hat 6.0 .
Just thinking to all those who worked hard on FreeBSD and on setting up this server.
One day, maybe, it will be Linux turn but for now it's FreeBSD.
jmarten@ibm.net
On the same file they tell why they have it - there are plans to upgrade the network connection. Right now they have only 100Mb internet connection. And doing some simple calculations what do we get - the new record won't be broken before that 1Gb network since the record was about exactly the amount of data a 100Mb ethernet can transfer if it transfers 24h it's max speed. So atleast now they should stop whining about internet backbones being too slow since the bottleneck is obviously their own network. And the fact that this new record came "with the new machine" - everyone can guess what was the previous bottleneck. (Ever thought why those "so slow" backbones can transfer data fast between two any other systems but not from ftp.cdrom.com.. :)
>And the fact that this new record came "with the new machine" - everyone can guess what was the previous bottleneck
Not necessarily, previously the user limit was 3600 users, w/ the new machine they upped it to 5000. Now, the only way you can conclusively say the machine was the bottleneck is if you take the old machine and up the limit to 5000
I'm not wondering at all.
I used to look after an rs6000 (among others) at a small university. It was mail server, web server, etc. and with two network cards and only 256mb it
moved more than 900mb per day every weekday during the school year and barely broke a sweat.
Do the math. 100mbs ethernet, with an awesome switch, would have trouble pushing these numbers. At least maybe a second card should be added?
jbest@magnacom.net
> Ftp.download.com reports...
:-)
And we all know how accurate they can be
It's not WU, it's a custom FTP daemon, written by one of the FreeBSD core team.
230-This machine is a Xeon/500 with 4GB of memory & 1/2 terabyte of RAID 5.
230-The operating system is FreeBSD.
Hey, I find it kinda fascinating too to know what such large sites run. Someone mentioned in a previous thread though that download.com runs Netscape Communications/Solaris 1.12. I know that Yahoo! uses FreeBSD (an old copy of the FreeBSD newsletter had an article about Yahoo! and the different OSes they tried when they were starting up). Dejanews and Slashdot are linux based, and Hotmail uses FreeBSD and Solaris (from the kirch paper).
That's pretty much all I know about who uses what.
No wonder you're posting as an AC!
windows "nt" could never handle this load!
In microsoft's "best practices" documentation, they recommend a GROUP of machines more powerful than ftp.cdrom.com, just to server 6-8 GB/day.
Install windows "nt" on ftp.cdrom.com, and watch it crash, just like the debacle that occurred when microsoft tried to move hotmail from Unix to windows "nt"!
Are you taking into account that they only have to pay for what they download? i.e. tcp acks
Forget Mindcraft, this is where it really counts.
----
Every year during my review, I just pray the words "slashdot.org" aren't mentioned.
No, it shows that FreeBSD is capable, stable, powerful, and robust.
How does a FreeBSD machine's stability and power somehow prove something about Linux or NetBSD? It proves nothing more about Linux than an NT box doing the same thing would.
10 PRINT CHR$(205.5+RND(1)); : GOTO 10
to find out what server and OS are being used by a given domain name. Try egg.microsoft.com
:)
I beleive those statistics are generated by a program called "Queso" (search for it on freshmeat) that does this at the command line.
You should check it out, it's hours of fun
-Erik-
Don't forget that Slashdot itself runs Linux and Apache and handles about half a million hits a day, much of that dynamically generated. By my calculations, at peak times, Slashdot tops 10 hits/sec.
--Phil (Way to go, Rob!)
355/113 -- Not the famous irrational number PI, but an incredible simulation!
It doesn't -prove- anything. It's merely an impressive feat. I have no doubt that another OS could achieve a similar accomplishment, however. Regardless, it is certainly a testament to FreeBSD's performance (not necessarily speed, also includes functionality) under extreme load.
"Bear in mind though, that all _independent_ testing has shown exactly the opposite to be true"
Proving that Linux is faster than NT on a desktop box with 64 megs of RAM doesn't really satisfy the statement "all testing has shown exactly the opposite."
And if you are referring to Smart Reseller's test, it was hardly independent. In speaking with the authors of the test, they readily admit to being biased against NT and went out of their way to cripple it.
That 17% was server OS sales for 1998. This is not 17% of total server installations.
Microsoft and Netscape dominate the Intranet web server market. Apache has only a small minority of this market.
But a 486 is hardly what people are running NT on.
The Oracle benchmarking that was posted to slashdot a couple weeks ago was also done in a biased manner.
By selecting hardware which is known to give good performance on Linux and poor performance on NT, the test is just as biased as the mindcraft study.
Which is fine, but don't pretend that they are unbiased an independent when they are not.
Oh, and BTW, all of the production servers at my company are running SMP. The intranet servers are quad processor Proliants, the Oracles are Sequents with 16 processors.
How many Linux servers do you see in production environments at Fortune 500?
Anyone who ever claims that the free Unices aren't up to handling heavy load ought to see this.
;-)
I think this proves very conclusively that the free Unices (Linux, NetBSD, FreeBSD, etc) are all very capable, stable, powerful, and robust. I'd love to see a box running a commericial OS try to match this.
Topher
It's now stable as a rock on Alpha's too :)
;-)
Is it? Neat.
I've heard lots of mixed reports on how far the Alpha port had progressed, though last I heard it was still fairly beta, but improving rapidly. The Sparc port though, last I heard, was pre-alpha still...
Topher
Why do we need to wait for more network upgrades for a terabyte in a day? Where is the bottleneck in this situation? Do we know? Or are we just spouting out that 'oh, it's the network thats slow' because it sounds good. OF course, that was the last second comment on the post, so that's probably the case.
I don't see that there would be much problem in boosting the total G services by 31G! Only 7% more! But heck, it made a great line to end the story on (yeah ok).
yacko
-- There is no sig line, only Zuul.
David Greenman, the Co-founder/Principal Architect of the FreeBSD Project just posted a new picture of the new wcarchive, it is now available here.
Updated hardware description is also available here.
It would be amazing if someone could pull some nice effects with The Gimp and make a cool looking "ftp.cdrom.com theme" for Windowmaker or something...
- Alfred Perlstein - Programmer and Administrator, Wintelcom.
How many Linux servers do you see in production environments at Fortune 500 companies?
Probably a lot more than you would believe.
Nobody takes an assay of "servers" if it's "just that box in the corner there". The only time people worry about their servers is when they're not doing what they're supposed to.
Chas - The one, the only.
THANK GOD!!!
Chas - The one, the only.
THANK GOD!!!
I've got 4 Linux machines with dual 300 MHz PIIs and half a gig of RAM each using round robin DNS to handle a very busy web site, and it doesn't serve anywhere near 1000 gigs a day, yet it needs hardware that is much more powerfull than cdrom.com, precisely because web serving is a much harder thing than FTP serving.
You are assuming FreeBSD and Linux have identical load handling patterns - they don't. It is not inherently harder to server static HTML pages than FTP files, and if used a special light-weight HTTP server (ftp.cdrom.com use a special light weight FTP server) then I do not think it would unfeasible to serve similar amounts of HTTP data.
In order to make ftp.cdrom.com capable of transferring that much data, however, sendfile() was needed. The FreeBSD sendfile API is, if I've understood correctly, different from the Linux one, in order to be able to support HTTP. If you'd want to serve web data competitively from a Linux machine, I think you would want to implement a similar API for Linux.
You'd probably also need to do a number of mods to the Linux VM system if you want similar performance to FreeBSD; however, I can't state that conclusively, as it is a long time since I've seen any benchmarks between the two.
Eivind.
Doubting the existence of evolution is like doubting the existence of China: It just shows that you're uninformed.
I've always gotten horribly slow connections from them, too many people always hitting it (mostly gamers I think). It was great back in '95 or so, but that place is too crowded now ...
... "No one goes to that restaurant anymore - it's too crowded."
what did Yogi Berra say
support gun control: take guns from cops
If I remember right, Microsoft had the record for a time, after releasing Windows 95. Then it was set by a big bunch of servers, not one single. This traffic lasted for several days. I don't remember how much it was.
Later, when cdrom.com moved their server, they copyed all the data over a 100Mbit connection and got the new record. I don't remember how much this was, either.
I haven't heard of anybody breaking this record before now.
"The assembler gave birth to the compiler. Now there are ten thousand languages." - Tao of Programming
Is the Smart Reseller test the same that was published on ZDNet?
If so, the configs were hardly "out of the box" - the Linux box in the ZD test was heavily tuned by a member of the Samba team. Furthermore, ZD didn't publish this information, where at least Mindcraft admitted that they tuned the hell out of the NT box.
--
Business. Numbers. Money. People. Computer World.
I'm not too surprised that it continues to break it's own records, CRL is a Tier-1 Backbone provider, so probably about 1/4 of the traffic is from within the same network, and the other 3/4 go across the NAP's on pipes dedicated to wcarchive.
--Jason Bell
--Jason Bell
Faster than the light of speed!
Mindcraft's test was a bunch of BS, they were very favorable to NT and not to Linux. Anyone here have an MSCE? We could do our own linx/NT test with the exact same machine and see which one does better. And how about a real world test, Linux can support 200+ users on a network, NT has trouble with more than 40. I've seen NT networks drop where a linux network wouldnt have even noticed the work load.
I'm a loner Dottie, a Rebel.
It's now stable as a rock on Alpha's too :)
Actually, their whole architecture seems strange. This seems like something much better handled by multiple machines with connections to different ISPs. Oh, but they're colocated in their ISP's machine room....
I'd love to see this kind of information (bandwidth, machine, OS) and more (time-of-day loading curve, ...) for all the big data providers, whoever they are... (download.com? yahoo? aol? conxion? ...) They don't seem to brag about it much. If they have little info pages like cdrom.com's, I haven't been able to find them.
I don't know why, but this kind of stuff just grabs me. Lifestyles of the bandwidth-rich and cache-famous? Packed-Tranfer Pr0n?
Umm, my 5.9 gcherokee does 0-60 in 7 secs, I don't think a hyundai can beat that :-)
(little sensitive sbout my car
My point was that it's a uniprocessor machine, and that benchmarking (and real-world deployment) has shown Linux to pretty conclusively outperform NT on uniprocessor machines. Quite likely FreeBSD would outperform Linux in some server benchmarks, but that's beside the point.
Well, Linux currently has 17% of the server market, and is estimated to be growing at 25% a year for the next few years...
If you check http://www.netcraft.net/survey/ you'll see that Apache massively dominates the web server market with around a 60% share. Being open source rather than commercially backed obviously hasn't stopped it from putting a huge dent in Microsoft's sales.
I'm sure Microsoft wishes that Linux _was_ a traditional single company commercial vendor, since that would give them a target to shoot at.
As Mindcraft's web site says (paraphrasing) "you identify your goals, we do the testing to satisfy them". Given that the paying customer was identified as Microsoft, it should come as no surprise that the goal was to show NT being faster then Linux. Bear in mind though, that all _independent_ testing has shown exactly the opposite to be true, certainly for uniprocessor machines such as the ftp.cdrom.com server.
... but as those two have indicated, this is a complete farce, and you can expect the "retest" results to be as information free as the first ones.
There have yet to be any standard SMP benchmarks (TPC-D, SPECWeb96 etc) published, although an unofficial Oracle benchmark indicated Linux to beat NT there also.
Also bear in mind that the "Mindcraft" testing has since been shown to have been performed in a Microsoft lab (the "Mindcraft" e-mails originated from a Microsoft domain)...
Ultimately, all the "Mindcraft" tests really proved is that Microsoft is starting to take Linux as a _very_ serious threat to NT - not surprising given the Linux server marketshare and growth numbers.
Microsoft is attempting to recover from the PR nightmare resulting from this testing by redoing the tests with "unimpeachable" Linux configuration expertise supplied by Linus and Alan Cox
How many net servers in the real world run off SMP boxes? Most ISPs use server farms of uniprocressor machines - much better bang for the buck. No-one's denying that Linux's SMP performance could be improved, but exactly how it compares to NT (which has it's own set of problems) is really unknown to this stage due to lack of fair testing.
The Oracle test I mentioned took one approach to fairness in testing both NT and Linux out of the box with no tuning on either side.
Given how artificial benchmarks are, the real world observations of NT vs Linux performance should probably be given more weight anyway. A quad zeon box is hardly what people are running Linux servers on - many are running on 486's! Try that with NT...
You can use this site:
http://www.netcraft.com/cgi-bin/Survey/whats
to find out what server and OS are being used by a given domain name. Try egg.microsoft.com !
This works by recognising the characteristic signatures of the different OS's TCP/IP stacks as they respond to a bunch of wierd packets.
Your reply is quite correct. Furthermore, I wrote "recent release of Red Hat 6.0", which should made it even more clear.
:-)
Alas, a second paragraph existed, which did mention FreeBSD in a as least offensive manner as I could manage. I guess CmdrTaco did not like my slamming of Windows instead...
(8-DCS)
What wcarchive needs now is a nice gigabit connection to an OC-12 or so. It's actually mentioned in /archive-info/slow.txt, too. At that point, wcarchive will truly be the best. (It already is, but its link is a bit slow for its popularity ;)
Didn't their "tests" "prove" NT was *faster* than Linux? A Hyundai is "faster" than a Jeep Cherokee, but guess which is more powerful. And after all, power counts more than speed in 90% of cases.