TCP Equipped Ethernet Card

Wow, seems cool by Anonymous Coward · 1999-05-08 08:36 · Score: 0

Seems cool. But How much will they cost?.

Also, whats Linux (or Open/FreeBSD's) system load on an equiv OS TCP/IP stack situation?

Re:Err... by Anonymous Coward · 1999-05-08 08:44 · Score: 0

It seems it would be an I/O bottleneck in this. Even with an Ultra-DMA HD

Magic is indistinguishable from a Rigged Demo by Anonymous Coward · 1999-05-08 08:53 · Score: 0

seriously, where're the numbers? all i see are some claims and some broad descriptions. I want exact specs.

I'd also like to know if they're going to open the API, provided this is what it claims to be.

Re:Err... by Anonymous Coward · 1999-05-08 09:00 · Score: 0

Ulta-DMA HD? Since when do file servers use Ultra-DMA hard drives? The last time I checked most (serious) hardware configurations with 100 Mb (and higher) NICs in them are designed for server applications. Those same (serious) hardware configurations usually employ the use of a SCSI subsystem and more importantly RAID. Thus you have all kinds of storage-IO and, unlike your desktop, your IO bus probably doesn't represent a serious bottle neck (not at 100 Mb anyway).

Re:Err... by Anonymous Coward · 1999-05-08 09:06 · Score: 0

9 MB/second = ~90Mb/second. That is a MASSIVE transfer rate. Few hard drives can keep up with that in actually continuous read rates. Although of course Linux' stack just MUST be better than NTs.

Bah.

Umm.. no. by Anonymous Coward · 1999-05-08 09:22 · Score: 0

Max win95 is 40kbyte/sec? I've got a win95 box sitting next to me now that I've seen do sustained 600 kbyte/sec over cable modem (10bT).

Re:Err... by Anonymous Coward · 1999-05-08 09:32 · Score: 0

Let's see some credible URLs please.

Re:Err... by Anonymous Coward · 1999-05-08 09:34 · Score: 0

My Linksys 5 porter is still going strong after 9 months. Couldn't beat it for the price!

Re:Err... by Anonymous Coward · 1999-05-08 09:40 · Score: 0

who said it had to be server-only?

What about geek-toy :)

Just give the best source as reference by Anonymous Coward · 1999-05-08 09:51 · Score: 0

You only passing on information, its not as if you have to give credit to anyone who has ever done an artical on such a topic.

It seems logical to me that you should quote the best source of information you have as the refernce.

That said, im not a logical person either and plagarism would be fine by me (claim it as your own hehe).

Re:The next software task to be moved to hardware by Anonymous Coward · 1999-05-08 10:59 · Score: 0

youve obviously never tried to run linux 2.2.6
on a 386SX-16 with 4 megs of ram and a 43MB
HDD.

Re:Err... by Anonymous Coward · 1999-05-08 11:02 · Score: 0

I don't know how you got those numbers. I routinely get over 1MB/sec with my ftp server over the 10bt lan here. I only have a cheapo 10/100 DEC 21140 based netgear card.

Re:No by Anonymous Coward · 1999-05-08 11:06 · Score: 0

How is 900kbytes slow? 900kbytes is 75% of the maximum rate of 1.25Mbytes/sec.

Re:Err... by Anonymous Coward · 1999-05-08 11:06 · Score: 0

Typical Slashdot "news". I get more reliable information on the wall of a public bathroom.

And a ViRGE speeds 3D on a 486SX... by Anonymous Coward · 1999-05-08 11:14 · Score: 0

Of course they want people to think, "if this board can do 9MB/sec on an Aptiva, it must be good". Never before have I heard of anyone running NT on an Aptiva. Heck, you can do a benchmark of 386 with a GUS versus a 386 with an SB16 and Timidity, and completely miss the point that on a decent system, both sound the same.

In regard to Aptiva model numbers and CPU speeds, it's possible that IBM updated the chip before you bought yours--even though they're both E56's, they probably have different model/type numbers.

Anyway, none of this means diddley due to the crappy "benchmark". TCP on a gigabit ethernet card would be more worthwhile anyway, since that's when the interrupts really flood your CPU.

loadavg != cpu utilization by Anonymous Coward · 1999-05-08 11:17 · Score: 0

that simple, or do you think that when you get a load average of 5 it the CPU is doing 500% (of course not!)

The load average is just the average number of processes waiting on the cpu in order to get work done. Each measures separate things, so don't be surprised if they differ.

Re:Err... by Anonymous Coward · 1999-05-08 11:23 · Score: 0

Yeah, but for transfering a single large file from an unfragmented HD, UltraDMA/33 is going to be comprable to SCSI, unless you have some serious striping.

Re:The next software task to be moved to hardware by Anonymous Coward · 1999-05-08 11:32 · Score: 0

What do you mean by hardware-only memory management? The '386 already has a complete MMU, and the allocation/configuration algorithms really do belong in software for integration with the rest of the OS.

Re:The next software task to be moved to hardware by Anonymous Coward · 1999-05-08 11:49 · Score: 0

In fact, the company I worked for, has several
486sx25 machines with linux-2.2.x kernels. Of course, compiling the kernel on those is out of question. We tried 386sx16-s too, but they
were too hard to find.

Re:Is this new? by Anonymous Coward · 1999-05-08 11:59 · Score: 0

Sure, your point seems valid (and mostly is), except for one thing you seemed to have looked over: 9 years ago you weren't using 100Base-T, were you? If their claims about the processor usage are true, then umm, this would provide a signifigant improvement. As for the good reason they haven't gone mainstream, /.'s only been around for about 3 years, right? Hehe
-me

Re:Err... by Anonymous Coward · 1999-05-08 12:00 · Score: 0

well - the 21140 is cheap as in not expensive. it is a good piece of hardware though - I use the Kingston KNE100TX cards (21140 based) in a few machines - works great.

Re:What do you think is "average" on 10bT? by Anonymous Coward · 1999-05-08 12:10 · Score: 0

Well, if I remember well, the curve knees out
somewhere between 55 or 65% of the nominal
throughtput (assuming there are multiple nodes
contending -- single node networks are a different
story).

Re:NT, not linux == WHO CARES by Anonymous Coward · 1999-05-08 12:14 · Score: 0

Not true. Hardware is interesting, even if it doesn't (currently) work under linux. The availability of PC NIC's that have on board TCP/IP could be a real win for linux, too.

Face it, all new PC hardware is going to only support M$ stuff.

Re:NT, not linux == WHO CARES by Anonymous Coward · 1999-05-08 12:46 · Score: 0

Who cares if it runs on NT. It's a good idea, even if it is one revisited or overhyped. I don't think /. should be passing up stories because of anti-NT sentiments. If it's cool, it's cool. If it's a good idea, it's a good idea, regardless of the OS.

Yes, /. has a heavy linux bent to it (duh). But I also like the fact that /. doesn't ignore the other OSs. First, it's good overall coverage and understanding of what's changing--technology, marketing, design. Second, it should be reminding us that we, the /. community, do not have a monopoly on thought out there. Good changes can come about from "bad" companies.

Heck, just look at KDE and gnome. Even they have some efforts to mimic known desktops; I would imagine part of the reason is because they want to make it easier for people to migrate to linux, but also because there are good things to be learned from other OSs.

Re:Been There, Done That by Anonymous Coward · 1999-05-08 13:04 · Score: 0

yep, it's the old wheel of reincarnation...

application-specific hardware gets replaced by
more flexible software as soon as microprocessors get fast enough.

Re:Memory management in hardware wouldn't be progr by Anonymous Coward · 1999-05-08 14:35 · Score: 0

Silicon TCP has the potential to scale further than software based TCP, particularly on systems which are CPU bound, not bandwidth bound. Would you rather spend your CPU cycles doing real work, or building packets?

That said, I don't know how much putting the TCP stack in hardware really saves you vs. a reasonably intelligent NIC that just computes the checksums for you (which already exist)

Re:Ethernet can run at almost 100% bandwidth. by Anonymous Coward · 1999-05-08 15:44 · Score: 0

Bandwidth utilization is not directly attributable to octets os bits of payload data moved. It's actually quite a dry calculation. And I can guarantee that very rarely will two collision domains show the same benchmark results. I've seen enough poorly designed repeaters that blantantly break IEEE 802.3 spec in the way they handle runts, long events, collisions etc. Also, there's lots of NICs that do the same.

Take the same 10 boxes and do a bench mark in one collision domain of one manufacturer's repeater, place them into 9 different collision domains and repeat the same benchmark with a different repeater. I guarantee that they will not be the same.

The scenario you mention above is a rather ideal one. I'm sure it could more or less be reproduced, but be careful not to make false blanket statements in arguments against others of the same ilk.

Re:The next software task to be moved to hardware by Anonymous Coward · 1999-05-08 15:46 · Score: 0

Try 1.2.13 - it runs nicely on that level of hardware

Re: 1mbs/s by Anonymous Coward · 1999-05-08 16:06 · Score: 0

1 mbs is one milli-bit per second ;)

Re:Err... by Anonymous Coward · 1999-05-08 16:21 · Score: 0

I don't know where you're pulling these numbers from. Where I work, we regularly get 800 kB/sec and higher over switched 10 Mb ethernet between Windows machines.

Re:Ethernet can run at almost 100% bandwidth. by Anonymous Coward · 1999-05-08 16:21 · Score: 0

"Sigh. That old myth."

As the previous poster mentioned: "be careful not to make false blanket statements in arguments against others of the same ilk."

(Mea culpa: up front) I have not read the specific paper you have referenced (and will do so (I promise ;)). However, I have studied ethernet congestion formally, and there are concrete mathematical reasons why a contention based algorithm cannot produce 100% of nominal throughtput (and there is an obvious common sense argument as well (but it doesn't pin down a specific number).

This really isn't a religious argument, nor is it some kind of 'insult' towards Ethernet. Stating that there is a fundamental performance limit in Ethernet (CSMA/CD) is not equivalent to a criticism of the design.

In fact, Ethernet is a brilliant design and (along with tcp/ip, html, dns and others) it is a brilliant example of two design concepts:
(1) worse-is-better, and
(2) complex systems are doomed by complex solutions: instead, complex systems _require_ "simple" basic principles -- .

Cheers

Re:NT, not linux == WHO CARES by Anonymous Coward · 1999-05-08 16:33 · Score: 0

Depends entirely on them, and how well they document their hardware. Considering the product info on their "web site" for prospective customers is only available in Word format (spit), I don't hold much hope.

Done before, but now patented? by Anonymous Coward · 1999-05-08 16:43 · Score: 0

These guys patented "Silicon TCP." If they now exclusively own the notion of hardware that knows how to do more of its job, I may have to finally go postal.

Re:Er... why is this on Slashdot? by Anonymous Coward · 1999-05-08 16:48 · Score: 0

A new kind of coprocessor, tied to the Internet Protocol, is certainly News for Nerds. Even if proprietary (and it may not be; it'd be easier to tell if only their Web site weren't so useless) - it doesn't have to be *good* news....

Thats funny! by Anonymous Coward · 1999-05-08 17:11 · Score: 0

My PRO200 server can almost saturate 100Bt (11.25MBytes/sec) using knfsd off of a SOFTWARE eight disk raid-5 array!

Re:Err... by Anonymous Coward · 1999-05-08 17:22 · Score: 0

Hey man, the highest call so far was 8MB, meaning 80Mbits, so your 1mb doesnt match the pot. Now are you in or are you out?

Re:security implications by Anonymous Coward · 1999-05-08 17:34 · Score: 0

Then this means that the real future of TCP/IP lies in fast ethernet cards built of somekind of FPGA cards.

Re:The next software task to be moved to hardware by Anonymous Coward · 1999-05-08 17:45 · Score: 0

Recently, more and more is moving to the hardware.

Part of the Great Wheel of Being. Someone notices that you can do something faster in hardware, and ships it. A few years later CPUs are faster, and someone discoveres you can save a few bucks on hardware by doing it on the general-purpose CPU via software.

It would be interesting to see how it holds up under multicast. Lots of cards filter mulitcast groups only at the OS/TCP stack level, and even those that do some filtering at the hardware level usually have to pass at least some groups up to the OS. That puts a load on the CPU, even when the box isn't actually subscribed to any multicast groups. I've seem some 166 pentiums with 3Com cards, subscribed to no mcast groups other than ALL-HOSTS, on an ethernet that has heavy mcast traffic, suck up 15-20% of CPU just discarding packets not addressed to it.

Re:Yes, M$ is slow. by Anonymous Coward · 1999-05-08 17:53 · Score: 0

> Using a raw packet driver on a 200MMX notebook with a SMC 100Mb/s Full Duplex card and using remote loopback we are unable to get much more then 15Mb/s sustained. (That's 15Mb/s going out and 15Mb/s comming in) .
This was using raw Ethernet packets, no TCP/IP or other protocol.

'remote loopback' hopefully implies that no Ethernet packets were _ever_ sent - just the ip code realizing to loop back to itself. So, what you're describing is either a really broken implementation, or you misspoke. Which is it?

That's because it's a hoax by Anonymous Coward · 1999-05-08 17:56 · Score: 0

C'mon. Get real. Thousands of things like these get tossed out on the 'Net all the time...but usually people don't fall for them this badly. Let's see. A little tiny company, only two or so years old (try looking for "interprophet" with altavista or something, and whois shows their domain name registered in Dec '97) suddenly (Altavista didn't find bunches of pages yammering on about how great whatever they were making was going to be during development time...expected, if they're a startup looking for funding) comes out with a pretty darn impressive engineering feat (a full TCP stack is a pretty large thing to be putting in hardware). Not a lot of technical info. I've seen other scammers make *much* more plausible web sites than that. If they really had the mony to put into doing that much hardware development, they could afford to at least take pretty pictures to push their products...not trashy shots in a barren office or whatever.

Yeah, it's *real* hard to doctor screen shots.

Over 100 comments so far, and I haven't seen anyone who has realized that this thing isn't for real. :-(

Re:That's because it's a hoax by gavinhall · 1999-05-08 20:03 · Score: 1

Posted by max kreed:

http://mainz-online.de/internet/news/news130598. html

Notice the date: 13th of May 1998. If it's a hoax then they've been doing it for quite a while I guess.
Re:That's because it's a hoax by sjames · 1999-05-09 01:58 · Score: 1

(a full TCP stack is a pretty large thing to be putting in hardware)

Unless they count firmware as hardware. I have considered similar stunts with the Acenic gigabit card. At gigabit speeds, the idea looks a lot more useful.

Re:Documented Linux vs NT Performance by Anonymous Coward · 1999-05-08 17:57 · Score: 0

Are you sure your cards were running in full duplex? With a crossover cable I could only get half. And yes, I did it right for full duplex and even made one the "wrong" way and it still did only single. I think the cards need to talk to a switch to go full duplex. Full duplex is pretty important for tcp/ip on local networks because of the ACK's sent back.

With NT, Microsoft has a KB article at http://support.micro soft.com/support/kb/articles/q169/7/89.asp which supposedly helps tcp/ip speed. Not sure if it's going to make a difference...

Also, a little benchmark of my own (which is not anywhere as good as yours) is sending a 4.5 meg file between a Linux 2.2.7 and a FreeBSD 4.0 system (standard ftp daemons and clients as installed) via ftp. FTPing from the linux system to FreeBSD transfers the file in approx 1.3 seconds. Going the other way (FreeBSD doing the sending) takes 0.4 seconds... I'm on a 1/2 duplex 100 mbps network. There is a 3c905 PCI in the Linux box (PPro 180) and a DEC 21040 in FreeBSD (p5-200).

The most intersting observation: running rc5des seems to slow down the transfer rate! Although running at nice -20 priority, it seems to affect receive speed by approx .25 seconds but puts FreeBSD transfer speed up to 2 seconds! Maybe IP in silicon isn't such a bad idea after all. I will try my setup with your tests though.

P.S.
Where do you get your 21040 cards? I can only find the 3com's which cost way too much or these VIA Rhine cards which leave much to be desired. Got a couple 21040's a while back which work great on all OS's.

MP3 player offloaded to hardware? by Anonymous Coward · 1999-05-08 19:04 · Score: 0

How about... A sound card with an MP3 player easily loadable into it's onboard DSP to offload that task from the processor? Or better yet, a means to use the DSP to encode MP3s without signifigant processor load?

Re:MP3 player offloaded to hardware? by bdjohns1 · 1999-05-08 19:31 · Score: 1

What do you think DVD and VideoCD hardware decoders do? Video CD is MPEG-I, and DVD movies are stored using MPEG-II.

Of course, it's not as though MP3 decoding should be rough on a reasonably fast machine. My 333 Celeron typically has loads less than 1% while playing using mpg123 or x11amp. Comparably, running WinAmp over on my windows partiton typically uses ~10% CPU.

The problem is that adding a specialized MP3 decoder eats up yet another slot in the case (unless you make it a USB/FireWire thingy). I'm down to one PCI/ISA slot on my ATX (w/ AGP) mainboard, and once I buy a modem next month, it's full.

A potentially bigger problem is that you've got to redo all the MP3 players out there to support the decoders, and you've got to settle on a standard hardware decoding method, so we don't have to have 5 different versions of x11amp.
Re:MP3 player offloaded to hardware? by Doze · 1999-05-09 04:11 · Score: 1

Surely there is more at stake here than the CPU usage?

If the soundcard could do the decoding then you'd only be sending 3 or 4 megs over the PCI bus rather than 30 or 40 megs. Would this not be a good thing?
Re:MP3 player offloaded to hardware? by ijuz · 1999-05-08 21:51 · Score: 1

>My 333 Celeron typically has loads less than 1% >while playing using mpg123 or x11amp.

no, that is not right, linux has got a problem with this, also the CPU usage of NAD was under Windows only 0%...

winamp consums 5,4% of the cpu power, on my PII/300.
tested with the rc5-client, with and without winamp, this 5,4% are the difference between the key-rates.

try it under linux with this method
(i had not the time, yet)
Re:MP3 player offloaded to hardware? by Breace · 1999-05-09 11:08 · Score: 1

Video CD is MPEG-I, and DVD movies are stored using MPEG-II.

Video CD uses MPEG-1, layer I or II, not layer III which is MP3. Most MPEG-1 hardware decoders that I know of don't implement layer III decoding.

Most DVD movies use Dolby AC-3 for audio, not MPEG although I believe in Europe they do (but again layer I or II as I understand).
I also know of no DVD decoder that integrates MP3 decoding.

Too bad really...

Breace.

Re:Let me guess: ISA 3c509? by Anonymous Coward · 1999-05-08 20:03 · Score: 0

It's because of the busmastering! The ISA bus can only adress 16 MB of memory, if the data buffer is for example at 18 MB the driver needs to do some tricky stuff each time data is sent to and from the NIC, that's also the reason why no one should ever buy a ISA busmastering SCSI card for a system with more than 16 MB of RAM.

Re:Ethernet can run at almost 100% bandwidth. by Anonymous Coward · 1999-05-08 23:09 · Score: 0

That 10Mb/s has to carry all the TCP/IP packet
headers etc, so you can't expect to get the
entier 10Mb/s for data transmissions (like the
way you can't get 2Mb of useable data on an
HD floppy)

Does anyone have some figures as to how much
bandwidth is eaten by TCP/IP -- I would suspect
that it can partailly account for the 7.2Mb/s
observed speed.

Yes by Anonymous Coward · 1999-05-09 02:01 · Score: 0

Linksys Sucks ;-) Hell, I've got the generic DEC Tulip chipset in my alphastation and a 3c509 card in my ppro and I can easily get between 800-900Kb/sec consistently. My laptop's Intel EtherExpress PRO/100 card (16 bit) can easily sustain 800Kb/sec. On the other hand, transfers to my other machine with a generic ne2000 card in it are around 150-200Kb/sec.

Re: 1mbs/s by Anonymous Coward · 1999-05-09 03:49 · Score: 0

You're in college?

Re:Ethernet can run at almost 100% bandwidth. by Anonymous Coward · 1999-05-09 05:06 · Score: 0

Someone already mentioned how the number of stations effects the % but another overlooked one is physical net size. If you have active nodes at oppisite ends of the max size net they produce many collisions which drives down the % quite effectively. Someone just needs to solve that pesky speed of light problem now.

Re:Err... by Anonymous Coward · 1999-05-09 05:35 · Score: 0

Maybe for old decrepid SCSI II, but compared to U2W, UltraDMA is a complete joke.

Re:Is this new? by Anonymous Coward · 1999-05-09 05:57 · Score: 0

Sure, your point seems valid (and mostly is), except for one thing you seemed to have looked over: 9 years ago you weren't using 100Base-T, were you? If their claims about the processor usage are true, then umm, this would provide a signifigant improvement. As for the good reason they haven't gone mainstream, /.'s only been around for about 3 years, right? Hehe

Indeed, it might well be possible that with 100 Mpbs/1 Gbps cards and very small packet sizes the processor usage may hit the roof. However, TCP/IP is not heavy to process, so automatically you should hit a bottleneck from your server process before having to much TCP/IP to process in the kernel. Looks what happens on Slashdot :-)

Re:security implications by Anonymous Coward · 1999-05-09 06:34 · Score: 0

"Denial of Service." You know, the way some OS' TCP stacks would tie themselves in knots when receiving a flood of bogus SYN packets (every one representing a "connection request" that allocates resources in the stack) or a "super source quench" ICMP redirect ("your new route to me is ... yourself!") or similar attacks.

What you missed by Anonymous Coward · 1999-05-09 06:40 · Score: 0

Everybody seems to be talking about TCP/IP performance on a machine with no load and an empty network. Now test a WinNT database server running 10,000 data base request, that many dynamic request should jack your cpu usage to about 98% (estimate) now were is the processing room for your NIC? What is your average transfer rate at that kind of load?

To be fair try this on Linux also, start up something like MySQL, give it 10,000 request and test the average transfer speed on that.

Also on both machines set your packet size to something small (500) and see how that effects performance.

All these test on machines not really doing anything is not going to prove much, I don't know why they performed these test on regular PC's when the equiptment that is going to benifit from a technology like this would have a large riad subsystem and a ton of ram.

Just something to think about

Re:Please attribute your sources better. by Anonymous Coward · 1999-05-09 07:03 · Score: 0

i was surprised that no reference to memepool was made... Of course there's going to be a lot of overlap of "cool" info... but if everyone tries to take credit for it as their own, we all lose.

What is www.microsoft.com running? by Anonymous Coward · 1999-05-09 09:08 · Score: 0

From using nmap, it is clear that Microsoft does not use the standard Windows TCP/IP stack on their main web servers, like www.microsoft.com (though they do the standard Windows TCP/IP stack on some of their less popular servers). So what TCP/IP stack DO they use?

Re:What is www.microsoft.com running? by Anonymous Coward · 1999-05-09 15:04 · Score: 0

Check out this page:
http://www.microsoft.com/backstage/column_T2_1.h tm

Re:Documented Linux vs NT Performance by Anonymous Coward · 1999-05-09 10:31 · Score: 0

Look here for my study on Linux vs NT performance with TCP. NT's stack is indeed inefficient, but the performance may also be related to really lousy interrupt handling as well.

Your data is interesting ; but to go further it would be interesting have a tcpdump trace to see how the TCP/IP dynamic interacted. Because Windows and Linux are probably implementing different variation of TCP/IP (there are numerous behavior possible, depending of the options implemented). Note also that getting super-high performance with one flow between two machines, and getting good performance on a big network/Internet are not necessarily completly related. Because of even if you don't saturate, that means that more bandwidth will be free for others machines.

Re:The next software task to be moved to hardware by Anonymous Coward · 1999-05-09 10:52 · Score: 0

Looks like web servers are going to be implemented in hardware real soon.

There is no advantage in making webserver in hardware because you'll have to use micro-code in the hardware anyway, with a persistant storage for code. Basically to code a webserver, you need a general purpose computer, and there is no point in putting a webserver on a low-end RISC subprocessor, on a machine where you have a powerful processor (multiple way, with much cache).
Maybe you are implying that soon, say, your monitor itself alone will host a web server, but that will be because your monitor would have a powerful RISC processor and have bigger software.

Would you believe 3D graphics were once done in software?

There have been several cycles to put graphics processing to hardware to CPU to hardware to CPU etc...

For instance when the first 3D game was out (Wolfstein 3D), on 386 a multiplication was taking something like 20-50 cycles ; nowadays on MMX machines you could do 4 16-bit multiplications per cycle. I'm not sure but we could well be close to be moving 3D stuff back to CPU. If you consider Voodoo 1/3D cards prices, an indicator of the die surface necessary to implement the 3D processing, you'll see that probably putting the 3D processor on the same chip as the CPU would be even doable now (it wasn't when the first Voodoo 1 was out).

You should note also that for sound, the CPU has already took the task back ; the CDROM quality , 16 bit, is now doable even on low end cheapo cards (I remember when SB 8 bit was costing $100), audio card makers don't now what to invent now to continue to sell expensive cards, and audio compression (MP3) can be done in software (but not MPEG2 video yet right now, although...).

Re:Documented Linux vs NT Performance by Anonymous Coward · 1999-05-09 19:10 · Score: 0

The extra time with the distributed.net client should actually be easy to explain...

When the RC5 client get's a timeslice, it most likely is using the FULL timeslice, whatever it is set to. It could be 200ms, 300ms, 150ms ... on some OS's it varies (the less the cpu load, the longer the timeslice on some systems).

In addition, while FTPing, the ftp process may be waiting on I/O for some reason or another -- but has nothing else to do. So it'll give the RC5 client another timeslice. Which it uses...

...my thoughts on the matter.

Re:teach the winmodem manufacturers by Anonymous Coward · 1999-05-09 21:09 · Score: 0

I'm with you. Moving common tasks to dedicated hardware is always a winner. TCP/IP today is like WinModems--the software has to do most of the work.

More FUD by Anonymous Coward · 1999-05-10 03:20 · Score: 0

Will you Linux kiddies stop spreading FUD about Microsoft products? You are looking silly!

Re:Diff please? by Anonymous Coward · 1999-05-10 03:22 · Score: 0

In the file kernel/sched.c find the function
count_active_tasks and remove the tests for
(*p)->state == TASK_UNINTERRUPTIBLE || (*p)->state == TASK_SWAPPING

Rob

Wow. An expert. by Anonymous Coward · 1999-05-10 03:22 · Score: 0

Wow, you seem to know a lot about the internals of the Microsoft stack.

Which parts of the source code are exactly inefficient, or is this more FUD?

This is a real win by Anonymous Coward · 1999-05-10 06:07 · Score: 0

Our studies show that just doing the TCP and IP checksums in hardware more than doubles throughput. In theory, one should be able to turn off checksums and acheive the same performance gain, but in practice, the NT4.0 registry entry that is documented as disabling checksums appears to have not effect... I presume in Linux it is actually possible to disable checksums?

Re:Err... by Anonymous Coward · 1999-05-08 08:48 · Score: 1

I'm using Samba between Windows 98 and Linux 2.2.6, and my speeds are significantly lower than
that (~ 100 kb/s). I have a decent 3Com card in
the Linux box and a cheapo Linksys card in the
Windows box, along with a Linksys hub. Is there
anything I can tweak to make it go faster?

I agree by Anonymous Coward · 1999-05-08 08:49 · Score: 1

I never though CPU load was an issue, if 100 Mbit netwwork cards have that sort of cpu load then whats the go with gigabit ethernet ?
Maybe they have some wierdo network cards that dont use DMA.
I think ill stick my 10mbit NIC for now

Already have routers on silicon by Anonymous Coward · 1999-05-08 08:52 · Score: 1

There are at least a half dozen companies developing/offering routers on silicon. Most have highly parallel and exotic (e.g. hypercube) architectures with hardwired advanced pattern matching algorithms. The future of TCP/IP is in dedicated hardware circuits.

Re:Already have routers on silicon by netwiz · 1999-05-08 14:24 · Score: 1

Cisco's switching layer 3 at 40mpps, and layer 2 at 256mpps. All at wire speed gigabit ethernet, OC48 ATM, etc., all non-blocking. mmm... beefy...

Let me guess: ISA 3c509? by Anonymous Coward · 1999-05-08 08:55 · Score: 1

Banging bits out an ISA bus isn't exactly a fast process.

Re:Let me guess: ISA 3c509? by akharon · 1999-05-08 09:22 · Score: 1

true, but a ten base card can't fill the throughput of an isa slot.
Re:Let me guess: ISA 3c509? by IntlHarvester · 1999-05-08 11:08 · Score: 1

Although, how much CPU do you need to fill that ISA bus? My understanding is that by moving 10BT ethernet to a PCI slot on an older machine (like that guy's P120) could cut CPU usage by 50% or more. In ye olden days, moving the network from ISA to even EISA was a noticible improvement.
--

--
Business. Numbers. Money. People. Computer World.

Benchmark with netcat of /dev/zero by Anonymous Coward · 1999-05-08 12:06 · Score: 1

Ok, Ive got two computers with SMC etherpower II 10/100 cards running under Linux 2.2.4, connected by a crossover cable in full-duplex mode, with little other traffic. ie: ideal conditions.

host 1: nc -l -p 5050 > /dev/null
host 2: dd if=/dev/zero of=/proc/self/fd/1 bs=4096 count=102400 | time nc utrk 5050 -w1

output from host 2:
102400+0 records in
102400+0 records out
0.70user 7.70system 0:46.10elapsed 18%CPU (0avgtext+0avgdata 0maxresident)k
0inputs+0outputs (140major+22minor)pagefaults 0swaps

since netcat had a 1 sec delay, we get a total of 45 seconds for the transfer of 419430400 bytes = 9320675 bytes/sec = about what they are quoting. CPU load was higher that 2%, obviously (actually, about 25%), but the computer was still usable (it is the disk IO that would kill it, anyway)

Re:Already have routers on silicon -- yup by Anonymous Coward · 1999-05-08 12:22 · Score: 1

Yeah, this is in fact one of the "other uses" that toshiba is planning for the chip that is going to be used in the "Playstation II"

Re:Err... by Anonymous Coward · 1999-05-08 17:17 · Score: 1

Um, just wanted to point out, that the samba protocols specifies that for most machines out there, you can only send 65,535 bytes in one go. And then you have to make sure that the other side heard you. So, ftp will always be faster than samba.

Re:Err... by Anonymous Coward · 1999-05-08 08:53 · Score: 2

Remember to keep in mind the differences between bits and bytes, while its probably true that you don't _notice_ a performance hit at 900K/s, what you really mean is 900 kilobits/s. That's nearly 80 times less than 9 megabytes/s. So if the kernel spends 1% of CPU time on 900kb then its easy to see why it might spend a whole lot more time on 80 times more data.

My two cents.

Re:Been There, Done That by Anonymous Coward · 1999-05-08 13:08 · Score: 2

oh yeah, interested people should take a look at

D. Clark, V. Jacobson, J. Romkey, and H. Salwen,
An analysis of TCP processing overhead, IEEE Communications, Vol. 27, No. 6, pp. 23--29,
June 1989.

Classic paper! (well, kinda)

Re:Err... by Anonymous Coward · 1999-05-08 18:00 · Score: 2

You must be *very careful* assessing the load on a Linux system, as the measurement of the load is far from optimal or complete!
The "load average" you see displayed in the output of the "w" or "uptime" commands is quite useless in this case, it shows the average number of user processes that are "ready to run", i.e. they would take the processor if it were available. This number in no way reflects the load by interrupt handlers. It is also badly computed, as it includes processes that are waiting for the disk.
(this error was introduced into the kernel long, long ago and I have mailed Linus about it, but he did not want to change it as this figure better reflected the general feel of load on the system, in his opinion. In my opinion, it makes the whole "load average" useless, so I always patch this away when I compile a kernel)

The numbers in /proc/stat are also useless for the kind of tests required in this case, because the load imposed by interrupt handling is added to the process that happens to be running when the interrupt comes in. When the interrupt comes when the idle process is active, the interrupt handling time is counted as idle time.
This means that when a test is run that does not use much user-space processing and heavily relies on interrupt handling (like a networking test where a test process sends useless data over a TCP socket! same when it tries to use a serial port), the load results obtained from Linux will be far off the realistic load on the processor.

Rob

This is a Bad Idea (tm) by Anonymous Coward · 1999-05-08 11:45 · Score: 3

Although writting a driver would be fairly easy, this would break all sorts of features that have come into existance due to open source protocol stacks (not being able to do IP chains stuff to an external IP stack is a definite step backwards).

They would either need to open source the stack and make it downloadable from an OSS driver (like some of the SCSI cards out there) or the card will never get within 10' of my boxen.

Somone already pointed out the security implications. Personally, I don't want to be yanking a card in and out of production just becuase someone built the next teardrop and the vendor is slow to fix it.

As for NT, well, this is obviously the tact that MS is pursuing to gain equal performance with other OSS operating systems, but it has certain implications that will keep NT in the second fiddle chair. The card will probably weaken NT security initially by breaking fixes that are covered by current hotfixes and service patches. Additionally, I can think of no better way to fill up somone's disks than by having improved transfer retes across the net, operating system and disks. Worm designers should re-joice as well. Now, you can design worms that can consume more resources without being noticed. If you're really tricky-trick you could design a worm that existed only within the context of the the TCP/IP stack and if the board has NVRAM... well, a box could stay compromised for years. How long before a Microsoft Weapons System sees daylight?

-- "Most decently written TCP/IP stack applications have NO buffer overrun problems" - an anonymous programmer at Fort Mead

-- "ALL TCP/IP stack applications are a long way from being mathmatically correct." -- A mathmatician's retort.

-- "Our job is to find the differences between one and the other and keep this information from the public as long as possible." -- A manager who successfully defused the situation..

security implications by Anonymous Coward · 1999-05-08 09:06 · Score: 4

Heh, the sites been slashdotted, they should use their own cards or something... :)

Moving the stack into hardware is an interesting idea, though. Unfortunately it has some negative (and admittedly positive) implications for those concerned about security.

First, it will be impossible to tell what operating system a computer is running by using TCP fingerprinting. This is both good and bad in that it will thwart script kiddies to some extent by not revealing the platform, thus making it more difficult to take advantage of well known exploits. On the other hand things like Netcraft and the Internet OS counter will also not be able to take surveys properly.

Second, and entirely negative, is the possibility that their hardware implmentation of TCP/IP may be sub-standard. It may have scads of DOS loopholes and other weaknesses. Unless they make the thing software upgradeable as holes are found, and make the software Open Source, I don't see it gaining much marketshare against the cheap and plentiful cards we have now.

Re:security implications by Ami+Ganguli · 1999-05-08 09:15 · Score: 2

Netcraft uses ident info returned by the web server. This would still be available to both Netcraft and script-kiddies.

DOS attacks on the card might be a problem, but I imagine they would be upgradable. Also, the higher levels of the protocol would still need to be on the main processor. Presumably workarounds would still be possible.
... Ami.

--
It is tempting, if the only tool you have is a hammer, to treat everything as if it were a nail. - Abraham Maslow
Re:security implications by Gumber · 1999-05-09 04:14 · Score: 1

Um, it is equally possible it doesn't suck. What is a "DOS loophole" Open source would be nice, but it doesn't necessarily determine the success or failure of this card. I think the amount of money they can put into marketing and distribution is a bigger issue.

How far should we go? by jbaugher · 1999-05-08 09:35 · Score: 1

I submitted the news item, and thought it to be a bit overwhelming to cite more than one source. What is the appropriate limit as to how many sources to cite?

I think going 1 level down is fair -- I got the item from 1 source, the geeks list.

What if an item popped up on the geeks list that came from site1 that came from site2 that came from site3...?

Shoud geeks, site1, site2, site3 ALL be cited?

Opinions?

I meant kilobytes... by Wakko+Warner · 1999-05-08 09:13 · Score: 1

I've managed to max out the ethernet card a few times with ftp transfers between my Linux box and my FreeBSD box at home. I would certainly be complaining if I was only getting 900 kbits/second. Heck, I could get more than that over a parallel port. :)

- A.P.
--

"One World, One Web, One Program" - Microsoft Promotional Ad

--
"Remember when the U.S. had a drug problem, and then we declared a War On Drugs, and now you can't buy drugs anymore?"

Err... by Wakko+Warner · 1999-05-08 08:33 · Score: 2

Maybe NT's stack is just woefully inefficient, but I never experience undue CPU load when doing massive file transfers in Linux. At worst, the load will hit 0.05 or so. I certainly don't *notice* any performance hits, however, even when getting 900K/s on a 10 megabit card.

- A.P.
--

"One World, One Web, One Program" - Microsoft Promotional Ad

--
"Remember when the U.S. had a drug problem, and then we declared a War On Drugs, and now you can't buy drugs anymore?"

Re:Err... by Fict · 1999-05-08 09:29 · Score: 1

If your return policy is still good get rid of the linksys hubs. not because it is slow, but because in a matter of months, the hub will experience nice spiraling death-type symptoms.

So far my sohoware 8 port + 1 switch is holding up nicely.

------------------
Re:Err... by Trepidity · 1999-05-08 15:14 · Score: 2

Note that WinNT had 98% CPU load going at 4.5 MB/sec. You mention that you don't notice a performance hit at 900kB/sec. If the CPU load varies linearly with transfer speed, NT would only use around 19.5% of the CPU at the 900kB/sec you're getting, at which point you probably wouldn't *notice* any performance hits either.

--
10 PRINT CHR$(205.5+RND(1)); : GOTO 10
Re:Err... by Ole+Gjerde · 1999-05-08 17:27 · Score: 1

>since netcat had a 1 sec delay, we get a total >of 45 seconds for the transfer of 419430400 >bytes = 9320675 bytes/sec = about what they are
9,320,675

Actually, the speed he is quoting is about 9.3MB
Re:Err... by jd · 1999-05-10 02:53 · Score: 1

Depends on your MPU. :) 1500 is merely the maximum. :)

--
It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
Re:Err... by jmalicki · 1999-05-08 18:10 · Score: 1

But ethernet only lets you send 1500 bytes in one go, so it's not that big of a deal.
Re:Err... by Caelum · 1999-05-08 09:21 · Score: 1

Linux's tcp/ip is many times better than NT's, someone did a comparison I don't remember the URL. But do your own tests...
Re:Err... by mikpos · 1999-05-08 09:09 · Score: 1

Give up some more information. Is this just with Samba? Try FTPing and see if you get similar speeds. If you do, then I can't help you because I'm hopeless with Samba :)
Re:Err... by Rendus · 1999-05-08 15:46 · Score: 1

5 port 10mbps 10baseT hub, $50 with 2 PCI NE2000-compliant NICs. 2 years later, still going strong, with an average of around 650kbytes/sec transfer (acceptable for what I use it for)...
Re:Err... by Extremist · 1999-05-08 13:41 · Score: 1

One thing that really made a difference for me was switching samba (ver 1.9.18 on Caldera) to user level security (at least I think that's what made the diff.=)

Tranfer speeds of 1 MB/sec (100Mb NIC, all IDE since it used to be my desktop system,) compared to our NT server (100Mb NIC, all SCSI on a $25K dual CPU Compaq server) doing about 500K. My boss (a big NT fan) hates when I point this out. Hehe.

Performance does suck when backed up to another NT server, though. Only 500K/sec. I have a feeling this would inprove if we'd map the share instead of using the UNC. Anyone have exp w/this?
Re:Err... by aphr0 · 1999-05-08 12:36 · Score: 2

I find it odd that so many linux users complain about fud directed towards linux, then go around and spread this kind of nonsense. I get 1mb/s at work easily between 2 win95 machines, both with 10mb cards with no noticable cpu overhead at all.
Re:Err... by sinator · 1999-05-08 13:58 · Score: 1

You can actually dynamically load a new TCP/IP stack if you want. Most applications ask you to use Microsoft TCP/IP stack, and ask you to reboot after making changes because the Win32 API is so brain-dead that no one is sure what changes do what to what. So long as you're fairly confident about the reliability and ability to stay within its memory space, you can plug and chug a new stack (i'd reccomend you do it in kernel land, instead of userland, the hooks ARE there, or use psxss)

--
Three Step Plan:
1. Take over the world.
2. Get a lot of cookies.
3. Eat the cookies.
Re:Err... by dirty · 1999-05-08 15:26 · Score: 1

From what I've heard Linux has the fastest TCP/IP stack of any OS. But i've never seen any numbers to back this up.

--

-matt
Re:Err... by SoftwareJanitor · 1999-05-11 00:38 · Score: 1

The first thing I would do is get a couple of better ethernet cards. The 3C509 series are good 10 Mbit cards, but the 3C905 has gotten bad reviews as a 100 Mbit card. The cheapo LinkSys card is probably an NE2000 clone, and NE2000 clones are notoriously poor performers. As cheap as 100 Mbit Ethernet cards are these days there is little reason not to get them. Assuming you have an available PCI slot in each machine I recommend getting the Bay Networks FA310TX card, they run between $25 and $40 retail (CompUSA and Best Buy both carry them) or mail order. These cards use the DEC Tulip style chipset which is very fast. Another Tulip based card that is good is the D-Link DFE-500TX. I believe LinkSys also has a Tulip type card, but be careful, all of these brands also make cheapo "NE2000" style cards, even in 100 Mbit versions. The older Tulip cards were easy to identify because the large chip had "DIGITAL" on it, but newer chips are manufactured by someone else as Compaq sold off the old DIGITAL chip manufacturing recently.

In general, NE2000 style cards are at best mediocre performers, and at worst are a major nightmare (read the Ethernet HOWTO). If you can afford it, get yourself a 100 Mbit hub. If you only have two machines and can't afford a 100 Mbit hub, get a flipover cable and don't use a hub at all. If you have more than two machines and some money, you can get a switching hub to reduce collisions.
Re:Err... by slashdot-me · 1999-05-09 22:28 · Score: 1

My IBM deskstar 10 gig UDMA gets 11 MB/s. I've got two in raid-0, though. Together they get 23 MB/s.

[root@leia root-tmp]# /sbin/hdparm -tT /dev/hda6

/dev/hda6:
Timing buffer-cache reads: 64 MB in 0.96 seconds =66.67 MB/sec
Timing buffered disk reads: 32 MB in 2.92 seconds =10.96 MB/sec
[root@leia root-tmp]# /sbin/hdparm -tT /dev/md2

/dev/md2:
Timing buffer-cache reads: 64 MB in 0.99 seconds =64.65 MB/sec
Timing buffered disk reads: 32 MB in 1.41 seconds =22.70 MB/sec

http://www.ryans.dhs.org
Re:Err... by _Quinn · 1999-05-08 08:52 · Score: 2

I don't *know* about NT's stack, but I would suspect it's woefully ineffecient; I've run tests Linux vs Win95 on the same box, and Linux wins by a minimum margin of four times on everything I've bothered to test, and it's usually higher. (Max Win95 rate is about 40k/s; max Linux is about 200k/s; this is the 2.0.34 kernel with Becker's WD driver.) This really sounds like a hardware solution to a problem in Microsoft's software. (Why, pray tell, should the transfer rate depend on 'getting lucky' with context switching? Did somebody at Microsoft decide that the scheduler worked fine because it didn't lose keyboard strokes?)

-_Quinn

--
Reality Maintenance Group, Silver City Construction Co., Ltd.

7200/10,000 RPM SCSI drives can... by Wakko+Warner · 1999-05-08 09:17 · Score: 2

I've got 4 drives of varying make and model from IBM, Seagate, and Fujitsu running at 7200 or 10K RPM. The IBM gets about 10 MBytes/s on average. I've seen it hit 13. The others get between 8 and 12 (usually on the low side.) Since Linux's load average is not just a measure of CPU activity, when I'm transferring files between drives on the same SCSI chain, even, the load will hit 1, but the CPU usage will remain at or close to 0.

- A.P.
--

"One World, One Web, One Program" - Microsoft Promotional Ad

--
"Remember when the U.S. had a drug problem, and then we declared a War On Drugs, and now you can't buy drugs anymore?"

Re:7200/10,000 RPM SCSI drives can... by Rendus · 1999-05-08 15:53 · Score: 1

Western Digital Caviar 20.4 GB IDE drive gets 10.8 MBytes/sec raw reads on my box (P200, 96 megs RAM), and 33.3MBytes/sec from cache.

But to WRITE that fast is another issue entirely.

Re:What do you think is "average" on 10bT? by Scott+Wunsch · 1999-05-09 02:48 · Score: 1

I've seen ftp reporting 1.1e+3 kbytes/s on my 10Mbit Ethernet. Of course, there wasn't really any other traffic on the network at the time.

--
\\'

Re:why is this on Slashdot? by volkris · 1999-05-08 10:01 · Score: 1

Whomever said proprietary?

First of all, many threads here have been talking about writing drivers for it for Linux, etc, and second, many threads have talked about makeing it software upgradable to fix security holes.

Consider that BECAUSE it was posted to Slashdot the makers of this card could be slashdotted with email and offers of help for making drivers for alternative OSs. Plus, if slashdotters show enough intrest they'll make OS drivers because they can see that it will increase their sales.

I think this kind of thing is EXACTLY what should be posted to Slashdot because we'd be able to make a difference.

Is this new? by Ami+Ganguli · 1999-05-08 09:08 · Score: 4

I worked with an expensive Intel NIC 9 years ago that had an i960(I think) and an OSI protocol stack on board. Never did any benchmarks, but I'm guessing the complex OSI protocol stack plus wimpy ISA '386 boxes made putting intelligence on the NIC a good idea at the time.

I figure there must be a good reason these things haven't gone mainstream in almost a decade. The proliferation of simple TCP/IP plus faster CPUs might be one reason.

... Ami.

--
It is tempting, if the only tool you have is a hammer, to treat everything as if it were a nail. - Abraham Maslow

Re:No by Nat+Lanza · 1999-05-09 04:00 · Score: 1

Not all of your bandwidth is going to go to data. Remember that you have protocol overhead to deal with -- ethernet packets have headers, TCP packets have headers, and so forth. All of those headers take up some of your precious bandwidth, so you don't really have 1.25MB/sec to play with.

Re:this would be nice by sjames · 1999-05-09 01:51 · Score: 1

The PIO is killing your performance. Switch to a PCI bus mastering card, and your load will go WAY down.

Re:Been There, Done That by Iggy · 1999-05-08 22:34 · Score: 1

Please, correct me if i'm wrong, but couldn't you
use an FPGA or something similar thereby providing
reconfigurable hardware TCP.

Now that would almost certainly put the cost up and there may be security issues, i'm not too hot on FPGA's. :(

Anybody know if this has been tried using FPGA's somewhere else ???

Iggy

The next software task to be moved to hardware by heroine · 1999-05-08 10:01 · Score: 1

I can't wait until memory management is moved to hardware only. Maybe Linux MM would stop crashing. Looks like web servers are going to be implemented in hardware real soon. Would you believe 3D graphics were once done in software?

Re:The next software task to be moved to hardware by Rendus · 1999-05-08 16:00 · Score: 1

I've run it on a 386DX/33 with 4MB of RAM and -NO- hard drive, a 486SX/33 with 4MB RAM and a 204MB hard drive, a 486DX2/50 with 4MB of RAM and a 400MB hard drive... No problems ever. 2.0.34, 2.0.36, 2.2.1, 2.2.5.
Re:The next software task to be moved to hardware by DannyC · 1999-05-08 10:19 · Score: 1

can't wait until memory management is moved to hardware only. Maybe Linux MM would stop crashing.

Are you trying to spread FUD? I've been using Linux for 7 years now, never had it crash. What are you _doing_ wrong?
Re:The next software task to be moved to hardware by Tarnar · 1999-05-08 10:18 · Score: 1

And when does this reach it's critical mass? How much CAN you take off the software? Are we going to reach a point where the only bit of software we have is that which writes the (hopefully) upgradable driver?

And then, will the standards still be there? Will they be open? We can hope so, but we have to remember that hardware companies can be extremely stuffy about giving away hardware specs. Just look at Aureal or Creative Labs who won't release specs for driver writing to the Linux community.

Recently, more and more is moving to the hardware. And as Moore's law begins to top out we can expect to see more of it. 3D graphics, 3D sound, these tcp/ip stacks, et al. Where does that put software itself?

Food for thought.

teach the winmodem manufacturers by bhmit1 · 1999-05-09 09:07 · Score: 1

Winmodem people need to learn something from this. Things perform better in hardware than software. Of course, this depends on the openness of the drivers. We may be stuck with a good card and no documentation. Only problem I see is that tcp and other pieces of this layer are intended for software. So maybe this isn't a good idea (I'm tending to agree with the "what if there is a hardware bug" comment). Winmodem people seem to have taken the opposite approach and I'm not sure who is worse, but the winmodem people can definately learn from these guys.

No by mikpos · 1999-05-08 09:14 · Score: 1

If whoever did this is getting 900 kilobits/s, he's got SERIOUS problems :). Even 900 kilobytes/s is pretty slow for a 10Mbit (although maybe SMB is just a slow protocol?). Still, my ftpd will eat up to 5% of my CPU running at ~1100 kbytes/s, so I can see where this TCP on-a-card sort of thing might come in handy for higher bandwidth stuff.

RE: No by _SkiBum_ · 1999-05-08 11:17 · Score: 1

I don't find SMB that slow... between my 2 machines Dual-100 linux to celeron 450 win98, I get 900KB/sec on 10mbit and 4MB/sec on 100mbit. And yes those are bytes, not bits.
The main reason it isn't any faster is the drive in the linux box can't go any faster, linux/ide-fireball -> win98/scsi-cheetah...

--
Just a SkiBum stuck in the east...
Re:No by remande · 1999-05-09 23:46 · Score: 2

Still, my ftpd will eat up to 5% of my CPU running at ~1100 kbytes/s, so I can see where this TCP on-a-card sort of thing might come in handy for higher bandwidth stuff.
This is ESR's cycle of reincarnation in action.
Putting TCP on the ether card should allow some serious savings in CPU time, regardless of OS, if it's bus-mastering. For ftpd, the CPU should (in theory) set up a TCP connection with the card, open up a set of blocks (or a file; I'm no expert on this particular protocol) with a bus-mastering SCSI, and just let the disk flow to the card, or the card flow to the disk. Voila, file transfers that eat only the bus and not the processor.
I think that, in the cycle of reincarnation, we have been going from everything-on-the-processor to everything-on-separate-chips. We have separate video accelerator cards (ignoring the video cards themselves), we have SCSI to drop the I/O load off of the chip. People still haven't folded sound into the processor.
Between all this and the dearth of processor vendors, offloading TCP onto a card makes sense. The less load one places on one's processor, the less tied one is to the processor vendor. Today, people talk about having Intel or not. Imagine a world where people describe their computers by their net card type or memory vendor...

--
--The basis of all love is respect
Re:No by Breace · 1999-05-08 11:21 · Score: 1

I guess on a Half Duplex that's acceptable (depending on the entire network usage though), but on Full Duplex I would wonder where the other 25% went.

Breace.
Re:No by Breace · 1999-05-09 07:34 · Score: 1

Let's see:
preamble & start frame = 8 bytes
Ethernet header = 14 bytes
UDP header = 28 bytes (TCP = 40 bytes)
DATA (not counted as overhead) = 1500 bytes
CRC = 4 bytes
Interpacket gap = 8 bytes

That's 62 - 74 bytes overhead for 1500 bytes of data: 4.13% - 4.93% overhead, not 25%.
And this is realistically _achievable_ throughput on a two node network.

Breace.

Wow by mikpos · 1999-05-10 04:14 · Score: 1

Thanks, I never saw that before. Still there is more to consider.

When you start offloading stuff from the CPU to a processor on an add-on card, it's going to be a pretty single-purpose processor. This really limits the lifespan of it, though. What if there's some new networking protocol that obsoletes TCP? If the processor on the netcard isn't general purpose enough, then it'll go to waste. What if people start using voxels instead of polygons (hypothetical), are we going to find a bunch of Voodoo cards in the trash?

Of course general purpose processors on cards would be even worse. You'd be much better off with another CPU. But it would be nice to have some means of keeping abreast with changing protocols and APIs and the such.

Re: remote loopback by jonabbey · 1999-05-08 20:57 · Score: 1

Remote loopback is a term used by some Ethernet card diagnostics for a network test in which one system sends out an ethernet packet and the receiving system immediately takes it and sends it back.. sort of a link-level ping test.

With a remote loopback, you get hardware testing, but it's probably not a great test for anything having to do with tcp/ip since it's not involved.

--
- jon

Ganymede, a GPL'ed metadirectory for UNIX

Re:NT, not linux == WHO CARES by cirby · 1999-05-08 12:03 · Score: 1

It's a hardware story... how hard would it be to implement this on a linux box?

Just about any interesting new computer toy should be reported here. If linux (and other) users don't keep up on what's new, they're going to end up in the silicon ghetto, just like Microsoft would want.

Memory management in hardware wouldn't be progress by slothbait · 1999-05-08 12:38 · Score: 1

Putting that functionality in hardware ties you to a particular implementation. Systems have been designed that would handle page faults purely in hardware. They were very complicated and rather limiting. Experience has shown this to be an area best handled by software, which is flexible and can be easily changed or fixed.

Yes, I believe that 3D graphics were once done in software. They still are, in many applications. 3D game rendering is a rather specific task, and very computationally expensive. Thus, it was worthwhile to develop specialized hardware for this purpose. Compared to modern workstation hardware, TCP stacks are fairly inexpensive, if written correctly. I question how how useful silicon TCP would be. I expect that the added cost and reduced flexibility would more than outweigh the saved CPU-load for most cases.

Then again, perhaps NT's implementation of TCP is poor enough to where this is a concern.

Oh, and I never have problems with Linux MM.
--Lenny

//"You can't prove anything about a program written in C or FORTRAN.
It's really just Peek and Poke with some syntactic sugar."

Er... why is this on Slashdot? by Sinner · 1999-05-08 09:51 · Score: 0

Yay, proprietary hardware that'll probably never work under any OS other than NT! And, wow, look, it's incredibly fast under synthetic benchmarks, so it MUST be good!

Seriously, Linux is quite capable of fully utilising 100 megabit ethernet by itself, combine that with the cheap price of processor power and this sort of gizmo looks kinda redundant. Oh, and I'll give you 2-1 odds that queso et al identify it as a *BSD OS :-)

--
fish and pipes

Re:Er... why is this on Slashdot? by Sinner · 1999-05-09 07:34 · Score: 1

it doesn't have to be *good* news....

Ah, I didn't think of that. Well said. I fell into the "this shouldn't be on Slashdot!" trap there when what I really wanted to do was inject a note of caution. Proprietary hardware is one of the few things that can kill a free operating system, and a nerd who values her freedom has cause to be wary.

--
fish and pipes
Re:Er... why is this on Slashdot? by Meathook · 1999-05-08 16:25 · Score: 3

Hmmm...

When I read the slogan under the Slashdot logo at the top of this page it says "News for Nerds. Stuff that Matters". I don't see "LINUX ONLY" stamped anywhere up there. I don't see anything wrong with the posters giving us articles concerning platforms or OSes other than Linux and the hardware it runs on. In fact even though I'm quite a Linux supporter, I'm sick of the people who think this must be a Linux only site. My understanding is that Linux is on the minds of the nerds at this point of history so we get alot of stories about Linux (if I'm wrong here, well, sorry... somebody should make that clear). That's cool, but why are the posters slammed when they post an article about something other than Linux? That's not cool, or uncool, or whatever.

bah

this would be nice by akharon · 1999-05-08 08:48 · Score: 1

i get about a 50% cpu load on a p5-120/3Com 509/10BT when moving at full speed, messes with my mp3 playing...

Documented Linux vs NT Performance by markster · 1999-05-08 15:47 · Score: 2

Look here for my study on Linux vs NT performance with TCP. NT's stack is indeed inefficient, but the performance may also be related to really lousy interrupt handling as well.

Of course, I tried to post this to /. a few times and it didn't make it. Oh well. Maybe people will find it appropriate to this thread.

Re:Documented Linux vs NT Performance by Breace · 1999-05-09 11:17 · Score: 1

Looks like an excellent study to me.

One little thing, your network card:
DEC 21040 based AsanteFAST 10/100Mbps ethernet cards
Should probably be DEC 21140. 21040 is only 10Mb/s.

Breace.

loadavg 1+ & 9*% idle by Zappy · 1999-05-08 10:32 · Score: 1

I've oft wonderd about this maybe time for an other ask how can the load avg be out of sync with the cpu utlisation by such a large margin?

Been There, Done That by Detritus · 1999-05-08 10:06 · Score: 3

When 3COM first started making Ethernet boards, they tried putting the protocol processing software on a smart Ethernet board. It never worked too well. Unless you had a slow system, with a brain dead operating system or small address space. The processors on the "smart" boards were cheap and slow. The on-board software tended to be buggy, limited and out-of-date. The cards were expensive. You still had to have some sort of light weight protocol for communication between the operating system and the card, adding another layer of software. With well written software, a 25 MHz 68020 host, could run TCP/IP at wire speed on a "dumb" 10 MBPS Ethernet card. The "smart" cards quickly disappeared.

Putting the protocol processing in Silicon will burn you when you need new features and algorithms in your networking stack. What happens if you need large windows, SACK, IPV6, IPSEC, QOS?

The right solution is to use an operating system that doesn't suffer from MBD and use decently designed network cards on a fast bus. 100 MBPS shouldn't be a problem for a decent system. 1 GBPS is where current hardware and operating systems fall down and need improvement.

--
Mea navis aericumbens anguillis abundat

As usually NT TCP is rooten eggs by arivanov · 1999-05-09 20:25 · Score: 0

See subj:

--
Baker's Law: Misery no longer loves company. Nowadays it insists on it
http://www.sigsegv.cx/

Re:What do you think is "average" on 10bT? by jhme · 1999-05-08 10:46 · Score: 1

If I remember well, an Ethernet above a 10% usage level is considered very close to being quite dead.

--
-- Fast, Cheap, Well. Pick two.

Please attribute your sources better. by peterb · 1999-05-08 09:00 · Score: 1

This item was originally posted on memepool on Friday. Hey folks, there's absolutely nothing wrong with taking items from one forum and sharing them on another. Just make sure credit is given where credit is due. In this case, the original item at the geeks list referenced both memepool and robotwisdom.

Let's get those attributions right!

Peter

SGI O2 slow as well by TA · 1999-05-09 19:25 · Score: 1

I've been wondering about how well a Linux PC would actually be doing on a 100Mbit Ethernet.. I'm a bit worried that it wouldn't be too good, as there seem to be real work for the CPU to do. I became aware of this when I found that an R5000 SGI O2 couldn't do more than max. 5 MB/sec, memory-to-memory TCP no disk involved! And this used more than 60% of the CPU, the system was completely kneeling and with all the other work going on the TCP became a real bottleneck :-(
IRIX 6.3 isn't the worst operating system in the world so this got me thinking.
That on-board TCP stack seems interesting, but only if it supports something else than NT of course.
But then there's the problem of embedded TCP stacks, I've yet to see one withouth strange bugs here and there. TCP stacks are notoriously difficult to get right, in practice it's only a real, open-source preferably Unix box that can be trusted to (eventually) get it Right.
TA

Re:Err... Idiot yourself, here are numbers by TA · 1999-05-09 19:30 · Score: 1

Who's talking out of his ass here? If you haven't got better comments than that please shut up.
Here's the .sig Dave Miller used last year:
Yow! 11.26 MB/s remote host TCP bandwidth & ////
199 usec remote TCP latency over 100Mb/s ////
ethernet. Beat that! ////

Diff please? by TA · 1999-05-09 19:38 · Score: 1

>I always patch this away when I compile a kernel)
Would you mind posting a patch for us lazy people? :-)
TA

Thanks! by TA · 1999-05-10 23:44 · Score: 1

Thanks,
TA

What do you think is "average" on 10bT? by BeBoxer · 1999-05-08 09:42 · Score: 1

If you think that 900KB/sec is slow for 10bT, what do you think the average is? Ethernet is not known for it's ability to work well at high levels of utilization, and 900KB/sec is 7.2Mb/sec. At this level of utilization and up, the contention between different hosts trying to talk at once drives up the collision rate which keeps the tranfer rates down.

Re:What do you think is "average" on 10bT? by Gumber · 1999-05-09 04:28 · Score: 1

Ever heard of switched ethernet?
Re:What do you think is "average" on 10bT? by jsdkl · 1999-05-08 11:44 · Score: 1

Just in case you wanted to know - I get an average of about 600K/s across my campus's 10BT network.

Already being done on Amigas by Sloppy · 1999-05-09 09:58 · Score: 1

How about... A sound card with an MP3 player easily loadable into it's onboard DSP to offload that task from the processor?

Such a thing has already become commonplace on the Amiga. Most of the soundcards have DSPs, and most of those are capable of doing MP3 decoding to take load off CPU. It's cool, but I would expect it to only be of interest to people who aren't able to get faster CPUs (e.g. 680x0 users). With today's PPCs and x86 chips, and clock rates approaching a gigahertz in the next year or so, it seems like the processing requirements of decoding MP3s are kinda trivial.

Don't get me wrong -- I'm always interested in custom hardware to take stuff over from the CPU. Heck, that was the whole point of the original Amiga hardware. But general-purpose CPUs are just getting so damned fast... who cares about CPU load anymore? Just get a faster processor. They're already "infinitely" fast for most people's purposes.

--
As copyright owner of this comment, I authorize everyone to defeat any technological measure which limits access to it.

no more slashdottings? by UM_Maverick · 1999-05-08 09:00 · Score: 1

hmm...does this mean that some of those poor, defenseless servers are finally going to have a /. defense mechanism?

--
Juiced? Or Not?

Re:no more slashdottings? by Breace · 1999-05-08 10:49 · Score: 1

Apparently not. I can't access their site right now. /.'ed allready. :o)

Breace.

Oh please by Gumber · 1999-05-09 04:25 · Score: 1

First, a couple of years is a long time in the networking industry. More impressive first products have been engineered in that time frame.

Second, don't be missled by claims that something is done in hardware. As often as not, this does not mean that the entire implementation is burned into silicon. Often it only means that processing that may have been done on the main CPU has been shifted to a dedicated processor. Often times this dedicated processor may be particularly well suited to the task at hand because it implements special instructions. It may also be on the same die as other discrete functional units dedicated to the task (like an ethernet controller).

What you end up with can be quite fast, but it still retains a bit of flexibility, so it can be reprogrammed to fix bugs, or meet new standards. (or something totally unrelated. Appearantly the engineers at Alteon programmed the MIPS CPUs they use in their gigabit ethernet switches (two per port) to crack RC5 keys for a laugh.)

Ever heard of switched ethernet? by Gumber · 1999-05-09 04:29 · Score: 1

Well, have you?

Re:Done before, but now patented? Questions... by CodeShark · 1999-05-10 00:12 · Score: 2

In reading this last post, I realized that I have a number of questions which I'm not able to answer. Figuring that I'm not in the minority here, here are my questions for others to answer:

There are many hardware patents out there that don't affect us that much, the question is -- is this one of them?
is the "Silicon TCP" is indeed a step forward for NICs, and if so
is the patent based on "prior art", and therefore unenforceable?

I would hate to see something that would work well for us regular (e.g., don't have unlimited funds to buy the next new great hardware) folks trapped within a "can only get it from one company" patent.

--
...Open Source isn't the only answer -- but it's almost always a better value than the alternatives...

Re:Network benchmark needed by Clover_Kicker · 1999-05-09 05:25 · Score: 1

> Maybe we should try to port netperf
> (www.netperf.org) to Windows and add raw
> Ethernet to it.

netperf allready runs on Win32, I've tested it on both 95 and NT. The netperf ftp site has binaries for Intel and Alpha.

Neither 95 nor NT has any problem saturating a 10Mbps ethernet, but at higher speeds NT kicks the snot out of 95 on the same h/w. I've seen NT get 90+Mbps on a 100Mbps ethernet, but don't have anything faster to test with.

The loopback tests show that Linux and FreeBSD have comparable maximum speeds, much higher then NT (on the same h/w). One assumes that this has to do with how often stuff is moved around in memory, etc. etc.

BTW, if you ever want to convince yourself that ISA sucks, just run some tests on a PCI NIC and then an ISA NIC, and watch the 60-80% increase in CPU use for the ISA card.

Re:Network benchmark needed by Clover_Kicker · 1999-05-10 21:25 · Score: 1

The systems were PII-333s with 64MB RAM running NT4SP4, and DLINK 8029 (or maybe 8019) PCI NICs.

I believe that the NT TCP/IP stack has been improved greatly since the 3.5x days, which was what I suppose you would have been running on a P90.

(Sorry for delay getting back to you, has been busy at work)

Ethernet can run at almost 100% bandwidth. by SEWilco · 1999-05-08 12:49 · Score: 1

Sigh. That old myth. Look at the first paper on this page. One of the designers of Ethernet tested it. Drove it at almost 100% of 10Mbps. A bunch of workstations on the net.

Look at the computers they used... by DeepBrain · 1999-05-08 09:18 · Score: 3

They make it sound like they're using fancy servers.

They're not, they're using two proprietary IBM Aptivas, low end machines. (I should know, I have one of them, the same model they used). Firstly, the E56 has a 266 mhz K6, unless IBM lied to me too :) Secondly, 48 megs of RAM seems a bit low for NT. Thirdly, the hard drive systems in there are probably also low quality to say the least.

Couldn't they have borrowed Mindcraft's server or something? At least THEY could tune an NT box :), although not a Linux box... and it was a server. I don't call a home system a server that will be pumping 90 megabits/sec.

IO2 NIC's? by adamhupp · 1999-05-08 08:38 · Score: 1

Isn't this what IO2 is supposed to accompish? This is very cool, but it would be nice for the TCP/IP stack to be configurable and upgradable. i.e. to IPv6 etc. This should be great for homebrew routers and such also. I hope linux drivers appear soon.

Re:IO2 NIC's? by soldack · 1999-05-09 16:46 · Score: 1

I20 is one attempt at solving this problem. I work over at a company that is working on I20 ethernet and raid devices. Our target OS is Windows NT, for reasons discussed above, but SCO, IBM (for OS/2), and Novell have already written preliminary drivers for it. We are looking at having these devices preinstalled on top of the line Dell, IBM, HP, and Compaq servers. Our tests have shown major improvments in CPU usage over 3Com dumb nics. Also, a friend and I, being the resident Linux people, will be working on Linux drivers in the near future.

--
-- soldack

My data indicates misplaced optimization by victim · 1999-05-08 15:30 · Score: 1

I just checked the troughput between a pair of my computers. I did this with a test program that used read and write calls to transfer the data and did nothing else with it.
The data transfer ran at 10Mbytes/second with a 12% cpu load on a PII-266. configuration info at end
Extrapolating to a modern 500MHz system, that would be something like a 6% cpu load. I suppose there are niches where the differense might matter, but with three 10Mbyte/sec transfers the PCI bus will choke anyway and we are looking at the difference between 18%cpu for network and 6%cpu for network.
I guess if you are stuck with a server OS where the TCP stack is a pig and you can't fix it then maybe this is a reasonable optimization, but I sure wouldn't bother with it for linux.
Footnote:It turns out the sending data to /dev/null is more expensive than the ethernet transfer.
Configuration of Receiver: DellXPS PII-266, Kingston 10/100mbit card(dec21140), Linux 2.2.6, tulip driver.
Configuration of Sender: Gateway PII-300, Kingston 10/100mbit card(dec21140), Linux 2.2.1, tulip driver.
I'm not sure if Kingston still makes these, we buy equivalent cards for $13 now.
Configuration of Network: Half duplex 100mbit ethernet with about 50 machines hanging on it. Mostly idle during test.
Procedural Note: I did run the test many times and exclude the slow outliers, both of these machines have operational services that I did not wish to disable, so many test runs were ruined by other loads on the machines.
Open Benchmark In order to preserve my credibility I will offer to allow anyone to run my tests exactly as I have in order to verify my results. No changes which may improve the results will be permitted. ;-)

The Obvious Question: Linux drivers? by Izaak · 1999-05-08 08:35 · Score: 1

Interesting, though I expect IO bottlenecks elsewhere in the architecture would keep you from seeing the full benefits in a real world situation. Still, might be fun to snag a couple and hack together some Linux drivers.

Thad

--

The Bolachek Journals

Too much hype, not enough info by akkem · 1999-05-08 09:12 · Score: 2

The web page describing this NIC is really unimpressive. What I'd really like to know is:

How many simultaneous connections can this thing support, and is it slower when multiple connections are used? How's the performance when one of these cards is hooked up to a machine with a standard software TCP/IP?

Putting TCP/IP in hardware is nice, I guess, but then nasty real-world issues crop up. What happens if there's a bug in the implementation? There's also the nasty challenge of writing a driver for a card like this, but I won't claim that's a defect of this NIC design.

Gig ether by netwiz · 1999-05-08 14:05 · Score: 2

the big push for gig ether is campus trunks between buildings. goes up to 10km (single mode) and Cisco's Gigabit EtherChannel can aggregate up to 16 links. Really great if your org grew strangely, and you have departments in two buildings. It also helps in linking switches in a internet server fanout. Either way, it's more of a backbone technology than anything else. In fact, PCI32's theorietical peak (132MB/s) doesn't match the throughput of full duplex 1000Base-SX. And NT's networking core prevents running the network over 400Mbit peak.

Interestingly enough, on most tested unices, i think they're getting around 800-900Mbit. It'd be interesting to see how fast ftp.cdrom.com would be with gig ether to the backbone... Maybe then i'd see more than 10KBps...

Bits and pieces by Salamander · 1999-05-09 02:44 · Score: 3

First off, to all those engaged in the "I get more bandwidth than you" pissing contest: try beating 560Mb/s sustained application-to-application bandwidth between two P2/450GX systems (running NT, BTW, but not NT's TCP/IP). So you think you can beat that, eh? Try beating 2us application-to-application latency for zero-length messages. Can't do it, even with buzzwords like VIA and I2O, can you? OK, next topic.

Regarding TCP/IP on the card: as another poster pointed out, this is not a new thing but a very old thing. I once worked on putting DECnet on old 3Com "smart" cards...ick. There are all sorts of problems with doing this sort of thing on the card. First is upgradability of the network stack. You immediately become dependent on the manufacturer for upgrades - don't expect open-source firmware any time soon, even if you had the tools to compile and load it. FPGAs aren't really a good choice here because they increase the component and design costs too much. You'd be much better off using a commodity embedded microcontroller with "firmware" stored in flash memory, although this may still increase the cost unacceptably and for _real_ speed you just plain have to chuck all this stuff out the window and go ASIC. As it turns out, most systems in most uses have more CPU power to burn than any other type of resource. Some guys at HP several years ago took this observation and ran with it; they designed a card that was even more stripped down than the typical Ethernet card, doing even more of the work in software, and they actually got excellent results.

Lastly, the conversation about NT's networking code reminds me of an exchange I had with an engineer at MS a couple of years ago. He was saying that they had to sacrifice a little on TCP/IP features and error checking (e.g. not crashing if sent a source-routed frame, or something like that) to get speed. My response was that (a) not checking unusual conditions in incoming network packets is just unacceptable, and (b) NT's TCP/IP performance is piss poor, indicating that they have bigger issues to worry about than shaving a few instructions by not checking packet headers. In the time since then I have found no reason to change either observation.

--
Slashdot - News for Herds. Stuff that Splatters.

Re: 1mbs/s by Breace · 1999-05-08 13:50 · Score: 1

O common, what do you expect?
Can we please be serious about this. 1MB/s is nothing. If win95 wouldn't be able to even sustain that...

Let us know when you get >10MB/s with a 100Mb/s board.

btw. please get your capitalization correct: MB = Mega Byte, Mb = Mega bit, mb = I dunno ;o).

Breace.

Re:Yes, M$ is slow. by Breace · 1999-05-09 07:38 · Score: 1

Using an other system doing the remote loopback, or simply wiring TX to RX on the system that's being tested itself.

Sorry about the confusion.

Breace.

Re: 1mbs/s by Breace · 1999-05-09 07:41 · Score: 1

No you don't understand.
With Linux you don't need multiple processors to sustain decent Ethernet throughput.

Breace.

Re:Network benchmark needed by Breace · 1999-05-09 07:51 · Score: 1

I've seen NT get 90+Mbps on a 100Mbps ethernet

I'm sure you are right, but can you please discribe the system (CPU, RAM etc?), just for us to get an idea. Because I've never been able to see such a thing. Then again, I think I gave up on NT when we still only had P90's...

BTW, if you ever want to convince yourself that ISA sucks, just run some tests on a PCI NIC and then an ISA NIC, and watch the 60-80% increase in CPU use for the ISA card

Right. Although we had a bad experience with the OPTi 802/832 PCI chipset. It doesn't implement DMA burst modes from RAM to PCI devices, and thus limits the max datarate in that direction to about 5MB/s. That's worse then ISA! Anyways, that's an other story altogether... :(

Thankx for letting us know about netperf. I wasn't able to connect to their ftp server yesterday.

Breace.

Re:Network benchmark needed by Breace · 1999-05-12 05:33 · Score: 1

Hmm, you must be confused about the D-Link. I don't think they have a 8029. They do have a Realtek 8029 clone though, but that's a 10Mb/s only chip. It's also a NE2*00 clone, which means that it's very unlikely to actually get anywhere close to 100Mb/s because of the silly I/O scheme. I imagine it was probably a DEC based board.

Anyways, that's a pretty fast system,- we don't have one in the office here. I'll certainly redo our tests though as soon as I can, because we do have NT4 now and I think you are right about us using 3.51. (Although most of our test are Raw Ethernet related, not TCP/IP)

Thankx for letting us know.

Breace.

Yes, M$ is slow. by Breace · 1999-05-08 11:16 · Score: 3

Exactly, this card is of interest because the Microsoft network stack is terribly slow, and costs an awful lot of CPU time.

The main reason for this is: Poor design. (What did you expect?)

The M$ network stack is (as with most M$ device driver architectures) way more complex to deal with then for example Linux and as a result many of the network drivers are not well written. E.g. they are not optimized for performance. I'm sure the writers are happy if the damn thing works at all.

The network stack itself also has much more involvement with each packet going out or coming in then with Linux. In some cases packets are actually copied in RAM. This is what causes the higher CPU overhead.

We have run several tests and the results where depressing. I think we probably lost the source code, but if I can find it I will post it somewhere.

Here's some of the figures I remember: (all tests where raw network tests, RAM to RAM, no hard drives involved)

Two P90 systems using 100Mb/s Full Duplex (DEC 21140) cards. We where unable to sustain more then 35Mb/s using UDP in one direction only.

Linux on this configuration: close to 100Mb/s.

Using a raw packet driver on a 200MMX notebook with a SMC 100Mb/s Full Duplex card and using remote loopback we are unable to get much more then 15Mb/s sustained. (That's 15Mb/s going out and 15Mb/s comming in)
This was using raw Ethernet packets, no TCP/IP or other protocol.

This same configuration in a normal network is unable to receive more than around 4Mb/s UDP multicast packets, or packets will be dropped (thus not received)

I'm surprised that people have kept up with the poor raw network performance of our Redmondians. It's a disgrace, and I will NEVER use a M$ product if serious networking is to be done. Even if the Ethernet adapter handles the protocols.

Breace.

Network benchmark needed by Breace · 1999-05-08 14:05 · Score: 5

One thing is becoming obvious.

We need some serious network benchmark tools that are cross-platform usable between Linux and Windows and maybe more.

Strangely enough not too many seem to exist. Neither does someone seem to have some hard data on network performance. It entirely useless to say 'I get 1MB/s using FTP'.

A good network benchmark tool will be able to test raw Ethernet performance as well as performance through protocol layers.

Of course it would only test the network and not use the file system or hard drive. It should be very clear what sort of configuration is supposed to be used: two systems running full-duplex, one system using remote loopback or whatever. It could also be interesting to have a > 2 system test to show what collisions do.

And most important as far as I'm concerned: Open Source. I don't believe in closed source benchmarks.

The difference between raw Ethernet and TCP/IP protocol would show us how badly we need hardware assistance on what platform.

Maybe we should try to port netperf ( www.netperf.org) to Windows and add raw Ethernet to it.

Well, maybe I'll be a bit more serious about this if there's an interest.

Breace.

Disk write speed vs. read speed by zatz · 1999-05-08 19:57 · Score: 1

Unless your file is very large, writing is easier than reading, because buffering can cover the seeking latency. The dirty blocks sit in main memory for a while, then when the bus has a chance they sit in the controller cache, then in the disk's cache, and then when the head is in the right spot they finally get committed to media.

Also, with a heavily used disk running a good filesystem, large writes are more likely to be contiguous on disk than reads.

When I'm testing TCP networking performance, I give the web server a file that is just a hole (length huge, size just a few indirect blocks) and use a command line http client. Works nicely:
# httpget -v http://.../~ambrose/hole -o /dev/null
67108864/67108864 bytes transferred in 8.26 seconds (7930.47 kB/sec).

--

Java: the COBOL of the new millenium.

168 comments