Is There a Place for a $500 Ethernet Card?

Is There a Place for a $500 Ethernet Card? by Anonymous Coward · 2005-06-20 14:04 · Score: 5, Funny

Yes, there is a place for a $500 ethernet card, far, far away from this guy.

Re:Is There a Place for a $500 Ethernet Card? by Anonymous Coward · 2005-06-20 15:00 · Score: 3, Funny

pffft.... they don't have an ipod killer.
They just lost my business.

It's only funny if you see the site.
Re:Is There a Place for a $500 Ethernet Card? by tcr · 2005-06-20 23:06 · Score: 2, Funny

Ah, I see you have constructed a new EtherKiller.
Your skills are complete.
Indeed, you are powerful...

--

Information wants to be beer.

A look into the past by bananahead · 2005-06-20 14:04 · Score: 5, Insightful

This sounds very similar to the 'smart card' concept back in the late 80's and early 90's. Intel had the 586-driven smart-cards, and I believe 3Com had them as well. They were intended to offload the CPU by putting parts of the stack on the card. They failed because the performance gain and CPU offload numbers were never enough to justify the price difference.

I wonder what has changed? I have never known the CPU to get dragged down by network traffic, but maybe in the network server markets it is different, However with the Ethernet chipsets being designed into the motherboard and integrated into the tight circle of RAM and CPU, it isn't clear there is a need for this.

How long before the network control is put into the CPU? It is going to be tough to beat that type of performance.

--
A most overlooked advantage to owning a computer is if they foul up there's no law against wacking them around a bit.

Re:A look into the past by bhtooefr · 2005-06-20 14:25 · Score: 3, Informative

The only time I've heard of that was with Ethel, an Apple II ethernet card. It used a PIC that ran the TCP/IP stack, and fed stuff to the A2.

Of course, the A2 is perfectly capable of running it's own TCP/IP stack - Uther doesn't do any of that, IIRC, and nor does the LANceGS (although, it seems that the LANce can only do pings on the //e - either that, or it just costs too much).
Re:A look into the past by bluelip · 2005-06-20 14:27 · Score: 5, Interesting

I've noticed a slowdown in computer response when using gig cards and moving lotsa' data. I thought the bottleneck may have moved to the file systems. Didn't seem to be the case as pumping dummy data throught the nic also caused issues.

I didn't pursue it far enough to see where where the actual problem was. These cards may help, but my money is on a faster cpu.

--

Yep, I never spell check.
More incorrect spellings can be found he
Re:A look into the past by PONA-Boy · 2005-06-20 14:40 · Score: 3, Informative

Back when our company was running ATM on the backbone and in our HQ offices, all of the FORE HE155 NIC's were "smart". That was also due, in part, to the particular nature of ATM. The NIC's handled their own routing in addition participating in LANE and PNNI services. Very little of the network load was actually handled by the server themselves. It was really very nice and the NIC's themselves were more than $600 a pop.

Load on our servers from network processing increased easily by 20% when we moved to an all Ethernet/IP setup. $500 for a "smart" NIC, hell yeah! As much as my boss may chide me about it, I sill lament the loss of ATM in our network.

-PONA-

--
+that's funny...I don't FEEL tardy.+
Re:A look into the past by X · 2005-06-20 14:51 · Score: 2, Informative

I wonder what has changed? I have never known the CPU to get dragged down by network traffic, but maybe in the network server markets it is different

The thing that has changed is that the frequency that frames arrive at has gone up. Unless you can use jumbo frames (and even then, if the payloads are small), GigE is delivering the same sized frames as fast ethernet, just 10x faster. This tends to create a hell of a lot more interrupts for the processor to handle (a condition made worse by the deeper pipelines in processors like the P4). If you can offload the processing of the frames a bit, just enough to give a processor a chance to get something done, you could dramatically improve performance.

That being said, changes to the protocol (such as jumbo frames) can also have a positive effect in a lot of circumstances, and have the advantage of being cheaper to implement.

--
sigs are a waste of space
Re:A look into the past by llzackll · 2005-06-20 14:52 · Score: 2, Informative

There are a lot of Cards that do this now. You can get a 3com 3c905c, which does at least partial offloading, for about 20 bucks.
Re:A look into the past by __aaclcg7560 · 2005-06-20 14:55 · Score: 3, Interesting

Jerry Pournelle had a column in the February 2005 issue of Dr. Dobb's Journal about Gigabit hardware. If you have a Gigabit PCI card, expect to see a doubling of speed over 100Mb PCI card. If the motherboard has a built-in Gigabit port, you can see a five to six times speed over 100Mb PCI card or port. PCI cards are limited by the PCI bus, but built-in ports have direct access.
Re:A look into the past by ProfaneBaby · 2005-06-20 15:16 · Score: 2, Insightful

Built-in ports have direct access... depending on the chipset/motherboard.

I've seen some 'built-in' broadcom gig-e ports that were on the PCI bus, even though they were technically built into the board. Horrible performance.

--
Video Phone Blogs send video messages straight to the web.
Re:A look into the past by toddestan · 2005-06-20 15:22 · Score: 2

I have found that the CPU load due to the disk I/O is usually a lot worse than CPU load from the network card. A nice test is to take a computer with both SCSI and IDE drives, and try moving data off of each of the drives over the network. The IDE drive will cause the system to slow down noticably, while the SCSI drive will barely (or not at all) affect the system.
Re:A look into the past by Anonymous Coward · 2005-06-20 15:26 · Score: 5, Informative

Plain old 32-bit PCI has a bandwidth of 32 bits * 33 MHz = 1.056 Gbps. A 64-bit bus would obviously be twice that. Unless there is a lot of other traffic on the PCI bus, I suspect the limitation is the driver, not the bus.

Peripherals like that built into the motherboard are generally on a PCI bus segment anyway. You can see by looking at the device manager in Windows or by using lspci in Linux. In both cases you will see a vendor ID, bus number, and slot number.
Re:A look into the past by Fweeky · 2005-06-20 15:29 · Score: 4, Interesting

I expect you can get an Intel 1000/Pro for around $30; full TCP/IP checksum offloads in both directions, interrupt moderation, jumbo frames, and Intel even write their own open source drivers.

Heh, my on-board Realtek GigE chip has checksum offloads too, but even with them on, 300Mbps would have me up to 70% system/interrupt CPU load (and I hear the checksumming is a bit.. broken); I barely scrape 30% with a PRO/1000GT.
Re:A look into the past by Detritus · 2005-06-20 15:35 · Score: 3, Informative

I/O and interrupt handling, like many things, doesn't scale with CPU clock rates.
Collisions are not a problem on switched networks. Even on older shared media and hub based networks, collisions were not the evil thing that they were portrayed as. Ethernet is not Aloha. See Measured Capacity of an Ethernet: Myths and Reality by David R. Boggs, Jeffrey C. Mogul, Christopher A. Kent. It debunks much of the misinformation that is still prevalent.

--
Mea navis aericumbens anguillis abundat
Re:A look into the past by jamesh · 2005-06-20 15:47 · Score: 2, Interesting

If you've ever had one of the recent worms come in behind a linux router, then you'll see how network traffic can make a cpu stop.

I made a boo boo in a firewall rule and opened up an unpatched mssql server to the internet (*hangs head in shame*). Within 30 seconds it had caught one of the mssql worms and had stopped the linux router dead. Pulling the network plug from the mssql server caused the linux router to come instantly back to life. With TCP and all its flow control goodness it's probably not a problem, but when something is sending udp or icmp packets at you as fast as it can, you'll really see the difference.
Re:A look into the past by Phil+Karn · 2005-06-20 15:52 · Score: 4, Insightful

And how long ago was that? What kind of servers had loads increase by 20% when you dumped the "smart" NICs? How much faster have general purpose CPUs gotten since then? And whose unusually inefficient TCP/IP stack and/or Ethernet driver were you running?
"Smart" network cards are one of those bad ideas that keep coming back from the grave, because computer science seems to lose its collective memory every decade or so.

Fifteen years ago, Van Jacobsen did a wonderful presentation at SIGCOMM 1990 on just why they were such a bad idea. The reason is very simple. A modern, well-tuned and optimized TCP/IP stack can process a packet with only about 20 instructions on average. Very few "smart" controller cards have host interfaces that can be spoken to with so few instructions! The switch to and from kernel context will usually cost you more than TCP/IP.

Not only that, but the coprocessor on the "smart" controller card inevitably ends up being considerably slower than the host CPU, because typical host CPUs are made in much larger quantities, enjoy large economies of scale, and are updated frequently. So you often have the absurd situation of a blazingly fast and modern host CPU twiddling its thumbs waiting for some piss-poor slow CPU on a "smart" controller to execute a protocol handler that could have been done on the host with fewer instructions than it takes just to move a packet to or from the "smart" card.

And if that weren't enough, rarely do these "smart" network controllers come with open source firmware. Usually the company that makes them obsoletes them quickly (because they don't sell well) and/or goes out of business, and you're left with a very expensive paperweight.

Since his talk, Ethernet interfaces have totally obsoleted "smart" network cards. They now come with lots of internal buffering to avoid losing packets when interrupt latencies are high, and they take relatively few instructions per byte of user data moved. What more could you want?
Re:A look into the past by cymen · 2005-06-20 16:18 · Score: 2, Funny

It won't just stop a linux router. It will stop other routers too.
Re:A look into the past by Jah-Wren+Ryel · 2005-06-20 16:39 · Score: 3, Informative

Although I agree with you about the stupidity of smart ethernet controllers, the interesting thing about this one is that it claims to not need to transition to kernel space to set up a packet for sending. In principle, that might actually make a difference in throughput...it it's true.

The thing driving smart ethernet cards is stuff like rdma and scsi over ip. The part of thinking behind it for rdma is that the card exerts the same load on the host as local dma (i.e. almost none). For scsi over ip, they think that doing scsi is already enough for the host cpu so let it treat the network interface as "send it and forget it."

As for avoiding the kernel context switch, I haven't looked at how this card is implemented, but with the right smarts on the card, and a replacement socket library, they could enable each process to talk directly to the card - bypassing the kernel once stuff is initialized - kind of the way an X server can talk directly to the frame-buffer without involving the kernel.

--
When information is power, privacy is freedom.
Re:A look into the past by EtherMonkey · 2005-06-20 16:40 · Score: 3, Interesting

Agreed, but it's even been more recent than the early 90's. The late 90's also had its run of so-called "Intelligent" network cards.
I worked for a large HP/Intel VAR at the time and I we were selling $500 Intel Intelligent Server NICs like they were Big Macs. Then one day one of our biggest customers called in a fit. It seems that his manager asked him to do a quick comparison between a smart Intel NIC and a regular Intel NIC, so he could tell his bean-counters to get stuffed. It turns out that we were NOT ABLE TO FIND ANY SYSTEM OR TRAFFIC CONFIGURATION that would result in higher throughput, lower CPU utilization or lower memory utilization when using the smart NIC.

In other words, the standard $100 Intel NIC (PILA8465B, as I recall) beat the piss out of the much more expensive Intel intelligent NIC with on-board co-processor.

Within 3 months we stopped getting any orders for the smart NICS. In 6 months Intel retailiated by disabling server features (adapter fault tolerence, VLAN tagging, load balancing and Cisco Fast Etherchannel support) on the basic NIC, in an effort to save the smart NIC. When this didn't work they modified the driver so the server features would only work with a re-released version of the "dumb" NIC at a higher price (the only difference between the cheapest and most expensive version was an ID string burned into a PAL on the NIC).

Similiar experiences with earlier cards from Intel, IBM, and others. In every instance I tested, a plain old NIC (not junk, but the basic model from a reputable manufacturer) always outperformed the NIC's with on-board brains and/or co-processors.
Maybe this Level-5 NIC has some new voodoo engineering, but I'd have to see real-world testing to believe it. Especially from a company that apparently intentionally is playing-off Level-3 Communications' name recognition for its own benefit.

--
--- A man with a briefcase can steal more money, than any man with a gun. [Don Henley]
Re:A look into the past by klui · 2005-06-20 16:54 · Score: 4, Insightful

It would depend on the implementation. Not all mobos with built-in ports have "direct access." Some of them go through a shared bus or worse, the PCI bus.
Intel's implementation for the 865P/875P chipset goes through the memory hub directly http://www.intel.com/design/chipsets/schematics/25 281202.pdf while the i845 chipset has the ethernet interface connected to the ICH4 controller hub that is shared among other devices like the PCI bus http://www.intel.com/design/chipsets/datashts/2519 2401.pdf. VIA's PT894/PT880 ethernet connection goes through a "VIA Connectivity" bus much like the Intel 845 http://www.via.com.tw/en/products/chipsets/p4-seri es/pt894pro and http://www.via.com.tw/en/products/chipsets/p4-seri es/pt880. There were some value motherboards that although I recall that they use good/decent chipsets, their designers decided to connect the built-in gigabit ethernet ports off the PCI bus. I cannot recall what these were but I read about them in anandtech several years ago.
Re:A look into the past by adolf · 2005-06-20 17:16 · Score: 2, Insightful

Ya know, that's the same sort of argument I've been using to promote software RAID vs. hardware RAID.

I've learned this: Nobody cares. People will blindly spend hundreds, sometimes thousands, of dollars on specialized gear to offload their precious CPUs.

When it is explained to them that better system performance can be had for less money by simply buying a faster CPU, they throw up their hands and blindly reassert that dedicated hardware must be better, by simple virtue of the fact that it is dedicated. That such reasoning is plainly a crock of shit seems to escape them.

Van Jacobsen be damned, people are an illogical bunch. They're always doing stuff for all the wrong reasons, and trying to solve problems with solutions that are only vaguely related.

That all said, if the card manages to improve ethernet latency even a little bit, it might be worthwhile in some circumstances. I'm thinking of applications like Cobranet for professional audio (where latency is always critically important), or perhaps clustering.

I mean, can you imagine a Beowulf cluster with these?

--
Kid-proof tablet..
Re:A look into the past by 0rbit4l · 2005-06-20 17:32 · Score: 4, Informative

The reason is very simple. A modern, well-tuned and optimized TCP/IP stack can process a packet with only about 20 instructions on average.
No, Van Jacobson showed that the fast-path receive could be done in 30 instructions. This doesn't include ("smart"-NIC features, which you ignorantly deride) TCP/IP checksum calculation, nor is it even remotely realistic for a modern TCP/IP stack that supports modern RFCs. You're not including timeout code, connection establishment, state updates, or reassembly on the receive side, and you conveniently completely ignore segmentation issues on the send side. If "smart" NICs are such a bad idea, then I guess Intel and Sun are really up a creek - Intel currently supports TCP segmentation offload (pushing the packet segmentation task from the TCP stack onto the hardware), and is moving to push the entire TCP stack to a dedicated processor + NIC combo.
Since his talk, Ethernet interfaces have totally obsoleted "smart" network cards.
You couldn't be more wrong. Since the 90s, the boundary of what the NIC should do and what the OS should do has been repeatedly re-examined, and industry leaders in networking have successfully deployed products that big-iron servers rely on.
Re:A look into the past by arivanov · 2005-06-20 17:36 · Score: 2, Insightful

No.

Realistically there are bottlenecks all over the place and out of them these 2 prevent nearly any computer from reaching 1G.

1. Interrupt handling bottleneck. Even with interrupt mitigation your typical pps value for a single CPU P4 is under 100 kps. It falls down to under 60 kps when using Intel dual CPUs (dunno about AMD or Via) or SMT due to the overly deep pipeline on the P4. That is way less then 1G for small packets.

2. IO bottleneck. Many motherboards have IO-to-memory speeds which are realistically way under 1G in total, usually around 600-700 Mbit.

No card can help for these 2 problems.

--
Baker's Law: Misery no longer loves company. Nowadays it insists on it
http://www.sigsegv.cx/
Re:A look into the past by Eivind · 2005-06-20 18:14 · Score: 5, Informative

Smart cards do more than just offload the CPU from handling TCP data. They offload the PCI bus from handling interrupts. Every packet of data triggers an interrupt.
With newer Linux-kernels this is quite simply not the case.

To avoid the torrent of interupts from a fast nic the Linux-kernel detects that the card gets packets so often that essentially there's "always" one or more packets waiting in the cards buffers.

It responds to this condition by disabling interupts for the card in question and switch to polling it regularily.

Normally polling is inefficient, because it amounts to asking over and over again "got anything for me now?", where in most situations the answer is "no" 99.99% of the time, which makes it a waste of resources to ask in the first place.

This changes when the answer is "yes" basically all the time. There's no need for the network card to tell the cpu over and over and over "I got a packet for you", instead the cpu collects packets regularily.

It's sorta like having your lawyer receive legal letters for you.

If you get very few, it'd be a waste for you to drop by him every day and ask if he's gotten any for you (polling) most of the days you'd be making the trip for naught. In this situation interupts (i.e. having your lawyer call you and inform you on the rare occasions when a letter *does* arrive) makes more sense.

But if your letter-flow increases to the point where there's normally 3-5 letters every day and it's rare that no letter arrives, then it no longer makes sense that the lawyer calls you every day to tell you you got letters (interupts), you already assumed that. In this scenario it makes more sense for you to come by regularily and pick up letters without being prompted to do so. (polling)

Thus, the flow of interupts from a Gb-nic being flooded with 100byte small-packets (say a loaded dns-server) is not 1 million interupts every second -- it is zero interupts.

(allthough what you write is correct for kernels older than 2.6.10 and for less clever OSes.)
Re:A look into the past by tob · 2005-06-20 18:44 · Score: 2, Interesting

100kips is actually enough to get 1Gbps throughput. 1 ethernet packet = 1500 bytes = 12k bits. 100k ethernet packets are 1.2Gbits.

In practice (did some testing 2 years ago, on very modern hardware for the time) you can do 70 MB/s NFS on an untuned Linux box. By tuning buffersizes and using jumboframes (9kB ethernet packets instead of 1.5kB) you can get 110MB/s which is amazing if you realize what kind of everhead is in that transfer (NFS, RPC, TCP, IP, Ethernet).

Tobias
Re:A look into the past by 3rd_Floo · 2005-06-21 00:23 · Score: 3, Informative

And everyone forgets about the good higher PCI-X standard (Now in 2.0a!), which permits not only 100MHz and 133MHz like 1.0, but also 266MHz and 533MHz. Although you will be hard pressed to find 2.0 devices, the 66-133MHz devices are very common in the server market, infact I have a nice 133MHz GigE card here, a very nice 64-bit addition!

Sure there's a place for them by grub · 2005-06-20 14:05 · Score: 5, Insightful

Is There a Place for a $500 Ethernet Card?

Of course there is, assuming the card performs as advertised. Sheer conjecture: the card likely has a lot of the smarts onboard. Maybe it has some of the TCP and IP stuff on board too (checksum, etc). Compare that to a crapbox $10.95 RealTek[a] card which generates interrupts like mad because it has no smarts and you'd probably be very suprised. (Think of comparing a decent hardware modem to a software based WinModem.)

[a] I had a sales-drone at Computer Boulevard here in Winnipeg just RAVE about RealTek cards. I said I really wanted 3 Intel or 3COM cards for a new work proxy server and he said 'Why? RealTeks are way cheaper and run at the same speed!' Retard.

--
Trolling is a art,

Re:Sure there's a place for them by pchan- · 2005-06-20 14:47 · Score: 2, Informative

This is different from a TCP/IP offload engine in one critical feature: it is designed so that the network interface is run directly out of user-space, without involving the kernel. It may still do the network stack in software, but it does it in-process. This means that there is no copying from/to userspace and no context switch for every read or write. When you're handling gigabit-speed traffic, this becomes a big issue. Just like video players today use special OS features to open a video port directly to the graphics card without routing it through the windowing system, this does the same with network data. Obviously, you and I won't be buying this thing, unless you happen to work in an environment where you have massive network traffic.
Re:Sure there's a place for them by Swaffs · 2005-06-20 16:54 · Score: 2, Interesting

I walked into Computer Boulevard about three weeks ago looking for a replacement hardware modem (unfortunately I don't live in Winnipeg anymore, but about 250 km north east, no cable or even DSL here.)

So the sales guy asks me if I need help and I tell him I want a 56k hardware modem. So he ushers me to the modem section to let me browse. Sure enough, there's nothing there but winmodems. He comes back a couple minutes later and asks if I found what I was looking for. I said no, and that what I was looking for was a hardware modem, because I'm running linux. I told him they probably only had it in OEM.

So he goes and gets the big book of parts and finds it, and I tell him that yes, that's exactly what I came for. Well off he goes and comes back several minutes later to tell me that those are in the back, which is why I couldn't find it. Gee, thanks, that's why I mentioned it was OEM, since they don't usually put OEM parts on the shelf. So he asks if I want one, and I say yes, and off he goes only to come back and report that they're out of stock.

I went over to Computer Avenue and was met with someone (unforuntately I didn't get his name) that knew what he was talking about and was quickly walking away with what I needed, albeit at a $10 higher price than I would have paid at CBIT if they'd had one.

The sad part about this is that I'm not a computer professional, just a user, and one that's not even that familiar with hardware. I'm all isolated from the computer world and out of the loop up here on MTS's when-they-feel-like-it internet access. I don't think I should know more than the clerk at the store.

--
--
"Karma can only be portioned out by the cosmos." - Homer Simpson [1F10]
Re:Sure there's a place for them by gad_zuki! · 2005-06-20 18:53 · Score: 2, Insightful

>I don't think I should know more than the clerk at the store.

I don't really think that's a valid complaint. In a perfect world, yes, but in retail? Not really.

Running linux is like owning a foreign car and expecting GM/Ford guys to fix it just as easily. Its one of the real liabilities of not running the monopoly/defacto standard OS. As a linux user, you should know what you're buying. I mean, users ofter get criticized for being ignorant of their systems, but you want the same ignorance and expect retailers to spend all this extra time and traning on what is really a minority OS they might get a tiny amount of sales for?

I *always* expect the salesman to be next to useless, that's why I do a little research and buy what I need. The retail sales position is there to push product, not to solve problems. It blows my mind when I see friends and family chat up the salesman and be semi-sweet talked into something thats good for them, but actually costs them an extra couple hundred of dollars or has things they don't need or is missing things they'll want in the future all because they wouldnt spend 10 minutes on the internet researching the purchase or reading reviews.

yes there is a place for it by Anonymous Coward · 2005-06-20 14:06 · Score: 3, Funny

right inside my computer :)

Short answers: PHB's by Anonymous Coward · 2005-06-20 14:08 · Score: 2, Funny

Short answer;

Where there are PHB's, there is overpriced hardware.

yes, there is by commo1 · 2005-06-20 14:08 · Score: 3, Informative

Million dollar PCs (sans gold-plating) and (quite seriously) mission-critical blade servers, customer ip routers, etc.... I have clients that pay upwards of $600 canadian now (though that's for quad cards with ample on-board processing to off-load from cpu horsepower).

Specific acceleration cards nothing new by Sv-Manowar · 2005-06-20 14:12 · Score: 2, Insightful

This isn't exactly an entirely new concept. Intel have been selling their ethernet chips with built in SSL accelerators for quite some time, and the advantage of offloading duties from the software to the hardware (see Intel etherexpress vs RealTek style cards) is obvious. Whether these cards offload enough of the normal duties of a typical cluster node to be worthwhile should be interesting to see, as there are a wide variety of cluster load types and obviously these cards will have a niche to fit into alongside their competitors in the diverse set of demands around cluster networks. As for the price tag, I seem to remember gigabit cards being extremely expensive a few years back, and its probably pretty competitive with where they're aiming this product, alongside myrinet and infiniband.

--
Business Voyeur

Knock-Offs by randomErr · 2005-06-20 14:13 · Score: 4, Insightful

I give Realtek 6 months tops to make thier own knock-off of the card for $24.95.

--
You say things that offend me and I can deal with it. Can you?

I think there is definately a market for this... by Famanoran · 2005-06-20 14:16 · Score: 5, Insightful

But not necessarily where the vendors think it is.

Back when I was working at a startup developing anti-DDoS technology, one of the biggest problems we were faced when implemented GigE, was the load on the PCI bus. (This was before we started using PCI-X).

It depends on exactly how customisable the network card software is, but if you could plonk a couple of those into whatever system you wanted - and if the cards themselves could do, say, signature detection of various flood types, or basic analysis of traffic trends then that is a very definite market.

I realise the core issue is not addressed (if your physical pipe is full, then you're fucked), but it takes the load of dropping the malicious packets off the host CPU so it can attempt to service whatever valid traffic actually gets through.

And then there is IP fragmentation. Bad fragments? Perhaps a dodgy fragmentation implementation in the stack? (you know which OS I mean) Lets just drop that before the host sees it and crashes.

I don't know, I can't find any real information describing what they do, but I can certainly see uses for this.

Linux before Windows by mepperpint · 2005-06-20 14:18 · Score: 3, Interesting

It's nice to see a piece of hardware that ships with linux drivers and promises Windows support later. So frequently applications and hardware are first supported under Windows and occasionally ported to other platforms.

Re:What good is such a fast Ethernet card... by Phroggy · 2005-06-20 14:21 · Score: 3, Insightful

if your internet connection is anything less than fiber, which is about 99.9% of all connections?

The other 0.1%, obviously.

--
$x='S24;r)>63/* h@<5+oZ)32"5cz';$me='phroggy'x$];
$x=~y+ -xz+\0-Tx+;print$_^chop$me for split'',$x;

Re:What good is such a fast Ethernet card... by ScentCone · 2005-06-20 14:22 · Score: 5, Insightful

if your internet connection is anything less than fiber, which is about 99.9% of all connections? Not to mention the fact that not many computers can actually handle that much data at once anyway

Listen, when I've got 30 web servers banging away on a single database server, I want each web server in and out as quickly as possible. Every bit of the handshake, query, and results is going to wrap up that much faster if things are faster, period. When you're dealing with a huge data-driven e-commerce site, where every page renders around a hundred or more queries, and there are dozens or hundreds of concurrent page views, this stuff really counts in the aggregate.

If you sell one more widget per day, all year long, because your web presentation layer is just a little more snappy, that's sure as hell going to pay for a $500 NIC.

--
Don't disappoint your bird dog. Go to the range.

A rose by any other name... by Ingolfke · 2005-06-20 14:23 · Score: 4, Insightful

The name Level 5 refers to the network protocol stack where level 5 delivers data from the network to the application, according to Karr. The company isn't concerned about any potential confusion with Internet Protocol telecom Level 3 Communications Inc. On the contrary, he quipped, "It's working in our favor. People say, 'Yes, we've heard of you. You're a big company.'"

As lawyers at Level 3 begin salivating at thought of all of the potential lawsuits.

Re:A rose by any other name... by Leers · 2005-06-20 16:09 · Score: 3, Funny

"As lawyers at Level 3 begin salivating at thought of all of the potential lawsuits."

Actually, salivation does not come till Level 8. However, at Level 3 you do get "Detect Potential Lawsuit." Don't worry, once you hit Level 3 you'll be a Level 8 in no time.

Deja Vu Mainframe days by justsomecomputerguy · 2005-06-20 14:23 · Score: 2, Interesting

Back in the early-mid 80's (and probably even before then) IBM mainframes using SNA instead of TCPIP used special networking processors that handled all of that "networking stuff" so that the mainframe CPU (which really was a "unit" and not just a single chip) could just concentrate on running its jobs and not be interrupted by the communications end of things. Everything old is new again. Same situation, just smaller and faster (CPU and helper communications card take up 1U in a rack instead of 1 whole corner of the head end room).

There is a place in an NFS environment by crusty_architect · 2005-06-20 14:24 · Score: 4, Insightful

We use Filers for storage at Gigabit speeds. Compared to our SAN/FC evironments, we see much higher CPU utilisation on our Sol 8 boxes, especially when attempting to get to Gigibit speeds.

It is less import today then it was 10 years ago. by jellomizer · 2005-06-20 14:27 · Score: 3, Insightful

$500 for a network card you have to have a good reason that you will need it. I am sure there are applications that will utilize it but for the price it may not be worth it. With sub $500 computers coming to age. It is probably cheaper just to split all your services onto smaller boxen and have a load balancing switch/router. Computers are cheap today $500 for a network card is steep and will only fill a niche market. Perhaps if the price was in the $50 range it would be more widely accepted. But with good enough systems at 1k and additional 500 could be used for a faster CPU other then a faster network CPU

--
If something is so important that you feel the need to post it on the internet... It probably isn't that important.

Re:It is less import today then it was 10 years ag by Desert+Raven · 2005-06-20 14:37 · Score: 2, Insightful

I highly doubt they're aiming these cards at the general public. The kind of folks who worry about this kind of performance aren't buying $500 computers, they're buying $5,000 + computers, and trying to tweak every ounce of performance out of them. I'm willing to bet my employer is going to look pretty seriously at these cards for some of our heavy-use systems.

Sometimes you can't "split all your services onto smaller boxen and have a load balancing switch/router". Not everything on the network is a web server.

Re:What good is such a fast Ethernet card... by njcoder · 2005-06-20 14:39 · Score: 4, Insightful

" When you're dealing with a huge data-driven e-commerce site, where every page renders around a hundred or more queries, "

Each page renders a hundred or more queries? Sounds like you're better off investing in a better design than better hardware.

--
Open Source Java DAO Generator

spend the money on more CPU, not specialized stuff by SuperBanana · 2005-06-20 14:49 · Score: 4, Interesting

. Intel had the 586-driven smart-cards, and I believe 3Com had them as well. They were intended to offload the CPU by putting parts of the stack on the card.

You're probably thinking of the i960-based cards, though Intel's PRO series adapters (not i960 based) do something similar (TCP checksumming is now builtin to the chipset and most OS drivers now know how to take advantage of that). That processor, and variants, were used in everything from network cards to RAID controllers.

They failed because the performance gain and CPU offload numbers were never enough to justify the price difference.

Ding ding ding. I forget who said it (maybe Alan Cox, but I'm REALLY not sure about that), but the opinion was along the lines that it would always be more benefitial to throw the money at a faster processor (or a second processor etc), because you'd get a performance boost everywhere. $300 buys quite a bit extra CPU horsepower these days, and there's no need for the hassles of custom drivers and such. Nowadays CPUs are just so damn fast, it's also not really necessary.

--
Please help metamoderate.

Is there a place? by ChozCunningham · 2005-06-20 14:52 · Score: 3, Funny

Only one place. Amongst gamers.

"A $500 LAN Card? Oh my God, Stevie, thats almost as much as my GeForce9900XTLSI+ cost!" Said the kid with the Lone Gunmen T-Shirt.*

"That's nothing, This 8-Track-ROM player off of ThinkGeekcost almost a cool grand" Stevie said, as the other nerds bowed around his glowing and chromed Frag Machine.

*Lone Gunmen T-Shirts coming soon. 8-Track-ROM's, too.

--
Looks good for your age..

Is There a Place for a $500 Ethernet Card? by sporktoast · 2005-06-20 14:53 · Score: 2, Informative

Sure there is.

--
In a related story, the IRS has recently ruled that the cost of Windows upgrades can NOT be deducted as a gambling loss.

In a word, no by bofkentucky · 2005-06-20 14:55 · Score: 3, Interesting

Take sun, some of their new server kit this year is going to ship 10Gbit/s ethernet on the board, which acording to their docs, is going to take 3 USIV procs to keep the bus saturated (6 cores). But when you are looking at 8 to 64 way server boxes, who cares about those 3 procs, especially when in 24-30 months it will take less than one proc to handle that load (Quad Cored + Moore's Law), and the eventually one thread will have the horsepower.

Surely those smart dudes at Via, AMD, Intel, Samsung, Nat Semi, and/or Motorolla aren't going to:
A) FUD this to death if it really works
B) File patent suit until doomsday to keep it locked up
C) Buy them out
D) Let them wither on the vine and then buy the IP.

--
09f911029d74e35bd84156c5635688c0

Re:What good is such a fast Ethernet card... by cujo_1111 · 2005-06-20 15:00 · Score: 2, Funny

And you can't spell, you will fit right in...

--
If I point out that you are incorrect, making me a foe does not make you any more correct.

High-Performance Computing by amjacobs · 2005-06-20 15:03 · Score: 2, Informative

The place where things like TCP offload and RDMA support really matter is the high-end space. The major limitations on building really large clusters is the interconnect. If you look at the top 500 supercomputer list, you'll find that the top computers all use something better than gigabit ethernet. (Mostly Infiniband and maybe Myrinet) The reason for using Infiniband is that the communication latency is around 5us. For comparison , a standard GigE card has a latency around 70us.

That being said, http://www.ammasso.com/ makes an Ethernet card (priced around $495, I believe) that utilizes both TCP offload and RDMA. The latency of the cards is around 10us. This is great for people needing a high-performance cluster, but can't afford the Infiniband interconnect.

IPSEC by Omnifarious · 2005-06-20 15:06 · Score: 3, Insightful

If this card can do most of the work of IPSEC for me, it'd be a big win.

My main concern though is that with two ports, how can I be absolutely certain the packet has to go through my firewall rules before it can go anywhere?

Of course, the extra ports could be an advantage. If it could handle all the rules for you, then it might even be capable of functioning as a layer 4 switch and sending out a new IP packet before completely recieving said packet.

But, I'd want all the software on that card to be Open Source.

--
Need a Python, C++, Unix, Linux develop

Re:IPSEC by dbIII · 2005-06-20 20:48 · Score: 2, Informative

If this card can do most of the work of IPSEC for me, it'd be a big win.
Snapgear/Cybergaurd make little embedded firewall/VPN/router cards that can do this and plug into a PCI slot and pretend to be a network card as far as the host computers OS is concearned - but that's a completely different thing to what is talked about in the article.
I'd want all the software on that card to be Open Source
They run on linux and the IPSEC implementation is open source as well.

One word... by jcdick1 · 2005-06-20 15:11 · Score: 2, Insightful

Virtualization.

These are the kinds of NICs that would be put into a datacenter that is leaning heavily toward VMware GSX or ESX servers. Any bit of offload of the CPU in sharing the NICs is a good thing.

--
What?

Re:A network card for gamers too? by DigiShaman · 2005-06-20 15:14 · Score: 2, Informative

Pfffttt...please!

All the TCP/IP acceleration isn't going to do you jack shit if the server is running like crap. Also, you will need a good gigabit switch or router with CAT6 in place to remotely take advantage of this new NIC.

Basic rule of thumb still applies. "Your fastest connection is your slowest link"

--
Life is not for the lazy.

Looking into EPSRC by mislam · 2005-06-20 15:50 · Score: 2, Interesting

I would wait on jumping to any type of conclusion and see what happens when EPSRC adopts this for ther 512 node cluster. If it really improves the performance by 10% it certainly should be something to look into. Now, is there is a market for it at end user's desktop? I don't think so. 9$ Realtek card would be just fine. :-)

Re:It is less import today then it was 10 years ag by taylortbb · 2005-06-20 15:54 · Score: 3, Insightful

"Smaller boxes" is relative. Google's cluster nodes are dual Xeons with terabyte+ HDs. For Google it is small, for anyone else, that is powerful computer you're going to be paying alot for. If you're buying one of those computers you're probably going to look at one of these cards, and that is exactly the market they're looking for.

30k is peanuts. spent over 10x that on 1 box. by teknickle · 2005-06-20 15:58 · Score: 2

I have a few servers right now that cost close to $30k each. (and this is at a 29 employee non-tech company).
$30k might seem like a lot to a Windows technician, but that is a cheap box for high-end Unix servers (won't even touch mainframe for that).

6 years ago, when I ran a network for a large company (in a very rural community, mind you) the most expensive server there cost over $320,000.
It was an AS/400 and it was/is a phenomenal piece of equipment.

Heck, it cost $900 just for 1 10/100 ethernet card back in 1999. (and PC nics were $15)

$30k is peanuts. Go checkout some 8-way IBM boxes. Or HP or high-end chompaq. And no, you do NOT reboot a $300k+ server during maintenance.

Lies, damned lies and benchmarks by Harry+Balls · 2005-06-20 16:03 · Score: 2, Insightful

I looked at their benchmark web page http://www.level5networks.com/prod_etherfabricperf .htm where they claim that a typical PC with "conventional" ethernet burns 83.5% of CPU for communication overhead while only 16.5% remain to the application.
But they don't say which CPU was used - probably an 850 MHz Pentium III or something similar outdated.

Fact is, on a current 3.x GHz Pentium IV or an equivalent Athlon or Opteron the communication overhead is in one digit range, percentage-wise.

A famous computer science quote is:
"Lies, damned lies and benchmarks"
and another one is
"Don't trust any statistics that you haven't forged yourself."

--
Dedicated Linux servers (root access) $45 p.M.

Re:What good is such a fast Ethernet card... by njcoder · 2005-06-20 16:05 · Score: 2, Interesting

asp on iis 5 years ago, ok, that makes sense too.

Not trying to knock your design if it works it works. Since you're working on another flavor of it, let me give you my opinion on what I would have done differently. I've worked on webapps like this in java, not sure if you could do the same in php or asp.net. For something like this I would go with Java from my experience with it and also doing PHP. Whatever you do it in this advice might be helpful.

Your merchant info.... This probably doesn't get updated every day so you can cache it on an application level and let the cache refresh itself in a smart way when there are updates. You can do the same on a per session basis with shopper info. Sounds like your tax logic can be streamlined a bit as well. You might want to think of havinga seperate process that does the tax and can keep a cache of all the information, that way you don't have to hit the database for each item like it sounds you're doing now. Most of your logging probably doesn't have to be done in real time to the database. Or even in the database at all. There are ways to link your application logic with the webservers logging mechanism. The webserver usually does it in a smarter way, then you aggregate that info on a regular basis. If that doesn't work for you try asynchronous logging. Start up a seperate thread that writes to the log. That way the user doesn't have to wait for the logging to finish. Also caching the logs locally and then aggregating it even every minute or so on a heavy site should increase network and db performance since a few larger writes are faster than a lot of smaller ones. Even with everything you mentioned I don't see how you can have a hundred or more queries per page. I'm thinking at most 5-10 queries per page will get you all of what you need to display products, cross reference products/specials, and a bunch of other stuff. Your checkout pages might need a little more because you'll want to make sure you get fresh data from the database even though a good caching method shouldn't require it. Doesn't hurt to play it safe when the money actually trades hands. Your datamodel might need some going over as well.

You might want to add some more ram to handle the extra caching and there are many open source distributed caching tools that make it easy. OSCache is good for Java apps, memcached is in C but can be used with other languages including PHP and Java and I think ASP?

Since you're thinking of doing another version of it, you might want to consider these things, and probably more. Hard to say concretely without knowing much more. YOu can probably cut down on your hardware too. A site like hotornot.com, which is granted a lot simpler can serve up about 20 million pages a day across I think 50 servers. If you have a strong DB with 30 front end webservers (assuming you got them all 5 years ago and they're standard issue type webservers) I would expect a well designed, complex e-commerce site to be able to handle around 6-9 million page views a day with good response times easily. That's trying to be conservative too considering what you said.

Getting this NICs for 30something servers is going to run you between 10-15k depending where the volume discount sits in. Like I said I don't know your whole system but that money being spent so you're pages don't hit the database 100 or more times per page should do more for you than these cards. That's my long answer so you don't think I was just being flippant.

--
Open Source Java DAO Generator

Um, no. by holysin · 2005-06-20 16:20 · Score: 5, Interesting

If you have a machine (say on a machine running linux kernel 2.4.20-30.9smp) with a built in gig port (say with eth0 identified as eth0: Tigon3 [partno(BCM95704A6) rev 2003 PHY(5704)] (PCI:66MHz:64-bit) 10/100/1000BaseT) connected to a decent gigabit switch, and another machine (same card, same os)with a gigabit card, those two machines will achieve 940Mbps talking to each other (results via iperf, 0.0-10.0 sec 1.09 GBytes 940 Mbits/sec).

However, if you plug a windows box (2000 or xp, didn't have a 2003 handy) with either an add on card, OR built in gig (2000 vs xp) you get a rather less impressive figure of 550-630. Coincidentally, you'll get the same basic number if you run two instances of iperf on the same computer... This tells me the bottleneck isn't the PCI bus, it's the OS. If you can prove me wrong please do so...

Re:Um, no. by anno1602 · 2005-06-20 20:45 · Score: 2, Informative

If you encrypt data it makes it more random>

Encrypted data should look completely random, not "more" random, or else you can use the patterns in the stream for malicious activity. That's why, if you want to compress and encrypt, you always compress and then encrypt. Compressing an encrypted stream is impossible if your encryption is worth its salt.

Ok, kids, a quick education on sessions & late by postbigbang · 2005-06-20 16:38 · Score: 2, Interesting

All GBE cards are FC on the MAC layer. Get over it.

Here's where the problems come in:

1) buses suck. PCI-X is fast; a faster bus clock is better still
2) the problems for GBE NICs are, in no special order: dropping crap packets; cleaning up dirty cache (a huge problem for clusters, where this product is poised); session/protocol relationship managment; buffering up misrouted UDP; managing evil ports (setting them up and tearing them down); managing proxies and work arounds (a little SIP anyone? Burping up your IPSec???)
3) making the driver aware of what's going on so sessions don't vomit
4) not bothering the freaking CPU chipset every few milliseconds with useless crap

So, if they do any of these things, bless them and send me the bill. Because (save TOE cards), all of them hassle the drivers and chipsets to no end with stuff that could easily be offloaded. And to those that say, just put more cheapo load-balanced hardware on the job-- you're a chump and deserve to have stuff blown to bits when you mulitply failure points with junko doorstop hardware boxes with all of the brains of a goose.

--
---- Teach Peace. It's Cheaper Than War.

Re:What good is such a fast Ethernet card... by truesaer · 2005-06-20 16:42 · Score: 2, Insightful

Not only that, but you set up your internal network to operate at gigabit speeds, can't you? There is more to the network than the connection to the public internet even for those who don't have a fiber connection.

Disagree - more valuable now than before by jimsingh · 2005-06-20 16:48 · Score: 2, Insightful

As CPUs get faster an interrupt costs you more in terms of lost CPU time. So, reducing the number of interrupts is more important now than ever before.

My 100 Mbs ethernet card generates about 5k interrupts / second when transferring data at about 30 Mbps. Gigabit cards are engineered to hold interrupts until a few packets of data come in so that a DMA can move a larger chunks of data. If this NIC reduces the use of interrupts even further (say by off boarding computation or even the entire TCP/IP stack and thus allows for even larger DMA transfers) the impact could be substantial.

Unfortunetly, my knowledge of computer innards stops here, so I can't calculate how much cpu time 5000 interrupts actually take or how the new PCI-Express bus changes interrupt processing or how much a benefit it would be to have say only 1000 interrupts / second instead of the 5000.

Re:spend the money on more CPU, not specialized st by evilviper · 2005-06-20 17:00 · Score: 2, Insightful

Ding ding ding. I forget who said it (maybe Alan Cox, but I'm REALLY not sure about that), but the opinion was along the lines that it would always be more benefitial to throw the money at a faster processor (or a second processor etc), because you'd get a performance boost everywhere.

Interrupts are the one place where it's not remotely true. A faster processor will allow your system to handle significantly more interrupts. The whole interrupt model needs to be thrown out and replaced with something much better.

And while I'm at it, there are many cases where it's not true. Wherever you have a significant bottleneck, hardware acceleration helps tremendously. Tasks like encryption and (HighDef) video playback can max-out the highest-end systems available, while a $50 card can handle those tasks easily.

I don't think purpose-built hardware everywhere is the answer, but I do think having an FPIC/ASIC as a standard computer component could make for incredible speed improvements in most/all of the tasks that are hard for CPUs to perform.

--
Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant

Search for "Dealing with high network loads" by anti-NAT · 2005-06-20 17:29 · Score: 2, Informative

and have a read of why the interrupt problem isn't a problem anymore, at least on Linux. Note the date too - October 2001.

LWN.net

NAPI has been implemented in the kernel.org kernels for a number of years now.

--
The Internet's nature is peer to peer - 20050301_cs_profs.pdf

Wow!! by EightBits · 2005-06-20 17:36 · Score: 2, Interesting

I can't believe what I'm seeing here. The majority think this is a bad thing, it seems. I disagree.

I have a few Sun E450s in my shop and I am going to be moving them to gigabit ethernet soon. A gigabit ethernet card from Sun costs considerably more than this so it is an option as long as it will run on Sparc hardware with Solaris drivers. Sorry, but the Intel e1000 just isn't going to cut it here.

I'd like to see that article about 20 instructions (I assume these are ML instructions) handling an entire packet. This may be the case on CISC CPUs, but I just don't see it happening on RISC CPUs. I am not saying it's impossible, but I would definitely have to see that to believe it and I am genuinely interested in reading this article. Please let me know where I can find it.

I don't think some of you understand the difference between intelligent network chips and networkchips with a CPU core inside them. Take a look at Cisco's solutions. Their line is moving hastily towards ASICs. The idea here is that specialized hardware designed to perform a task will ALWAYS be faster at handling that task than a CPU running on the same clock. Cisco is proving this with Layer 3 switching vs routing. It's not clear what their solution is, but I'm willing to bet this NIC is an ASIC optimized for the purpose of handling TCP/IP traffic from an ethernet network. I have a hard time seeing how any CPU will be able to beat out an ASIC in this field today.

Also of note is memory and bus bandwidth. I have seen some comments about CPU usage and how it's negligible and what not. While I don't believe that either (I pay a lot for those cycles, I want to use as many for data processing as possible), I do believe that the CPU handling the TCP/IP stack takes a little more BUS bandwidth as well as memory bandwidth. If this is all handled on the card, both bandwidth usages will be reduced. Bus and memory bandwidth is already lagging way behind CPU speed as it is. It is my number one system performance limiter right now. The more I can eliminate it, the more productive I can be. Someone already mentioned large numbers of packets. This is a good argument as well. When dealing with large numbers of small packets, CPU usage on a CPU-based TCP/IP stack increases as opposed to a smaller number of larger packets. So in some cases, it depends on your network and it's configuration.

Also, consdier that maybe I do only get 10 more cycles per second from using this card. Is the card worth it? With CPU cycles at a premium and everyone here trying to purchase as many as possible and never a single idle CPU in any of my servers, I have to give a resounding YES! 10 cycles per second per CPU times the numebr of CPUs and seconds the NIC is in place is a LOT of cycles and most certainly worth $500 over the lifetime of a $30,000 machine. If they can prove it does what they claim on my hardware, count me in.

Admission of trademark confusion: by Chmarr · 2005-06-20 18:16 · Score: 2

From the article:

The name Level 5 refers to the network protocol stack where level 5 delivers data from the network to the application, according to Karr. The company isn't concerned about any potential confusion with Internet Protocol telecom Level 3 Communications Inc. On the contrary, he quipped, "It's working in our favor. People say, 'Yes, we've heard of you. You're a big company.'"

Congrations guys... you just admitted to causing actual trademark confusion... have fun in the courtroom.

Re:30k is peanuts. spent over 10x that on 1 box. by B2382F29 · 2005-06-20 18:17 · Score: 2, Funny

they were used to reduce the over-generation margin of the power companies

By using up the excess power? :)

--
Move Sig. For great justice.

Re:A network card for gamers too? by Sj0 · 2005-06-20 18:57 · Score: 2, Insightful

It's just like people who see gold ends on something and figure it's the automatic winner in any contest.

Never mind that using gold connectors and non-gold connectors together causes corrosion. :P

--
It's been a long time.

I worked on TCP offload card at Adaptec by Krellan · 2005-06-20 20:45 · Score: 4, Informative

Accelerating Ethernet in hardware, while remaining 100% compatible with the standard protocols on the wire, isn't all that new. Just over 2 years ago, I worked on a TOE (TCP offload engine) card at Adaptec.

http://www.adaptec.com/worldwide/product/prodfamil ymatrix.html?cat=%2fTechnology%2fNAC+Cards%2fNAC+C ards

It was a complete TCP stack in hardware (with the exception of startup/teardown, which still was intentionally done in software, for purposes of security/accounting).

Once the TCP connection was established, the packets were completely handled in hardware, and the resulting TCP payload data was DMA'ed directly to the application's memory when a read request was made. Same thing in the other direction, for a write request. Very fast!

I'm not sure of the exact numbers but we reduced CPU utilization to around 10%-20% of what it was under a non-accelerated card, and were able to saturate the wire in both directions using only a 1.0Ghz CPU. This is something that was difficult to do, given the common rule of thumb that you need 1Mhz of CPU speed to handle every 1Mbit of data on the wire.

To make a long story short, it didn't sell, and I (among many others) was laid off.

The reason was mostly about price/performance: who would pay that much for just a gigabit ethernet card? The money that was spent on a TOE-accelerated network card would be better spent on a faster CPU in general, or a more specialized interconnect such as InfiniBand.

When 10Gb Ethernet becomes a reality, we will once again need TOE-accelerated network cards (since there are no 10GHz CPU's today, as we seem to have hit a wall at around 4Ghz). I'd keep my eye on Chelsio: of the Ethernet TOE vendors still standing, they seem to have a good product.

BTW, did you know that 10Gb Ethernet is basically "InfiniBand lite"? Take InfiniBand, drop the special upper-layer protocols so that it's just raw packets on the wire, treat that with the same semantics as Ethernet, and you have 10GbE. I can predict that Ethernet and InfiniBand will conceptually merge, sometime in the future. Maybe Ethernet will become a subset of InfiniBand, like SATA is a subset of SAS....

--

Dr. Demento On The 'Net!

The Great Circular migration of Hardware by Ancient_Hacker · 2005-06-20 23:54 · Score: 3, Insightful

This is yet another round of the GCMOH. Anytime there are idlle hardware engineers they find something that can be moved off the main CPU to hardware (or these days, almost always, another processor). This is almost always a bad idea:

Erecting yet another edifice brings on the huge and unavoidable overheads of yet another different CPU instruction set, yet another real-time scheduler, another code base, another set of performance and timing bottlenecks. Another group of programmers. Another set of in-circuit emulators, debugging tools, and system kernel. Another cycle of testing, bug fixes, updates.
It sets up a split in the programming team-- there's now much more reason for finger-pointing and argument and mistrust.
The extra money would usually buy you another CPU and lots of RAM, resources that would benefit every part of the system, not just the network I/O.
The separate I/O processor usually requires the geekiest and least communicative of the programmers-- not a good thing. The manuals for the I/O card are likely to be very brief and sketchy, and rarely up to date.
The I/O processor is almost always at least one generation of silicon technology older than the CPU, so even though the glossy brochures just drip with Speeeed! and Vrooom!-y adjectives, it's not that speedy in comparison to the CPU.

For examples, see the $4000 graphics co-processor that IBM tried to sell for the PC (IIRC the CPU could outdo-it). The various disk-compression cards for the PC. Also see the serial ports on the Mac IIvx (very expensive and not noticeably better). Don't forget the P-code chip for the PDP-11/03. All very expensive and blase' performance/$.

How it works. by JQuick · 2005-06-21 04:18 · Score: 2, Informative

The white-paper and the web pages on the company site which describe the implementation talk about how this is done.

This card, and the software which drives it, differ from traditional ethernet accellerator cards and from alternative network protocols (like myrinet and iWarp) in several ways.

Alternative protocols not only require using a different software API but also require custom hardware at both communication endpoints.

Traditional hardware TCP/IP accelerators run the bottom half of the stack in custom silicon. This does tend to help reduce host CPU load but suffer from a number of problems. Since host CPU speeds have tended to increase regularly, they often helped for only a brief period of time. They also tended to help most for large packets but helped little or not at all for small packets.

This technology claims to help large and small packets equally well, and also claims to reduce packet latency across the board. It does so by running the bulk of the TCP/IP stack in user space rather than via system calls. The hardware runs ethernet Rx and Tx processing but does not implement the higher level IP protocol processing. Instead, once connections are established, the ethernet frames coming from the hardware, are fed via a system call interface to the application process to which they belong. Then, no further context switching between kernel and the process are required. The top end of the hardware driver and all of the subsequent IP layers, are executed in the context of the user space process. They are linked to the app via shared libraries.

Basically, instead of the linking the IP calls against code which requires frequent switching between user and kernel space, the entire upper half of the stack is run by the application sending and receiving the packets. This offers uniform benefits in packet latency across all packet sizes, and offers improvement in throughput as well.

I assume that all that is required is to link against a different set of shared libraries to gain these benefits (and of course to have the custom hardware on at least one side of the comm. link). This looks very good in principle.

The following page provides an overview of the technology and compares it to each of the competing mechanisms.
http://www.level5networks.com/sol_approaches.htm

Slashdot Mirror

Is There a Place for a $500 Ethernet Card?

76 of 423 comments (clear)