Intel and BlueArc Set New Mail Server Record
louismg writes "With e-mail traffic continuing to explode, Intel and BlueArc announced this morning that the two companies have set a new SPECmail benchmark record in cooperation with CommuniGate Pro, offering a solution that can serve 30 million messages per day - 67% ahead of the previous record, owned by Sun Microsystems. Rather than clustering a lot of smaller servers together, large ISPs can now use fewer systems to handle massive traffic load."
And if there was no spam, I doubt there would be such a need for these machines.
Looks like spammers have a new tool ...
There will be partying in the care homes when this news spreads (via email of course).
liqbase
Now that's what I call a server consolidation booster!
Java Oracle Linux Enthusiast
Turns out it's really spammers.
"Rather than clustering a lot of smaller servers together, large ISPs can now use fewer systems to handle massive spam load."
Of course, you can always opt to include some stronger transport filter rules. Since most email is spam, this could make your MTA over 50% faster...
SunOS was BSD based, but Solaris never was.
Real World performance of any email server can be increased by blocking any IP allocation without an abuse@ contact availiable via IP whois. Yes APNIC & LACNIC, that basically means your assignments! You can also block comcast and res.rr.com by RDNS for a further significant reduction in unsolicited mail.
Why would anyone want to increase processing for the benefit of spammers?
CommuniGate Pro Dynamic Cluster - Backend Servers (4 systems)
Software
CommuniGate Pro: CommuniGate Pro v4.3.6 Operating System: Sun Solaris 10 x86
CommuniGate Pro Dynamic Cluster - Frontend Servers (5 systems)
Software
CommuniGate Pro: CommuniGate Pro v4.3.6 Operating System: Sun Solaris 10 x86
No, not the post.
Isn't part of the allure of smaller systems handling the specifically to get away from large dedicated systems that aren't nearly as reliable?
By now, google should have taught the world something - distributed computing with small cheap specced systems that can each be swapped with multiple redundancy is the way to offer both uptime, speed, and be cost effective.
It's nearly identical to the "lean" manufacturing techniques pioneered by the Japanese. Small cells that can increase or decrease output based on the amount of workers (systems) that are working that day. Very flexible.
After all, it's a COMPUTER.... do you really want it dedicated to just email, or can we use it for other tasks in the downtime.
Using massive systems for handling mail invites a single point of failure (SPF), whereas using clusters of smaller systems for the same amount of money gives failover capability.
Of course, ISPs won't realize this.
LOL!
Of course there is still life in BSD! Not everything is run on Win/Tux!!
And don't forget MacOSX is a BSD at heart.
-if at first you don't succeed, stay the heck away from paragliding.
I didn't even know there was a SPECmail, but this figure doesn't seem too outstanding to me.
;-)
Firstly I assume this is just a raw delivery setup - no spam or virus filtering. You'd be amazed how much of a difference this makes to any real world setup.
Secondly, apache.org does over 2 million mails a day on a dual 2.4Ghz Xeon using an SMTP server written in Perl. And that's with full anti-virus (clamav) and lots of different anti-spam measures including SpamAssassin (which is known to be slow - I know because I used to be one of the developers).
I also know of commercial setups doing over 50m (legit, well - mostly) mails a day. Using an SMTP Server designed with performance in mind. Perhaps they should submit for SPECmail
So 30 million doesn't seem terribly amazing to me. Perhaps Communigate Pro isn't a very fast mail server.
Matt. Want XML + Apache + Stylesheets? Get AxKit.
Sadly, the need for that much capacity is probably driven by spam, which means ISPs are spending a boat load of money to facilitate spam that neither they, nor their customers want, and the ISPs have to turn around and charge higher prices because of it.
Certainly, any ISP worth its salt is going to be filtering spam, but there aren't a lot of anti-spam systems out there that are effective over the long term or aren't annoying as all hell.
Jerry
http://www.cyvin.org/
...imagine a beowulf cluster of these...
*sigh*
Procrastination -- because good things come to those who wait.
If you would look at the details, the mail cluster uses NFS. Invented at Sun.
67% more mail, 250% higher cost! Well maybe not, but they're still only telling half of the important story.
Really though - is this really a useful metric - x million emails per day? Anyone needing this sort of mail throughput is going to be either a megacorp with a centralised mail server (do any actually do this?) or an ISP. If you're talking this level of mail you're already using some multi-box mail system. In that case individual box performance isn't so important, but rather costs.
In essence - for anyone needing this kind of system, "$x per email per day" has got to be a better metric than the useless "one single box, we won't tell you how much it costs, can do x million mails per day".
Interestingly enough, they set their record using a message store mounted on NFS. It had 140 FC/AL attached disks, and 14Gb of RAM.
Virtually every file handle an MTA writes to is opened "O_SYNC". One of the quickest ways to make Sendmail or other common MTA's go fast is to mount their delivery queues on a solid state disk. I'm betting this disk array is turning around the queues without ever committing the data to the platters. (Not that there's anything wrong with this...) I am left wondering if there isn't some bit if NFS trickery not reported in the config.
But looking at the Sun entry, the old record was set using 2 year old software, and a much smaller disk configuration. Sun will probably update their entry in the near future, just to reclaim the crown. Email is much more an I/O problem than a CPU problem. Sun used to push their mail server on much larger HW, but most ISP's don't want to buy big boxes these days. The small to medium sized boxes, connected to a SAN are more cost effective, permit redundancy and easier maintenance.
actually this isn't true either. MacOSX has a FreeBSD based compatibility layer, but in reality is Mach based. This also causes some problems. The guy at http://www.kernelthread.org/ recently had a contest which was based on a huge problem with the way processes are handled in the compatibility layer and how certain things arent actually ever seen to mach. Thus causing kernel panics in certain situations.
I write code.
Translation: spamming overtakes porn in driving computer technology forward. Senators everywhere rejoice.
Hyperthreading is a nasty kludge, it splits the cpu into 2 virtual processors instead of a single superscaler processor.. The reasoning behind this, is that 2 poorly written apps which couldn't otherwise take advantage of a superscalar processor, can run simultaneously...
The side effect, is that apps which can take advantage of superscalar processors end up only seeing half of the processor and half of the cache.. So, modern apps (or apps compiled with modern compilers) actually lose performance due to hyperthreading..
Also, hyperthreading has been the cause of a number of security issues lately.
http://spamdecoy.net - free throwaway anonymous email - avoid spam!
It's been shown that you can mount timing attacks by the cache usage patterns. This is due to the fact that the cache is NOT cut in half, rather, all the cache is shared. If one thread stalls, or doesn't use floating point/vector operations, while the other one does, there are clear benefits of hyperthreading. It is HARD (impossible?) to use all of the execution units in a "wide" CPU all the time. Real benefits are reaped if the threads have good memory locality, while running diverse instructions. It is not pointless and it's not overly insecure.
Intel says this entire performance increase goes away if you use AMD processors. Why performance goes away when you use a chip that doesn't need its own mini-nuke for a VRM is beyond me.
One of the 187.
It says a lot about the state of things when something like the fact that an e-mail server can handle millions of messages per hour makes the news.
Wake me up when there is a news item about how all the pornography on the Internet has suddenly disappeared.
I don't know the meaning of the word 'don't' - J
what we need to do is build LESS capable servers than we have now so spammers cant get as much through.. and use that slow time to both track down the persistent ones and develop better filters, not faster servers. better technology = better abuse of technology.
- How about that spiffy big Postfix or Sendmail box you've got sitting out there, whose sole purpose in life is to act as a relay? Sure, it'll process millions of messages per day. It doesn't have to do much.
- What if you're running a virus checker like ClamAV or a spam checker like SpamAssassin? Those take up CPU cycles. Sure, delivery is slower, but value was added.
- What if your back end mail system is something like the Citadel groupware platform, where MIME content drives an event handler system? Again, delivery is impacted, but the functionality of the system depends on it.
- What if your org has a global directory and your mail hub is responsible for making complex routing decisions for each message? Again, delivery is impacted; it'll be slower than the mega-fast-box, but mail won't be delivered correctly otherwise!
So you see, it's not always just about raw speed. And besides, next year's hardware will be faster anywayTired of FB/Google censorship? Visit UNCENSORED!
Thats all I want to know
So how the hell is this a record?
the company I used to work for (a spammer) - hey, I had to eat!
they were sending on AVERAGE about 100 MILLION emails a day by using a cluster of 100 small (1U) racked machines.
and we just used POSTFIX. running mostly K6-2 cpus, yeah real old stuff. lots of K6 cpus too.
so how the hell is this a new record?
oh I guess because spammers don't do the SPECmail tests... hehehe
Well they're 1/3 of the way to a simple postfix config. keep working on it guys!
We were doing nearly 2 million per hour using an intel box in 2001 at NBCi/Xoom... using qmail running on redhat in a almost stock configuration. The hard part is getting all those emails out of a database and throttling how they are sent. Its pretty easy to set off big time spam alarms if server at bigdomain.com happens to get 10,000 or so emails in matter of minutes.
My old web firm used Communigate Pro and we always found it to be a highly reliable and stable system. Their support used to be handled by one of their lead programmers and was top notch.
This can't be right. Right now, I manage a system that does 500,000 messages/day running of 8 IBM 306 boxes with FC3 & Exim and a NFS backend. Its pretty small, most of the companies I know here in japan are 100 times bigger than us and must easily break 50M messages/day ... docomo alone must do 500M/day ...
...
And the idea that one would need commercial software to do this is laughable
The world population is about 6000 million people. Considering that not all of them read mail (e.g.: infants) it's not too way off to assume that average number of mails that a person can cope with is around 10.
If this box can deliver 30 million messages, 2000 of them are going to saturate the email limit the human civilization can handle (6000 x 10 = 60,000).
But there is a catch: this is under the assumption that all those 60 billion mails are not spam.
In other words, this record breaking "system" has no market in a spam-free world!
So that includes users connecting, picking up email, deleting from their data store etc etc etc.
Disclaimer: I have two friends who work for Bluearc but have no other connection to the company
ad
Considering that the main cybersecurity (sorry) threats today are viruses and phishing attacks, how long might it be before super-high-volume email systems will be classified as munitions that require licensing and export restrictions?
Imagine if Nigeria got the e-bomb...
Kevin Fox
BlueArc's Titan sustained a performance level of 12,500 SPECmail messages per minute, or the equivalent of two and a half million SPECmail users, sending 30 million e-mail messages per day.
The math seems a little off...
12,500 messages/minute * 60 minutes/hour * 24 hours/day = 18M messages/day, not 30M.
That having been said, CGPro is fast as all heck so I can believe it topped the previous record.
Moderating "-1, Disagree" is simple censorship. Have the guts to post your opinion.
The announcement came from BlueArc. They use Intel hardware. They use CommuniGate software.
So you replace 5 Sun 480's with 4 1.2GHz processors with 9 servers with 4 2.4 GHz Xeons, and deliver mail to 2 million users instead of 1.5 million? Excuse me for not being too excited. Yes, 480's are expensive - wonder what a 4 way dual core V40z would do.
Envy my 5 digit Slashdot User ID!
Network Commerce (now chaper 7) was sending 60 million a day with 3 racks of 4U P3 servers & a few Sun boxes (just another reason to hate java) back in 2002. The day before they went chapter 11, they figured out how to double that amount.
(I knew the tech staff & they hated their jobs.)
Just thought I'd add a few details and address some of the questions here. My name is Thom O'Connor and work for CommuniGate Systems (CGS), and was the one who put together and ran these tests - you can (mostly) verify this by looking at the comments in the source on the results page.
First off, on the SPECmail test itself. SPECmail is a standardized test (the only one I'm aware of for email) that attempts to closely regulate a level playing field for measuring email performance. It is critical to understand that this is not just measuring SMTP. The 30 million message a day text is a little vague, but it is important that this includes a distribution of delivery, relayed, and retrieved email. Sure, anyone can just relay many millions of messages an hour.
SPECmail does POP and SMTP, so the test measures not just MTA behaviour but also local delivery and then retrieval of the messages. The SPECmail test also uses Quality of Service (QOS) measurements such that a message injected via SMTP to the system MTAs (the CommuniGate Pro Frontend servers in this diagram) must then be delivered locally into the users' account, then be retrieved within 60 seconds. Satisfying the QOS criteria during the benchmark is often the most difficult part.
So, SPECmail itself just does POP and SMTP, which is a little 1990s I agree, but SPEC is coming out with a SPECimap test in the near future, and CGS is also very interested in seeing a SPEC VoIP/SIP test for measuring CommuniGate Pro's Real-Time capabilities.
A few others questions I've seen raised here:
1. The CommuniGate Pro Dynamic Cluster described in this test is fully and completely appropriate for production use in all aspects. In fact, if you're running a 2+ million user ISP on a CommuniGate Pro Dynamic Cluster, we'd recommend you to use these results as a guide for your architecture (although load balancers should be added to the gateway point for all inbound connections). In fact, CGS has ISP customers running architectures which match the layout of the described system almost exactly. All systems in the Cluster service all accounts - you could lose 4 Frontend Servers and 3 Backend Servers, and all users could still access their email (albeit with decreased capacity).
2. HyperThreading was disabled in the BIOS because the downloadable Solaris 10 x86 operating system would not (yet?) support the Intel x86_64 Potomoc chipset properly. That said, on top of the recent security vulnerabilities on the topic, we have also discovered miscellaneous threading and even NFS issues related to having HyperThreading enabled on Linux 2.6, FreeBSD 5.4, and Solaris 10 x86 systems.
3. On NFS...NFS is used safely and securely in this test. The integrity of data storage is one of the major criteria that the SPEC organization closely evaluates when reviewing a SPECmail submittal. Obviously, there are many ways to cheat and/or cut corners using Solid State Disks, unsafe RAM for message queueing, and other techniques that you would never want to use on your production message system. However, the test described here was performed using a standard (albeit excellent) BlueArc Titan Storage System with write caching only in NRRAM and using proper mount options and layout for security, redundancy, and data integrity.
Hope this clears up any misconceptions. Obviously, I'm clearly biased about the work here, but assembling and then passing a SPECmail test of this size is a gigantic effort. If anyone thinks
The suppercomputer devoted to bulk mailings
"Flyin' in just a sweet place,
Never been known to fail..."
Now we know why Intel has such dominance in the processor market, it has a secret alliance with spammers. (Buy AMD!)
All your base are belong to Wii.
30000K or 347 mail per second does not sound that fantastic! I mean the Apache HTTP server can do around 2000 request per second on a standard PC today. If you do not factor in relaying the SMTP protocol is not that much different from HTTP measured in work per message, or did I miss something here?
I know of a cluster system that sends 400 million messages a day. If they have to use this BlueArc one, then they would still need a cluster of BlueArc mail senders.
"Rather than clustering a lot of smaller servers together, large ISPs can now use fewer systems to handle massive traffic load."
From the testing reports linked in the article we have:
For a total of 10 systems seperated into 3 levels (front end, backend, data storage).
High quality editing for Slashdot as usual.
Source: http://www.spec.org/mail2001/results/res2005q3/mai l2001-20050707-00039.html
So? Still no spam scanning. Still no AV. These figures aren't good, and I think CommuniGate is the bottleneck.
Matt. Want XML + Apache + Stylesheets? Get AxKit.
Interesting, I was under the impression that when they said freebsd services they had taken it that much closer.. I stand corrected. http://www.apple.com/macosx/features/unix/
-if at first you don't succeed, stay the heck away from paragliding.
yeah, it's funny as most people see something about FreeBSD and immediately assume that the kernel is freeBSD. I think the gnu/hurd people were on to something when they decided to use mach. it is a great base for doing what you want. of course they have no changed their minds again (will we ever see a stable hurd?). Luckily OS X brings mach to life, and with it's user base growing and growing, i couldn't be happier to be using such an interesting kernel.
I write code.