Comparing MySQL Performance
An anonymous reader writes "With the introduction of the 2.6 Linux kernel, FreeBSD 5-STABLE, Solaris 10, and now NetBSD 2.0, you might be wondering which of them offers superior database performance.
These two articles will show you how to benchmark operating system performance using MySQL on these operating systems so you can find out for yourself if you're missing out. While this may not necessarily be indicative of overall system performance or overall database application performance, it will tell you specifically how well MySQL performs on your platform."
what about postgresql?
I thought Windows XP was supposed to be the Fastest and the Most Reliable OS in the World
...never believe everything you read on the Intarweb. ;)
- Tuple calculus
- Transaction journaling
- Operator space/system call overhead
- Disk cache timings
And much more... in essence, you can't be certain these benchmarks hold true for the performance of all databases and it may even be a mute argument -- the same operating system may be tweaked differently if you're fileserving or mailserving or networkserving or if you're only dataserving. A useful tool, but one that must be run on each server.Try not. Do or do not, there is no try.
-- Dr. Spock, stardate 2822-3.
There was another test that included PG-SQL, but I can't find the link now. Basically stating that posgresql burned the rest of them out of the water on a mid ranged server
if anyone finds the study/test, post a link?
On a PostgreSQL install, I almost quadrulpeled performance on FreeBSD 4.10 by bumping up the SHMMAX in FreeBSD, then tweaking PostgreSQL to use it for queries and indexes.
Make sure FreeBSD has DMA turned on as well, and make CFLAGS somthing other than a 486.
All of the *BSD are *VERY VERY* conservative and will do a lot better when properly configured.
Moneyed corporations, non-working 'poor' and criminal prisoners are turning productive citizens into tax-slaves.
I'd say that depends more on your compiler than anything else. Even in windows [the OS we love to hate] once the application is in ram it can steal pretty much the entire CPU time if it wants.
...].
Then even on the heap side [e.g. libc] is fairly standard if you use GNU libc.
Things like file performance [or network] do vary because different OSes offer different storage algorithms [e.g. JFS vs Reiser vs Ext2 vs
Tom
Someday, I'll have a real sig.
Conclusion/final thoughts
Both Linux 2.4 and 2.6 had the strongest showing overall for these tests, dominating just about every benchmark no matter the workload. Scalability for both kernels was also excellent with addition of an extra processor. In fact, I was surprised how well 2.4 had done, as I had somewhat expected 2.6 to show at least a noticeable, if slight, increase over 2.4. Instead, they took turns besting each other from test to test -- and in scalability -- for a fairly even overall showing.
Solaris 10 had a very strong showing as well, having great speed as well as great scalability. I think the results show that Solaris 10 is a great platform for MySQL. Of course, I didn't have Super Smack results as I couldn't get Super Smack to port to Solaris (as detailed in the previous article), so bear that in mind.
NetBSD 2.0 also had a very strong showing, although it was tarnished by two issues. One, MySQL on NetBSD 2.0 doesn't scale with the addition of CPUs. The results would seem to indicate that it might be wise to run a uniprocessor kernel even if two processors are available. The other issue was the poor I/O performance for the 10M row SysBench test. The SMP scalability issue is easy to understand since, to be fair, this is the first NetBSD release to support multiple processors. The I/O issue is more of a mystery, however.
FreeBSD 5.3 did relatively well in both KSE and linuxthreads mode, although with all the work that's been done in the SMP and threading realms, I was a little disappointed with the results. Still, it seems that the native threading model for the production release of FreeBSD-5 is ready for prime time, and can replace the long-standing FreeBSD convention of using linuxthreads with MySQL.
For FreeBSD 4.11, however, linuxthreads definitely helped with performance (and in many cases outperformed FreeBSD 5.3). With libc_r, performance lagged far behind linuxthreads for many tests, and there was little scalability. I would say it's highly advisable to build your FreeBSD 4.11 MySQL binary with linuxthreads.
For all the time it took, I think the tests were worth it. I learned quite a bit about MySQL performance in general, and I'd like to again thank Peter Zaitsev for his methodology recommendations and input, as well as Jenny Chen from Sun for her input.
Slightly off topic, but if it's really performance you want, why don't people just use Postgres? It's had a much better feature set for years, and is starting to get enterprise level features. It seems like MySQL is somehow the default choice for open source projects, but as far as I can tell it offers no advantages and many disadvantages over postgres.
Is it just MySQL is slightly easier to setup?
AccountKiller
From the article: I used the GENERIC configurations unmodified, expect for above-mentioned changes and adding SMP support.
FreeBSD's GENERIC kernel config is for i486. If he'd commented out two lines, he could've tested for i686, which is what a P3 is. As it is, these benchmarks aren't helpful at all, because the optimizations assume a machine inferior to what's actually being used. He failed to eliminate enough variables for these to be meaningful.
Well, at least thats what Microsoft told me...
From what I gathered in the article he never GAVE benchmark results? Where the hell did all of what you posted come from?
From the article:
In the next article, I'll present the results for all six operating systems.
Linky
Oh wow! You mean there were too links in the summary.
Hee-haw! Hee-haw!
I might just be naive, but doesn't database performance depend a lot more on filesystem than OS?
There are 11 types of people in the world: those who can count in binary, and those who can't.
because everyone else does.
___
If you think big enough, you'll never have to do it.
Too bad he could not test on Mac OS X.
Of course in order to do that he would have to install the OSes on a PPC machine and I don't think freebsd on PPC is ready for prime time yet.
Get a free Mac Mini
Does anyone go: "OK, I need the OS for my mySQL project. I'll benchmark BSD, Linux, Windows, and choose the fastest OS."
Difference among OS should be pretty much unimportant unless one's an ISP or big enterprise. I would choose the OS based on completely different criteria:
1) Existing skillset (advantage to existing skills)
2) Existing deployed OS (advantage to OS already deployed)
3) My company's OS strategy (advantage to the OS and the CPU platform we chose to standardize on)
4) Existing software (if I already have X vendor's backup agent for mySQL on Linux or database tuning tools, I wouldn't use BSD just to (potentially) gain an extra 5% in some ludicrous benchmark result).
Today's hardware (and operating systems) are so cheap that it's almost irrelevant what OS and hardware goes into many a project.
Look at the new HP's 25p and 35p blades (Opteron-based) - a 2 processor 1GB RAM version is just some $1,700 more expensive than a 1 processor 512MB RAM version.
It's easy to lose that $1,700 in downtime, spend it on a Windows engineer's new RHCE or such...
Try:You'll need a stopwatch.
Marxist evolution is just N generations away!
"MySQL" sounds like something Yahoo or Microsoft would release. PostgreSQL sounds like ... an open source application.
I don't always use unix-like operating systems; but when I do, I prefer FreeBSD.
As someone who develops on PG, Mysql, Oracle and several other DBs, let me just say that if you're so wrapped up with your RDMBS that you react like that, you should really take up jogging, get a dog, start doing heroin, or something.
Christ on a cracker, some geeks are more of a downer than condom breakage. Coding is supposed to be fun.
I forget what 8 was for.
See the message thread titled "NetBSD performance" at http://software.newsforge.com/article.pl?sid=04/12 /27/1243207: an anonymous reader asks "Did you enable PTHREAD_CONCURRENCY? You have to set that variable to the number of CPUs in your system, else you won't be able to run more than one thread at a time, even you have more than one...". He replies "Sunofa. The $PTHREAD_CONCURRENCY environment variable wasn't set, as I had no idea it was an option. ...
It could very well be the issue. In the next few days I'll re-run the NetBSD tests with that set."
You should have used malloc--in Linux, you malloc till you run out of memory, Linux starts killing processos to free it up.
Slashdot: Where people pretend to be twice as smart as they really are by behaving like children.
Doh! Please no one correct me, please no one correct me...
Slashdot: Where people pretend to be twice as smart as they really are by behaving like children.
...and you claim that emacs will kick the shit out of both of them.
Fascism trolls keeping me up every night. When I starts a preachin', he HITS ME WITH HIS REICH!
Well, it's good enough for Wikipedia.org so it's good enough for me.
Treehugger? Treehugger... Treehugger!
I think you missed the joke. Not that it was that funny...
Treehugger? Treehugger... Treehugger!
That is a very good question, I don't know why has it been moderated as off-topic. Naturally it is useless to compare MySQL performance to MySQL performance ignoring any other options. (It is essentially the same tactic Micro$oft is doing all the time! Do we really want to parrot them?) First of all, there are MySQL gotchas and PostgreSQL gotchas, so you have to know whether the particular glitches are acceptable for you before you decide to use either RDBMS. Understanding the relational algebra, set theory and predicate calculus is essential to understand what the relational model is all about. Lack of this knowledge often leads to confusing tuples with OOP-style objects and other stupidity, so you will save a lot of time learning it first.
Now, the performance. Generally speaking MySQL is faster for a heavy load of simple read-only queries (like Slashdot) while PostgreSQL is faster for complex read-write queries (like a bank). Once you turn on the ACID support in MySQL it is no longer so fast, and it can really crawl because of row or even table (sic!) locking, a mistake avoided for decades by any advanced database. Here is another comparison. See also this recent thread on Slashdot. One of the best comparisons of Oracle, MySQL and PostgreSQL was done by the Computer division of Fermilab (Fermi National Accelerator Laboratory), this is a must-read.
There is a lot to read about it if you need more comarisons, but the general rule of thumb is that if you want lots of very simple read-only and very few read-write queries when the integrity of your data is not critical, you should probably choose MySQL. When you need that (or better) speed but the data is critical and you need ACID transactions which would severly slow down MySQL, try SQLite, the easiest choice there is, especially using Perl where you don't even need to install it (but just like with every other database, there are SQLite gotchas too, you need to be aware of them). If you need full ANSI SQL compatibility, ACID transactions, scalability and your data integrity is important, you should probably choose Oracle or PostgreSQL. There are also licensing issues. Oracle is proprietary. MySQL is GPL so you need to pay if you want to use it in any non-GPL software. PostgreSQL is released under a free-for-all BSD license. SQLite is public domain.
As you can see, there is no one-size-fits-all database. Every one has its strengths and weaknesses. The correct choice is a matter of trade-offs and finding out which database is optimal for your particular niche. Good luck.
Sincerely,
Pan Tarhei Hosé, PhD.
"Homo sum et cogito ergo odi profanum vulgus et libido."
What about Mac OS X? I know (actually, not really) that Solaris doesn't run on Apple hardware, but it would be interesting to compare the same stuff on an Xserve and also be able to test the OS X performance.
When you look at the state of the world, how can you not become a radical, liberal anarchist?
No serious DBA uses MySQL. Oracle, Sybase, PeopleSoft(soon to be oracle), IBM...
MySQL is what they use to teach you structured query in school. Databases have come a long way in FEATURES in the last 10 years...
------- "From bored to fanboy in 3.8 asian girls" ----------
Certainly you have read the documentation and noted their standpoint on compliance. It's a question of choice and that's all yours..
-if at first you don't succeed, stay the heck away from paragliding.
They only said 20% faster?
Free Firefox news reader.
Our company uses Firebird as a main database engine, it is indeed feature-rich compared to MySQL (triggers, stored procs, events, transactions, versions) and more mature/stable than PostgreSQL; yet we replicate the base to MySQL/PHP system for web portal functionality.
It turns out cheaper to establish a replication between two databases and use strong features of both (rich feature set and Borland's visual data controls + PHP/MySQL integration and raw speed), than settle for any single DB.
It would be disgusting to have no triggers/procedures on main DB; but working from PHP with Firebird seems to be not too native.
(We did also consider PostgreSQL but dropped it due to our 24x7 requirements and its VACUUM problems...)
Must say that it is very nice to see Solaris up there in the top in the tests that it was featured in. Seems Sun was not joking around when they claimed that Solaris 10 would be greatly improved on x86. As is often said around here; More choice is good.
What about the Firebird relational database?
Evans Data says it is the best, in a survey done for 2005, but copyrighted 2003. (I'm uncertain how much they should be trusted.)
As an other poster hinted, Windows 2000 server or 2003 server _would) be an interesting comparison. And Windows (albeit with SQL Server) does not do too badly when it comes to database performance, particularly when you consider Price/Performance.
What about the Firebird relational database?
Good point. "Firebird is a relational database offering many ANSI SQL-92 features" [emphasis added] PostgreSQL "supports SQL92 and SQL99" [emphasis added]. "New code modules added to Firebird are licensed under the Initial Developer's Public License. (IDPL). The original modules released by Inprise are licensed under the InterBase Public License v.1.0. Both licences are modified versions of the Mozilla Public License v.1.1." On the other hand, "PostgreSQL is released under the BSD license." Other than that they are mostly comparable, so you have risen a very good point. If you don't need standard SQL support and the license is acceptable, Firebird is a very good option.
Where Oracle ... can return simple select queries or complex insert or joined select queries in .5 to 1.0 seconds each
.5 to 1.0 seconds to return 'simple select queries', you are doing something wrong. Very large unindexed tables, perhaps. Alternatively if Oracle is taking the same amount of time to return simple and complex queries, that might indicate that something is wrong with the connection between your app and Oracle.
... I don't need to have nth degree optimized queries.
If Oracle is taking
Your 'code library' sounds an awful lot like what stored procedures tend to be useful for - presenting a stable external 'API' for accessing the database. If the database changes internally, you just change the stored procedures, and all applications using these procedures carry on as normal.
I don't need to have "good habits"
Uh huh.
I agree completely that you don't need to 'swat a fly with a sledgehammer' and some applications genuinely only do need a simple database with a few simple tables.
But good habits come in useful, particularly if circumstances change and you have to scale up rapidly - your website becomes massively more popular, your HR application suddenly needs to incorporate new features, whatever. And in any case MySQL has been getting a lot more advanced database features lately, so it's no harm to know them. They might just come in handy.
MySQL may be fast, because its features are so limited. Sure, it stores and retrieves records, but its partial implementation of SQL (without subqueries etc) and blatant bugs that violate SQL (try inserting '123456' into a varchar(4) column, it will silently truncate to '1234' instead of giving a 'Inserted value too large for column' error) make it useless for anything reliable.
If 'fast' is your only important concern you should be using flat files or dbm/ndbm files, but there is more to a database than speed.
Somebody posts a comparison of an application running on different OS's as a system benchmark, and what do people do? Attack MySQL.
God guys get over it, MySQL is here and it has actually proven itself to be usefull. Yes its missing features and has issues, but it fills the niche it is aimed at.
By default NetBSD's threading library in 2.0 only runs on one processor. To enable (experimental) support for scheduling a threaded process against multiple processors, you can set the environment variable PTHREAD_CONCURRENCY=.
I have always had a soft sop for Solaris since 2.51, it secure, and stable, in my experience, but alas its always been SLOW on x86 hardware.
:) I wonder if its a result of better x86 optimization or the new Filesystem
These benchmarks show that at least with mySql its pretty fast, but more importantly look at the solaris benchmarks, they are nearly identically consistent across all test, where others vary much.
Ive always kinda thought of Solaris as a 4 wheel drive truck in low range, but it looks like they added a turbo
Its good to see that Linux 2.6 kicked Solaris's ass.
One thing I noticed about version of mysql (prior to version 4.1, I believe) was that mysql didn't support the notion of nested queries, which at the time, was what I really needed to perform. An upgrade to mysql 4.1 solved this, but something that what I would consider to be an integral part of sql was just "left out" of previous versions just reinforced my decision to use PostgreSQL instead. (The mysql databaase was a product of a coworker, not using our organizational standards.)
And they said zombies weren't real!
RTFA and then check out the comments. There is a moron AC (who happens to love gentoo) who thinks recompiling the toolchain gives him faster binaries. Another AC spends a great deal of time trying to explain that this is not possible before the flaming starts. Fun stuff! If all the OS's are configured similarly (which the author didnt say they were) they should perform the same except for the 2cpu tests. There simply isnt a whole lot for the OS to do in these tests other than retrieve stuff from disk or something. The 2cpu tests are definitely interesting.
While your post is thorough and accurate,
you glossed over the fact that MySQL is now dual licensed. This DOES have repercussions. The GPL version can only be used by GPL software OR as a special exception. The special exception is made for PHP (and maybe others). If you are a Bank and choose MySQL you have to BUY a license.
I wonder how much there is to the MySQL great for websites (many read, few write) and the PHP license exception.
MySQL 4+ is not the MySQL that we all came to know and love in the 3.x days. Previously, I used MySQL 3.x but when I needed to upgrade, I moved to PostgreSQL because of the new license alone.
Let me re iterate my take. PHP license allows you to make commercial websites with it. MySQL allows its GPL license to be used with PHP regardless of purpose of the PHP scripts by special exception. Had there been no special exception, we'd have seen the downfall of MySQL and the upshoot of PostgreSQL or SQLite.
As a user/admin of all 3, I find that you can either use PostgreSQL or get away with using SQLite. Incedentally, try using SQLite with SQLRelay if you need network access for SQLite.
Slashdot's rate-of-post filter: Preventing you from posting too many great ideas at once.
Does anyone know of a nice computational/science benchmark that runs on Linux/Windows/BSD and is free?
Linpack, stream, polyhedron, livermore kernels, you name it. Most of them are freely available and require little more than a Fortran compiler.
(1) 9 GB SCSI-160 (7200 RPM)
Okay, this guy is not a database person. Now, the choice of database product to benchmark with might have been a dead giveaway, but so is the choice of configuration. If you're not going to test with a real database configuration (which the author of the article has probably never seen) why bother? Funny that he'd test on a machine with two CPUs and one drive, though. Great judgement in hardware selection.
Releasing the results against an old build of Solaris x86 a few days after the general release of Solaris 10 is pretty funny, too.
Compiling his own versions of the software without even looking at the patches the maintainers of the FreeBSD and OpenBSD ports of MySQL use was really good judgemenet. I mean, heck, they might know something about how to make that software perform properly on their platforms, and we wouldn't want that skewing the results! "After all, I'm testing the operating systems, not their pre-packaged MySQL distributions or source builds."
What... a... loser...
Now before I get modded down, I be to remind whoever might read this that what I am saying is FACT. - bogaboga
Learn more about the benchmarks and WHY FreeBSD is slightly slower NOW. This time next year you will likely fully understand FreeBSD's decisions when they are again the benchmark leader on a 'typical' hardware configuration. The future is all about multi-processors and multi-core processors. Guess what? FreeBSD 5.x is ready.
It was foolish, in my opinion, to keep this in the release. I wonder, how many points the OS lost in the benchmarks because of it...
In Soviet Washington the swamp drains you.
MySQL had better be fast - they just cranked the price again. In the last two months the per-100-copy bulk price of MySQL jumped from about $100 to about $230 a server.
(Yeah, I'm looking hard at Postgres now.)
With the introduction of the 2.6 Linux kernel, FreeBSD 5-STABLE, Solaris 10, and now NetBSD 2.0, you might be wondering which of them offers superior database performance.
Shouldn't you be using a database to do the performance testing then?
-1 Uncomfortable Truth
I pay something like $4/month for my host. I get one mysql db. If I want a different db, I have to go to a more expensive plan.
hawk
seems to me that not everyone needs Oracle, which can cost tens of thousands per CPU, even with the educational or non-commerical discount, and that we have to look at each person's needs individually.
Do we need field-level locking, or will row-level locking suffice?
Do we need distributed servers with failbacks and rollbacks or will a simpler solution with a periodic backup suffice?
Each database system design should match the needs and at least reasonable expectations of growth - One Size does not necessarily fit all.
-- Tigger warning: This post may contain tiggers! --
well, i've been having some interesting ODBC issues passing the same SQL queries to Access and to Oracle, so let's not pretend that the ANSI SQL compatible world is that compatible ...
-- Tigger warning: This post may contain tiggers! --
There's nothing preventing you from installing OpenDarwin http://www.opendarwin.org/ on x86. This puts you effectively in an OS X environment on x86, only without the Aqua UI.
The cure for cancer is coming: Reovirus
ie, we had a job that would take 40 minutes to complete and it needed to run hourly.. That only allowed 20 minutes for users to analyze those results. The root cause of the problem was the poorly designed application but we couldn't get management approval to fix it.
A contributing factor to the problem was the server... Even tho we had multiple disks, they were only setup in concat mode. We convinced the unix team to strip the data and our job went from 40 minutes to 20. (If we were able to properly redesign the app, including the disk change, we could probably had most of the application almost real-time...)
In another case a DBA had set their SGA to have a buffer cache of 1 Meg. Server was tuned, app was tuned but not the database.
What i'm trying to say is.. if your experiencing performance problems you have to make sure you find the root cause before you make any recommendations.
"Thanks to the remote control I have the attention span of a gerbil."
If you don't need standard SQL support
Firebird implements standard SQL. Firebird supports full SQL92 and most SQL99, according to the project website.
This is why the DragonFlyBSD project does exist.
{{.sig}}
After reading the benchmarks, I noticed that the Linux distribution chosen was Gentoo. I like Gentoo (and use it myself). The author of the newsforge article does not really state if the all of the kernel, libraries and applications were built completely from scratch or did he use a stage 2 or 3 install.
If the box was built from source, I would expect that Linux benchmarks would be higher simply because the kernel, libraries, and applications were most likely tuned to the hardware. Otherwise, I would like to see RedHat, or SUSE, or other "out of the box" distros in addition to the others.
Just my $0.02
Coderz 4 Life
It does, but you might not WANT to have a single process taking up more than one processor because of threads. The change is entirely trivial and not hard to do, but the flexibility allows you to share load on a per-process or close to per-thread basis, depending on this value.
But I agree it'd be saner to have the default be the number of processors, and if I read correctly this is the situation in -current right now.
Sam ty sig.
I don't think it would be out of the question at all to compare benchmarks of an Apple machine with a street price comparable to the Intel machine used. After all, if you're going to buy a machine to run MySQL on and you have budget XX, why wouldn't you consider a Mac if it turned out to be faster?
Breakfast served all day!
MySQL ships with the InnoDB database engine as a standard part. InnoDB will write to its log with every completed transaction, flushing that to the hardware. If you lose power, it'll reapply the transactions from that log automatically, without hassle, when the database server is restarted. The log writes are sequential so you can get both excellent write caching (into dirty database pages in RAM) and transaction durability (via those immediate flushes of the log to disk). InnoDB is also the database engine which offers transaction support, so if you want the normal transaction guarantees like durability, you're going to be using it anyway. This isn't a theoretical capability - I've had production Wikipedia servers go down with tens of thousaands of dirty database pages in RAM and come back up with all data, while I was asleep.
Alternatively, though it's still in alpha, there's the choice to use the Cluster engine to spread the load over many redundant servers via mirroring in the database storage engine. Very interesting technology for me, though still too early for Wikipedia to use.
For those with high read load or worries about single machine failure, replication is also standard and it's easy to set up a web server or two to do replication in the background to get the last bits of data saved before a disaster hit the master. On the Wikipedia technical to do list is a replicating slave in Paris, so the building falling down on the main site will still leave even very recent updates available.
The combination of InnoDB and replication makes it easy for me to sleep well. The one thing it doesn't yet do as a standard part of the package to complete the picture is automatic failover to the best slave if the master dies and stays dead. I think the failover bit is on the to do list for the 5.n series. Cluster does, though - one of the reasons why I'm watching it with interest.
None of this means PostreSQL is bad. Just different.
I think they only used lc_r on the 4.11 release test. For 5.3 they used the new KSE and the older LinuxThreads.
I did expect KSE to do better against LT, though. KSE has been sold as being lighter weight and faster than LT.
Jerry
http://www.syslog.org/
Funny, I had the opposite reaction. Considering that he tested optimized Gentoo (I'm assuming it was comiled from source for i686) w/ ReiserFS (isn't it still considered non-standard in Linux?) against binary installations (i.e. compiled for i486) of FreeBSD 4 & 5, I would say that FreeBSD delivered a very strong performance.
Regarding 4 vs. 5: if you consider that many code paths in FreeBSD 5 are considerably more expensive than in 4 due to the fine-grained mutexing, it's impressive that the first truly useable release of 5.x is already keeping up (for the most part) with the simpler, highly uni-processor optimized FreeBSD 4. It was also impressive that KSE delivered roughly the same performance as LinuxThreads, again considering the small amount of time that KSE has been in widespread use.
If you read the FreeBSD developer lists even a little bit, you would find that the focus of work so far has been on laying the foundation for fine-grained SMP and threading to eliminate the scalability cap of FreeBSD 4's fully locked (i.e. non-SMP) kernel. It will take some time before the fruits of that labor result in significantly higher performance (especially in the uni- and dual-processor cases). In some areas (e.g. packet throughput), FreeBSD 5 already trounces FreeBSD 4 and Linux 2.6. In others, the shorter code paths and years of optimization in FreeBSD 4 still can't be beat. This seems quite reasonable to me.
I have to point out just a few of the most significant flaws in these benchmarks in the hope that people will interpret the results appropriately:
1.) The domain socket (i.e. local, non-network) tests were performed using super smack. As the author notes, super smack does not compile out-of-the-box on FreeBSD. That's because it is developed on Linux and (obviously) the author(s) don't test or optimize it for FreeBSD. It's reasonable to assume that it's performance may therefore suffer quite a bit on FreeBSD and no benchmarks were performed to isolate this potential penalty. If you think I'm stretching, consider this: PostgreSQL was (and may still be) primarily developed on FreeBSD. If the development team never tested and optimized it to perform well on Linux, would you consider a PostgreSQL on Linux vs. PostgreSQL on FreeBSD a fair (or even meaningful) benchmark?
2.) While the author fails to clarify whether the sysbench tests were performed remotely or locally, he does mention that he thought sysbench gave meaningful results over the network, so I'm going to go out on a limb and guess that the sysbench results are over a network connection. If the sysbench tests were local, then the same criticism applies as the super smack tests - the benchmarking tool performance and load on each OS must be accounted for so that it may be isolated in the results if they are to have any meaning.
I'm sure you can see where I'm heading but, if any of these benchmarks are remote, we need to know the OS and hardware of the client machine. This is never mentioned. While remote database testing is a good idea in that the benchmark workload is automatically isolated, one must take into account that the benchmarks now reflect network performance between different OS's (assuming the same client machine is used for each test). If careful, informed TCP/IP tuning is not performed on each server to eliminate differences (and the author mentions nothing about this), the OS with the most aggressive default tuning or default tuning that most closely matches the client will certainly show superior results that have nothing to do with MySQL performance. Givens the author's obviously increased familiarity with Linux over the other OS's tested, I would guess that his client machine (if any) was running Gentoo Linux, giving that OS an extreme advantage in network performance. If you're not familiar with FreeBSD, it's default TCP/IP settings are extremely conservative. A good example is the kern.ipc.somaxconn kernel tunable which controls the maximum number of simultaneous TCP connections. It is set to
Alright, calm down tough guy. I can't really see how an imaginary database benchmark would help, but niether do the benchmarks we're discussing now becuase of the all the points I raised in my last post. What would be interesting is to see the tests repeated with the client workload properly isolated by alternately running each benchmark on two different client machines. The client machines would ideally have identical hardware but different OS's (say SUSE and FreeBSD 4.11). An attempt would then be made to tune all TCP/IP stacks (clients and servers) for the highest compatibility. That means only enabling common features (e.g. no SACK because NetBSD doesn't have it) and probably disabling window scaling altogether. Since this is not supposed to be a network benchmark, the goal would be to eliminate network performance variations to hide any networking advantages one OS has over another.
While using MySQL for a benchmark is useful because it is a real world app in every sense, the benchmarks would be far more useful if another similar app such as PostgreSQL was used in addition. This would still not provide anything close to a scientific control to eliminate unknown variations, it would at least be one step closer.
I'll close by mentioning that when I moved the busiest MySQL server that I administer (avg ~ 200 queries/sec) from Red Hat Linux 8 to FreeBSD 4 on a Dell 6650, I saw a drop in load average and CPU utilization of about 20% (same MySQL version and config). There was no noticeable speed difference. So FreeBSD ended up providing a meaningful benefit that would never have been reflected in this or similar benchmarks. All benchmarks have their shortcomings and holes, this one just happens to be especially riddled with them.
Acutally I agree with you 100% on FreeBSD4.
I still like FreeBSD and I did not intend to start a flamewar.
I am talking about FreeBSD 5.x.
If you put it on your Dell server at work I guarantee you that a decrease in performance would be observed.
All the really good developers left Fbsd and 5.x is a disaster to many old time BSD hackers who either left the project or switched to another version.
Dragonfly looks interesting and many of the former FreeBSD and BSDunix hackers are working on it. I hope it stabilizes soon but that wont happen until more people use it and more software is ported to it.
http://saveie6.com/
ARGHH!! This is worse than the cliff hanger cowboy thrillers I saw every Saturday afternoon at the old State Theatre on the East Side of Waterloo, Iowa. I had to wait a week to find out how the hero survived going over the cliff. How long will I have to wait this time???