How To Set Up A Load-Balanced MySQL Cluster

← Back to Stories (view on slashdot.org)

How To Set Up A Load-Balanced MySQL Cluster

Posted by ryuzaki0 on Sunday April 23, 2006 @03:03AM from the someone-teach-me-please dept.

hausmasta writes "This tutorial shows how to configure a MySQL 5 cluster with three nodes: two storage nodes and one management node. This cluster is load-balanced by a high-availability load balancer that in fact has two nodes that use the Ultra Monkey package which provides heartbeat (for checking if the other node is still alive) and ldirectord (to split up the requests to the nodes of the MySQL cluster)."

25 of 127 comments (clear)

'Ultra Monkey'? by Anonymous Coward · 2006-04-23 03:21 · Score: 5, Insightful

One of the major problems with a number of useful open source projects is that their names are terrible for the corporate environment. This is a perfect example of such a project. I sure wouldn't want to pitch the use of this software to a CTO or CIO, even if it was perfect for our database cluster, just because it is called 'Ultra Monkey'. That's just not a professional name, and this product will likely not be taken seriously.
1. Re:'Ultra Monkey'? by kryten_nl · 2006-04-23 03:54 · Score: 4, Insightful
  
  So? Call it the "U.M. system".....
  
  Problem solved.
  
  --
  For the perfect anti-Unix, write an OS that thinks it knows what you're doing better than you do and let it be wrong.
2. Re:'Ultra Monkey'? by Conanymous+Award · 2006-04-23 03:56 · Score: 4, Funny
  
  But, imagine a Beowulf Cluster of Ultra Monkeys...
3. Re:'Ultra Monkey'? by Mr.Dippy · 2006-04-23 04:16 · Score: 2, Funny
  
  imagine a Beowulf Cluster of Ultra Monkeys running Breezy Badger with a Hooray Hedghog backup...
  
  --
  
  -Dipster
4. Re:'Ultra Monkey'? by Eunuchswear · 2006-04-23 04:29 · Score: 2, Insightful
  
  Insightful?
  
  What is all this shit whining about names of "open source" (bleurgh) projects?
  
  I am the fucking CTO. I couldn't give a shit what the name of the product is. I want to know if it works.
  
  "Professional name"?
  
  --
  Watch this Heartland Institute video
5. Re:'Ultra Monkey'? by Reality+Master+101 · 2006-04-23 04:44 · Score: 4, Insightful
  
  And let's not forget there are plenty of stupid names for succesful, commercial software. [snip bad examples]
  You seem to be confusing "literal names" with "good names". What makes a good name isn't that it literally means what it does, it's that it 1) sounds good, and 2) doesn't have negative connotations, and 3) isn't hard to say. It's a bonus if it even vaguely resembles what it does. Lotus Notes, Oracle and Excel all have vague connotations of what they do, but more importantly, there is nothing offensive about them, and they're easy to say and remember.
  Compare to some of the popular names in Open Source. GIMP. Gnu (Gah-nu, the absolutely STUPIDEST name and pronounciation ever invented, and that goes for all their programs with the 'g' prefix). Ultra Monkey is right in line with traditional stupid naming.
  
  --
  Sometimes it's best to just let stupid people be stupid.
6. Re:'Ultra Monkey'? by RookKilla · 2006-04-23 06:13 · Score: 3, Interesting
  
  I'm with you on that one. Well, i'm a CTO, and i don't care about the names, but sometimes the initiative starts "from the bottom", so to speak, and sometimes needs to be run by not-so-open-minded people who happen to write the check. Unless the person wants to start company-wide project of opening minds prior to initiating something like "Ultra Monkey", he/she has to do something about the name. In this very case, i'd call the software "UM", i guess, if i were working in a "good old corporation" and had to submit an official project proposal, and stick to "UM" in all latter documentation. The problem with projects like this is the fact that they are relatively unknown. While Google, RedHat and others are names you can find all over the WSJ and NYTimes, projects like this are not known to general public, including the gray-haired conservative guys in financial departments that are writing the checks. At least those a my thoughts.
It doesn't look easy by BadAnalogyGuy · 2006-04-23 03:23 · Score: 2, Funny

... and they just stare at you and ask incredulously, "You had to do WHAT with the seat?"
This is for the MySQL Cluster storage engine by Jamesday · 2006-04-23 03:26 · Score: 4, Informative

This is for the MySQL Cluster storage engine, describing how to set up load balancing across two MYSQL servers that serve as front ends to it.

If you want code to set up a MYSQL cluster (cluster of MYSQL servers NOT using the Cluster storage engine) you might do something like looking at the PHP code in MediaWiki, which does application-level load balancing for reads (not writes) and has so far been tested to 25,000 or so queries per second. Or you could use the same director tools and skip using them for writes, implementing failover for the master server getting the writes.
Re:sounds cool, but MAJOR drawbacks! by Jamesday · 2006-04-23 03:32 · Score: 4, Interesting

The sort of thing people use the MySQL Cluster storage engine for include session management and telecoms. That data tends to be of modest size, even for a company with many millions of customers. If you don't mind losing the data if there's a power failure longer than your UPS life you can use the Memory (Heap) storage engine instead of Cluster. You can often back that up wth triggers and reload in a startup script.

The RAM only bit will go away once MySQL 5.1 is released. It's currently in early beta. It doesn't mean you lose your data if there's a power failure; the data is backed up to disk regularly.
Don't forget - The MySQL Protocol is proprietary. by DAldredge · 2006-04-23 03:39 · Score: 4, Informative

The MySQL Protocol is proprietary.

The MySQL Protocol is part of the MySQL Database Management System. As such, it falls under the provisions of the GNU Public License (GPL). A copy of the GNU Public License is available on MySQL's web site, and in the product download.

Because this is a GPL protocol, any product which uses it to connect to a MySQL server, or to emulate a MySQL server, or to interpose between any client and server which uses the protocol, or for any similar purpose, is also bound by the GPL. Therefore if you use this description to write a program, you must release your program as GPL. Contact MySQL AB if you need clarification of these terms or if you need to ask about alternative arrangements.

http://dev.mysql.com/doc/internals/en/licensing-no tice.html
This isn't very elegant lb'ing MySQL cluster by layer3switch · 2006-04-23 03:45 · Score: 3, Interesting

No MySQL sessions are carried over from failed node, so all MySQL connections will be dropped and need to re-establish again. For sessionless Apache or other fast and short connections may make this very useful, but for application service with shared connection pool, a DNS RR with node check makes better choice rather than elaborate setup using lb nodes in front of MySQL cluster due to +2 lb nodes administration overhead. And also because of arp problem due to MAC on loopback broadcasting, the article forgets to mention;

net.ipv4.conf.all.arp_announce = 2
net.ipv4.conf.all.arp_ignore = 1

Should take care of all interface arp announce/ignore. Doesn't need to set net.ipv4.conf.eth0.arp_ seperately as the article provides. Or easier way to control it is to set ARP=no on /etc/sysconfig/network-scripts/ifcfg-eth0 and ifcfg-lo:x

Personally I rather have Active/Active lb nodes in front and carry over sessions from failed node to active node transparently by using shared memory space to replicate all session info. OpenMosix comes to mind.

--
"Don't let fools fool you. They are the clever ones."
Without wishing to start a flamewar... by m50d · 2006-04-23 03:50 · Score: 2, Interesting

Anyone have a guide for doing this kind of thing with postgresql?

--
I am trolling
1. Re:Without wishing to start a flamewar... by BurningDog · 2006-04-23 05:24 · Score: 2, Informative
  
  Right now there is no equivalent for Postgres. Best you can do is use Slony for creating read only replicas and then send any write queries to the master database. Heres a link (though slightly old) explaining the situation.
  
  http://brianray.chipy.org/postgreSQL/cluster_vs_re plication.html.
  
  Not much has changed since then. I do think there are some 3rd party commercial apps that handle master-master replication or even shared storage but thats about it.
Such capabilities are built in. by Anonymous Coward · 2006-04-23 04:00 · Score: 2, Informative

PostgreSQL supports these capabilities natively, without needing to resort to third-party packages. The best place to learn about setting it all up is from the PostgreSQL docs. Between the FAQ and the manuals, you should have more than enough to get you started.
Re:sounds cool, but MAJOR drawbacks! by kpharmer · 2006-04-23 04:22 · Score: 3, Informative

> The RAM only bit will go away once MySQL 5.1 is released. It's currently in early beta. It doesn't
> mean you lose your data if there's a power failure; the data is backed up to disk regularly.

Seems backwards, but this is probably because mysql bought a product and has had to figure out how to bolt persistence onto it. A much simpler approach is to go the direction of most commercial databases: provide fine-tuned memory controls that easily ensure that the most-used data is kept in memory.

In this scenario (with a database like db2 or oracle) you simply organize tables into tablespaces, dedicate however much memory you want to each tablespace. If you want a table to live in memory - just give it enough memory and it will. If a different table is 100 GB in size, then give it as much memory as you want and the database will manage the memory cache accordingly.

With a solution like the above you don't need completely separate products, but instead see clustering, in-memory databases, and reliable persistence as all just different aspects of a robust and mature database.
MyPostgres? by Doc+Ruby · 2006-04-23 04:51 · Score: 4, Interesting

Anyone have an equivalent HowTo for Postgres clusters?

And while I'm getting everything in life I casually ask for, where's the SW that automatically swaps out a MySQL install and replaces it with Postgres, including revising source code that calls/queries the DB, or just uses a MySQL installation as a proxy replica for Postgres nodes in a mixed cluster?

--
--
make install -not war
Re:sounds cool, but MAJOR drawbacks! by Jamesday · 2006-04-23 05:02 · Score: 3, Informative

Cluster was designed for the telecoms case where you need to authenticate your subscriber and verify that they are entitled to make a call now even if they haven't made a call for a month, doing it in the same time window as the other calls. Also to scale out to as many servers as it takes to handle either load or data volume while delivering that predictable real-time performance. In a non-telecoms case, one user is capturing in real time performance data from thousands of vehicles for analysis, exploiting the write rate scaling and reliability.

You can get some of that by allocating data space to RAM on one server but that doesn't get you all of Cluster's properties. Cheaper if you can use the approach you gave, of course, just doesn't do the job for some situations. Same for the Memory engine approach I outlined - doesn't get you all Cluster provides, but is enough for some situations.

Like the other MySQL storage engines, you don't use the Cluster engine if you don't need its properties. It's a pick the right engine for the job situation.
Monkey Management by daitengu · 2006-04-23 05:08 · Score: 3, Funny

I use a Monkey Management Technique for my mySQL servers. If one server goes down, I have a monkey that jumps up and down at the datacenter and gets real mad until someone fixes it.

.. Works like a charm!
NDB and RAM by moogoogaipan · 2006-04-23 05:51 · Score: 3, Informative

I heard that you need to have enough RAM on the SQL nodes to hold your entire database on each cluster machine so that NDB will work. I did not see this info in the front page. I think it's critical to know.
Re:Don't forget - The MySQL Protocol is proprietar by forrie · 2006-04-23 06:10 · Score: 2, Insightful

My understanding was that your obligation to release under GPL was based on your intended usage of said modifcation. For example, if I develop a hack to this, but intend only for internal/private use, then I'm under no obligation to hand this out publicly.

The only time I would be obligated to submit this under public GPL would be if I intended to redistribute it.

Is that not correct?
Re:Don't forget - The MySQL Protocol is proprietar by DAldredge · 2006-04-23 07:18 · Score: 2, Informative

The text I posted is a direct copy from the official MySQL site. If you click the underlined text it will take you to their site.

It isn't my fault that MySQL believe it.
Re:No they aren't (Re:Such capabilities are built by Anonymous Coward · 2006-04-23 07:40 · Score: 3, Informative

> The mysql clustering is pretty much bunk anyways. All of the DBs need to live in RAM. What happens if you have a catastrophic UPS failure?

It was designed for read-only telecom use, with real time writes from "upstream" propogated down. Basically, a cache. Local writes aren't super-critical in this case, because whatever causes a failure to write probably takes out the rest of the node anyway.

It's not really a general purpose SQL database, it just happens to let mysql provide some front-end services. Personally I consider it a drag on an otherwise decent distributed cache engine (mysql isn't exactly synonymous with bulletproof) but I suppose it does gain some exposure.

Postgres provides a MUCH more flexible infrastructure than MySQL for doing real clustering and replication ... unfortunately no one's seriously built up that infrastructure.
Re:Is that true? by jadavis · 2006-04-23 09:59 · Score: 2, Insightful

As usual, MySQL is intentionally being confusing about their licensing. You can't GPL a protocol, only the implementation of a protocol.

However, the protocol itself still every bit as proprietary a format as an MS Word document. It just happens that they license their implementation to you under the GPL. You can reverse engineer it, just like the MS word doc format, but MySQL AB can always make subtle changes to break it (just like MS can do to the word format).

--
Social scientists are inspired by theories; scientists are humbled by facts.
Re:No they aren't (Re:Such capabilities are built by Jamesday · 2006-04-23 10:29 · Score: 2, Informative

If you have a catastrophic UPS failure you wait for the power to come back up and reload from the disk copy. MySQL Cluster regularly saves the data to disk. Or you use two places that are on different power setups and UPS systems. Or you shut down while you stil have UPS power. Or you switch to your alternate data center and its copy of the cluster.