Experimenting w/ High Performance Computing and Multicasting?
jessemersonuy asks: "Multicasting plays an important role in the design, development, operation, and application of next generation networks that rely on the efficient delivery of packets to their destinations. Due to the advent of broadband, wireless and web-based system design technologies, it has become possible and feasible to design and construct large scale, heterogeneous and complex wireline and wireless communication networks that can support
multimedia conferencing, streaming media distribution,
distributed data sharing, distance learning, 'push'- oriented, and QoS for wired-cable and wired-wireless applications. Now, we have a small High Performance Computing system in our university campus and I would really like to use that system for testing multicasting applications. However, I do not know what would be the best way to use the cluster for multicasting purposes. Has anyone experimented with this before? What might be the best multicasting application to use to be able to fully utilize the power of the cluster?"
Jesus H. Christ in a bucket, let's deconstruct what this idiot "Asks Slashdot".
>creamofsomeyoungguy asks:
> "Multicasting plays an important role in the design, development, operation,
>and application of next generation networks that rely on the efficient
>delivery of packets to their destinations.
We want to multicast.
> Due to the advent of broadband, wireless and web-based system design technologies,
Because the internet exists,
> it has become possible and feasible to design and construct large scale,
>heterogeneous and complex wireline and wireless communication networks
>that can support multimedia conferencing, streaming media distribution,
>distributed data sharing, distance learning, 'push'- oriented, and QoS
>for wired-cable and wired-wireless applications.
we can hook different computers together and do lots of cool things.
> Now, we have a small High Performance Computing system in our university campus
>and I would really like to use that system for testing multicasting applications.
I know of a computer or two I'd like to try multicasting on.
> However, I do not know what would be the best way to use the cluster for
>multicasting purposes. Has anyone experimented with this before?
Anybody have any tips?
> What might be the best multicasting application to use
>to be able to fully utilize the power of the cluster?"
Know of any multicast programs that will bring it to its knees?
C'mon, people, speak English.
I've spent a little time setting up and using multicast networks. I can't necessary say where a cluster would fit into the mix. I'll outline a couple of the uses we have found/use multicasting for, maybe there is something in here that you can get ideas to use your cluster for:
1. Music/sound broadcast - using a MP3 streamer you can pump stations out on multicast addresses then using mp3 players that support multicast streams you can latch on and listen to the stream. For business purposes this might be to distribute recordings of company meetings or perhaps training information that does not need video.
2. Video - The setup of stations to re-run important videos of events, etc. can be useful. In most cases these are just like TV statations - you join a multicast stream then you see whatever is on that stream at the curren time. Good for just about anything that you would like to rebroadcast to a large number of users using minimal bandwidth.
3. Applications - Under windows there are a couple applications available that will distribute applications across a campus (or any multicast network) to remote users. Broadcast once, receive many. I haven't used this function much but for file distribution that happens on a schedule to the desktop or to multiple servers it could be useful. Not sure of what support is available under Linux for this currently.
4. Video Conferencing - This is one of the applications that we have been experimenting using multicasting for. Being able to broadcast one video conference to say, 80 different sites but in a network only having to distribute a small number of streams is great stuff. An example of this is we have customers who have say 50 sites, all of the sites are connected via sattelite, if we streamed to all 50 sites it would consume a massive amount of bandwidth (say 768k x 60) - using multicasting we can cut that to one transmission that every site receives (only 768k). There are some other rules - like the returning data must follow a unicast path, etc. but overall the bandwidth used is still around 1 T1 instead of the 25+ it would normally take to do this type of streaming.
Someone else stated that this is largely a router/switch thing. It really is, proper configuration of the multicast network at on the routers/switches in the network is crucial. If you are going to be distributing across a large network you need to choose the proper protocols for your needs (sparse or dense mode protocols) and setup the routing so everything flows across the links you desire. A misconfiguration of the equipment could mean that you end up distributing the same data across multiple links to the same site, not distributing the data at all or having it traverse the wrong link which could end up destroying network performance rather than enhancing it.
-Alan
People have been talking about multicasting for a long time . I first heard about mbone in 1994 but I have yet to see it in widespread use.
Multicasting made sense when you looked at the internet, and the amount of high bandwidth content was very limited, but the user base was growing very rapidly.However, content grew at a rate that was probably not forseen. In order for multicasting to make sense, you have to assume that many people downstream of the signal are watching the exact same content exactly in sync (presumably live content) in the same format.
With a few notable exceptions (The Victoria Secret Fashion Shows come to mind) this situation doesn't come up that often. So although it's an attractive idea, neither the ISP's not the content providers have pushed very hard for it, so essentially nothing has happened.
One possible use content that is provided by one person with relativly low bandwidth that can reach many people. This comes at a cost of convienience; you can't provide information "on demand," but only in pseudo-broadcast style. Additionally, ISPs generally don't approve of "little people" providing content to many many people. They want to reserve that ability for those who are willing to pay (a lot) for it.
I'd be surprised if multicasting ever comes into very wide use; since the situations that it's useful under are limited, and the econmoic incentives to get it going don't seem that strong to me. OTOH, the trend in online advertising advertising has been to make it more like television, so maybe a shift is in the works.
Multicastings real killer app is amplification per node and cacheing remotely on nodes. I'll bet $50 US that I can multicast as successfully if not better (depending on switch) with a pentium 75 from a home DSL line than you can with your cluster.
The reason is that this is a throughput and amplification problem, not a computing one. Your cluster has to communicate through soda straws of bandwidth compared to the througput you can get on one mainboard. 1 Gbs is not as fast as 6 Gbs, which is slower than a G4's mainboard.
Once the output stream leaves the box (this really is just I/O) you send it to a node which sends it to a bunch of nodes, then those nodes do the same ad infinitum. Kinda like in "Waynes World" when Wayne does the whole "telephone" speech, with nifty visuals.
In short...using your cluster for this is a waste of computing power, and is hobbled by the connections between boxen.
"Let him go, Ralph. He knows what he's doing." --Otto Mann (simpsons)
Multicasting doesn't really equate with needing high-end hardware. In fact, the requirements would be much lower for multicasting, as opposed to having to do the equivalent number of unicasts.
Multicasting has more to do with switch and router configuration.
I've also been interested in mbone (multicasting backbone) for a while since I got broadband access. It's seems simple enough by installing a special virtual router on my firewall (mrouted).
However I have not been able to find any way to hook up my home router to the actual mbone network.
Are there any official contacts for the mbone network that can assist in finding a connection point?
//Pingo
--- Linux or FreeBSD, it's like blondes or brunettes. I like both. ---
Here is a a explanation of unicast vs. multicast:
Unicast means you (your computer) sends out N identical packets to N different (destination) IP addresses.
Multicast means you (your computer) sends out 1 packet to 1 special multicast IP address, and downstream router(s) duplicate that 1 packet into N identical packets to the N different (real) IP addresses which the special multicast IP address corresponds to.
Imagine the potential for DDoS attacks...;-)
Uh .. there IS reliable multicast. There are quite some different approaches to it.
...
.. you'll find some !
One would be to to cache some of the data in the routers (or on computers near the routers), which can be retransmitted if someone down the leaf sends NACK's
STFW
One application of multicast-IP is broadcasting Interactive Digital Television.
- Multicast an MPEG video stream.
- Multicast the EPG information.
- Multicast interactive applications & data. Massively multi-player games, quizzes (International-Interactive "Who Wants to be a millionaire" ?)
Unicast wouldmn't work because the load on the server would be a factor of the number of consumers, multicast-IP is much more efficient.
It's possible for practically anybody to multi-cast 'Pirate' TV broadcasts, (i.e. "Eyes Only" from "Dark Angel") currently the limitation factor is not the technology, but the high bandwidth cost of content insertion, compared to the [low] numbers of people with broadband connections, to exploit the content.
Soon, real soon :)
The possibilities are awsome!
"We control the horizontal; We control of the vertical; this is the Multicast-Zone"
Check this link out for a cool example of this technology in action. http://www.kit.tv
Although most of your current multicast applications are for streaming audio and video, multicast can do SOOOOOOOO much more.
Any application where you basically have one system sending identical information to a bunch of other systems at the same time is a potential multicast application. Right now, the common perception is for audio and video technologies. Makes sense...that's what broadcast radio and television is all about. And streaming media does work very well over multicast. But here's a few other apps that also could work well with multicast:
Online games. Especially games which have hundreds, if not thousands of people interacting together. How do you really know where everyone is in a game? You don't, because most online games can't spare the bandwidth to tell you where everyone is. With multicast, a server would be sending a single update on the world to everyone playing. Big bandwidth savings. Also, think about the little guys making games. They might not be able to support the kind of games they want to make, because they can't afford the bandwidth necessary. Multicast saves on bandwidth. You could end up with a couple of guys in their basement hosting games to thousands of people.
Pushing data to cache servers. Think about a company like Akamai, that's got hundreds of these web caching servers all over the country. Reuters comes out with a news flash. They could push out that flash to all these cache boxes out there with multicast. Every cache server gets it at the same time, so whomever hits that cache box sees the story when everyone else does. Also useful for...
Stock market tickers. Everyone wants to see how their stocks are doing, and they want to see it in real-time. You don't want to give an advantage to someone just because they know information sooner than you do. That's where multicast comes in. Using 5 kbps or less of bandwidth, a brokerage company can update millions of people with the stock ticker.
Sys admin. Ever want to deploy new software in your company, but dread having to install it on 500 workstations. You could multicast the files, without completely bringing the LAN to its knees, and it would only take as long as it does to transfer one file.
Popular application updates. Linux kernel 2.6.0 gets released, and instantly FTP servers around the world are hit. They could multicast the software, using a couple of different bandwidth streams to hit modem and T3 users alike, and do it with a reliable multicast protocol that makes sure everyone gets the entire file without loss. Big bandwidth savings, and much happier users who won't have to wait several days for the initial burst of traffic to subside.
Multicast can do a lot of things. Streaming media is just a small part of what multicast can do.
The whole point of multicast is that it requires very little in terms of resources. Having a high performance computing cluster isn't going to do much if you're trying to test out multicast.
More likely, you'll want a sniffer, or else access to routers, either directly or through SNMP. Because what you're really going to want to find out is bandwidth utilization. That's where you're going to see gains going with multicast.
You should head over to IETF's website and start looking at RFCs about multicast. RFCs are usually boring to read, but very insightful.
A better question to ask the Slashdot crowd would be: "Which multicast protocol should I be using?". (FWIW, I recommend going with PIM sparse-mode. See also SSM.) In terms of apps, you haven't really told us much... Are you looking for UNIX, Mac, NT? Audio, video, text, file transfer? High bandwidth, low bandwidth, reliable transport? Without this information, the question posed is so open as to be unanswerable.
The basic problem is that the most popular content on the Internet has a share of maybe 0.1%, compared to, say, 20% for top-rated TV shows. Even if TV eventually moves to the Internet, it will probably be video on demand, not multicasting. If there were any content stream that could draw a sizable market share on the Internet, somebody would have found it during the dot-com boom. Nobody did.
You know what the whole thing sounds like? Like one of those questions that's written by a parent and given to a kid to ask into a microphone at a school assembly featuring a 'special guest speaker.' I can just hear this being read in the monotone of a little six year old kid.
Vintage computer games and RPG books available. Email me if you're interested.
You could try implementing some parallel computing test using some middle ware like Tibco. Tibco is expensive, but it meant for exactly this purpose. There may be other similar middleware applications out there, possibly even Free/Open ones.
Fuck 'im up, Tim! His views are invalid! -Pirate Corp$
furthermore, multicasting has fuck all to do with servers. multicasting is accomplished within a routing/switching infrastructure. the server sends out a SINGLE data stream to a SINGLE SPECIAL ip address; this SINGLE SPECIAL ip address actually identifies a GROUP of hosts. routers & switches know which UNIQUE hosts belong to said GROUP and selectively forward the data. the whole point of this is to reduce load on the server, and prevent multiple identical data streams on the network infrastructure.
k /i cs/cs011.htm
a few notes from cisco :
http://www.cisco.com/univercd/cc/td/doc/cisintw
"Multicast---Multicast applications send each packet to a multicast group address. Hosts that want to receive the packets indicate that they want to be members of the multicast group. This type of application expects that networks with hosts that have joined a multicast group will receive multicast packets. Multicast applications and underlying multicast protocols control multimedia traffic and shield hosts from having to process unnecessary broadcast traffic."
" IP multicasting applications use Class D addresses to address packets. The high-order four bits of a Class D address are set to 1110, and the remaining 28 bits are set to a specific multicast group ID. Class D addresses are typically written as dotted-decimal numbers and are in the range of 224.0.0.0 through 239.255.255.255."
" The Internet Group Management Protocol (IGMP) uses IP datagrams to allow IP multicast applications to join a multicast group. Membership in a multicast group is dynamic---that is, it changes over time as hosts join and leave the group.
Multicast routers that run IGMP use IGMP host-query messages to keep track of the hosts that belong to multicast groups. These messages are sent to the all-systems group address 224.0.0.1. The hosts then send IGMP report messages listing the multicast groups they would like to join. When the router receives a packet addressed to a multicast group, it forwards the packet to those interfaces that have hosts that belong to that group. If you want to prevent hosts on a particular interface from participating in a multicast group, you can configure a filter on that interface by using the ip igmp access-group interface configuration command."
Install Napster across the network.
--
--
#nohup cat
How about real time stock quotes? A few companies have multicast backbones for this sole purpose (e.g., Savvis). Requirement is then to never miss a beat from opening to closing of the market(s). You also end up have to deal with multiple streams that can be very bursty.
Frankly, it all depends.
It wasn't modded down, but when you click on post anonymously it gets posted at 0 instead of 1
Slashdot: Proof that a million monkeys at a million typewriters can create a masterpiece
Absolutely nothing, and that's my point. What do 8 headless boxes have to do with multicasting? Nothing.
You didn't check the technical specifications on the equipment they are using. They have 8 Athlon machines plugged into a Intel 410T Ethernet switch. This switch does not provide anything beyond level 2 switching (based on MAC address). Their setup is identical to what you would find in a small office or computer lab, save 7 of the machines are headless.
Now lets look at this fellow's question:
I would really like to use that system for testing multicasting applications. However, I do not know what would be the best way to use the cluster for multicasting purposes. Has anyone experimented with this before? What might be the best multicasting application to use to be able to fully utilize the power of the cluster?
His question is similar to asking "I have a waffle iron and I want to look into the effects of microwave energy on the human eye". The tools have nothing to do with the application. If he had been familiar with the subject area he would have known that. He might be able to use it for load testing, but I am sure others have more important uses for it (like distributed/parallel processing).
First I would not call 8 Athlon 650 machines a "High Performance Computing system", also if you check all the machines are plugged into a singe 24-port switch. Where does the multi-casting come into play? Your testbed was designed for distributed computing, not mult-casting. I have to assume you just started in your lab. Go talk to your professor and read up on the material before asking for Slashdot to help you out.
There is plenty of room for making good use of
multicasting in high performance computing.
However, most clusters really don't use it
because either A) The switches don't support
it (other than 1-to-all) or B) TCP/IP doesn't
support it.
The message passing libraries probably being used on your cluster (either PVM or some variant of an MPI implementation probably have multicasting statements, but they are in actuality probably
implemented as a sequence of normal one-one communications. Now if you wanted to implement
your own simulated multicast in the cluster, where
you used some kind of tree structure to forward
messages to all the right nodes, have at it,
but you'll probably have to modify your message
passing library...
Dan
As gets mentioned here from time to time, Digital Fountain addresses (or endeavors to address) the "recipients in sync" problem.
I'd be surprised if multicasting ever comes into very wide use; since the situations that it's useful under are limited...
Where I work, many of us kids like to listen to streaming radio broadcasts. We've been criticized for the strain we put on our Internet connection, and it's a valid point. Often several of us are listening to the same shoutcast stream (or whatever) at the same time, and it seems kinda silly that we consume N * 50-100kbps of bandwidth to receive the same content. But, hey, this is just a personal way in which multicast could help my life.
When most people think of multicast they think of 1-to-many transmission. There are also lots of applications involving many-to-many transmission. Chat is an obvious one; chat becomes particularly well suited for multicast when you're dealing with voice chat rather than text. A more interest application is in networked virtual environments (less grandiloquently, games). A couple other fellows and I wrote the networking part of an NVE that used many-to-many multicast: the world was partitioned into octrees, and each octree was assigned a multicast group. Octree nodes split and merged based on traffic, and there were different levels of groups for messages of different levels of detail (e.g., toe movements vs. explosions). (Well, this was the plan; we didn't finish all of it, but it was a cool demo). Peer-to-peer NVEs have many advantages over client-server systems, including reduced (and hopefully optimally minimal) latency and natural scalability. This book provides an overview of the subject, but there are many papers out there that are more in-depth and informative.
Finally, check out Kevin Almeroth's research in multicast applications. He has several good survey papers that address your synchronized play out gripe and explore the gamut of potential multicast applications.
2) Multicast campus events that have a limited capacity at the site, like sporting events, concerts and plays.
3) Multicast or unicast of a remote event to the campus, via point-to-point connection, which is then multicast to the students. Again, special lectures, sporting events and such could be covered.
That or you could just multicast parties at the local frats and sororities for entertainment value.
If you can't beat them, embrace and extend them.
What might be the best multicasting application to use to be able to fully utilize the power of the cluster?
When you think about it, the applications of network multicasting (of content anyway) are pretty straightforward, and pretty much come down to 1-to-many streamed audio/visual content. Most of your ideas (distance learning, conferencing, etc.) assume that you have content to stream. As for distributed file share, etc., those all seem to me to be more applicable to unicast technologies.
If you're interested in multicast technologies, there are some very interesting low bandwidth applications. Heartbeats for distributed system applications are a good example. See the Linux-HA (High Availability) project for an application of this: linux-ha.org.
Invisible Agent
Invisible Agent
This post is a mirror; when a monkey stares in, no hacker gazes out.
Multicasting is designed to require very little hardware. I doubt that you'll be able to put much of a strain on your equipment at all. Now, Seti@home is a different story....
Kind thoughts do not change the world