Slashdot Mirror


Experimenting w/ High Performance Computing and Multicasting?

jessemersonuy asks: "Multicasting plays an important role in the design, development, operation, and application of next generation networks that rely on the efficient delivery of packets to their destinations. Due to the advent of broadband, wireless and web-based system design technologies, it has become possible and feasible to design and construct large scale, heterogeneous and complex wireline and wireless communication networks that can support multimedia conferencing, streaming media distribution, distributed data sharing, distance learning, 'push'- oriented, and QoS for wired-cable and wired-wireless applications. Now, we have a small High Performance Computing system in our university campus and I would really like to use that system for testing multicasting applications. However, I do not know what would be the best way to use the cluster for multicasting purposes. Has anyone experimented with this before? What might be the best multicasting application to use to be able to fully utilize the power of the cluster?"

29 of 78 comments (clear)

  1. WTF? by Anonymous Coward · · Score: 4


    Jesus H. Christ in a bucket, let's deconstruct what this idiot "Asks Slashdot".

    >creamofsomeyoungguy asks:

    :-)

    > "Multicasting plays an important role in the design, development, operation,
    >and application of next generation networks that rely on the efficient
    >delivery of packets to their destinations.


    We want to multicast.

    > Due to the advent of broadband, wireless and web-based system design technologies,

    Because the internet exists,

    > it has become possible and feasible to design and construct large scale,
    >heterogeneous and complex wireline and wireless communication networks
    >that can support multimedia conferencing, streaming media distribution,
    >distributed data sharing, distance learning, 'push'- oriented, and QoS
    >for wired-cable and wired-wireless applications.


    we can hook different computers together and do lots of cool things.

    > Now, we have a small High Performance Computing system in our university campus
    >and I would really like to use that system for testing multicasting applications.


    I know of a computer or two I'd like to try multicasting on.

    > However, I do not know what would be the best way to use the cluster for
    >multicasting purposes. Has anyone experimented with this before?


    Anybody have any tips?

    > What might be the best multicasting application to use
    >to be able to fully utilize the power of the cluster?"


    Know of any multicast programs that will bring it to its knees?

    C'mon, people, speak English.

  2. Multicasting Media and other Good Stuff by Saono · · Score: 4

    I've spent a little time setting up and using multicast networks. I can't necessary say where a cluster would fit into the mix. I'll outline a couple of the uses we have found/use multicasting for, maybe there is something in here that you can get ideas to use your cluster for:

    1. Music/sound broadcast - using a MP3 streamer you can pump stations out on multicast addresses then using mp3 players that support multicast streams you can latch on and listen to the stream. For business purposes this might be to distribute recordings of company meetings or perhaps training information that does not need video.

    2. Video - The setup of stations to re-run important videos of events, etc. can be useful. In most cases these are just like TV statations - you join a multicast stream then you see whatever is on that stream at the curren time. Good for just about anything that you would like to rebroadcast to a large number of users using minimal bandwidth.

    3. Applications - Under windows there are a couple applications available that will distribute applications across a campus (or any multicast network) to remote users. Broadcast once, receive many. I haven't used this function much but for file distribution that happens on a schedule to the desktop or to multiple servers it could be useful. Not sure of what support is available under Linux for this currently.

    4. Video Conferencing - This is one of the applications that we have been experimenting using multicasting for. Being able to broadcast one video conference to say, 80 different sites but in a network only having to distribute a small number of streams is great stuff. An example of this is we have customers who have say 50 sites, all of the sites are connected via sattelite, if we streamed to all 50 sites it would consume a massive amount of bandwidth (say 768k x 60) - using multicasting we can cut that to one transmission that every site receives (only 768k). There are some other rules - like the returning data must follow a unicast path, etc. but overall the bandwidth used is still around 1 T1 instead of the 25+ it would normally take to do this type of streaming.

    Someone else stated that this is largely a router/switch thing. It really is, proper configuration of the multicast network at on the routers/switches in the network is crucial. If you are going to be distributing across a large network you need to choose the proper protocols for your needs (sparse or dense mode protocols) and setup the routing so everything flows across the links you desire. A misconfiguration of the equipment could mean that you end up distributing the same data across multiple links to the same site, not distributing the data at all or having it traverse the wrong link which could end up destroying network performance rather than enhancing it.

    -Alan

  3. Multicast: Forever the protocol of the future by bperkins · · Score: 2

    People have been talking about multicasting for a long time . I first heard about mbone in 1994 but I have yet to see it in widespread use.

    Multicasting made sense when you looked at the internet, and the amount of high bandwidth content was very limited, but the user base was growing very rapidly.However, content grew at a rate that was probably not forseen. In order for multicasting to make sense, you have to assume that many people downstream of the signal are watching the exact same content exactly in sync (presumably live content) in the same format.

    With a few notable exceptions (The Victoria Secret Fashion Shows come to mind) this situation doesn't come up that often. So although it's an attractive idea, neither the ISP's not the content providers have pushed very hard for it, so essentially nothing has happened.

    One possible use content that is provided by one person with relativly low bandwidth that can reach many people. This comes at a cost of convienience; you can't provide information "on demand," but only in pseudo-broadcast style. Additionally, ISPs generally don't approve of "little people" providing content to many many people. They want to reserve that ability for those who are willing to pay (a lot) for it.

    I'd be surprised if multicasting ever comes into very wide use; since the situations that it's useful under are limited, and the econmoic incentives to get it going don't seem that strong to me. OTOH, the trend in online advertising advertising has been to make it more like television, so maybe a shift is in the works.

  4. more interesting from a weak box by mr_burns · · Score: 2

    Multicastings real killer app is amplification per node and cacheing remotely on nodes. I'll bet $50 US that I can multicast as successfully if not better (depending on switch) with a pentium 75 from a home DSL line than you can with your cluster.

    The reason is that this is a throughput and amplification problem, not a computing one. Your cluster has to communicate through soda straws of bandwidth compared to the througput you can get on one mainboard. 1 Gbs is not as fast as 6 Gbs, which is slower than a G4's mainboard.

    Once the output stream leaves the box (this really is just I/O) you send it to a node which sends it to a bunch of nodes, then those nodes do the same ad infinitum. Kinda like in "Waynes World" when Wayne does the whole "telephone" speech, with nifty visuals.

    In short...using your cluster for this is a waste of computing power, and is hobbled by the connections between boxen.

    --
    "Let him go, Ralph. He knows what he's doing." --Otto Mann (simpsons)
  5. Why high performance? by ryanr · · Score: 3

    Multicasting doesn't really equate with needing high-end hardware. In fact, the requirements would be much lower for multicasting, as opposed to having to do the equivalent number of unicasts.

    Multicasting has more to do with switch and router configuration.

    1. Re:Why high performance? by john_many_jars · · Score: 3

      I don't see any real innovative work brought to either high performance computing (which has almost no real-time applications--ie, they run models that are not dependant on time of completion) or multi-casting (which is almost completely dominated by real-time applications--ie, the delivery of information must be done in a specific time frame or the information is useless) by bringing them together directly.

      As for performance v. bandwidth, that's easy enough.. there aren't memory buses faster than a Pentium 100 cpu (ie, the fetch-execute cycle is bottlenecked at fetch), let alone network connections that can compete with the pure computational power of a--gasp--486/50. I can have a 486 just send sequential numbers to <i>x</i> recipients and tie up a 100Mb switch in the closest wiring closet heading for a gateway.

      Perhaps there may be something said for making HPCs that communicate between processors using some multi-casting like technology. For instance, many problems involve performing calculations that SIMD pipes work wonderful for [ie SUM ( iFFT ( FFT ( A ) x FFT ( B ) ) ) over many distinct As and Bs]. It turns out, problems like this in parallel environments involve every node needing to know something from everyother node. This can be done in O(n log n) communications. However, use of multicasting might bring this down to O(n) communications where n is proportional to the number of separate nodes used. For interesting problems, this could improve the performance on a supercomputer by an order of magnitude (ie from 1 year of CPU time to 1 month of CPU time).

      Of course, this involves writing your own memory management system, redesigning boards, and other not-quite-so-simple tasks.

    2. Re:Why high performance? by EvlPenguin · · Score: 2

      Multicasting has more to do with switch and router configuration.

      True. Once you hit a certain MHz on the nodes, it loses all practicality because of the bandwidth bottlenecks. A good test would be to experiment with diffrent MHz machines to see where the performace maxes out the bandwidth. That way, you can have a better price/performance ratio.

      Or just use gigabit, but that can get (very, very) expensive.
      --

      --

      --
      #nohup cat /dev/dsp > /dev/hda & killall -9 getty
  6. How to get on mbone? by Pingo · · Score: 3

    I've also been interested in mbone (multicasting backbone) for a while since I got broadband access. It's seems simple enough by installing a special virtual router on my firewall (mrouted).

    However I have not been able to find any way to hook up my home router to the actual mbone network.

    Are there any official contacts for the mbone network that can assist in finding a connection point?

    //Pingo

    --
    --- Linux or FreeBSD, it's like blondes or brunettes. I like both. ---
  7. 2 sentence explanation of unicast vs. multicast by ddstreet · · Score: 2

    Here is a a explanation of unicast vs. multicast:

    Unicast means you (your computer) sends out N identical packets to N different (destination) IP addresses.

    Multicast means you (your computer) sends out 1 packet to 1 special multicast IP address, and downstream router(s) duplicate that 1 packet into N identical packets to the N different (real) IP addresses which the special multicast IP address corresponds to.

    Imagine the potential for DDoS attacks...;-)

  8. Re:Multicast != streaming media by mbyte · · Score: 2

    Uh .. there IS reliable multicast. There are quite some different approaches to it.

    One would be to to cache some of the data in the routers (or on computers near the routers), which can be retransmitted if someone down the leaf sends NACK's ...

    STFW .. you'll find some !

  9. Multicast Application(s) by Martin+S. · · Score: 2

    One application of multicast-IP is broadcasting Interactive Digital Television.

    - Multicast an MPEG video stream.

    - Multicast the EPG information.

    - Multicast interactive applications & data. Massively multi-player games, quizzes (International-Interactive "Who Wants to be a millionaire" ?)

    Unicast wouldmn't work because the load on the server would be a factor of the number of consumers, multicast-IP is much more efficient.

    It's possible for practically anybody to multi-cast 'Pirate' TV broadcasts, (i.e. "Eyes Only" from "Dark Angel") currently the limitation factor is not the technology, but the high bandwidth cost of content insertion, compared to the [low] numbers of people with broadband connections, to exploit the content.

    Soon, real soon :)

    The possibilities are awsome!

    "We control the horizontal; We control of the vertical; this is the Multicast-Zone"

  10. p.s. Re:Multicast Application(s) by Martin+S. · · Score: 2

    Check this link out for a cool example of this technology in action. http://www.kit.tv

  11. Multicast != streaming media by jmilne · · Score: 2

    Although most of your current multicast applications are for streaming audio and video, multicast can do SOOOOOOOO much more.

    Any application where you basically have one system sending identical information to a bunch of other systems at the same time is a potential multicast application. Right now, the common perception is for audio and video technologies. Makes sense...that's what broadcast radio and television is all about. And streaming media does work very well over multicast. But here's a few other apps that also could work well with multicast:

    Online games. Especially games which have hundreds, if not thousands of people interacting together. How do you really know where everyone is in a game? You don't, because most online games can't spare the bandwidth to tell you where everyone is. With multicast, a server would be sending a single update on the world to everyone playing. Big bandwidth savings. Also, think about the little guys making games. They might not be able to support the kind of games they want to make, because they can't afford the bandwidth necessary. Multicast saves on bandwidth. You could end up with a couple of guys in their basement hosting games to thousands of people.

    Pushing data to cache servers. Think about a company like Akamai, that's got hundreds of these web caching servers all over the country. Reuters comes out with a news flash. They could push out that flash to all these cache boxes out there with multicast. Every cache server gets it at the same time, so whomever hits that cache box sees the story when everyone else does. Also useful for...

    Stock market tickers. Everyone wants to see how their stocks are doing, and they want to see it in real-time. You don't want to give an advantage to someone just because they know information sooner than you do. That's where multicast comes in. Using 5 kbps or less of bandwidth, a brokerage company can update millions of people with the stock ticker.

    Sys admin. Ever want to deploy new software in your company, but dread having to install it on 500 workstations. You could multicast the files, without completely bringing the LAN to its knees, and it would only take as long as it does to transfer one file.

    Popular application updates. Linux kernel 2.6.0 gets released, and instantly FTP servers around the world are hit. They could multicast the software, using a couple of different bandwidth streams to hit modem and T3 users alike, and do it with a reliable multicast protocol that makes sure everyone gets the entire file without loss. Big bandwidth savings, and much happier users who won't have to wait several days for the initial burst of traffic to subside.

    Multicast can do a lot of things. Streaming media is just a small part of what multicast can do.

    1. Re:Multicast != streaming media by Invisible+Agent · · Score: 2

      I don't mean any offense, but it's exactly these sorts of misconceptions about multicast that get people inappropriately excited over multicast technologies (IMHO).

      Let's look at your categories:

      Online games: You're really going to transmit the moves and state changes of all game objects to my machine? Clearly this doesn't work (unless you believe every player has the combined processing power of all of the game's server-class machines with giant network pipes). Data doesn't shrink in size just because it's multicast. The reason that large multiplayer games work is because of big honking servers that can figure out the minimum set of data that my game client needs.

      Data to cache servers: why does multicast help here? Those cache servers are in geographically distinct regions, so you either believe that a multicast solution somehow spans all NOCs, or else you're back to the (appropriate) unicast solutions. This may be a solution if you needed to transmit the same data to multiple cache servers on the same network.

      Sotck market tickers: not a bad idea actually, but not a high-speed application (as the original poster specified).

      Sysadmin: yes, this is a good application of multicast, but as a previous poster said, it only works if you're switched network actually transmits multicast packets (again, the original poster gets to control this since he has a lab).

      Linux kernel updates to ftp sites: see "data to cache servers" above.

      I think that the misunderstanding here is one of locality -- if you have individual machines on disparate networks, multicasting buys you nothing over unicasting, except for the configuration headaches. Multicasting is cool when you have a lot of machines on the same network (or a few networks) all needing the same data.

      Invisible Agent

      --

      Invisible Agent
      This post is a mirror; when a monkey stares in, no hacker gazes out.
  12. Multicast doesn't care about high-end processors by jmilne · · Score: 3

    The whole point of multicast is that it requires very little in terms of resources. Having a high performance computing cluster isn't going to do much if you're trying to test out multicast.

    More likely, you'll want a sniffer, or else access to routers, either directly or through SNMP. Because what you're really going to want to find out is bandwidth utilization. That's where you're going to see gains going with multicast.

    You should head over to IETF's website and start looking at RFCs about multicast. RFCs are usually boring to read, but very insightful.

    A better question to ask the Slashdot crowd would be: "Which multicast protocol should I be using?". (FWIW, I recommend going with PIM sparse-mode. See also SSM.) In terms of apps, you haven't really told us much... Are you looking for UNIX, Mac, NT? Audio, video, text, file transfer? High bandwidth, low bandwidth, reliable transport? Without this information, the question posed is so open as to be unanswerable.

  13. Multicasting, bleah. by Animats · · Score: 2
    Me master, you slave. You listen to what I'm sending when I want to send it. That's multicasting. It works fine technically, but so far, nobody has found a killer app. When "push technology" went bust, interest in multicasting went with it. As a practical matter, distributed web caches seem to be doing a good job of handling high-popularity content.

    The basic problem is that the most popular content on the Internet has a share of maybe 0.1%, compared to, say, 20% for top-rated TV shows. Even if TV eventually moves to the Internet, it will probably be video on demand, not multicasting. If there were any content stream that could draw a sizable market share on the Internet, somebody would have found it during the dot-com boom. Nobody did.

  14. Re:look at the question... by SuiteSisterMary · · Score: 2

    You know what the whole thing sounds like? Like one of those questions that's written by a parent and given to a kid to ask into a microphone at a school assembly featuring a 'special guest speaker.' I can just hear this being read in the monotone of a little six year old kid.

    --
    Vintage computer games and RPG books available. Email me if you're interested.
  15. Parallel computing by wfaulk · · Score: 2

    You could try implementing some parallel computing test using some middle ware like Tibco. Tibco is expensive, but it meant for exactly this purpose. There may be other similar middleware applications out there, possibly even Free/Open ones.

    --

    Fuck 'im up, Tim! His views are invalid! -Pirate Corp$

  16. Re:WHAT? by mdouglas · · Score: 4

    furthermore, multicasting has fuck all to do with servers. multicasting is accomplished within a routing/switching infrastructure. the server sends out a SINGLE data stream to a SINGLE SPECIAL ip address; this SINGLE SPECIAL ip address actually identifies a GROUP of hosts. routers & switches know which UNIQUE hosts belong to said GROUP and selectively forward the data. the whole point of this is to reduce load on the server, and prevent multiple identical data streams on the network infrastructure.

    a few notes from cisco :

    http://www.cisco.com/univercd/cc/td/doc/cisintwk /i cs/cs011.htm

    "Multicast---Multicast applications send each packet to a multicast group address. Hosts that want to receive the packets indicate that they want to be members of the multicast group. This type of application expects that networks with hosts that have joined a multicast group will receive multicast packets. Multicast applications and underlying multicast protocols control multimedia traffic and shield hosts from having to process unnecessary broadcast traffic."

    " IP multicasting applications use Class D addresses to address packets. The high-order four bits of a Class D address are set to 1110, and the remaining 28 bits are set to a specific multicast group ID. Class D addresses are typically written as dotted-decimal numbers and are in the range of 224.0.0.0 through 239.255.255.255."

    " The Internet Group Management Protocol (IGMP) uses IP datagrams to allow IP multicast applications to join a multicast group. Membership in a multicast group is dynamic---that is, it changes over time as hosts join and leave the group.

    Multicast routers that run IGMP use IGMP host-query messages to keep track of the hosts that belong to multicast groups. These messages are sent to the all-systems group address 224.0.0.1. The hosts then send IGMP report messages listing the multicast groups they would like to join. When the router receives a packet addressed to a multicast group, it forwards the packet to those interfaces that have hosts that belong to that group. If you want to prevent hosts on a particular interface from participating in a multicast group, you can configure a filter on that interface by using the ip igmp access-group interface configuration command."

  17. The best test of your multicasting capabilities. by EvlPenguin · · Score: 3

    Install Napster across the network.
    --

    --

    --
    #nohup cat /dev/dsp > /dev/hda & killall -9 getty
  18. Real time stock market data by bulgroz0 · · Score: 2

    How about real time stock quotes? A few companies have multicast backbones for this sole purpose (e.g., Savvis). Requirement is then to never miss a beat from opening to closing of the market(s). You also end up have to deal with multiple streams that can be very bursty.

    --
    Frankly, it all depends.
  19. Re:NCS, an open source distributed shared memory by Decado · · Score: 2

    It wasn't modded down, but when you click on post anonymously it gets posted at 0 instead of 1

    --

    Slashdot: Proof that a million monkeys at a million typewriters can create a masterpiece

  20. Re:WHAT? by wmoyes · · Score: 2
    Explain exactly what the hell having a 24-port switch has to do with multicasting

    Absolutely nothing, and that's my point. What do 8 headless boxes have to do with multicasting? Nothing.

    You didn't check the technical specifications on the equipment they are using. They have 8 Athlon machines plugged into a Intel 410T Ethernet switch. This switch does not provide anything beyond level 2 switching (based on MAC address). Their setup is identical to what you would find in a small office or computer lab, save 7 of the machines are headless.

    Now lets look at this fellow's question:

    I would really like to use that system for testing multicasting applications. However, I do not know what would be the best way to use the cluster for multicasting purposes. Has anyone experimented with this before? What might be the best multicasting application to use to be able to fully utilize the power of the cluster?

    His question is similar to asking "I have a waffle iron and I want to look into the effects of microwave energy on the human eye". The tools have nothing to do with the application. If he had been familiar with the subject area he would have known that. He might be able to use it for load testing, but I am sure others have more important uses for it (like distributed/parallel processing).

  21. WHAT? by wmoyes · · Score: 3

    First I would not call 8 Athlon 650 machines a "High Performance Computing system", also if you check all the machines are plugged into a singe 24-port switch. Where does the multi-casting come into play? Your testbed was designed for distributed computing, not mult-casting. I have to assume you just started in your lab. Go talk to your professor and read up on the material before asking for Slashdot to help you out.

  22. You can't experiment if your switch doesn't do it. by dstanzi · · Score: 3

    There is plenty of room for making good use of
    multicasting in high performance computing.
    However, most clusters really don't use it
    because either A) The switches don't support
    it (other than 1-to-all) or B) TCP/IP doesn't
    support it.

    The message passing libraries probably being used on your cluster (either PVM or some variant of an MPI implementation probably have multicasting statements, but they are in actuality probably
    implemented as a sequence of normal one-one communications. Now if you wanted to implement
    your own simulated multicast in the cluster, where
    you used some kind of tree structure to forward
    messages to all the right nodes, have at it,
    but you'll probably have to modify your message
    passing library...
    Dan

  23. Useful by srichman · · Score: 4
    In order for multicasting to make sense, you have to assume that many people downstream of the signal are watching the exact same content exactly in sync (presumably live content) in the same format.

    As gets mentioned here from time to time, Digital Fountain addresses (or endeavors to address) the "recipients in sync" problem.

    I'd be surprised if multicasting ever comes into very wide use; since the situations that it's useful under are limited...

    Where I work, many of us kids like to listen to streaming radio broadcasts. We've been criticized for the strain we put on our Internet connection, and it's a valid point. Often several of us are listening to the same shoutcast stream (or whatever) at the same time, and it seems kinda silly that we consume N * 50-100kbps of bandwidth to receive the same content. But, hey, this is just a personal way in which multicast could help my life.

    When most people think of multicast they think of 1-to-many transmission. There are also lots of applications involving many-to-many transmission. Chat is an obvious one; chat becomes particularly well suited for multicast when you're dealing with voice chat rather than text. A more interest application is in networked virtual environments (less grandiloquently, games). A couple other fellows and I wrote the networking part of an NVE that used many-to-many multicast: the world was partitioned into octrees, and each octree was assigned a multicast group. Octree nodes split and merged based on traffic, and there were different levels of groups for messages of different levels of detail (e.g., toe movements vs. explosions). (Well, this was the plan; we didn't finish all of it, but it was a cool demo). Peer-to-peer NVEs have many advantages over client-server systems, including reduced (and hopefully optimally minimal) latency and natural scalability. This book provides an overview of the subject, but there are many papers out there that are more in-depth and informative.

    Finally, check out Kevin Almeroth's research in multicast applications. He has several good survey papers that address your synchronized play out gripe and explore the gamut of potential multicast applications.

  24. A number of applications come to mind by sacremon · · Score: 2
    1) Multicast lectures to students, particularly those on remote campuses. This can help with scheduling difficulties for the faculty and students.

    2) Multicast campus events that have a limited capacity at the site, like sporting events, concerts and plays.

    3) Multicast or unicast of a remote event to the campus, via point-to-point connection, which is then multicast to the students. Again, special lectures, sporting events and such could be covered.

    That or you could just multicast parties at the local frats and sororities for entertainment value.

    --
    If you can't beat them, embrace and extend them.
  25. Multicasting == streaming media by Invisible+Agent · · Score: 2

    What might be the best multicasting application to use to be able to fully utilize the power of the cluster?

    When you think about it, the applications of network multicasting (of content anyway) are pretty straightforward, and pretty much come down to 1-to-many streamed audio/visual content. Most of your ideas (distance learning, conferencing, etc.) assume that you have content to stream. As for distributed file share, etc., those all seem to me to be more applicable to unicast technologies.

    If you're interested in multicast technologies, there are some very interesting low bandwidth applications. Heartbeats for distributed system applications are a good example. See the Linux-HA (High Availability) project for an application of this: linux-ha.org.

    Invisible Agent

    --

    Invisible Agent
    This post is a mirror; when a monkey stares in, no hacker gazes out.
  26. Multicasting not stressful by 3ryon · · Score: 2

    Multicasting is designed to require very little hardware. I doubt that you'll be able to put much of a strain on your equipment at all. Now, Seti@home is a different story....