Slashdot Mirror


Behind the Scenes at Hotmail

mallumax writes "ACM Queue interviews Hotmail engineer Phil Smoot on how they manage more than 10,000 servers spread around the globe. Between them, they process billions of emails per day and are overseen by hundreds of administrators. To do that they have returned to the command line. From the article: 'Our operations group never wants to rely on any sort of user interface. Everything has to be scriptable and run from some sort of command line'. The overriding philosophy seems to be KISS. Also: tape backups are out and spam levels have stabilized."

20 of 292 comments (clear)

  1. KISS my hotmail body by digitaldc · · Score: 4, Funny

    The overriding philosophy seems to be KISS.

    Don't try to tell me that the guys at Hotmail only want to Rock & Roll all night and party every day?!?

    --
    He who knows best knows how little he knows. - Thomas Jefferson
    1. Re:KISS my hotmail body by DJ_Goldfingerz · · Score: 5, Funny

      I read you subject as "KISS my hotmale body".

  2. UNIX? by IAmTheDave · · Score: 4, Interesting

    If I recall correctly, wasn't Hotmail originally run on UNIX boxes?

    --
    Excuse my speling.
    Making The Bar Project
    1. Re:UNIX? by Kraegar · · Score: 5, Informative
      It used to be on FreeBSD w/Apache, now it runs on Windows w/IIS. It's not exchange based.

      Read about it

    2. Re:UNIX? by Amoeba · · Score: 5, Informative

      Yes. Hotmail was originally run on clusters of E3500 and E4500's running Solaris 2.5.1. After they got bought by Microsoft, a major initiative to migrate all boxes to Windows was undertaken in 2000. Hotmail has been 99.9% Windows for over 3 years now. The remaining 0.1% are some legacy solaris boxes used to handle backups for clusters... and even they are being phased out slowly.

      --Amoeba (who no longer works there)

      --
      Do not taunt Happy-Fun Ball
    3. Re:UNIX? by enantiodromia · · Score: 5, Informative

      This is only half true. The _front end_ runs on Windows with IIS. The _back end_, where the email data is stored (the User Stores), are Solaris. The front end machines dont mean much. If one or twenty go down, there are tons more to take their place. They are simply removed from the load balancing and marked as "admin plz fix this some day". The back end machines however, are super critical, as each user lives one one, and only one, user store. That machine goes down, and hundreds of thousands, to millions, of Hotmail users cant get to their mail. And thats why those machines run Solaris.

  3. Better subject... by pegr · · Score: 5, Funny

    "Between them, they process billions of emails per day and are overseen by hundreds of administrators."
     
    And how does the NSA process all that email? Now THAT would be an interesting technical challenge!

  4. Does anyone know... by ehaggis · · Score: 4, Funny

    What OS it runs on and which web server? I am not trying to be funny.

    --
    One ring to bind them - should probably have more fiber and less rings in their diet.
  5. Fairly Impressive by eldavojohn · · Score: 4, Interesting
    I don't know about everyone else but this article was shocking to me.

    Not only are the questions well picked but the some of the answers are quite interesting. For instance Phil on scalability:
    The problems are those of basic client-server programming--that is, figuring out the browser/http/server data-access patterns and optimizing the protocols, extending these protocols as new functionality is introduced, and ensuring that these protocols work across geo-distributed data centers when the speed of light becomes a factor. Designing applications with built-in redundancy so that they are resilient to abuse is also a challenge.
    Before reading this article, I always had hotmail pegged as a hacked together e-mail system less organized than a monkey sh*tfight but if Phil speaks the truth, I've underestimated them. They're a hacked togethor server mess with a lot of effort put into staying afloat--and they have been doing well for a long time.

    I guess I've always taken my free Hotmail account for granted.
    --
    My work here is dung.
  6. High level of QC! by Anonymous Coward · · Score: 5, Informative

    From the article:
    Hotmail relies on less than 100 system administrators to manage it all.

    From the summary:
    Between them, they process billions of emails per day and are overseen by hundreds of administrators.

    Brought to you by the high quality control here at /.

  7. Re:I wonder.... by daikokatana · · Score: 4, Funny
    ..who they call for support? :-)

    Ghostbusters?

    --
    http://jcsnippets.atspace.com/ - a collection of Java & C# snippets
  8. Interacting without any sort of user interface by Phat_Tony · · Score: 4, Funny
    "they have returned to the command line. From the article: 'Our operations group never wants to rely on any sort of user interface"

    I always thought that the command line was a user interface. You know, interfacing between a user and a computer.

    It's hard to picture using a computer without any sort of user interface. I'm pretty sure that, in order to call it "using" a computer, some sort of interface must exist, be it keyboard mouse and monitor, binary switch, light gun, real gun, neural link, telekinesis, or whatever. Otherwise, you're not using it, are you?

    On the other hand, maybe the article is correct- a lot of operations group probably don't want to use "any sort of user interface" to communicate with their computers. They want to be sitting on a beach in tahiti drinking daiquiris, thousands of miles away from the computers they're supposed to maintain.

    --
    Can anyone tell me how to set my sig on Slashdot?
  9. From the immortal words of Henry Spencer by hackstraw · · Score: 5, Insightful


    "Those who don't understand UNIX are doomed to reinvent it, poorly."

    From the article and elaborating on the /. summary (It has a print version that consolidates the 4 pages together if you want):

    Q: Are there scaling reasons to think about the benefits of a command line for managing over a GUI, or are there other things to think about?

    A: Our operations group never wants to rely on any sort of user interface. Everything has to be scriptable and run from some sort of command line. That's the only way you're going to be able to execute scripts and gather the results over thousands of machines.

    Also, we all remember the scaling issues that MS had when they took over hotmail and initially tried to switch from freebsd to Windows.

    MS had to port over cron jobs because its not something that is installed and used by default under windows like UNIX. They had to rewrite the "inefficient" perl code that ran fine on FreeBSD to C++. They had to redo the memory allocation to prevent memory leaks in the new C++ code. Read about it from the goat's mouth http://www.microsoft.com/technet/interopmigration/ case/hotmail/default.mspx.

    I can't wait until FreeBSD and other inferior OSes get tools to find memory leaks. One day....

    (That last line was sarcasm and not a flame).

  10. Re:stabilised... by the+chao+goes+mu · · Score: 4, Funny

    Stabilized at 100% of bandwidth.

    --
    Boys from the City. Not yet caught by the Whirlwind of Progress. Feed soda pop to the thirsty pigs.
  11. F**Kin Speak English ! by CmdrGravy · · Score: 4, Insightful

    Why does he keep mistaking the word "use" for the word "leverage" ? The only possible advantage I can see in substituting the word "leverage" is that it sort of implies they are making the best use of these tools that they can in which case you would think that most people would have already assumed they are not making the worst possible use they could of the tools and it's interesting that the author feels it necessary to make that distinction.

    1. Re:F**Kin Speak English ! by kotj.mf · · Score: 4, Funny
      Why does he keep mistaking the word "use" for the word "leverage"?

      Because his team leverages best-of-breed systems to utilize the synergistic effects of the paradigm shift in relationships among stakeholders and the knowledge infrastructure, silly.

      --
      hang brain.
  12. Hundreds of administrators by Iphtashu+Fitz · · Score: 4, Interesting

    more than 10,000 servers spread around the globe ... are overseen by hundreds of administrators.

    Heh. I used to work at Akamai which provides content delivery services for many of the biggest sites on the web. They have somewhere over 15,000 servers that are managed by tens of administrators, not hundreds. In fact, a typical NOCC (yes, 2 'C's for Akamai) shift at Akamai is only staffed by 8 or so people, with only a couple of senior level admins on call. And they're delivering all sorts of web-based content, including streaming, not just e-mail.
    But then Akamai runs them all on linux, whereas I belive Hotmail is all Windows based. You do the math.

    1. Re:Hundreds of administrators by Iphtashu+Fitz · · Score: 4, Insightful

      I think running a mail server is a bit more complicated than a webserver or a streaming server for video

      It sounds to me like you don't understand what it is that Akamai does. They're not just running web & streaming servers on their 15k machines. They're distributing content in real time in a way thtat vastly improves user access all around the world. You may have heard when Victorias Secret held their first video-streaming lingerie show. Well their servers couldn't handle the load because of all the people trying to watch it. They became an Akamai customer, and Akamai was able to redistribute their streams in real-time all over the globe. To be able to take video (or just web content) from a single source and distribute it quickly and efficiently to thousands of distributed users in real-time is a huge undertaking. Akamai has some very impressive technology to be able to do this.

      I'm not saying that running a mail service like Hotmail is a piece of cake, but I do think that what Akamai does is a lot more difficult and impressive when you think about it. If Akamai's distributed environment were to drop off the net then you probably wouldn't be able to access any of the on-line services of most of their customers. (And that's just a small subset of their customer base) The ability to keep websites like those of Microsoft, eBay, Fed Ex, Red Hat, etc. all highly responsive to end users is not a simple feat by any stretch of the imagination.

  13. Hotmail was next to my cage at by wsanders · · Score: 4, Funny

    AT first, it was BSD running on a bunch of identical custom-made sub-1U servers. But No! Then it was replaced by windows boxes . . . racks and racks of 99c Fry's keyboards velcroed to the backs and fronts of racks, with miles of small-gauge track, upon which ran diabolical steam-powered robots, each with a single arm and with fingers at the end, forever fixed at the precise spacing to stab the keyboards' CTRL-ALT-DEL keys. Noisily the robots rumbled back and forth on their appointed rounds . . .

    --
    Give a man a fish and you have fed him for today. Teach a man to fish, and he'll say "WHERE'S MY FISH, YOU IDIOT?"