Slashdot Mirror


Email Offline At the Home of Sendmail

BobJacobsen writes "The UC Berkeley email system has been either offline, or only providing limited access, for more than a week. How can the place where sendmail originated fall so far? The campus CIO gave an internal seminar (video, slides) where he discussed the incident, the response, and some of the history. Briefly, the growth of email clients was going to overwhelm the system eventually, but the crisis was advanced when a disk failure required a restart after some time offline. Not discussed is the long series of failures to identify and implement the replacement system (1, 2, 3, 4). Like the New York City Dept. of Education problem discussed yesterday, this is a failure of planning and management being discussed as a problem with (inflexible) technology. How can IT people solve things like this?"

179 comments

  1. Nothing to do with Sendmail by bobstreo · · Score: 3, Insightful

    It's the backend. When you have too many connections on too few servers, with not enough storage
    you usually see this kinda issue.

    1. Re:Nothing to do with Sendmail by grcumb · · Score: 2

      It's the backend. When you have too many connections on too few servers, with not enough storage you usually see this kinda issue.

      I see it as yet another failure for the client/single server model.

      It surprises me that people are still investing so much time and effort on centralisation of services when obviously the most practical technical[*] answer is the opposite. Simple, common protocols and decentralised infrastructure are the most robust model for overall survival of a communications system. DARPA proved that some time ago, but we seem intent on forgetting as much of that lesson as possible.

      ----------------
      [*] Okay, I don't want to be disingenuous about this. The reasons for centralisation are financial and organisational. It's more costly to spread IT capacity through the breadth of an organisation, and it's hell on wheels in administrative terms. But past a certain point, you would think that IT would finally earn the right to have some input into the discussion about how best to manage an organisation's information. Unfortunately, IT managers are not always the best ones to advocate for a different approach because they're the ones who've made their mark by proving (or pretending) they could manage these big, ugly 'enterprise' systems.

      --
      Crumb's Corollary: Never bring a knife to a bun fight.
    2. Re:Nothing to do with Sendmail by vlm · · Score: 4, Funny

      It's the backend. When you have too many connections on too few servers, with not enough storage
      you usually see this kinda issue.

      Knowing the speed and flexibility of university upgrade policies, and knowing sendmail was born around 4.1BSD, and knowing the -BSDs were VAX only until 4.2 or 4.3 or so in the 80s, I'm guessing they're still using the original VAX it was developed on?

      --
      "Science flies us to the moon. Religion flies us into buildings." - Victor Stenger
    3. Re:Nothing to do with Sendmail by Anonymous Coward · · Score: 0

      Congratulations on reading the summary.

      No-one was trying to blame Sendmail.

    4. Re:Nothing to do with Sendmail by tramp · · Score: 1

      You are right that it is not sendmail causing the problem but it is quite sad that UC Berkeley has fallen to such mismanagement of their IT resources. Somehow you expect such a institute to do better then this.

    5. Re:Nothing to do with Sendmail by TheRaven64 · · Score: 3, Informative

      Uh, email is decentralised. Anyone can set up a node in the network just by pointing an MX record at a machine. The problem in this case is too many people using the same node. You'll note that while UCB was having problems, email continued to work fine for everyone else unless they had unrelated problems.

      --
      I am TheRaven on Soylent News
    6. Re:Nothing to do with Sendmail by CAIMLAS · · Score: 2

      Many educational institutions lag behind because they're an ever-evolving door. Even when they've got dedicated and experienced IT staff, most of it's just in a managerial role for the student work studies (it saves money, of course).

      --
      ~/ssh slashdot.org ssh: connect to host slashdot.org port 22: too many beers
    7. Re:Nothing to do with Sendmail by dgatwood · · Score: 1

      We have a very different notion of the word decentralized. A decentralized system is one in which the traffic for a given endpoint has no single point of failure other than the endpoint. The user's computer is an endpoint. The UCB mail server is not an endpoint; it is a node in the connection graph. Therefore, email is not decentralized. As soon as any organization has a central server, it is not decentralized.

      The problem is that email is designed around trusted hosts instead of trusted users, and at this point, those trusted hosts have gotten way too large. Worse, they aren't all trusted anymore. What we really need is something more akin to a P2P email infrastructure, in which each user is responsible for storing his or her own email, and in which each user must use a valid public-private key pair to sign each message (or encrypt, if desired).

      Such a design would not only eliminate these single points of failure, but also make spam much, much harder because it could always be traced back to either the original sender or to a specific user whose compromised computer relayed the message. Either way, cutting off a source of spam becomes trivially automatable at that point.

      --

      Check out my sci-fi/humor trilogy at PatriotsBooks.

    8. Re:Nothing to do with Sendmail by RogerWilco · · Score: 1

      Well, and the managers get more prestige if they manage a larger organisation.

      I totally agree with your points though, if things are decentralized and each department has it's own servers, then they can even be each other's fall back option.

      A single point of failure isn't always cheaper.

      --
      RogerWilco the Adventurous Janitor
    9. Re:Nothing to do with Sendmail by Anonymous Coward · · Score: 0

      Not to mention that they are run Exim and not Sendmail at all. They could not even call Eric if it were a "Sendmail" problem.

  2. It isn't an I.T. problem by Colin+Smith · · Score: 2, Insightful

    It's an economic one. It needs an economic solution.

    e.g.
    Have people buy a $10 ticket to get an account on the email server.

    --
    Deleted
    1. Re:It isn't an I.T. problem by StikyPad · · Score: 5, Insightful

      Pretty sure that's what tuition is.

    2. Re:It isn't an I.T. problem by Anonymous Coward · · Score: 4, Funny

      no I'm pretty sure tuition is more than $10

    3. Re:It isn't an I.T. problem by Jim+Hall · · Score: 1

      Pretty sure that's what tuition is.

      Tehnically, that's actually covered by a student fee, usually a "technology fee" in most universities. So yes, this cost should already be built into the cost of attending university. Whether that fee is enough to cover everything, including email, I'll leave to Berkeley.

  3. So the ultimate solution will be outsourcing by HotNeedleOfInquiry · · Score: 1

    I am depressed.

    --
    "Eve of Destruction", it's not just for old hippies anymore...
    1. Re:So the ultimate solution will be outsourcing by betterunixthanunix · · Score: 1

      Indeed, that is exactly what happened at my alma mater. First they blamed the Squirrelmail front end, then they bought a black box solution from Mirapoint, and when the Mirapoint solution proved too expensive they just went with Google.

      --
      Palm trees and 8
    2. Re:So the ultimate solution will be outsourcing by Anonymous Coward · · Score: 2, Funny

      Wow, Squirrelmail. So at least they managed to migrate from pine at some point.

    3. Re:So the ultimate solution will be outsourcing by lucm · · Score: 2

      Outsourcing would work, because when there is another failure they will have another party to blame instead of pointing fingers to a decision made in Spring 2011 (even as a total stranger I could feel the bitterness under that bullet point in the slides).

      --
      lucm, indeed.
    4. Re:So the ultimate solution will be outsourcing by twisted_pare · · Score: 2

      This is really quite common. It happened at my alma mater as well. Servers could not handle the POP requests, so they started blacklisting students that checked their mail more than four times an hour. A month later a RAID drive failed and email for 17k people (including a hospital) was completely offline for 3 days. It is sad that seemingly anyone can be a high paid "IT Professional" these days, but without a clue about HA.

      --
      HTFU
    5. Re:So the ultimate solution will be outsourcing by lucifuge31337 · · Score: 4, Insightful

      One can have all the clue in the world, yet be powerless to prevent failures if not funded to purchase the appropriate equipment.

      --
      Do not fold, spindle or mutilate.
    6. Re:So the ultimate solution will be outsourcing by AK+Marc · · Score: 2

      HA isn't an IT problem. It's a business issue. I've never seen a business who put a dollar amount on their downtime. I know they are out there, but every "real" place I've worked has never quantified their costs. How do you justify having a cold spare if it's a "waste" of money planning for an outage that is "free" (after all, if downtime were a problem, then someone would calculate the cost of downtime). IT did what they should, gave the users the best they could within their budget. That the budget was too small is partly the fault of the IT manager, but only partly.

  4. downtime by masternerdguy · · Score: 0, Troll

    Oh no, a service had downtime. Surely this is the end of the world and only the greatest sinners of the IT world ever have to bring something down for maintenance.

    --
    To offset political mods, replace Flamebait with Insightful.
    1. Re:downtime by Anonymous Coward · · Score: 1

      Well, "bringing it down for a week to handle a problem with a failed disk" is somewhat indicative of lack-of-clue.

    2. Re:downtime by gVibe · · Score: 0

      Well, "bringing it down for a week to handle a problem with a failed disk" is somewhat indicative of lack-of-clue.

      Ha! Lack-Of-Clue sounds like a Whack-a-Mole game ... /me ponders!

      --
      Keywords for the NSA overthrow oppressive regime true believers marathon Manhatten the financial district blueprints I
  5. I can tell you how by Billly+Gates · · Score: 0

    By hiring more cost accountants and requiring special and complicated business case studies with a thorough financial analysis on even the most mundane upgrade on how it will raise the companys stock price. Just ask any visionary MBA? Always buy cheap consumer grade stuff and view talent as unneccesary expenses. Do that and you will never have problems. What could go wrong?

  6. Telnet by qualityassurancedept · · Score: 2

    When I started college in 1991 I was amazed by the telnet access I had to the email account given to me by the University. I hadn't had an email address prior to that. Now I have an email addresses through hotmail, gmail and yahoo that I use for different things and facebook also gives me an email address. So, I doubt students really need email addresses provided by the university anymore. As for the NYC Dept of Ed example, I think it just shows that trying to build IT competence into a government agency basically a waste of money because the institutional culture of government. In short, all of these kinds of organizations could just offer email through gmail/google business or any number of other providers that will scale up almost infinitely.

    --
    if your life is such a big joke then why should I care?
    1. Re:Telnet by slimjim8094 · · Score: 5, Insightful

      Students need school email addresses because that way all students have an email address.

      At my school, students are expected to check their university email at least once every 24 hours. Many people forward it to a personal account, and obviously most people check it more frequently than that, but if the university issues an account to everyone, then there can be no debate about how they didn't get the email. The school takes responsibility for the email system (and any failures), and then professors can be assured that if they send an email out to the class, it will be (or should have been) read, leaving the onus on the student to actually do it. It's similar to why we provide computer labs - that way, each student unequivocally has a way to do electronic assignments, even if nearly everyone has their own machine.

      --
      I have developed a truly marvelous proof of this comment, which this signature is too narrow to contain.
    2. Re:Telnet by betterunixthanunix · · Score: 1, Interesting

      facebook also gives me an email address

      When did this start happening? Does it actually interoperate with other email services?

      --
      Palm trees and 8
    3. Re:Telnet by mrmeval · · Score: 1

      It's safer if the school has no access to your email so they should just drop that service except for those students who provably don't have the money for it.

      I'd never use their servers the same as I won't use my ISP's. I use ones I have some insulation from and if I need more privacy than that I use encryption and if it's life or liberty threatening a drop off to a hollow tree in some random park. ;)

      --
      I'd go on a Vegan diet but the delivery time from Vega is too long. --brownkitty
    4. Re:Telnet by stephanruby · · Score: 1

      Yes, but for most students, I'll bet that their official school email address is still their primary email address for all the important stuff.

    5. Re:Telnet by toomanyhandles · · Score: 1

      I like Tinker, Tailor, Soldier, Spy also. The Alec Guinness one. Although I have hopes for the new movie given those involved.

    6. Re:Telnet by Mr.+Freeman · · Score: 3, Informative

      Having a .edu address gains you a lot of credibility when communicating with people outside the university. They are quite valuable. You can often get very quick responses to questions that most companies won't even respond to if they came from a name@gmail.com or name@yahoo.com.

      Also, email is used for a lot of very important stuff like sending reports, design files, etc. Having someone on campus that can fix problems is quite valuable. Your campus email will never be "accidentally" seized, locked out, etc. like people have experienced with google and yahoo. Because the campus maintains backups (or at least, they should), you data will never be suddenly gone with no chance for recovery like people have experienced with google and yahoo.

      --
      -1 disagree is not a modifier for a reason. -1 troll, flaimbait, redundant, overrated are NOT acceptable substitutes.
    7. Re:Telnet by Anonymous Coward · · Score: 0

      So, I doubt students really need email addresses provided by the university anymore.

      I totally agree. I have my .edu forwarded to my gmail account and in 5 years of college I used my .edu only twice, both for getting deals online (Dropbox is one, forgot the other).

    8. Re:Telnet by mr100percent · · Score: 2

      Actually, most schools require an Official school email address. This guarantees the uptime from the faculty's point of view; you can't claim you never got the assignment or that you turned it in on time and nothing was there. It's also important for them from a liability standpoint; my Registrar will not send me any bills unless it's to my .edu account, and professors are instructed to ignore any student emails from any other domain. They're also organized by real name, so the school has a working internal directory and doesn't have to bother with LDAP.

    9. Re:Telnet by icebraining · · Score: 1

      From news articles, it started a year ago, and it does interoperate with other email services (you get a [username]@facebook.com) but it doesn't let the sender choose the Subject line, add CCs, forward, etc.

    10. Re:Telnet by betterunixthanunix · · Score: 1

      What I really care about is being able to communicate with Facebook users without having to actually open a Facebook account. Thank you for the information, I will look into this in more detail.

      --
      Palm trees and 8
    11. Re:Telnet by Nimey · · Score: 1

      They also need school email addresses because Hotmail et al are liable to mark a university as a spammer if enough luser students decide to click the "mark as spam" button for university-sent emails. Happened to my university a few times before It Was Decreed that all students are required to have a Google Apps for Domains email account resolving to our domain.

      --
      Hail Eris, full of mischief...

      E pluribus sanguinem
    12. Re:Telnet by QuantumRiff · · Score: 3, Interesting

      When I was admining at a small college, we DID NOT provide email for students, only for staff. We ran a listServe (sympa) and if the students gave us their personal email address, and checked a box, they would be added to a mail list for every class automatically..

      Any student that didn't have an email would be sent to the library, where they would be shown how to sign up for a hotmail, yahoo, or gmail account.

      We had students thank us, since they have gone to other schools, and though it was silly to have to check yet another account, when they already had 3 or 4.

      The ONLY reason colleges give out emails is because they have been doing it since before email was a common thing. There is no actual reason for it.. (but I have heard some neighboring colleges give very, very very good sounding arguments on why they needed to drop a few hundred grant on a SAN and exchange)

      --

      What are we going to do tonight Brain?
    13. Re:Telnet by Jonner · · Score: 1

      If the university can't competently provide its own IT infrastructure, why should they be expected to provide anything else competently? Perhaps it's just time to privatize the whole thing. I can't wait for GoogleU and Inteliversity.

    14. Re:Telnet by Chaos+Incarnate · · Score: 2

      Given my university's propensity to send irrelevant e-mails to all students, students marking some e-mail from your university as spam was probably quite legitimate.

      --
      Benford's Corollary to Clarke's Law: "Any technology distinguishable from magic is insufficiently advanced."
    15. Re:Telnet by SpaghettiPattern · · Score: 1

      The school takes responsibility for the email system (and any failures), and then professors can be assured that if they send an email out to the class, it will be (or should have been) read, leaving the onus on the student to actually do it.

      That is peculiar coming from a scientific institution. Email offers no guarantee of reception. I grant that an extremely high amount of mails do arrive well. But if I were to avoid distracting discussions, I'd communicate with more than just email as a medium and I wouldn't rely on it being high available.

      --

      I hadn't the slightest objection to his spending his time planning massacres for the bourgeoisie... (P.G. Wodehouse)
    16. Re:Telnet by KingAlanI · · Score: 1

      lusers reporting as spam because they don't want to bother unsubscribing is a problem. but can one unsubscribe from those emails without missing out on the important/relevant emails? that's what I wondered about as I ran into the same problem

      --
      I listen to both RIAA and non-RIAA stuff if I like the music, tangential business/politics nonwithstanding.
    17. Re:Telnet by tibit · · Score: 1

      I've had this problem at a big ten school. They have a mailing list where the disclaimer at the bottom of every email is that if you unsubscribe (even though they won't let you anyway), you may miss some important stuff (I'm paraphrasing). I've archived all of those email from almost a decade, and one afternoon went through them all. All of six messages out of almost 3000 were relevant to me, and most of them would be irrelevant to 90%+ of students they were reaching. Yet there was no way of unsubscribing: I tried. They'd sternly rebuke my request, saying that "I might miss on critical messages". Fuck them, idiots, that's about all that I get to say.

      --
      A successful API design takes a mixture of software design and pedagogy.
    18. Re:Telnet by Nimey · · Score: 1

      Heh, yeah, that's legit. We send a bunch of crap that most of the *staff* don't care about, let alone the students, bulk emails about Spanish Club, bake sales, and so on. From two different email systems, yet.

      --
      Hail Eris, full of mischief...

      E pluribus sanguinem
    19. Re:Telnet by tibit · · Score: 1

      Um, because the "internal directory" can't be implement using LDAP?! What's wrong with LDAP anyway? Even OpenLDAP seems to scale decently and I'd hardly consider it a "bother".

      --
      A successful API design takes a mixture of software design and pedagogy.
    20. Re:Telnet by http · · Score: 1

      You know, I could have sworn slimjim8094 had just outlined a different reason why colleges give out emails. You may consider it insufficient justification, but I think it is actually a factor that must be considered. The ITU stats on internet penetration have yet to reveal a 100% mark anywhere in the world.
      Telling someone they must sign up for hotmail to further their education is the epitome of unprofessional. I feel your pain that you were ordered and paid to do it.

      --
      If opportunity came disguised as temptation, one knock would be enough.
      3^2 * 67^1 * 977^1
    21. Re:Telnet by slimjim8094 · · Score: 1

      But that's the point. If the mail system is functioning properly, they get the email. If it's not, it's not the students' problem. Since it's all internal, there's no real room for debate. The IT department does actually provide a service where they check the logs to make sure your email did actually get delivered to its recipient. It comes in handy if a student says they never were told about a test, and the professor can give the timestamp of the email he sent out and they verify receipt, or a student had an email exchange with a professor regarding grades or something which months later the professor denies.

      Such a service would be impossible if people had their own addresses because the whole chain can't be traced. Like I said, we do allow forwarding, but the onus is then on the student to make sure the forwarded address can receive all email sent to it. A copy is still placed in the inbox, and they're still advised to check it in case there was a problem. In practice, most people don't, but then there's still no excuse. This is why important emails are advised to be sent from a school account to a school account, and it's why I don't forward. If I didn't get an email, it's because there was a fault in the system - and nothing else.

      As a student, it's been great. Occasionally professors forget email conversations, and have since deleted them. Having IT verify that you did actually send the email you're re-sending them is a good thing.
      As a TA, it's been great. We occasionally email announcements with important information; invariably, they don't get read. But when the complaints come, we can show demonstrably that it was only because they hadn't actually read the email before deleting it. Saves a lot of strife if they get a 0 for something they didn't check their email about, since there's no fallback to "but but the email was broken".

      It's not adversarial, just good sense. People lie or forget all the time - but that's no way to run (or take) a class.

      --
      I have developed a truly marvelous proof of this comment, which this signature is too narrow to contain.
  7. lol by luftrofl · · Score: 0

    I know /. is a a little slow usually, but it's a little silly to see this article pop up now as full service has essentially been restored (just now getting back mail client access, while webmail was working for the past few days).

  8. Funding by scapermoya · · Score: 1

    Maybe it has something to with the fact that the state of california has cannibalized the funding for my beloved alma mater.

    --
    Beware the Jubjub bird, and shun the frumious Bandersnatch.
    1. Re:Funding by DesScorp · · Score: 3, Insightful

      Maybe it has something to with the fact that the state of california has cannibalized the funding for my beloved alma mater.

      They wouldn't have to if they didn't have too many colleges (they do), and try to send too many kids to college (they do), many of whom may have no business being in college (they don't). Tax revenue is not an infinite resource. But California seems to have a community college on every two dirt roads, and several 4 year (or higher) colleges in a similar area.

      --
      Life is hard, and the world is cruel
    2. Re:Funding by scapermoya · · Score: 2

      I agree that the overall system is probably too large, but we are talking specifically about the flagship university of the UC system. arguably the best public university in the world, and it is getting hurt just as bad as UC Riverside. that's absurd and embarrassing.

      --
      Beware the Jubjub bird, and shun the frumious Bandersnatch.
    3. Re:Funding by toomanyhandles · · Score: 1

      And lets not forget... they have the highest population count of anyplace in the USA also. Want to build a few more uni's in the Dakotas?

    4. Re:Funding by Anonymous Coward · · Score: 0

      California has a tech industry because of all those colleges. It's the fifth largest economy in the world because of that industry. But don't worry, the right wing dream of dismantling all of it forever is rapidly becoming a reality all so Grover Norquist can keep paying taxes at a lower rate than you or me.

  9. Improper capacity planning by mysidia · · Score: 3, Informative

    Briefly, the growth of email clients was going to overwhelm the system eventually, but the crisis was advanced when a disk failure required a restart after some time offline.

    Capacity planning is supposed to account for reduced capacity due to component failures, system outages, and temporary demand spikes due to restart events.

    1. Re:Improper capacity planning by Bacon+Bits · · Score: 2

      In my experience this type of "planning failure" is caused when IT repeatedly tells management they need money to maintain and upgrade systems, and management consistently says no because they don't have the money for it. Not enough money or people to configure, install, support, and maintain any new systems because the budget won't allow any more. Yet somehow there always seems to be money for shiny new iPads and iPhones for the executives.

      --
      The road to tyranny has always been paved with claims of necessity.
    2. Re:Improper capacity planning by Anonymous Coward · · Score: 1

      At the place where my girlfriend works, she has a mailbox max of 20MiB. One or two emails with an attachment and its full. She asked why that was and their IT person said that is all they had room for on the server's disk for her mailbox because of all the other crap on that machine (website, internal files, etc.). This is the same company that bought brand new desktop computers, laptops (to work from home, which hasn't left the bag in and has stayed in the closet since she first got it) and tablets (for travel to meetings and the like) for literally everyone that works there because "that is what all the other companies are doing as productivity solutions."

    3. Re:Improper capacity planning by KingAlanI · · Score: 1

      miniscule message storage limits like that are ridiculous anyway

      --
      I listen to both RIAA and non-RIAA stuff if I like the music, tangential business/politics nonwithstanding.
    4. Re:Improper capacity planning by tibit · · Score: 1

      Well, since the IT folks must have extra capacity (in terms of storage, network, processing, etc) saved for times when there's server or disk downtime, it's fairly obvious that service must be cut/rationed until such extra capacity is restored. If the bigwigs will be getting their email delivered timely only 8 hours per day, the money will come. I think that the problem is that the IT does not apply a cut-off point. They stretch until stuff break. The correct way of doing it is: you don't stretch anything, you run your systems with extra capacity assigned to planned-for failure modes. An extra employee comes and his/her mailbox would go over your quota? You say you need a hard drive or two, etc. Yes, just for that one user. No, there's no resources left elsewhere.

      --
      A successful API design takes a mixture of software design and pedagogy.
  10. Well... by Anonymous Coward · · Score: 0

    It's called sendmail.

    Not sendmailnomatterwhat

  11. unions by Joe_Dragon · · Score: 1

    http://slashdot.org/comments.pl?sid=2556922&cid=38249652

    IT should have unions so they are not the fail guy for management mess up's / lack of funds and or planing.

  12. Scalability of Algorithms by Anonymous Coward · · Score: 0

    There was some Silicon Valley ISP whose name unfortunately escapes me just now, that had the "problem" that its service had grown so popular that the time required to search for a mailbox in /var/spool/mail was greater than the time duration between incoming mails. The result was that their system worked great right up until a certain critical threshhold, then all of a sudden most of their users' mail started to bounce.

    Their solution was to place user mail spools in their home directories rather than all in one directory, that being /var/spool/mail. Because the home directories weren't all in the same parent directory - that is, not all in /home - rather than a linear search, finding the right spool became a much quicker tree search.

    If you have a large number of users, even if you have only one filesystem for home directories, you can speed access to individual user files by placing, say, my "mike" home directory in /home/m/mi/mike, rather than just /home/mike.

    1. Re:Scalability of Algorithms by KiloByte · · Score: 1

      That's only applicable if you're using FAT, sysvfs, ancient versions of NTFS or ext2 (before dir_index) and others of that age. Any modern filesystem can handle millions or more entries in a directory without going into the O(n^2) hole you're speaking of.

      --
      The creatures outside looked from Alt-Right to Antifa; but already it was impossible to say which was which.
    2. Re:Scalability of Algorithms by thogard · · Score: 1

      The MAIL environment variable has been around since before the early 1980s and IDA sendmail had patches to look up the mailbox per user using dbm files in the late 1980s. I can't see this every being a problem with a mail server but I have seen it on usenet servers that don't expire some groups.

  13. How IT people can solve this problem... by damn_registrars · · Score: 1

    IT people need to move into management at a more useful rate. Instead most of the people who ultimately make the financial decisions for IT centers around the world have little grounding in IT and hence limited understanding of what is actually important beyond the bottom line.

    Of course, this requires IT people who are willing to put their foot down. We don't seem to have many of those...

    --
    Damn_registrars has no butt-hole. Damn_registrars has no use for a butt-hole.
    1. Re:How IT people can solve this problem... by quetwo · · Score: 1

      Of course, the smart IT people are often not allowed into management -- they are too useful at their current level (keeping the systems running, turning the screws, etc), and would be a pain to be replaced. So they promote people who are easy to replace into management. For a smart IT person, usually the only way to get ahead is to move sideways, not up. Go somewhere else, or do something else... If you are good at what you do, there is little incentive for people to move you up.

    2. Re:How IT people can solve this problem... by Ziest · · Score: 1

      Unfortunately, those smart IT people who have spent years in the trenches and understand, in detail, how to build a robust and resistant infrastructure are often overruled by the a CFO who's only qualification is they have an MBA. In many tech companies the group that handles the infrastructure (DNS, email, backups, etc.) reports to the CFO not the CTO. Why is this? After 25 years in the computer field I still have not heard a rational explanation for this idiocy.

      --
      Another day closer to redwood heaven
    3. Re:How IT people can solve this problem... by Ritchie70 · · Score: 2

      History.

      Computers originally came into companies to do accounting and related work.

      --
      The preferred solution is to not have a problem.
    4. Re:How IT people can solve this problem... by KingAlanI · · Score: 1

      excellent example of the Peter Principle

      --
      I listen to both RIAA and non-RIAA stuff if I like the music, tangential business/politics nonwithstanding.
    5. Re:How IT people can solve this problem... by tibit · · Score: 1

      It's a rational explanation allright, but still the status quo is idiotic :)

      --
      A successful API design takes a mixture of software design and pedagogy.
  14. IT is not the Problem by arthurpaliden · · Score: 4, Insightful

    IT goes to management and says "based on current usage/loadings etc the system will fail in 6 months to prevent it we need to do this....." Management says "Really, that's not what the sales man told me and its his equipment so he should know".

    1. Re:IT is not the Problem by iggymanz · · Score: 3, Informative

      no way, I work at a Value Added Reseller of hardware and the good sales guy would definitely use your fears to sell you some expandable solution

    2. Re:IT is not the Problem by Anonymous Coward · · Score: 0

      The solution: Get better management.

    3. Re:IT is not the Problem by Anonymous Coward · · Score: 1

      Right now, sure. They'll say it lasts for, eg, 10 years. But then management remembers that, and if after only 5 years IT says they need to upgrade in the next couple months, that's where the "but the sales guy said" excuse comes in.

    4. Re:IT is not the Problem by chrisj_0 · · Score: 1

      1) don't let management talk to sales guys. 2) after 5 years tell them that got fired 6 months after he sold us

    5. Re:IT is not the Problem by iggymanz · · Score: 1

      If proper expandable solution for the present situation, projected growth and strategic plan were purchased, later unexpected purchases would only be necessitated by something unforeseen, whether growth or changes to business methods or direction. Management just needs to be made aware. If they think most businesses future IT needs can be seen absolutely accurately 5 or 10 years later, they are fools. For example, in 2004, would certain major U.S. banks know that the Blackberry (or similar) and server infrastructure for it would be a core IT tool in their business five to ten years later?

  15. No. by damn_registrars · · Score: 5, Insightful

    Now I have an email addresses through hotmail, gmail and yahoo that I use for different things and facebook also gives me an email address. So, I doubt students really need email addresses provided by the university anymore.

    You are quite wrong. Email addresses - especially .edu addresses - are still quite valuable. At lot of academic resources that take registration via email won't allow registration to go to a throwaway account (a la hotmail, gmail, yahoo, etc). Many organizations that are interested in real information on users insist that users use an actual unique account and not a freebie. And when you're in college and making very little money a lot of those things can be important.

    I think it just shows that trying to build IT competence into a government agency basically a waste of money because the institutional culture of government

    You're not very accurate on that, either. Government organizations need to be able to keep track of their email - especially internal communications - which they would not be able to do if they outsourced email and other telecom.

    In short, all of these kinds of organizations could just offer email through gmail/google business or any number of other providers that will scale up almost infinitely.

    With the various privacy breeches that have occurred, that would be a terrible idea. And on top of that, IT is a lot more than just email. Do you want the government to turn to comcast for networking support while their at it? What if the IRS web servers go down on tax day? Do you want them to have to lean on an outside company to get it back up?

    --
    Damn_registrars has no butt-hole. Damn_registrars has no use for a butt-hole.
    1. Re:No. by qualityassurancedept · · Score: 2

      you can get an email address through google with any domain name you want... so, the company I work for runs its email from google but we all still have email addresses that say mrbigshot@seriousbusiness.com so I don't think the point about the .edu ending of the address is really valid.

      --
      if your life is such a big joke then why should I care?
    2. Re:No. by Anonymous Coward · · Score: 0

      ...but you won't get anyone to delegate a .edu record to you unless you're actually an educational institution.

    3. Re:No. by Anonymous Coward · · Score: 0

      you can get an email address through google with any domain name you want... so, the company I work for runs its email from google but we all still have email addresses that say mrbigshot@seriousbusiness.com so I don't think the point about the .edu ending of the address is really valid.

      Some people think that Google shouldn't have access to their private life. Of course they're criminals and their opinion is wrong - if they weren't criminals, what would be the problem, right? We all know Google will never abuse this info, neither for profit nor to please a three letter government agency (who would have access to the emails anyway since SMTP is not encrypted and very few people encrypt their messages).

    4. Re:No. by Anonymous Coward · · Score: 0

      It would be better to have your email run by the regents of the university of california? How much pushback do you think they give the three-lettered agents?

    5. Re:No. by matty619 · · Score: 1

      In short, all of these kinds of organizations could just offer email through gmail/google business or any number of other providers that will scale up almost infinitely.

      With the various privacy breeches that have occurred, that would be a terrible idea. And on top of that, IT is a lot more than just email. Do you want the government to turn to comcast for networking support while their at it? What if the IRS web servers go down on tax day? Do you want them to have to lean on an outside company to get it back up?

      If you watched TFV...that is exactly what they're going to do. They're either going to go with Microsoft's 365, or Google's Gmail. They're just working out the contracts.

    6. Re:No. by AK+Marc · · Score: 1

      But a real educational institution can point their .edu DNS to Google.

    7. Re:No. by Anonymous Coward · · Score: 0

      You're not very accurate on that, either. Government organizations need to be able to keep track of their email - especially internal communications - which they would not be able to do if they outsourced email and other telecom.

      My son is a student at SIU Carbondale. The siu.edu email system is gmail. They farmed it out to google (apps). You might want to read more at the google apps for education website.

    8. Re:No. by nine-times · · Score: 1

      I think it just shows that trying to build IT competence into a government agency basically a waste of money because the institutional culture of government

      You're not very accurate on that, either. Government organizations need to be able to keep track of their email - especially internal communications - which they would not be able to do if they outsourced email and other telecom.

      I'd also like to note that the poster seems to be trying to turn this into evidence that the government is inefficient, since they don't outsource their email, but a lot of business don't outsource their email either. I help people outsource their email for a living, and I don't necessarily recommend it. There are advantages, and there are disadvantages.

    9. Re:No. by sciencewhiz · · Score: 1

      The accounts you describe aren't freebies as described by GP. The domain costs money. Assuming your business has more then 10 users, you pay google also.

    10. Re:No. by qualityassurancedept · · Score: 1

      Yes, we pay google. I doubt it would be cheaper to run an exchange server.

      --
      if your life is such a big joke then why should I care?
  16. Re:Hate Being First .... by Jeremiah+Cornelius · · Score: 0

    It's SO HARD to point your MX record to a working host!

    And then, to populate an IMAP from your directory? Don't get me started...

    --
    "Flyin' in just a sweet place,
    Never been known to fail..."
  17. outsourcing? by bcrowell · · Score: 1

    At the school where I teach, whenever there's a discussion of how much it costs us to run our own email, someone suggests outsourcing (e.g., to gmail), and then someone else says, "No, we can't do that because of privacy laws." Am I right in guessing that privacy laws don't in fact prevent outsourcing to google? I suspect the argument is basically a way for IT folks to have job security. There are certainly laws that say, e.g., that we can't give students' grades to third parties. But it's hard to believe that letting google keyword-index emails and serve ads based on the keywords would violate these laws. (Whether google creeps you out is a different issue -- a moral/political one, not a legal one. It may also be an issue, but it's not an issue that can automatically end the discussion the way the legal issue can.) Does anyone know of any colleges or universities that do outsource to google or someone else?

    1. Re:outsourcing? by Tridus · · Score: 1

      That depends on where you are. Privacy laws in Canada most certainly do restrict Government agencies (and probably educational ones) from using Gmail because it's American and the Patriot Act is a rather severe problem that can't be mitigated.

      --
      -- "So they told me that using the download page to download something was not something they anticipated." - Bill Gates
    2. Re:outsourcing? by betterunixthanunix · · Score: 3, Insightful

      Since both my alma mater and my current institution have migrated to Google, and both are covered by FERPA and other privacy laws, I am inclined to say that that argument is bogus. However, I have a separate issue with outsourcing student email: third parties get to set the rules for student conduct without any action by the university itself.

      Typically universities have acceptable computer policies and at those institutions that run their own mail servers, such policies usually govern email. Students and faculty can demand changes to university policy if the policy does not properly align with the academic mission of the institution. Students and faculty have essentially no power over the terms of use that Google or Microsoft or any other third party email service imposes on them. It is easy to say, "Well, it is not like Google is going to demand something outrageous!" but there is really nothing preventing Google from doing so (if you do not think they have done so already). Google does not have the best interests of academia in mind when it sets its policies, nor is there any reason for Google to care about academic needs.

      --
      Palm trees and 8
    3. Re:outsourcing? by Anonymous Coward · · Score: 0

      ...nor is there any reason for Google to care about academic needs.

      Really? If Google screwed it's academic clients, wouldn't that make it harder for Google to:

      1) hire from those academic pools?
      2) convince academics to use their other products (ie click AdWords ads)?

    4. Re:outsourcing? by GIL_Dude · · Score: 2

      Tons of schools use Google as their email provider. Here's a quote from a Time article from 2009:

      Google now manages e-mail for more than 2,000 colleges and universities, enabling students to transform accounts capped at 100 mb into Google-managed inboxes that allow for 70 times as much mail. Microsoft also provides free Web-based mail for thousands of schools, including colleges in 86 countries.

      Here's the article: http://www.time.com/time/business/article/0,8599,1915112,00.html. Now, a specific school? Sure, my daughter and I just toured California State Sonoma and they use Google services.

    5. Re:outsourcing? by Mr.+Freeman · · Score: 1

      That's a very difficult question. You need to sit down with your company's legal department and figure it out.

      The answer depends on:
      1) Where you are and thus what laws are applicable to you.
      2) Who you are. Healthcare, university, private company? If you are a university, are you a public university? If so, there may be additional laws and regulations.
      3) What's being emailed. Patient records, classified documents?

      What's acceptable for people in similar situations may not be acceptable for you. I go to a university and Google handles our email, but again, this doesn't mean it's OK for you guys to do it. Set up a meeting with your legal department and they'll figure it out. Meeting with legal also ensures that you're not going to get thrown under the bus (at least not as quickly) if you do something and later get hit with a lawsuit.

      --
      -1 disagree is not a modifier for a reason. -1 troll, flaimbait, redundant, overrated are NOT acceptable substitutes.
    6. Re:outsourcing? by swillden · · Score: 1

      nor is there any reason for Google to care about academic needs.

      Sure there is: If they don't meet the need, they'll lose the customers to someone/something else who does.

      --
      Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.
    7. Re:outsourcing? by msobkow · · Score: 1

      Can you provide an example of a policy that would get the students and educators up in arms?

      Can you provide an example where the terms of access to outsourced services are not set by the organization who set the terms in their contract rather than blindly accepting whatever the outsourcing company tabled as a "standard" contract?

      I thought not. More FUD. "Please panic, people, because they might possibly maybe perhaps breach their contract and do something we don't like."

      Pfffftttttt.

      --
      I do not fail; I succeed at finding out what does not work.
    8. Re:outsourcing? by betterunixthanunix · · Score: 1

      The first thing that comes to my mind is the requirement that nobody try to learn how the services are implemented (usually stated as "reverse engineering"). On principle, schools should have a problem with policies that forbid students from attempting to learn how their email software is implemented, although in practice schools generally have no problem with such agreements (perhaps they simply assume that students will never try to reverse engineer anything). Both Google apps for education and Microsoft Live@edu have these conditions.

      Microsoft Live@edu also explicitly forbids accessing the service by an automated method, which means that students are required to use the interface as provided by Microsoft -- even caching Microsoft's web pages is apparently grounds for account termination (I assume this is meant to exclude the browser cache, but that is not made clear). Microsoft also requires schools to maintain a particular academic structure -- differentiating between students, alumni, faculty, and staff, which while common is not necessarily universal and may not align properly with a school's philosophy on education.

      These may be negotiable, but I have yet to see any indication of that. Microsoft and Google both appear to require each end user at a customer institution to agree to a particular set of terms of service. Can you enlighten me by providing examples of institutions for which this is not the case?

      --
      Palm trees and 8
    9. Re:outsourcing? by msobkow · · Score: 1

      I'm just going with past experience on outsourcing details. In every case where I've seen outsourcing done, the terms of uptime guarantees, access guarantees, priveleges, security -- all were negotiable items with give and take by both the outsourcing provider and the service purchaser.

      If Google and Microsoft aren't allowing customers to set terms on critical things like system access and the validity of content, then companies are being very foolish to contract with them at all. Your provider should never be allowed to dictate terms you disagree with, even if you only disagree in principle.

      But I can't see any provider of software of services explicitly allowing users to reverse engineer their software. I think that's a perfectly reasonable stipulation for a service that is based on proprietary software. If you wanted to "know how it works", you should have badgered your school to sign up with a provider that uses open source software instead.

      --
      I do not fail; I succeed at finding out what does not work.
    10. Re:outsourcing? by captbob2002 · · Score: 1

      At our place Google and MS-hosted got scuttled as alternatives b/c they would not commit to keeping the data store withing the United States - this is a big issue with our gov't funded research which demands that the data stay within the USA. Fine by me, I think Google is pretty evil these days.

    11. Re:outsourcing? by Cimexus · · Score: 1

      It does breach privacy laws in many (most?) developed countries. I'm a part-time lecturer at a major Australian university and I can tell you, we are not allowed to communicate with students, send them material, or receive submitted assignments from Gmail/Hotmail/etc. accounts, because it could contain personally identifying information, which under the Privacy Act, cannot be stored on servers located outside of Australia. This same reasoning would also prevent the university outsourcing their email system to someone like Google, UNLESS they can guarantee that the information would remain only their servers/clusters in Australia.

      For US universities it's probably not an issue. But most countries have considerably tougher privacy laws than the US. Europe particularly.

    12. Re:outsourcing? by stevelinton · · Score: 1

      We decided we couldn't outsource staff email to google or anyone like that, but student email was fine. We do now outsource staff email, but it's on dedicated servers (which I think we legally own).

    13. Re:outsourcing? by Anonymous Coward · · Score: 0

      You should stick with RIM and their outstanding history of uptime :-P

  18. Solution: Join the Google Collective by Dachannien · · Score: 1

    Seriously, is Berkeley like the only college campus that hasn't outsourced their e-mail to Google yet?

    1. Re:Solution: Join the Google Collective by mactard · · Score: 1

      UCLA and UC Irvine do their email in-house as well.. I'm sure it's just a University of California thing. So many smart people here, but so many dumb decisions are made every day.

    2. Re:Solution: Join the Google Collective by 93+Escort+Wagon · · Score: 1

      The University of Washington has outsourced its student email as of a couple years ago; but for faculty and staff the old "deskmail" servers are still available with no announced EOL (yet) - although many do use Gmail too, since it offers an order of magnitude more storage.

      Technically UW offers both Gmail and the cloud Windows mail. It used to be just Google Mail and Calendaring, but before he left UW President Emmert dictated the university needed to have Exchange available to everyone - probably coincidental with some paid external gig from Redmond. Yes, a university president was setting IT policy. He's now running the NCAA - have fun with that sleazeball, guys!

      --
      #DeleteChrome
  19. Only 70000 accounts? by __Paul__ · · Score: 1

    Only 70000 accounts? That's not a big system at all. I was running systems with over million email accounts ten years ago, and by today's standards even those would be considered small.

    --
    worldmobilenet.com -- World Prepaid Wireless Internet plans
    1. Re:Only 70000 accounts? by Anonymous Coward · · Score: 0

      10 years ago, most email would have still been text based, not html, with signatures etc.. so volume/data wise its probably a lot more processing.

    2. Re:Only 70000 accounts? by Nimey · · Score: 1

      Not to mention not having lusers send 100MB attachments in their email to multiple recipients.

      --
      Hail Eris, full of mischief...

      E pluribus sanguinem
    3. Re:Only 70000 accounts? by __Paul__ · · Score: 1

      Nah, HTML mail was well and truely endemic by then (the ISP I worked for was even spamming our users with it every week).

      That said, how much of an impact does that have anyway? Worst case, there's an HTML copy and a plain text copy in the same email, so ... ~2.5 times the original size? That's not all that much of an imposition.

      Attachments might be more of a concern, but even today, the default maximum message size of Postfix is still only 10Mb. I daresay most servers on the internet are still running with their defaults, so most people should assume (and be educated to assume) that if they send anything big, it's going to bounce.

      --
      worldmobilenet.com -- World Prepaid Wireless Internet plans
    4. Re:Only 70000 accounts? by thogard · · Score: 1

      Two decades ago we were supporting 78,000 users on a machine that was almost as fast as 4 Nintendo 64.

    5. Re:Only 70000 accounts? by garaged · · Score: 1

      And had thesame CPU? (SG)

      --
      I'm positive, don't belive me look at my karma
    6. Re:Only 70000 accounts? by T-Ranger · · Score: 1

      And those 78,000 users, many of which used email for about 20 minutes a week. In 2011, those 78,000 users each have a desktop, laptop or tablet, and phone, connected to the system 24/7.

      Very different.

  20. What does this have to do with Sendmail? by farnsworth · · Score: 3, Insightful

    In the video, they don't even mention sendmail at all. Are they using it?

    Also, they mention that the cost of the system is something like $1.30 per account per month. I don't know much about IT budgeting, but that seems like a really low number for something as critical as messaging and calendaring. I have to imagine that they spend more money per user just cutting the grass around the campus.

    --

    There aint no pancake so thin it doesn't have two sides.

    1. Re:What does this have to do with Sendmail? by m1ss1ontomars2k4 · · Score: 2

      They no longer use Sendmail; they use Exim.

    2. Re:What does this have to do with Sendmail? by lucm · · Score: 2

      Also, they mention that the cost of the system is something like $1.30 per account per month. I don't know much about IT budgeting, but that seems like a really low number for something as critical as messaging and calendaring. I have to imagine that they spend more money per user just cutting the grass around the campus.

      Totally agree. One of my client did a major cost-cutting initiative for its email platform, and there was just no way to make it reliable under 9$ a month (per account). And this is when there is no Crackberry (which brings the numbers way up).

      --
      lucm, indeed.
    3. Re:What does this have to do with Sendmail? by TClevenger · · Score: 3, Informative

      That's amazingly cheap. I don't know how you'd do it any cheaper outsourced. Microsoft is $8.80/user in qty. 20,000, and while Google starts at $4.17/user, I couldn't imagine that even 70,000 accounts could bring down the price that much.

    4. Re:What does this have to do with Sendmail? by guruevi · · Score: 1

      That's a lot of money actually. $1.30/user is $13,000/month for what a simple server cluster (Postfix etc.) should be able to handle (~10,000 accounts). That's over $150,000/year.

      I used to work for a hosting company and our server cluster (4 servers with a 10TB backend) handled well over 50,000 accounts about 5 years ago. Sure the load would be at 10 or so but that's not a big deal for a mail server.

      --
      Custom electronics and digital signage for your business: www.evcircuits.com
    5. Re:What does this have to do with Sendmail? by KingAlanI · · Score: 1

      the point was "how the mighty have fallen", not something about sendmail in particular

      --
      I listen to both RIAA and non-RIAA stuff if I like the music, tangential business/politics nonwithstanding.
    6. Re:What does this have to do with Sendmail? by Anonymous Coward · · Score: 0

      Last year, I did the architecture & led the build for a new email system for a UK service provider. The initial build of the system was for 150k users (with 100k being migrated across from a couple of legacy platforms on day 1), but was designed to be scalable to 250k users over the next four years, so maybe we were a bit larger than the system that TFA talks about - but it should be roughly comparable.

      Our design goal was to have a minimum of 2n redundancy (with 2n+1 wherever possible) - for all elements of the system: switches, load balancers, MTA servers, POP3/IMAP/Sieve proxies and servers, Webmail (OpenXchange) servers, databases, logging & backup servers, and clustered storage nodes.

      We sold for roughly the same price as a Google account, and reckoned that we made something between a 80% and a 90% margin, depending on how you accounted for depreciation.

      So, yeah, $1.30/account/month is a bit more expensive than doing it in house, but isn't vastly so - especially if they don't have the required experience on hand.

    7. Re:What does this have to do with Sendmail? by AK+Marc · · Score: 1

      Also, they mention that the cost of the system is something like $1.30 per account per month.

      If that's operational cost (i.e. not capital/depreciation), that's a reasonable price. 70,000 accounts at that is about $100,000 per month. For email, even 70,000 accounts, that's plenty.

  21. IT cannot solve this by decora · · Score: 1, Interesting

    it's like saying IT can do heart surgery or IT can provide pscyhological counseling to a trauma survivor. IT is IT, it is not management and it is not leadership. IT is IT.

    of course, shit rolls downhill, and leaders nowdays are incompetent buffoons who gain their positions largely through bribery, kickbacks, extortion, and other 'features' endemic to societies where the rule-of-law breaks down thanks to a greedy, corrupt elite.

    again, IT cannot fix that.

    1. Re:IT cannot solve this by iggymanz · · Score: 1

      oh man, good IT people are leaders, whether in management or not. They try to identify resource problems before they become issue, and have solutions.

  22. Re:Hate Being First .... by Anonymous Coward · · Score: 1

    Considering that the outage meant that IMAP and POP couldn't be used, while webmail could, I'm not sure that an MX record change would've helped much. :)

    I still wanna know why they didn't just change the load balancer to take the bad server out of the pool (which could just as easily be a DNS round robin entry as anything). What? They didn't have redundancy built into the protocol which, short of DNS, is probably the easiest one to make redundant? Then remind me to consider "UC Berkely IT" as a back mark on future resumes when people apply to work here.

  23. The failure is leadership, planning, budgeting... by linuxwrangler · · Score: 3, Interesting

    I've only heard from people on one side of this but the story that I hear is that in the past, many departments had their own IT, mail servers, web, etc. When the campus built its centralized computing services facility, there was great pressure on departments to move to the central system. There was some griping about the costs for central services often exceeding the internal costs the departments formerly had but there was, I'm told, much need to justify the expense of and to pay for the new center. I've heard that some departments have been able to resurrect their internal systems to get through the outage.

    Perhaps someone with more inside knowledge than I have can fill in and/or correct information from both sides of the story.

    That slideshow is pure management-spin right from the opening "look how complicated and difficult this is..." I love how the "solution" to a system that is soon to outstrip its capacity is to stop expanding (and, it appears, properly maintaining) said system and hope it doesn't implode before you can toss the potato to an external party (who can then take the blame). Guess I was never learned at that school of capacity "planning".

    --

    ~~~~~~~
    "You are not remembered for doing what is expected of you." - Atul Chitnis
  24. Did the CIO just give up in the presentation? by Above · · Score: 3, Insightful

    The press pretty much reads like this to me:

    1) We didn't size the system large enough to handle the possible outages.

    2) The outage we didn't size for happened, basically taking everything down.

    3) My team is now working on a band-aid solution, which basically involves hobbling the application.

    4) Since we're incompetent, we're going to outsource this next year.

    I mean, if I was the CIO's boss I would have fired him on the spot. Maybe outsourcing is a better answer than putting in place a proper system and looking at that analysis could be interesting. I see no indication any of that was done here, basically the CIO gave the Barbie response, "Mail is hard, let's go shopping." If he doesn't understand how to do it in house, he won't understand how to arrive at a good outsourcing agreement.

    Which means this pretty much sums up everything that is wrong with large org IT today.

    1. Re:Did the CIO just give up in the presentation? by Anonymous Coward · · Score: 0

      I had a good friend who worked for Berkeley. A year or so ago, they went through an database upgrade. total cluster-you-know-what. Her laptop was loaded with crapware.

      Their IT dept. is the worst I've ever seen. I'm not surprised. The university administration has nothing to do with the university research. The admin is poor, imho.

    2. Re:Did the CIO just give up in the presentation? by SpaghettiPattern · · Score: 1

      "Mail is hard, let's go shopping."

      IT isn't a university's core business. When IT was in its infancy there was a case for letting the CS faculty run IT with students volunteering. As IT has advanced in the mean time, the CS faculty can now concentrate on CS and reduce its hands-on stuff to leading edge technologies (e.g. research in super computing, semiconductors, etc...). Nowadays for the vast amount of tasks -even for most CS tasks- ample computing resources, technologies and software can be easily made available.

      --

      I hadn't the slightest objection to his spending his time planning massacres for the bourgeoisie... (P.G. Wodehouse)
    3. Re:Did the CIO just give up in the presentation? by Above · · Score: 1

      When I was at a University known for Computer Science in the early 1990's the CS department had zero to do with running campus services. When you have 70,000 users (we had more like 40k) IT is a large function, and not something done in the CS department's spare time. IIRC the Computing Services group had a budget of like $50 million back then, doing mail, dial up, ethernet to the dorms and offices, running the large payroll printers, keeping the mainframes that scheduled and did grade tracking up, running the phone system etc. It is a big job at a university that size, and a key to making the institution work.

    4. Re:Did the CIO just give up in the presentation? by Anonymous Coward · · Score: 0

      You are assuming that outsourcing wasn't the plan all along.

  25. Wow! 50 comments, all trying to change topic! by Anonymous Coward · · Score: 0

    All comments so far are like "it's not sendmail's fault".

    Why is everybody so defensive about it?

    1. Re:Wow! 50 comments, all trying to change topic! by ksd1337 · · Score: 1
  26. How about... by Anonymous Coward · · Score: 0

    ... not treating a non-technical problem as a technical problem? Identify the problem, write the memo, keep the flimsies, and drop it on the relevant manager's trouble ticket queue. Or however the flow goes in your locality. The rest I leave as an exercise for the reader.

  27. Just move to Google Mail by dell623 · · Score: 1

    Google have 24x7 phone support now. It is really a futile exercise to maintain local email systems even for a few thousand users, it will be outsourced sooner or later.

    1. Re:Just move to Google Mail by Anonymous Coward · · Score: 0

      Google have 24x7 phone support now. It is really a futile exercise to maintain local email systems even for a few thousand users, it will be outsourced sooner or later.

      They probably will. The whole reason for the failure is lack of IT funding, and campus admin is currently studying whether to choose Google or Microsoft for their outsourcing. Of course, this is slightly tricky since federal law doesn't allow emails containing student information (eg grades) on third-party servers.

    2. Re:Just move to Google Mail by jasomill · · Score: 1

      They probably will. The whole reason for the failure is lack of IT funding, and campus admin is currently studying whether to choose Google or Microsoft for their outsourcing. Of course, this is slightly tricky since federal law doesn't allow emails containing student information (eg grades) on third-party servers

      Then why report grades via email in the first place? Presumably some of the money saved could go towards implementing a Web front-end to the registration system, or to improving your current system to cover whatever use cases led to grades being emailed to students.

      This is assuming communication between faculty and staff will remain either in-house or "seriously" outsourced (effective SLAs, NDAs that cover confidential information, etc., rather than best-effort "branded Gmail").

      Oh, and electronic gradebooks are one thing, but please, please, PLEASE don't torture your students by implementing messaging embedded insome "courseware" system that amounts to a non-interoperable inbox for each course that uses the system on compliance grounds. Any email system that's inappropriate for day-to-day course communications is unsuitable for campus use in the first place, no?

    3. Re:Just move to Google Mail by Mr.+Freeman · · Score: 1

      The general solution to this (as far as I can tell, based on being a student in a university) is to go with a third party and then implement a "no grades over email" policy. Professors can't send our grades, various things related to registration, etc. over email. At first I thought this was the university being stupid or an odd federal law with a blanket prohibition on such things, but now I'm starting to think that it's due to the interaction with third party devices. (we use Google for our email)

      --
      -1 disagree is not a modifier for a reason. -1 troll, flaimbait, redundant, overrated are NOT acceptable substitutes.
    4. Re:Just move to Google Mail by Anonymous Coward · · Score: 0

      It's simply a best practice. Since students tend to forward emails anyway - it is not acceptable to post grades via email. Our university requires logging into the authenticated and SSL encrypted student information system for all grades.

  28. Re:The failure is leadership, planning, budgeting. by MightyMartian · · Score: 1

    They left out the slide where management get great big bonuses for being such swell thinkers.

    --
    The world's burning. Moped Jesus spotted on I50. Details at 11.
  29. its also a privacy thing by RobertLTux · · Score: 0

    Due to FEDERAL LAW communications from school staff to students (and the reverse) must go to University Email accounts.
    plus if somebody "does not get" a given email then its the schools fault.

    --
    Any person using FTFY or editing my postings agrees to a US$50.00 charge
    1. Re:its also a privacy thing by mrmeval · · Score: 1

      So it would be for official use only like work.

      --
      I'd go on a Vegan diet but the delivery time from Vega is too long. --brownkitty
    2. Re:its also a privacy thing by damium · · Score: 1

      I would like to see said federal law, I have doubts to it's existence. The only federal regulation that I know of that has anything to do with university communications (not involving specific dept of education programs with their own rules) is FERPA and it only involves granting 3rd parties access to student records.

  30. No, tuition is for education by Colin+Smith · · Score: 1

    Clearly email is an afterthought thrown in for free.

    If you want a service to work, you have to fund it. You can try to fight for budgets against the football team or you can simply charge and the money automatically goes where it's needed.

    Think of money as little packets of information. You buy something there is a need for it, you don't buy it, there is no need. Resource allocation without dozens of layers of management.

    Maybe nobody cares about email and they can just shut it down. Charge for it and find out.

    --
    Deleted
    1. Re:No, tuition is for education by icebraining · · Score: 1

      Maybe they should offer redirection for free and paid storage. I find it useful to have an email address within my college domain, but I redirect everything to my main account.

    2. Re:No, tuition is for education by Anonymous Coward · · Score: 0

      The football team is getting a $321 million stadium upgrade. I imagine that a tiny tiny fraction of that could have solved this problem.

    3. Re:No, tuition is for education by Anonymous Coward · · Score: 0

      You ever seen a contract stampede, sonny? Ain't a purty sight, ain't a'tall.

    4. Re:No, tuition is for education by Nimey · · Score: 1

      At my university we split it into tuition and student fees. Fees go to all manner of things like clubs and activities and the rec center and other stuff.

      --
      Hail Eris, full of mischief...

      E pluribus sanguinem
  31. The email experts are not ... by jamesl · · Score: 1

    ... at the university. And they haven't been for decades. Microsoft and Google have thrown money and brains at these kinds of problems -- money and brains in quantities that university IT departments can only dream of. Both have economical, reliable, scalable, secure and user friendly solutions to this problem.

    Look up Microsoft live@edu and Google aps for education.

    1. Re:The email experts are not ... by iggymanz · · Score: 1

      Microsoft exchange server is a reliable, secure and scalable solution? Bwhahahahaha, my employer uses that shit, it is none of the above. If it were, the internet's backbone would use it

  32. Carrier pigeons coming soon! by syousef · · Score: 1, Funny

    Wow, Squirrelmail. So at least they managed to migrate from pine at some point.

    Yeah, they're planning the upgrade from squirrel to carrier pigeon as we speak!

    --
    These posts express my own personal views, not those of my employer
  33. IT has to deal with budgets, too by msobkow · · Score: 2

    I hate it when people try to act as if IT isn't subject to budget constraints and having to prioritize spending like any other department of a large organization. Sure the money comes out of the "client" departments, but it's an issue that IT does have to plan for and deal with.

    The summary asks "How can IT people solve things like this?"

    Forward the emails and responses to the demands for planned capacity growth to the public.

    Oh, you didn't keep the email from your manager refusing to pay for a needed capacity upgrade? I guess you haven't been in IT long enough to learn to cover your own butt.

    --
    I do not fail; I succeed at finding out what does not work.
  34. Poor management by riley · · Score: 1

    Sigh...

    Look at the first bullet point of the timeline. Productivity suite approved, upgrade to Calmail cancelled. Then a week ago, they decided on an interim upgrade because not upgrading in the first place caused problems. So, rather than a planned upgrade, the IT folks were thrown into panic mode because their (probable) proposed timeline for safely doing an upgrade, including burning in and testing of new hardware, was cut to a fraction of what it should've been.

    You can argue about the budgets, or the IT folks, but this is a failure of management. If (in Spring 2011) they cancelled the upgrade, and then had to have an emergency upgrade, what you have is management that fundamentally does not understand the system. This would (probably) not be the IT folks managing the system, but rather the budget and personnel management that doesn't quite grok how upgrades should be done in a safe and controlled manner. They misjudged the initial cancellation, and then (likely) pushed through a poorly planned emergency upgrade.

    If the slides are correct, there is very little having to do with a failure from a technical aspect, and everything to do with a breakdown of management.

  35. Radical Approach by Anonymous Coward · · Score: 0

    As IT personnel, we could demand that no shiny new toys be allowed to talk to the nerd stuff in the back until there is 2x the amount of nerd stuff in the "back" as would be needed to effectively handle all the new shiny things.

    Of course this would equate to a bunch of loathsome, Grinch-like, bureaucrats needlessly handicapping the business/school/non-profit to serve there own petty need for authority.

    All we want to do is hook all these toys to the e-mail cloud. Why do we even need to talk to you people?

  36. Re:Hate Being First .... by CAIMLAS · · Score: 4, Insightful

    Believe it or not, maintaining a mail host for a larger, geographically diverse

    If it were easy, there'd be no push to outsource it to "the Cloud" (or anywhere else), and countless organizations wouldn't be moving from the "burden" of administering something like Exchange (ie, a trivial amount of knowledge is required compared to any other MTA) to Office 365 or Google.

    It's not just as simple as setting the mx to point to a 'working host', especially not in academia (though many try). Do you have to deal with this kind of thing?

    As someone who has to deal with this stuff on a daily basis - I had dealings regarding CalMail last week on a similar mail related problem of their's - and with academic mail systems in general, let me clue you in:

    * This is not your business mail system, where everyone has a uniformly specified mailbox.
    * It is not dictated from the top down how mail is run. In a corporation, there is standardization. CalMail is the exception in academia, as far as I can tell, in that it's run somewhat like the business model. However, there is still somewhat of the "Greek" (vs. "Roman") model of management involved, and this does tend to lead to problems. (This is much more true with other academic mail systems, from what I can tell.)
    * Unlike in the work place, there is very little systems experience where it is needed (ie in the actual administration). Even with dedicated IT, very few people are actually good with the mail system due to how broad and complicated mail management can be.
    * Running a mail server effectively is now quite difficult. Not only do you have to "just make it work" - ie, dealing with all the misbehaving mail systems out there from other academic institutions and verifying the VIP email makes it through (regardless of how much spam that means letting through - but never let any spam through!) - but it's got to run like a top.
    * Often, you're dealing with decades of systemic dependencies. Mail was the first connected application, after all, and nobody's had it as long as Berkeley. Based on my own experience with networks which grew around their mail system, small changes can compound any sort of change or update. Suddenly, there's something everywhere that needs a specific mail system functionality which can't simply be copied over during a move to replicate it.
    * An organizational system like this is big, it's not garden variety email. Hell, i guarantee you they don't have as many IT people maintaining accounts as they have admissions people, probably not even a 10th. Yet the IT people have to actually make sure those records get to the right places all while assuring the admissions people that the information transits securely.
    * There is undoubtedly a faculty member with his pet requirements for email. He probably has things which will not migrate properly.
    * There will undoubtedly be the people using their mail account for file storage.
    * Believe it or not, it's actually fairly difficult to migrate mail from, say, Cyrus IMAP to anything else. It takes time (and anything at all with Cyrus, which I'd not be surprised if they were using, takes a lot of time). Sieve scripts, procmail, IMAP states, et al. It's a pain in the ass, and takes a loooong time to do seamlessly. Doing it under duress of hardware failure is something else entirely.

    From my reading of the events (and seeing some other things not mentioned in OP or linked article) there were a number of things which caused this prolonged outage. First and foremost, the system was not designed to be resilient so much as it was designed to scale up (or proper failure condition testing was not performed beforehand). Second, they either don't have the necessary (knowledgeable) human resources, or enough time allocated to those resources, to effectively manage this system. (You would not believe how difficult it is to find a "mail administrator". Everyone's done it, but nobody seems to like it or is all that good at it. If they are, they want a LOT in compensation.) Third, they may

    --
    ~/ssh slashdot.org ssh: connect to host slashdot.org port 22: too many beers
  37. In a perfect world, yes. But... by TheQuantumShift · · Score: 1

    First and foremost, it has to account for budget, and the rationalization thereof. It's scary how often suits (and more and more "engineers") say things like "Come on really; how often does that kind of thing actually happen?" This is usually uttered after staring at a couple dozen slides of metrics that detail exactly how often it happens...

    --

    Shift happens. Fire it up.
  38. Nginx by Anonymous Coward · · Score: 0

    Has anybody thought of using Nginx e-mail proxy to solve the issue?
    http://nginx.org/en/#mail_proxy_server_features

    1. Re:Nginx by Bronster · · Score: 1

      nginx is not a magic bullet - it would not help in this case.

  39. Why do campuses provide email? by HalAlpha · · Score: 1

    University IT departments providing email and calendaring services is like the facilities employees being required to build all the chairs for classrooms.

    --
    "If I can't dance, I don't want to be part of your revolution" - Emma Goldman
    1. Re:Why do campuses provide email? by solidraven · · Score: 1

      Cause else students are forced to use unreliable services meaning you can't ensure e-mails arrive at location.

  40. Aren't all these users supposed to be on Facebook by Anonymous Coward · · Score: 0

    I thought nobody (especially college kids) used email anymore. Facebook is where all the cool kids hang these days, right? California is doing a bang up job alright. When they're done with this project maybe they can consult themselves out to the Feds. I hear they've got a mail problem of their own with this Post Office thing. For those of you unfamiliar with Post Offices, the wikipedias have a decent write up: http://en.wikipedia.org/wiki/Post_office. Anyway, I can't wait to see the powerpoint slides for the Post Office Turn Around Medical Marijuana Home Delivery Program.

  41. A lot simpler than that by dbIII · · Score: 1

    Students and faculty have essentially no power over the terms of use that Google or Microsoft or any other third party email service imposes on them.

    There's a more mundane problem. Unless you are an incredibly huge customer the large service providers are just not going to care if there is an outage. One example I ran into last year is a University of 45,000+ students that lost their student email hosting (hotmail) for a week due to a DNS typo for a machine in a hotmail MS Exchange server farm. To get a job offer to a student I had to put an entry in /etc/hosts of my mail server - meanwhile no other mail was getting to any of the students for a week.
    That's the price of outsourcing. Your important services are farmed out to people that just do not care enough to fix a typo for a week.
    You don't even have to go as far as malice when apathy is enough to provide unacceptable problems.

  42. Coincidence? by solidraven · · Score: 1

    I think the mail gods are angry at the academic community. Our mail server crashed as well last week. Took them a 4 days to get a backup server online. And another week to get the emails to the new server.

  43. Re:Hate Being First .... by Anonymous Coward · · Score: 0

    Far out - this comment is so bad that it's not even wrong.

    Making 70,000 people change the mail settings and lose all their state and access to their old mail while you spool it somewhere, and then integrate it back into their main spool. Make sure all the filtering rules got applied correctly along the way.

    Trivial, I could do it in my sleep.

  44. Re:Hate Being First .... by Bronster · · Score: 3, Interesting

    As a mail administrator for a big system, I completely agree with you.

    The biggest problem was that they had everything on a single SAN, so when they ran out of IOPs, there was no spare capacity anywhere, and nowhere to mitigate it to. I've had people try to sell me on putting all our systems on a SAN too "it's so simple to administrate. It has plenty of IOPs, see, look at these shiny numbers". Fine when it's empty and you're only hitting the battery backed cache.

    Which is why we have hundreds of separate little disk sets managed with templated configurations rather than any single points of failure. I'm really glad to be there!

  45. Re:Hate Being First .... by Anonymous Coward · · Score: 0

    Building rockets is hard. E-mail is a piece of cake and getting easier by the year.

  46. Re:Hate Being First .... by MartijnL · · Score: 1

    A SPOC is never a good idea. So putting everything on one SAN isn't either. Why would you purposely introduce so much fragmentation in your data when you could just build a proper SAN environment which had high availability built in?

  47. Sendmail not at fault...Poor design is. by DaveGillam · · Score: 1

    Watching the video, two things were apparent: Sendmail wasn't being used--Exim was, and the fault was not the MTA, but rather the use of a single SAN backend for everything.

    I've been in the Messaging Infrastructure business for many years. The UC problem is poor design. They left themselves open to a single point of failure by not splitting the mailbox load across multiple SANs. Their load isn't really all that great--I've designed for much larger email volumes. What they need is an LDAP-based routing (or similar) mechanism to send different recipients' emails to different SAN backend stores--say, alphabetically by last name, or by entry (employee/student/alumnus/account) id. When a disk failure occurs, it then would only affect a small percentage of the population, and for a much shorter time. By enforcing RFC compliance on the front end, they would also reduce the load on the back end, and could easily handle their traffic load with far fewer servers--thereby costing far less than what they currently have.

    They certainly can pay someone else to do proper design, of course, but they should understand that technology and budget did not cause their problem, their poor design did.

    -David Gillam
    www.davegillam.com

    1. Re:Sendmail not at fault...Poor design is. by Bronster · · Score: 1

      They already have a Cyrus murder cluster - no need to do the LDAP routing. And they already have multiple backends. The IO issues are purely one SAN, and everything striped to spread load out so evenly that there's no spare capacity anywhere online when it runs out.

      It's certainly cheapest to run that way, but running everything at 100% utilisation means no capacity to adapt to change. It's not just computer systems either - if you work all your staff at 100% utilisation all the time just to do current tasks, you're unable to adapt to change, because nobody has time to do anything else. It's the flip side of perfect efficiency - total ridigity in the face of change.

      But yeah, SPOF. Most horror stories start with one of those.

  48. Re:The failure is leadership, planning, budgeting. by mako1138 · · Score: 1

    I don't know a whole lot about this but I'm on the mailing list for a department that was in the process of migrating to Calmail. (My email goes through a different system so it hasn't affected me.) After a slew of messages this past week about Calmail problems, they've decided to cancel the migration for now. Apparently Calmail is going to the cloud in the future, so they're hoping the existing servers last until then.

  49. its what we refer to as by nimbius · · Score: 1

    a cascade failure.
    1. data storage failure.
    2. database crash, presumably due to the fault in data storage
    3. heavy backlog of deferred mail begins hammering a generally neglected piece of Berkeley IT.

    email, calendaring, and instant messaging arent mythical, and they need constant competent care
    just like any part of the IT infrastructure. Having worked on complex email systems for the better half of my career
    some of the fault lies with the berkeley teams "set it and forget it" mentality as shel waggener scolds the audience about at the start of the video.
    is that backend database known to the DBA and in a healthy state? are the front-end components configured for email as it ran in 1993 or have they
    over time been upgraded with new features to address email as it operates in the 21st century.

    --
    Good people go to bed earlier.
  50. Re:Hate Being First .... by RogerWilco · · Score: 1

    I have a story of a mail system at a university running on a SAN, which ran out of file descriptors and took the mail system down for a week as well. I think it was some form of ZFS, although I've seen similar things on GPFS and other things as well.

    Just rebuilding a system like that after a crash can take days to weeks. (please note I'm talking Petabytes here).

    --
    RogerWilco the Adventurous Janitor
  51. Re:Hate Being First .... by Bronster · · Score: 1

    I've run out of inodes once. Now I monitor them as well. Thankfully it was just one machine and changing the replica was simple.

  52. Re:The failure is leadership, planning, budgeting. by Anonymous Coward · · Score: 0

    Here it is in a nutshell:

    UC Management had decided to delay the purchase of a replacement system a year ago. UCB has been working on OE (Operational Excellence - euphemism for downsizing) for a few years now. Because of the decision, one of the main techs that help created the stable, top notch mail system left for Twitter a few months ago. Someone on joked on the Micronet mailing list that mail was going to crash and burn.

    The university decided at the beginning of the year not to replace the aging, out of warranty hardware while they evaluated outsourcing to gmail or Microsoft.

    Mail services were already experiencing problems during the past few months prior to the crash. This was due, according to the CIO, to the unprecedented amount of users using cell phones & iPad to access mail constantly starting this semester. Then a critical server crashed the day after Thanksgiving. The techs brought it back up over the weekend, but they were still recovering data.

    Then on Monday, everyone came back to work and when the staff all started to log in, the system degraded so much they had to take it down again. They disabled imap & pop services to keep all the cell phones and ipads off the servers, which also destroyed the productivity of the staff that relied on Thunderbird and the Mac mail client to access imap. The web based squirrel mail and roundcube clients were unfamiliar and lacking functionality of real mail clients. Even the techs were figuring it out with each other on the Micronet mail list.

    Basically, it was a management decision. It was a gamble that failed.

    Capcha = raided