Slashdot Mirror


Should Developers Have Access To Production?

WHiTe VaMPiRe writes "Kyle Brandt recently wrote an editorial exploring the implications of providing developers access to the production servers of a Web site. He explores the risk introduced by providing higher level access as well as potential compromise solutions."

76 of 402 comments (clear)

  1. For me by enderjsv · · Score: 5, Insightful

    Whenever an error occurs that I can't replicate in a dev environment, I'm always SO tempted to hop into prod and start adding in some output statements.

    Yeah, it's probably a good thing I don't have access to prod.

    1. Re:For me by IICV · · Score: 4, Insightful

      That's why you shouldn't have access to prod, but you should be able to either A. get a clone of prod made fairly quickly or B. already have one running so you can mutilate it however you want.

      Seriously, hardware is cheap and people are expensive. Minimizing person-time is worth a bit of hardware gluttony.

    2. Re:For me by stillpixel · · Score: 5, Funny

      User: There is an error on page X
      I tweak that page code on the production server after looking at the error log. Me back to User: An error really? Have you tried pressing F5?
      User: Oh.. hmmm I guess I must have done something wrong. Sorry for bugging you!
      Me: Hey, no problem.

    3. Re:For me by x2A · · Score: 2, Interesting

      Eugh yeah I hate that. So what I try to do is code in such a way that if a bug should occur, the whole thing stops working, that way there's no point in my /not/ fixing it on the production server! I'm a freakin genius! No of course I'm joking, but a recent project has hit some problems where I've been able to explain and the client has actually been able to understand the challenges of trying to reproduce an intermittent undiagnosed problem without touching the production code (ie, is just not worth the time trying to do) and lets me fiddle with the code. Usually tho it's enough for me to be able to add logging code where it's needed and there's no end-user-visible effects. There've also been problems that have languished, but as soon as I've had the go-ahead to try resolve it on the live system and resolved it quickly and without interruption, so they're getting more okay with letting me do it that way. Sometimes I'll just fix a problem and not tell them, to avoid all the hassle. At the end of the day, I know better than them (which is why they come to me) and sometimes you do just have to make a judgement call. BUT, it's not a massive project with many developers, and in those conditions obviously you need to retain more order.

      rm -rf /^H^H^H^H^H^H^H^Hoops wrong window

      --
      The revolution will not be televised... but it will have a page on Wikipedia
    4. Re:For me by FatSean · · Score: 4, Insightful

      Test environments are essential, but they do require people-time to keep them matching production.

      --
      Blar.
    5. Re:For me by idontgno · · Score: 3, Insightful

      If I screw up, people can't get the correct pills.
      It's fun to make other people live dangerously. :-p

      FTFY. Well, for certain values of "pharmacy benefit management system". If your production hacking can botch scrip fulfillment, please say what company you're working for so I can try to avoid it like the plague it is.

      --
      Welcome to the Panopticon. Used to be a prison, now it's your home.
    6. Re:For me by Americano · · Score: 3, Insightful

      Deploying to an extra "prod-clone" test server should require pretty trivial amounts of people-time if it's set up properly - it's just another test server, if it takes ridiculous amounts of manual effort to deploy to a single extra test server, you're probably doing something remarkably inefficient.

    7. Re:For me by TheRaven64 · · Score: 4, Informative

      Depends on how your deployment system is designed. Mainline Xen now has support for brining up complete live duplicates of a running VM and keeping them in sync (or letting them diverge), and this support has been in high-availability hypervisors for decades. If you're using a system like this, it's just a couple of commands to clone the production VM and get something that you can break without impacting users.

      --
      I am TheRaven on Soylent News
    8. Re:For me by PotatoFarmer · · Score: 2, Interesting

      Use an automated process that rebuilds your test environments nightly from production backups. Test environment synchronization and backup verification rolled into one.

    9. Re:For me by Gerzel · · Score: 3, Insightful

      This article also assumes that the developers (or often developer) is not also the server admin. Many shops have one or a few IT people wearing many hats.

    10. Re:For me by characterZer0 · · Score: 2, Funny

      Unfortunately, copying prod to test does not copy the users, which are often an integral part of the error.

      --
      Go green: turn off your refrigerator.
    11. Re:For me by Anonymous Coward · · Score: 3, Insightful

      Yes, we call that staging.

      We work on dev.

      We promote changes to stage, which is a dup of the prod environment.

      When stage is approved we promote to prod.

    12. Re:For me by tomhudson · · Score: 2, Insightful
      And it's sometimes necessary.

      The article is crap. Sample quote:

      Account privileges, file permissions, web server configuration are often not what developers have experience in or are very interested in

      Retarded. Absolutely retarded. Anyone who can write that hasn't got a clue.

    13. Re:For me by gorzek · · Score: 2, Interesting

      The whole article is absurdly vague, anyway. Sometimes developers need access to production--such as on a critical system--and sometimes they don't. Had the article between written toward a narrower domain, something more specific than just "Web sites," it might actually be useful. As it is, it's too light and fluffy to have much real-world impact.

    14. Re:For me by es330td · · Score: 2, Funny

      I did this once and hosed a production Informix system. We had over 100 external users call in before it was realized and fixed. Fortunately, I did it because my manager told me to so I kept my job and she was reprimanded for A) changing production and B) asking someone else to do it and not doing it herself. I learned my lesson and in the subsequent 10 years have never modified a production system without thorough dev testing first.

    15. Re:For me by mwvdlee · · Score: 4, Interesting

      In the mainframe shop we used to have 5 stages; (production, shadow (with similar load to production), functional acceptance, system integration and development), next to that 2 well secured "emergency" stages linking to prod and shadow and a single "free for all" development area outside the control of the basic stages.

      Mainframe shops tend to be much more closed and mature than more modern environments, and hence much less goes wrong.

      I've also worked in a Java shop at the same company, where they had 3 stages and a locally for dev, but the stages were much less controlled and you could easily skip straight to production. Obviously only the most experienced of programmers did this and only when they were absolutely certain. Obviously quite some more fixes went wrong on production.

      Currently working in an environment without stages, I try to work on test copies as much as possible but the temptation of bugfixing directly in production is quite large.

      --
      Slashdot social media options: AIM, ICQ, Yahoo, Jabber and Mobile Text. Why no MySpace?
    16. Re:For me by tomhudson · · Score: 3, Insightful

      You've obviously never had your boss direct live traffic to your "test" server at the datacenter, and not tell you for 2 weeks.

      I'm comfortable working without a net - but I'd like to know about it.

      Obviously, from that point on, any time we did a make, we made sure that we could install the new code within a few seconds, so as to avoid downtime (we could tolerate a delay of 15 secs once or twice a day without raising eyebrows).

      And sometimes a fix HAS to be done live to prevent bad data from propagating. If you don't trust yourself to get the command just right the first time, then step away from the keyboard and find someone who can type the right sql command the first time with 5 people hanging over their shoulder sh*tting bricks. In other words, you need one of the devs - fast. You certainly wouldn't let a sysadmin do it.

      In theory, you may be right. In theory, theory and practice are the same. In practice, they're not. There are ALWAYS exceptions. Not realizing this, and being flexible, is a mistake.

    17. Re:For me by Americano · · Score: 2, Informative

      I can't disagree with that, but the stuff you're listing is more or less a one-time setup cost: The point made by GGP was that deploying to a prod-clone system ('keeping it matching production') takes time - and that should not take any more time or effort than deploying to your existing prod or qa or uat systems.

      If it takes great, ongoing manual effort to install your software on a single extra system, you're doing it wrong.

    18. Re:For me by tomhudson · · Score: 3, Interesting

      ... and really bad planning happens all the time in the real world.

      Look around you. Try to convince yourself that all this was properly planned. The real world is messy.

    19. Re:For me by x2A · · Score: 2, Insightful

      That was kinda my point, 'tho in reality I don't tend to do that... you learn quite fast after you accidentally shut down a remote server thinking you were rebooting your local firewall! Custom and highly different prompts helps here though.

      One thing that does from time to time happen though is the mistake-paste, where the contents of the clipboard weren't as expected (for example, while clicking to select the ssh console, maybe accidentally selected a few characters which replace the clipboard). This can mean that *anything* can happen.

      Either way, you do have to apply changes to the live system at some point, and I've seen problems come up here before, like permissions not being changed correctly, live system log file overwritten by a development system log file, killing any chance of debugging anything that needed that log etc - although once I pointed out that this was going on, the update proceedure could be corrected so it no longer happened, but it does demonstrate that the more actions you have to take, the larger the mistake-surface-area is. You just hope that the intersect of mistake-surface with the live system is smaller, even if the overall mistake-surface is larger.

      --
      The revolution will not be televised... but it will have a page on Wikipedia
    20. Re:For me by mwvdlee · · Score: 2, Informative

      How do you get shadow to have the same load as production? That's the really important part that often makes the production-clone quite different: much lower load.

      The idea of shadow was to test real-life production. It was basically identical to production in every way (including that developers had no access to the data; AFAIK it was anonymized anyway), except the data wasn't used for anything. Software had to run uninterrupted for a number of weeks before it would be allowed to go to production.

      Sometimes all those stages are a pain, but in the end it results in a much more stable production.

      --
      Slashdot social media options: AIM, ICQ, Yahoo, Jabber and Mobile Text. Why no MySpace?
  2. Short answer by Issarlk · · Score: 5, Insightful

    LOL! No.

  3. What a silly question. by Score+Whore · · Score: 3, Insightful

    No. It just encourages sloppy development practices.

    Would you want to drive over a bridge that wasn't actually designed and engineered, but rather they just piled some stuff up and will fix it if it collapses? Or have a surgeon chopping you open with the idea that they'll figure it out as they go? So why would we want developers to work with the expectation that they get to intervene at the last instant to resolve their failures?

    1. Re:What a silly question. by jameson71 · · Score: 5, Insightful

      On the other hand, I wouldn't want the surgeon to have to give instructions to a trained monkey on how to do the the surgery because the surgeon does not have access to the production patient.

    2. Re:What a silly question. by dkleinsc · · Score: 5, Insightful

      So why would we want developers to work with the expectation that they get to intervene at the last instant to resolve their failures?

      Because if there's a problem, there will be an expectation that they need to intervene to resolve their failures.

      To play Devil's Advocate here, there are some semi-legit reasons why developers might get production access:

      • If there's a serious production failure, developers are often called upon to assist the admins, because while they aren't admin experts they generally have some administration skills.
      • If there's a bug that makes it to production, the time it would take to fix the bug using proper procedures may cost more than doing a quick-and-dirty fix now and cleaning up using proper procedures later.
      • Diagnosing production-only bugs, which frequently require read-only access. For instance, developers may need read-only access to determine that their software didn't deploy correctly.
      • Helping admins properly configure their software.

      Now, none of this should be done willy-nilly. The basic rule at my workplace is that a developer can do nothing that could potentially alter behavior without managerial approval and admin approval where appropriate. At the same time, the primary enforcement of that rule is trusting our devs, so very little of that is actually enforced technologically.

      --
      I am officially gone from /. Long live http://www.soylentnews.com/
    3. Re:What a silly question. by jellomizer · · Score: 3, Insightful

      Bad analogy.

      It is more like having a Cardiologist diagnosing the problem then telling the heart surgeon what to fix.
      We don't give our fixes to a trained monkey we give them to System Administers or implementation specialists.

      I would say that I am a good developer. However I am only an OK System Administrator and I often when my code is done and working well. Deploying the code offers a new set of issues to work threw.

      However If I give to people who are good at taking my code and implementing it, the process runs much smoother without much problems.

      --
      If something is so important that you feel the need to post it on the internet... It probably isn't that important.
    4. Re:What a silly question. by Desler · · Score: 2, Insightful

      We don't give our fixes to a trained monkey we give them to System Administers

      There's a difference?

    5. Re:What a silly question. by mini+me · · Score: 2, Insightful

      I do prefer to drive over a bridge that is well designed, but if that bridge is found to have design flaws, I prefer to have them fixed right away. I do not want to wait for the crew to build a second identical bridge next door in order to test their patches.

    6. Re:What a silly question. by rtfa-troll · · Score: 2, Insightful

      Ahh. Well then I don't advise you to visit any medical colleges or hospitals. 'cos, whilst the doctors that will be treating you aren't going to be exactly trained monkeys, they definitely won't be the ones that developed the procedure. In fact, most of the time you will find that they are pretty much following the documentation.

      --
      =~ s,(.*),<sarcasm>$1</sarcasm>,g if any_point_you_wish();
    7. Re:What a silly question. by SatanicPuppy · · Score: 4, Insightful

      I work for a big company where every significant business unit has it's own devs and own processes. And while I'm primarily an admin, I still deploy some dev-level stuff a few times a year.

      And nothing, nothing annoys me more than going to a shop where the admin doesn't understand the tech, doesn't read the project requirements, and will not, under any circumstances, let me see his production hardware before he tries to deploy.

      Because inevitably it fails and then he tries to throw me under a bus, and I have to get on a conference call and try to use my magic mind powers to debug a system I'm still not allowed to see.

      Back when I was primarily a java dev, I used to have to configure Tomcat all the time, and it was the sort of thing were you needed to have some experience. And time after time I'd end up dealing with some windows admin who knew absolutely nothing, but was utterly convinced that I could never know more than him about a server technology.

      In short, common sense and a reasonable respect for experience beats a hard and fast rule any day.

      --
      ad logicam Claiming a proposition is false because it was presented as the conclusion of a fallacious argument.
    8. Re:What a silly question. by John+Hasler · · Score: 5, Funny

      > There's a difference?

      Sure. The monkey is trained.

      --
      Warning: this article may contain humor, sarcasm, parody, and perhaps even irony. Read at your own risk.
    9. Re:What a silly question. by vegiVamp · · Score: 5, Funny

      As a systems admin, I can assure you that there is definitely a difference.

      Trained monkeys get free bananas and are allowed to fondle their bits in public, to name but two.

      --
      What a depressingly stupid machine.
    10. Re:What a silly question. by John+Hasler · · Score: 2, Insightful

      I do prefer to drive over a bridge that is well designed, but if that bridge is found to have design flaws, I prefer to have them fixed right away. I do not want to wait for the crew to build a second identical bridge next door in order to test their patches.

      How about if, having built the bridge, they could build a second one in a few minutes for a dollar or so, store it out of the way in hyperspace, and swap the two at will? Would you still not want them to work on the offline one? Read I35 Bridge before you answer.

      --
      Warning: this article may contain humor, sarcasm, parody, and perhaps even irony. Read at your own risk.
    11. Re:What a silly question. by DrgnDancer · · Score: 2, Insightful

      You work for Boeing don't you? God I hated that place. Needed 4 people looking over my shoulder while I followed a script designed for a monkey. If you wanted a monkey, why did you hire an experienced HPC analyst/engineer?

      --
      I don't need a million points of light, just two points of multi-mode fiber and a 10 Gig-E router.
    12. Re:What a silly question. by recharged95 · · Score: 2, Interesting

      Then again, when I interviewed with Google for youtube:

      "We develop on the main servers, it's typical here, but now, aside from that and more importantly, I have a question : can you show the result of inserting the following values into an empty AVL Tree ... in a Python context"

      Being more of a software engineer than a pure CS wonk (couldn't answer the question completely), and having worked on spacecraft control software........ just say not working there was mutual.

  4. no. by pdp1144 · · Score: 2, Insightful

    It is my experience that giving development access to production gives you a production environment that looks like it has been vandalized. Although meaning well and trying to make the best application as possible; they need their own development lab, and their own staging / production lab.

  5. Read-only, if that, and nothing more. by Junior+J.+Junior+III · · Score: 4, Informative

    If you want to have control over your production code, you need to have assurance that it is not changing in an uncontrolled fashion. Allowing developers to have access to production locations makes it all too easy for this to happen. Read-only access allows developers to see the running code and perform file comparisons which can be useful in troubleshooting. They should never need more than this.

    And in some cases, even read access can be risky -- I've seen production web sites with resources linking back to development server URIs. It's a good idea to firewall your production servers in such a way that it is not possible for them to reach resources on development servers. This shouldn't prevent developers from being able to read the files on the production server, though.

    --
    You see? You see? Your stupid minds! Stupid! Stupid!
    1. Re:Read-only, if that, and nothing more. by SatanicPuppy · · Score: 4, Interesting

      See, to me this is more an issue of devs not obeying the rules. They damn well shouldn't be changing production code, and they damn well shouldn't be linking code from other servers.

      Either your devs are a bunch of barely trained lunatics, or they're breaking the rules in a vain attempt to get things done in a timely manner.

      Most times, when I see devs screwing with production it's either a "hero" coder who is way too good to use best practices, or a situation in which the environment is so hostile that the "best" solution seems to be breaking the rules.

      I once did some contract work for a company where the Q&A and testing process took a minimum of two weeks for the most trivial changes, and where the admins on the production servers refused to deploy things like security patches without a testing period that ran close to a month. The devs there had a hundred tricks for sneaking their code into production, and linking production code to the development servers in an attempt to meet their productivity goals.

      Fucking nightmare. Once we ironed out the Q&A thing, and split the admins into two groups (one who maintained, and the other who upgraded and approved changes) the whole process evened out and the devs stopped screwing around on production.

      --
      ad logicam Claiming a proposition is false because it was presented as the conclusion of a fallacious argument.
  6. Everyone agrees... by SatanicPuppy · · Score: 5, Insightful

    Everyone agrees that developers should never have access to production...Unless they're the developer, in which case it's different.

    Its a good practice to keep them separated, but in the end its just a pissing contest. The server admins don't want some filthy dev messing with their stuff, and I can appreciate that.

    However, admins often lack appreciation of some dev-specific issues, and their ignorance can lead to problems down the line.

    In the end, its the best practice to have everyone work together sensibly, than throw down inflexible rules that cause more trouble than they prevent.

    --
    ad logicam Claiming a proposition is false because it was presented as the conclusion of a fallacious argument.
    1. Re:Everyone agrees... by SatanicPuppy · · Score: 2, Insightful

      I work both sides of the fence myself, so I have a better than average appreciation of the actual risks and the actual benefits.

      It's my experience that the Berlin wall-like separation that many companies enforce between dev and production causes it's own set of problems, and that, instead of treating everyone involved as if they are involved, they just impose even stricter separation, which causes other issues, and makes the whole process glacial and inefficient.

      --
      ad logicam Claiming a proposition is false because it was presented as the conclusion of a fallacious argument.
    2. Re:Everyone agrees... by Aceticon · · Score: 2, Insightful

      Everyone agrees that developers should never have access to production...Unless they're the developer, in which case it's different.

      Developers should not have access to Production and I say this as a developer.

      Having a way to check logs in Production, maybe read the databases yes, more than that, no.

      Two reasons, one "good" and one bad:
      - If people have access to Production willy-nilly, sooner or later they will break it. This can even be true for Support people only they're the ones getting calls at 5 AM to fix it, so they soon learn not to do it.
      - Forcing a proper release procedure avoids creating expectations on non-developers (read managers/clients) that minor changes are fast to do and put in prod. Nothing like a Release Procedure in place to instantly kill any expectations of the "you can do this change in 5 minutes" type and create some space for actually doing UAT testing of even those seemingly innocuous changes that sometimes end up poluting a database with garbage beyond the point of recoverability.

      I work in finance and I've seen more than once cases of seemingly innocuous fixes done in Prod that brought down the systems for several hours, pretty much paralising the business and costing millions in lost business. This more often happens is seemingly non-mission-critical systems (where Prod access is more likelly to be open) that turn out to be required for know critical systems to keep on working.

    3. Re:Everyone agrees... by avandesande · · Score: 2, Insightful

      I don't think developers want access either- if something goes wrong, do you want to be responsible? I never want access to anything I don't need access to.

      --
      love is just extroverted narcissism
  7. No, if you can afford for them not to by dmomo · · Score: 2, Insightful

    So, yes.

  8. Uhm, no. by HerculesMO · · Score: 4, Insightful

    This is why there is a change control process, and a testing environment.

    If you're doing it wrong, you're asking for trouble.

    --
    The price is always right if someone else is paying.
    1. Re:Uhm, no. by Anonymous Coward · · Score: 2, Insightful

      The issue I have with this attitude is that the testing environments are never identical copies of production environments. You can have exactly the same hardware / software, but the instant the network topology differs, you no longer have identical environments. I've seen far too many config / routing / firewall / Akamai / replication issues that were initially flagged as software bugs, but were in actually environment bugs. The problem is that the SAs typically do not have the internal knowledge of the apps required to diagnose those kinds of configuration problems. I'm not arguing that devs should have unfettered r/w access, but a controlled measure of r/w access on at least one server in prod makes complete sense for senior devs.

  9. No correct answer by Motard · · Score: 3, Insightful

    There's no correct answer to this question. It depends on the size of the organization and the nature of the system. I've worked in different companies that have been on either sides of where I thought the line should be. The line is drawn in a very different place for a 20 employee company than where it is in a 20,000 employee company.

    1. Re:No correct answer by RyuuzakiTetsuya · · Score: 2, Interesting

      Bingo, and I don't think the line is drawn at the size of the company but how mission critical the application is. Granted, in a 20 person firm versus a 20,000 person firm, the developer's probably also the administrator. But OTOH, if your business is 20 people big and one of them is a developer, it's easy to assume it's probably IT based and as such, some sort of administrative control is probably a good idea in general. Think about it. Well, sure, 20 people but how many machines? Half rack? rack? 5 racks? A whole datacenter?

      --
      Non impediti ratione cogitationus.
  10. Jesus no by BigJClark · · Score: 2, Informative


    The day developers can write code that compiles the first time, then yes, otherwise, jesus, no.

    I work as an Oracle DBA for a mid-size company, and I provide a day-old cleaned copy of production in a different environment/box, and it does the trick.

    --

    Hi, I Boris. Hear fix bear, yes?
  11. Hmm by mark72005 · · Score: 2, Insightful

    I think it's helpful in analyzing real-world data and getting an idea about real system loads, testing issues to see if they are in the wild today, etc. For a good developer, it makes life much easier.

    In a very healthy development ecosystem all this data is replicated and there is never any need for a developer to touch prod. In the development ecosystems that exist in the real world though, most are very unhealthy, frustrated by ham-fisted security, process flaws, red-tape, inconsistency, and incompetence ranging from scattered to mostly cloudy.

    The answer is, do you have the class of developer that knows what not to do and desires to play nice, or do you have the usual.

  12. Developers need access to production DATA only by mikein08 · · Score: 2, Insightful

    As a developer I can tell you that it's impossible to test programs properly and thoroughly without access to production data. However, developers should NOT be granted access to production logins/sites - production data should be copied into development work areas so that developers have an appropriate "sandbox" in which to work/test.

  13. Re:Backdoors by alanebro · · Score: 2, Insightful

    I don't think he meant malicious backdoors. I read that as backdoors to allow debugging/etc.

  14. And the concensus is ... NO by mr_stinky_britches · · Score: 2, Interesting

    And the concensus is ... NO

    Who let this question through? It doesn't even seem controversial. I am not aware of any good reason to routinely give developers access to production.

    --
    Censorship is obscene. Patriotism is bigotry. Faith is a vice. Slashdot 2.0 sucks.
  15. Biggest issue for us... by i.r.id10t · · Score: 2, Insightful

    Biggest issue my cow-orkers and I have is that the sysadmin *claims* that the dev box and production box have the same packages, configuration, etc. but in reality, they don't. Most often we find out when we ask for production stuff to be copied over to the dev site to test errors, etc. and just loading it - which works on the live site - generates errors on the dev site.

    --
    Don't blame me, I voted for Kodos
    1. Re:Biggest issue for us... by PPH · · Score: 2, Interesting

      Good point. Lots of people are jumping in with remarks about developers tweaking production code. But there are other sorts of access. As a 'developer' I've had very good experiences with having shell access to production systems to read logs, inspect packages and even run little test suites to verify the configuration of the production system.

      For example, at one outfit, I was one of the few non administrative users with shell access to production servers. One day, everything came to a screeching halt. Nothing worked and the admin claimed no changes to the application. So I logged in and got a message to the effect that the /tmp directory was inaccessible. It turns out that IT management was under orders to clean, sweep and get rid of all unneeded 'junk'. So this manager asks his admin what each directory structure is for and is told that /tmp is where 'junk' is written. Orders went out that 'junk' was not to be kept on production servers and /tmp was to be deleted. Now, the admins knew better. But at this company, nobody bucks the chain of management. If the boss says, "Bolt the wings on this one backwards", you do it and move on. Being outside that chain of command, I was able to get things put back the way they were supposed to be. But having shell and read access to that system was what enabled me to see the problem (of course, being a *NIX geek gave me the experience to know what /tmp was).

      There's a large debate going on in many organizations on how much information to give each employee. DoD security requirements aside, giving everyone broad read access empowers employees to handle exceptions and solve problems without having to go up through the management chain. The down sides are: When it's easy to fix instances of problems, some people settle for that. Rather than searching for the root causes and making the necessary changes to eliminate them, the decision is often make to maintain the status quo. Because its so easy to fix things when they inevitably break. It creates the image of the hero or industrious worker when one can be seen to jump in and save the day. Repeatedly. I'm motivated by sloth. I like fixing things so they don't keep bothering me. The other down side is that undocumented work arounds make processes hard to reorganize and eventually outsource. If one needs to move their IT process offshore, for example, its much better to have everything compartmentalized and documented. This minimizes the amount of information the supplier will need in order to perform the job.

      --
      Have gnu, will travel.
  16. Depends on the size of your organization by jaymz2k4 · · Score: 3, Insightful

    If you are a small software shop then I can see reasons for allowing your small technical staff to have access to production. It's all well and good saying that only the admin of that server should have access and there's a full rollout procedure in place to be followed only on certain days, certain times; but even when I've seen that sort of structure in place there are times when it's useful for the developers to have access to production. Nothing is perfect and we'd all love to have multitude's of staging servers, replicating the typical load and uses of production but for a hell of a lot of (non critical I'd add) systems that just doesn't happen.

    There simply is no one rule fits all. Sometimes I wish we had extremely rigorous rules & regulations in place - I'd probably get to go home a hell of a lot earlier. I'm not suggesting you start chucking exceptions all over your checkout code on live but I think you should asses your own situation (and staff for that matter).

    --
    jaymz
  17. Install Good Logging Practices by FatSean · · Score: 2, Informative

    With some fore-thought and some discipline an application can be developed with very robust logging techniques. It takes development time, but there is nothing cooler than asking the production guys to turn the logging detail up for a few packages and seeing tons of data in the logs. It's not perfect as you can't log every variable at every moment but it certainly does help.

    I understand some shops can't or won't modify the logging levels on production servers.

    --
    Blar.
    1. Re:Install Good Logging Practices by GuidoJ · · Score: 2, Informative

      That's why you should always try to log all inputs and outputs (which includes configuration data, etc.) along with a timestamp. Afterward you can replay the situation in a test environment. If you properly set this up during development phase, this will already pay back during test phase. The developer can analyse the problem offline and proof that the fix actually solves the issue before it is shipped to production. It could even be part of an automatic testing environment.

    2. Re:Install Good Logging Practices by GryMor · · Score: 2, Insightful

      I don't know where you work, but when dealing with a few hundred or thousand tps per server, logging can account for a good chunk of latency.

      --
      Realities just a bunch of bits.
  18. ... am I the only one? by merlinokos · · Score: 2, Interesting

    I work in an environment where the devs fix bugs before adding features, so the code is stable almost all the time. I have less than 1 callout a week that's caused by something a dev has done to the code.
    We hire the best devs, and work in an environment where fixing bugs is more important than adding features. The result is that our devs get full access to production, and even offer to provide support in order to ensure that they're the ones that are woken up if something they've broken falls over OOH.
    I've been at my current company long enough that I'd forgotten there were places where devs and ops didn't trust each other.

  19. As a developer: read-only access by Todd+Knarr · · Score: 2, Interesting

    Speaking as a developer, I want/need read-only access in production. All too often I need to dig out information while troubleshooting, and most commonly I don't know what all bits I'll need when I start. If it were easy to identify exactly what I'd need to find the problem, I usually already know what the problem is. The hard ones are the ones I can't replicate in development and I only have a starting point, something that won't identify the problem but might help me narrow down where to look next. In those cases the only place I can look is production (since I can't make it happen in a controlled development environment) and I can't give the admins a list of what I'll need (because I need to dig through logs and config files before I'll know what I need to look for next). And if we've gotten to this point, it's probably a priority problem impacting production so it needs to get fixed Right Bloody Now.

    OTOH, while I may need to look at production, I don't need and don't want the ability to modify production except by going through the admins. This, of course, also requires admins who can follow basic instructions like "Look at config file FOO. Find the line in section X that starts with Y. It's value should be XYZZY followed by the number 1. Change that 1 to a unique number for that machine/instance. Repeat this for every machine/instance.". But all too often the response is "That's too complicated. Can you just give us config files to install?". And of course when I ask for the current config files, so I can be sure I'm not overwriting any other modifications to them (which may have happened since the admins control them and do modify them), I get "We can't do that, they've got production passwords in them.". Now all I can do is throw up my hands and go "Whatever.".

    1. Re:As a developer: read-only access by EricWright · · Score: 2, Funny

      You gotta know how to talk to admins. Tell them they also can be replaced by a very small shell script.

  20. No! No! A Thousand Times No! by OldeTimeGeek · · Score: 2, Insightful

    It's not necessarily a case of the admins versus the developers, its more of practicing good data governance.

    Our developers used to have direct access to all of the production databases. This was bad enough, but because of this the organization permitted them to directly "clean up" databases (meaning they wrote to tables directly), we had data that was being changed without the ability to really know who did it. The DBAs hated it and the developers were extremely uncomfortable doing it but it happened anyway. We eventually had a real process audit and the auditors had a field day.

    Needless to say we changed. I hope.

  21. überdev by Anonymous Coward · · Score: 3, Funny

    I am one of the few people who can run correct code the first time round. I am also proficient enough in OS matters to be able to circumvent access to locked down resources. So I don't care what this post says, I'm doing it myyyyy waaaaay.

  22. Hope you are never audited! by TarPitt · · Score: 4, Interesting

    I worked as an IT auditor for a very big public accounting firm. Reviewing IT controls was a key part of the financial audit (and more so now with Sarbannes Oxley).

    If I found developers had access to production, it was automatically a "no reliance" finding.

    This means the financial applications are inherently untrustworthy that the financial auditors would have to review original source documents for validation.

    "No reliance" meant the audit became much more expensive as a result.

    Also - if the auditors can't rely on the financial reports, should management?

    --
    If your children ever found out how lame you are, they'd murder you in your sleep
  23. No by istartedi · · Score: 2, Insightful

    Even when the suggestion of "would you like root on this internal box?" was put to me, my answer was always "No". I write code. Others test it. Admins deploy it.

    People specialize for a reason. If you want half-assed administration, give root to a developer. If you want half-assed code, let admins write software. If you want half-assed testing, have admins and/or developers do it.

    --
    For all intensive purposes, "whom" is no longer a word. That begs the question, "who cares"?
  24. Long Answer by IndustrialComplex · · Score: 2, Insightful

    If I found out a developer changed something in a system I tested without it going through the proper process...

    Let's just say I would be very interested to hear why they shouldn't go back and rerun everything again on THEIR dime. (at the very least) In fact, we DID do just that to someone who let a revision slip into their UUT because a developer felt it would fix something and make it perform better.

    It wasn't too expensive of a mistake, just $250,000 to rerun that portion of the test. Although that was just the physical cost of performing the test. I don't even want to know how much it cost in labor especially considering it was a 22 day test.

    Even if the change was removed, how do I know that without physically verifying checksums (do I even trust it anymore since their CM process is obviously flawed)

    --
    Out of modpoints but really liked a post? 1BDkF6TtmmeZ3yqXbz9yhdYVqRYnwFoXDj
  25. With Great Power... by Sandor+at+the+Zoo · · Score: 2, Interesting

    Of course developers should have some level of access to the production environment. No matter how good your test environment is, it's not going to match the live server in load, or what's in cache, or the concurrent access to some resource, etc.

    Our process was to have one person with access, investigating whatever problem via the SQL command line, or the Rails console (let the RoR jokes commence), with another person watching, to make sure they were doing select * and not update or delete. Even then we'd execute stuff in a transaction or sandbox so that we weren't making any permanent changes, although changes to memcache generally can't be rolled back so easily.

    I've seen admins, who are adamant that dev not be allowed to change anything, change psql configurations at a whim, crippling DB performance. And then blame dev for poor response times. That's so not cool.

  26. Comment removed by account_deleted · · Score: 4, Insightful

    Comment removed based on user account deletion

  27. Blue Cross Blue Shield Anyone? by Bourdain · · Score: 4, Interesting

    If I screw up, people can't get the correct pills. It's fun to make other people live dangerously. :-p

    FTFY. Well, for certain values of "pharmacy benefit management system". If your production hacking can botch scrip fulfillment, please say what company you're working for so I can try to avoid it like the plague it is.

    I don't know if Blue Cross Blue Shield has fixed this but, as of a few weeks ago (and this probably has existed for a while), living in EST has made it impossible for scrips to be fulfilled via insurance between midnight and 3AM. This is because, according to the late night pharmacist who is familiar with the issue, the servers are in PST and won't allow fulfillment from the anything but the "current day" regardless of time zone. Too bad the devs there don't understand time zones adjustments / UTC/GMT. Yet again, non-profit environments don't tend to attract the swiftest of folk in general.

  28. Dependent on size of shop/experience/environmet by ChronoFish · · Score: 2, Interesting

    For a one man show the answer is self evident.

    For a small web company developing "brochure-ware" - probably more efficient.

    For a small team it's ideal to have individual sandboxes - with one sandbox listed as "staging". Assign the lead developer to turnover code to production. Individual developers have access but are told not to touch anything. They will typically sift through live environment making sure it matches what is in their sandbox, looking at logs, etc.

    For a mid-size team you need one person for maintenance (which includes monitoring nightly builds, responding to code turnover requests, managing automated testing). Even more critical if the code you write is compiled, fragile, or highly sensitive. - Individual developers don't have access to the live box - maybe the team lead will.

    For large teams or small team "units" part of a large production shop : Several layers of "staging and testing" will exist. Code turnovers are mostly automated. Developers don't have access. Automated rollbacks are possible from a robust code management system.

    The key is discipline. If you find yourself modifying live code - you're not disciplined. It means you're not willing to insert logging code and would rather pollute the production environment. There should never be a need to copy from production back to a sandbox (that is what version control software is for!) And version control files should never live on the production server (i.e. in Subversion you never do a checkout of code on the production server - you do an export instead).

    Even with controls in place, there may be a tendency to "develop on production by proxy". Which means instead of re-creating the problem in development, the developer is saying "here try this, here try this, here try this". The team lead should recognize this and put a stop to it.

    -CF

  29. And the winner is ... by galego · · Score: 3, Funny

    ... in a split decision, vi wins. Oh wait ... wrong holy war!

    --

    Que Deus te de em dobro o que me desejas

    [May God give you double that which you wish for me]

  30. The more developers work in production... by Zenin · · Score: 2, Insightful

    The more developers work in production, the more they can ONLY work in production.

    I'm all for read access (the more eyeballs the better), but actual access to change anything is a train wreck. The devs will forget to check the changes in to the source repo, or they'll check them in differently (bad copy/paste), or they'll check them into the wrong branch/tag. Regardless the next release that goes out silently adds the bug back into production.

    And if developers think it's difficult to fully clone a prod environment configuration into dev now, wait until they try to do it after developers have been hacking on it directly for a while.

    Pretty soon every release is a train wreck requiring tons of post-release tweaking and hammering to get it in place. Every release is a stressful mess as you're all crossing your fingers because you really have no idea what you are actually changing and no way to find out.

    Just don't do it. Hire a good build engineer/release manager/software configuration manager that can sort out, automate, and track environment management well enough that yes, you can reliably clone an accurate representation of production in a matter of minutes. He'll cost you about as much as a good sr developer, but the savings across the board will easily dwarf his salary.

    --
    My /. uid is better then your /. uid
  31. Short answer, maybe. by Lord+Kano · · Score: 2, Insightful

    Long answer, it depends on the situation.

    I (and a couple of my coworkers) have access to production servers, but we don't develop on prod. End of story. We have other devs who do not have access to prod. Dev is for dev, prod is for prod and don't let anyone without the discipline to keep that rule have access to prod.

    LK

    --
    "Hi. This is my friend, Jack Shit, and you don't know him." - Lord Kano
  32. You say potato and I say potaeto by jhughe90 · · Score: 2, Interesting
    I've worked for 3 different Fortune 500s over the last 11 years, a defense contractor, a telco, and a bank. Big companies, big IT departments, many many custom-written applications.

    Reading a lot of comments it looks like there's a wide variety of definitions for some of the job titles and roles people are discussing here, so I'll list how I see them:

    * System Admin - Person(s) responsible for the hardware and supporting (OS, Web service, code language and client libs, JVMs, etc) software. They do not in any way support the applications running on said system and would be incapable of debugging or supporting an *application* problem even with a gun to their head. Most can only describe 2-3 sentences of what the applications even do. They do not report to or answer directly to the application teams. They also do NOT install application code.

    * Database Admin - Only want to address roles here. At every location, the actual application data stored in the database is NOT the role or responsibility of the admin. It belongs to the application team and any changes are their job and their accountability. The DBA only deals with schemas, packages, procedures, scripts, access roles and grants, etc. DBAs should NOT MANIPULATE DATA. Asking or allowing them to do so opens up a never ending blame game and is counterproductive. If you want to create some title and role within the application team where all data manipulation funnels through, that's the way to do it.

    * Implementation Specialist (Code Migration) - Trained monkeys who are supposed to follow a set of pre-delivered instructions for deploying application changes. In my experience their technical knowledge is limited, they cannot verify copy/paste correctly, and screw up (transferring ZIP files in ASCII instead of binary) more than they succeed. I don't feel this position is even necessary. The PROCESS is necessary and it can be performed by anyone, even a developer, as long as they switch their role hats before starting and are held accountable for accurately following the deployment instructions given.

    * Production support - They act both in a technical and relationship role, being the contact point between the customer (internal or external) and the application team when issues arise. Generally have read-only access to production. They are able to debug many problems and resolve a few, but definitely not all of them. They do not participate in any part of the development lifecycle processes.

    * Developers - Not going to discuss or debate any pre-production roles here since it's irrelevant to the topic. Developers are the only ones I would be confident could debug ANY problem. They are going to need some reasonable level of access to production, logging, or information if you want to have an application that can maintain high availability and recover quickly from any type of outage.

    If your definitions to these roles differ significantly, then my answer for your company's situation would change.

    Depending on the size of the application and the team allocated to run it, I've performed up to 4/5 roles and was pretty much the 5th as well since the Sys Admin only could barely squeak by supporting Windows 2k and definitely had zero knowledge of any of the supporting software. Are you going to hire someone to do 6 hours of work per month just to separate the responsibilities? Of course not. So the OP's generalized question is open to a million different interpretations because of all the different variables that weren't specified.

    My most recent application team recently went through our production lockdown after finally migrating over an application suite purchased from another company. Developers and Prod Support have read-only access. Database passwords used by the applications have been restricted down to just a couple individuals. When changes need to be made to either the application or data, an Emergency ID is checked out to a requesting individual with the appropriate access level,

  33. Two Words by Kirrilian · · Score: 2, Insightful

    Hell. No.

    I'm a developer as well as a sysadmin and I NEVER tweak anything in production and I have full access to it.

    I have an exact copy of my production environment for development and I do all my tweaking/test deployments there.

    In fact nothing gets deployed to production until everything has been checked in development.

    My previous job had dev/qa/prod environments where the devs had full access to development and it was so bad that we had to virtualize it for them just so we could revert back to a pristine snapshot whenever they jacked up the dev server.

  34. I think I speak for all programmers when I say by geekoid · · Score: 2, Funny

    No. Do not give developers access to the production machine ever, except me...just this once..

    --
    The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect