10 Dos and Don'ts To Make Sysadmins' Lives Easier

i am impressed by digitalsushi · 2010-12-23 08:01 · Score: 5, Funny

10 is an even number. There's no duplicates. None of them are filler.

I don't understand how this happened.

Did someone plan this before they wrote it? What gives?

--
slashdot: where everyone yells sarcastic metaphors to themselves to understand the issue

Re:i am impressed by mcgrew · 2010-12-23 08:06 · Score: 3, Insightful

Not only that, but I'd say almost all of them don't just apply to making admins of large networks' jobs easier, but to ALL software development for any computer use.
#11: NO DRM, dammit!

--
Free Martian Whores!
Re:i am impressed by kimvette · 2010-12-23 08:31 · Score: 3, Funny

No slashdot editors were involved in the production of the list. ;)

--
The Christian Right is Neither (Christian nor right). See: Matthew 23, Matthew 25, Ezekiel 16:48-50
Re:i am impressed by fuzzyfuzzyfungus · 2010-12-23 09:00 · Score: 5, Funny

There is a special place in hell for vendors who sell bulk licenses(50+ seats) for software whose DRM prevents automated installation, and requires that the IT office's picker of the short straw go around and type in a gigantic license key on all machines.

If a hole has to be punched in the firewall for the online activation/authentication step; because they were just too damn special to use SSL on a standard port like everybody else, that special place in hell is filled with screwworms.

If there is a hardware dongle component(that looks exactly like a USB flash drive, and thus wanders accidentally if not carefully hidden) and requires a new purchase order and a nasty pile of cash to replace, that special place in hell automatically inserts bullet ants into the scrotum of anybody placed there.
Re:i am impressed by mlts · 2010-12-23 14:05 · Score: 3

To elaborate on #11:
#11: No DRM. The BSA would turn any company inside out and have their entrails for Christmas lights if they are caught pirating even a single copy of WinRAR. Businesses who value being open are not going to be pirating anyway. So why add DRM which removes value from the product?
#12: Ability to rebuild the product if it gets corrupted. Have it as an option to have the .cab/.bz2/RPM/.deb/etc. file stored in a directory, including patches. This way, if there is concern about registry/NetInfo/ODM/whatever corruption, it shouldn't be hard to have the product reinstall itself.
#13: An uninstaller. Shit happens and crap gets in a half installed state. It would be great to be able to have a utility that completely removes any and all traces of a program, move aside/archive config files, and rename the config directories. This way, if a config document is causing problems, it is out of the path.
#14: Ability to send reports to a third location, via E-mail or whatnot. This way, either by system logs or E-mail, there is proof that a package was installed or maintained, and not just the install mechanism; but from the application itself.
#15: Ability to install as a non-administrative user if the functionality is relevant (this wouldn't be doable for system utilities, but a Web browser, yes.)
#16: Ability to have a way to completely block installs of the product.
#17: All executables are signed. Not just with the OS signing mechanism, but either with a manifest, or PGP/gpg detached signatures.
#18: A "master console" program that can check for updates, store them, check installed clients if the update is needed, push out updates (either by a program or through the OS's install mechanism), perhaps even allow for removal en masse.
I just wish more operating systems had not just an install mechanism (msiexec, rpm), but an update mechanism from repos (yum, macports). This would make life a lot easier, especially if it can be configured from custom repositories so enterprises can have their own mirrors.

Holy crap, a slashdot first by subreality · 2010-12-23 08:09 · Score: 4, Interesting

It's a top-10 list that actually has insightful information on how to do software right, instead of being a random collection of ten things to make a fluff article. Bonus points for being things that I actually agree with.

Re:fucking apostrophes, how do they work? by Anonymous Coward · 2010-12-23 08:09 · Score: 3, Funny

"sysadmins' lives" is correct. It is referring to the lives of sysadmins.

Unless, of course, you are referring to the sexual practices of punctuation marks. Then, I don't know.

The Practice of System and Network Administration by XanC · 2010-12-23 08:10 · Score: 5, Informative

The article author is also behind The Practice of System and Network Administration, truly an excellent text into the practicalities of work in IT.

#11: Meaningful error messages by eln · 2010-12-23 08:14 · Score: 5, Funny

If you want to make a sysadmin's life easier (as if any programmer ever wants to do that), you can start by making your error and status messages 1.) plentiful and 2.) easy to understand. Also, provide several logging levels so we can drill down as needed, and make sure the logging levels are meaningful. Too many programmers put just two log levels: one which shows nothing useful, and another that spews out indecipherable hex dumps of every call it makes.

Face up to the fact that no matter how awesome your software is, it's going to fail. Not only that, but it's going to fail in ways you never thought possible at the worst possible times. Make sure we have enough information to figure out what happened. Otherwise, stuff like this happens:

Program: *crash for no apparent reason*
Sysadmin: Why did you crash?
Program: Because something went wrong.
Sysadmin: What went wrong?
Program: Something.
Sysadmin: I need more detail. Increasing log level.
Program: Something bad went wrong.
Sysadmin: I need more than that. Increasing log level again.
Program: Fuck you. Here's a 16GB hex dump of system memory. Figure it out yourself jackass.
Sysadmin: *picks up a crowbar and goes off to find the programmer*

Re:#11: Meaningful error messages by Monkeedude1212 · 2010-12-23 08:36 · Score: 5, Insightful

That reminds me of a Web Developer I once knew.
He said he didn't bother putting try/catches around certain standard things (Like Database connection opening, closing, transactions, etc) - because if anything ever went wrong it was easier for the user to take a screenshot of the Stack Trace if and when it went wrong from the Webapp. Said it took too much time to build in proper exception handling and error messages.
He said that the user experience basically means nothing if your application doesn't work, so when something doesn't work, don't bother making it pretty.
He no longer works here, though I can't imagine why.
Re:#11: Meaningful error messages by Jaime2 · 2010-12-23 09:36 · Score: 3, Interesting

Of course, good error handling is best. But, no error handling is usually better than cargo-cult error handling that displays a pretty message, but doesn't record the error detail anywhere. Very few things bother me more in a code review than somebody who put in the extra effort to ensure that an error message can never be found, I would have rather they simply skipped it.

click-wall. by nblender · 2010-12-23 08:21 · Score: 5, Insightful

Don't make me use a real browser to click all the way through your site, make me agree to a stupid set of conditions for using the software, and then provide my browser with a cookie that it can subsequently use to download your software; when my browser is on one continent and the machine that wants the software is on another continent; you ass-fucks...

That's plain ASCII to you... by sl149q · 2010-12-23 08:23 · Score: 5, Insightful

> DO have a configuration file that is an ASCII file, not a binary blob.

And by ASCII we mean something that can be edited by any editor.

XML is the equivalent of a binary blob when you are up to your ass in alligators trying to get things working again with minimal tools available.

I disagree on the GUI by Zarhan · 2010-12-23 08:33 · Score: 5, Interesting

...if the GUI is well done and complements command line.. Some tasks actually ARE much better performed with Point&Click.

One example of a "good" GUI that I use a lot is the ASDM for Cisco ASA firewalls. Most of the simpler admin tasks are in fact *faster* via ASDM. If you have your network objects all properly set up and you need to add a firewall rule, it's far simpler to select it from a list (actually, in this case it's a combobox - just type first few letters to filter your choices and then click) than typing that stuff in manually. Packet tracer to check the rules is much nicer to use via the GUI. Setting up VPN profiles is simpler via ASDM. Handling network object groupings is simpler via ASDM.

Editing access-lists, doing routing configuration and most of the more "rudimentary" tasks are still something I do via command line, though.

Re:I disagree on the GUI by Qhartb · 2010-12-23 08:53 · Score: 5, Insightful

I think it's more a matter of not making a GUI instead of a command line interface. Making both is, of course, perfectly fine, so long as the CLI is fully-featured and reasonably usable.

Re:fucking apostrophes, how do they work? by blind+monkey+3 · 2010-12-23 08:35 · Score: 5, Funny

I thought they just followed Jesus around.......

--
BM3

Windows CAL cost by tepples · 2010-12-23 08:36 · Score: 5, Informative

From the article:

8. [...] Similarly, use the operating system's built-in authentication system and standard I/O systems.

This can be a bad thing if your application runs on a platform whose built-in authentication is a nickel-and-dime revenue stream for the platform's publisher. Microsoft Windows Server is like this: each user account on the built-in authentication system requires a Client Access License.

Re:Windows CAL cost by quacking+duck · 2010-12-23 12:42 · Score: 3, Insightful

This is why I hate having to deal with Windows on the side. In this aside about user CALs, there's three different takes (so far) on when you need a Windows CAL and when you don't.
I got sick of researching Windows Small Business server when I read their FAQ, and the section on licensing was longer than all the other sections combined!

Amendment to #2 by c++0xFF · 2010-12-23 08:42 · Score: 4, Insightful

Feel free to make a GUI for the administrative interface, but not at the expense of an underlying CLI.

There are two ways to do this: have your GUI call the CLI when necessary, or use a common API behind both. Other methods will lead to bitrot in one of the interfaces, most likely the CLI.

GUIs are fine and even enjoyable to a certain extent, but the author is right that the CLI takes priority.

Re:DON'T make the administrative interface a GUI by Chris+Mattern · 2010-12-23 08:50 · Score: 3, Insightful

GUIs are (sometimes) better when you want to do something *once*.

They really suck when you have to do that same thing hundreds of times. Which sysadmins do. On a regular basis.

Re:Channeling Philosoraptor by greed · 2010-12-23 08:58 · Score: 3, Informative

Which is a perfect example of a terrible error message. And there's plenty of bad examples like that to crib from, too. (In your particular example, sure, you'll have the "at line XXX" so someone can start digging around in the code... but that's something only suitable for quick-and-dirty hack scripts.)

What you need to know is WHAT, WHERE and HOW. You know WHO (the program), and are trying to figure out WHY. I've often had to resort to strace -etrace=file to find out "What file couldn't be opened? Why couldn't it be opened?"

So, sticking with perl:

open FILE,"filename.txt" or die "Cannot open \"filename.txt\" for reading--$!\n";

Your example will give only the errno, which is what I'm calling HOW [it went wrong]. WHAT went wrong is the "open for reading". WHERE it went wrong is "filename.txt".

I generally wrap such calls with a library; that way, I don't have the error handling littering up every call-site. But if you're using an exception-oriented language, we need the SAME INFORMATION once it turns into an error message!

Oh yeah: For error recovery code, files can't be opened for more reasons than just, "It's not there." You can try all you want, but if (say) the filesystem has gone read-only due to a disk controller failure resulting in journal abort, you might want to do something different. That one's strictly hypothetical, haven't had it happen in over a week--ever since I replaced that faulty cable....

Re:fucking apostrophes, how do they work? by daremonai · 2010-12-23 09:22 · Score: 5, Funny

"sysadmins' lives" is correct. It is referring to the lives of sysadmins.

No, I'm sorry, it is not correct. Sysadmins don't have lives.

Re:1 Do for being a user by DragonWriter · 2010-12-23 09:54 · Score: 3, Insightful

No.. the users are the ones who can't figure out how to use the system, that's why there's an admin.. if users knew what the fuck they were doing, we wouldn't NEED sysadmins in the first place.

If the system was designed properly for the userbase, so that users could use the system, you'd still need sysadmins to administer the system, which is notionally what sysadmins are for (hence the name.)

You wouldn't need sysadmins to take breaks from administering the system to handhold users through basic usage tasks, but then, that's not really the point of a system adminstrator in the first place.

Its an acm.org article ... by perpenso · 2010-12-23 09:56 · Score: 3, Informative

10 is an even number. There's no duplicates. None of them are filler. I don't understand how this happened. Did someone plan this before they wrote it? What gives?

Its an acm.org article. Not only did the author probably plan, re-read and revise the article before submitting it but a technically knowledgable editor probably read it and may have offered useful and insightful suggestions. Now there may not have been a formal peer review process but the editor may have also had one or more experts in the field read it and offer comments and suggestions.

Yes the above seems an archaic process but consider that the acm is full of old people who had experience publishing back when things were done with dead trees. ;-)

Eventlogs by Spad · 2010-12-23 10:37 · Score: 4, Insightful

In reference to point 8, this is something I wrote I while ago after dealing with several Windows apps that either horribly abused the Eventlog or refused to use it entirely:

DO create your own event message DLL(s) where appropriate to avoid your events looking like this
DO log important errors and warnings. Application failures, communication issues, invalid configuration data and the like. Things that will help administrators to troubleshoot issues that may occur.
DO make your logs intelligible to someone other than you. Not having developed the application myself, I have no way of knowing if “Invalid foo in bar. More cheese needed at 0×8003387 means that someone’s made a typo in a config file somewhere, a firewall rule needs changing or that the application doesn’t support running during the vernal equinox.
DO throttle your logging. Don’t log the same error every second, it’s pointless, generates a lot of “noise” and – much worse – forces other, potentially useful events out of the log’s retention.
DO make your logging level easily configurable by the user and DO set a sensible default.
DON’T log every single informational or debug event that your application generates. Nobody gives a shit that you successfully checked a message queue and found it was empty; either use a Custom Event Log or a log file in the application directory if you want to record that kind of information.

#1 big dont by MrLint · 2010-12-23 10:40 · Score: 5, Insightful

Do not assume that your software is running with elevated access... (root/administrator)

Re:It's noce to know by swordgeek · 2010-12-23 11:05 · Score: 4, Interesting

A GUI is NOT fine for administering a broken system over a slow link to the other side of the world.

I used to remotely administer a set of servers in the middle east. The bandwidth was tiny, and the latency was insane. I would type a command out, then take a sip of coffee while waiting to see it displayed before hitting "enter." I had to use a GUI for one application, and it took over 40 minutes to fire up and display on my machine.

Mandatory (and well-designed) GUIs should be for using an application, not administering or installing it.

--

"People who do stupid things with hazardous materials often die." -- Jim Davidson on alt.folklore.urban

Re:1 Do for being a user by skarphace · 2010-12-23 11:19 · Score: 4, Interesting

I wonder if there are forums on the Web where plumbers shit all over eachother.

--
Bullish Machine Tzar

10 years of personal experience... by CAIMLAS · 2010-12-23 12:58 · Score: 3, Insightful

1. DO have a "silent install" option.

Silent install is nice, but so is an intelligent install, or a well thought-out, correctable upgrade process.

These systems do it well:

Debian and RedHat derived; Windows, post-2003. OS install is still a bit of a bitch with Windows. The upgrade process for MediaWiki is also stupid easy and effective (basically: untar new tree and run db alter scripts).

Poorly:

FreeBSD, and, really, most BSDs, are horrible for upgrading. I suspect OS X is similarly stupid when it comes to "promptless installs". Cacti, likewise, is awful.

2. DON'T make the administrative interface a GUI.

A useful amendment to this is: don't make the administrative interface shitty. GUI is fine, as long as I can leverage it progmatically. CLI tool is great, as long as it's fucking documented and not obtuse.

Case in point (in opposition): MegaCLI, for MegaRAID cards. Absolute. Shit.

3. DO create an API so that the system can be remotely administered.

An API is great, and allows for programmers to dig in and extend the product. I'm thinking of VMWare, XenServer, and Virtualbox right now. The latest Windows versions with PowerShell and the management consoles are not a bad combination of usability/power/utility.

Most sysadmins don't have the time to dig into the API, though, so a good initial tool that isn't terribly dense or limited in functionality is a must (XenServer, please improve your shitty-useless UI on xsconsole and XenCenter; I'd like a little more access to my VM disks without digging into lv/pv commands, too).

4. DO have a configuration file that is an ASCII file, not a binary blob.

No argument here. Likewise, configuration should be human-readable and not have vague incantations.

Good: samba, and all tools which use similar configuration syntax.

Bad: sendmail is the worst offender I can think of at the moment. I'm sure all the djb* stuff, too.

5. DO include a clearly defined method to restore all user data, a single user's data, and individual items (for example, one e-mail message). The method to make backups is a prerequisite, obviously, but we care primarily about the restore procedures.

Good: any UNIX system and it's $HOME; modern Unix MTAs like Courier.

Bad: Cyrus IMAP. Pretty much any tape archive system comes close to frustrating as hell. Windows still has a long way to improve until it's capable of Unix-style $HOME utility.

6. DO instrument the system so that we can monitor more than just, "Is it up or down?"

WMI is great. SNMP on Unix/Linux hosts, not so much, due to the configuration and divergence involved. Most OEM Linux/Unix based machines or systems (XenServer) are relatively shitty in this regard, too.

7. DO tell us about security issues.

Telling us about them is great, but upgrading these things are the most important, time-sensitive upgrades we need to make, so they should also be the easiest. We should not have to break two-three different things just to get the upgrade done.

BSDs are bad about this; horrible, even. The time consumed by a simple upgrade is enormous.

Linux is mediocre, but better than most.

Windows, in this case, "just works". Except when it doesn't (though I'd argue the degree is no greater than, say, the Linux upgrade process). Your biggest cost will be when it installs something you've explicitly told it not to (*cough* new IE versions) or in bandwidth and/or uptime requirements.

8. DO use the built-in system logging mechanism (Unix syslog or Windows Event Logs).

Something which doesn't do this isn't even worth looking at. It's yet one more thing to manage and uses exponential

Addition: make your logging sensible, please. I don't want to see a full trace of everything in the logs and not be able to configura

--
~/ssh slashdot.org ssh: connect to host slashdot.org port 22: too many beers

Re:1 Do for being a user by jombeewoof · 2010-12-23 13:24 · Score: 4, Insightful

I disagree,

take any person of reasonable intelligence and place them in an unfamiliar settting. They become retarded.
The fact that they have been in front of that unfamiliar device for 20 years means they just don't care.

Give me a user who cares to familiarize them-self with the system and 6 months, I'll give you a half decent sysadmin. At least better than half of the paper certified MCSE's I've had the pleasure to work with.

--
Linux Zealots: Smarter than Mac Zealots, but still zealots.

Slashdot Mirror

10 Dos and Don'ts To Make Sysadmins' Lives Easier

30 of 246 comments (clear)