Undocumented Open Source Code On the Rise

Comment removed by account_deleted · 2008-06-15 04:47 · Score: 3, Insightful

Comment removed based on user account deletion

Source code is its own documentation by mangu · 2008-06-15 04:47 · Score: 4, Insightful

I'd rather have the source code which I can read and try to understand than an executable file alone.

The only reason why we don't see an article "Undocumented Commercial Software On the Rise" is because the public cannot see how badly documented the commercial software is.

Re:Source code is its own documentation by mikael_j · 2008-06-15 04:56 · Score: 1, Informative

I disagree, I tried changing some stuff in the rTorrent source code and noticed that sometimes the only comments/documentation to be found was the GPL notice at the beginning of each file, I never did manage to make the changes I wanted (but I got kind of half-way there at least).

/Mikael

--
Greylisting is to SMTP as NAT is to IPv4
Re:Source code is its own documentation by jps25 · 2008-06-15 05:40 · Score: 5, Insightful

I disagree. This isn't about closed vs open source, this is about decent programming. Comments in code are neccessary and a minimal requirement for any project. At least add one line to any function explaining what the function does, what its input is and what it returns. This isn't so hard and it won't kill you, but it'll make life easier for you and anyone else who will have to deal with the code later. It also makes finding errors easier, as your code may not be doing what your specifications say it should do. I don't understand this hatred for comments and the "code-is-its-own-documentation"-philosophy. I really don't. <code> #include <iostream> #include <algorithm> #include <iterator> #define ch_ty(ty) std::istream_iterator<ty>::char_type #define tr_ty(ty) std::istream_iterator<ty>::traits_type #define cin_iter(ty) std::istream_iterator<ty, ch_ty(ty), tr_ty(ty)>( std::cin ) #define void_iter(ty) std::istream_iterator<ty, ch_ty(ty), tr_ty(ty)>() int main( int argc, char *argv[] ) { while ( (cin_iter(size_t)) != void_iter(size_t) ? ( std::cin.unget(), argc += *cin_iter(size_t) ) : ( printf( "\nsum: %d\n", --argc ), system("exit") ) ); } </code> Perhaps easy to understand, but one comment-line would save you minutes wasted understanding and reading it. or <code> #include <stdio.h> int v,i,j,k,l,s,a[99];main(){for(scanf("%d",&s);*a-s;v=a[j*=v]-a[i],k=i< s,j+=(v=j<s&&(!k&&!!printf(2+"\n\n%c"-(!l<<!j)," #Q"[l^v?(l^j)&1:2])&&++ l||a[i]<s&&v&&v-i+j&&v+i-j))&&!(l%=s),v||(i==j?a[i+=k]=0:++a[i])>=s*k&& ++a[--i]);printf("\n\n");} </code> Well, obviously obfuscated, but one comment and it's immediately clear what it does.
Re:Source code is its own documentation by Anonymous Coward · 2008-06-15 06:19 · Score: 2, Insightful

Ha! You better believe it.

Recently, we bought a $middleware-thingy$ needed for a specific client installation..

Cost: 2000$.

Documentation and support: zero (apparently the original coder had left [without documenting anything], but the company keeps on selling licenses for something they have no idea of how works).
Re:Source code is its own documentation by Coryoth · 2008-06-15 06:38 · Score: 1

It also makes finding errors easier, as your code may not be doing what your specifications say it should do. In this day and age, if the code has some (suitably formatted) comments regarding what it is supposed to do, you can even get some pretty useful tools to automatically check the code against the specification. It's not bullet-proof, but it can catch a lot of subtle bugs that might get overlooked, particularly in subtely incorrect {use of|calls to} code that you didn't necessarily write yourself. See ESC/Java2 or Spec# to see what I mean.

--
Craft Beer Programming T-shirts
Re:Source code is its own documentation by Anonymous Coward · 2008-06-15 07:31 · Score: 0

The article is *not* talking about documentation in the code, but rather it is talking about documenting the fact that project A has code from an open source in it.
Re:Source code is its own documentation by Splab · 2008-06-15 08:00 · Score: 4, Insightful

I keep hearing people pro open source code say "I can check it!" Well can you? Have you done so - in a project spanning more than a few thousand lines of code? Just because the code is there to see doesn't actually mean its doable to waddle through it.

I'm not for either open source or proprietary code, my employer pays me money to produce code, what he does with it is his business, but what I do have, is experience using both proprietary code and open source code - both models have pros and cons.

With proprietary code there are someone I can call and they are by contract obliged to fix problems within a certain time frame. One particular instance is a database we are paying license fees for, I will not name them but to this date I have found more than 10 vectors that causes crashes. Those problems have been addressed by the vendor in a timely manner (I have yet to find bugs that would be show stoppers, but some did require annoying workarounds). With OSS we don't have this possibility, yes, we can log a bug in whatever bug tracker they use and hope someone will address our issue, but we have no guarantee - also in my experience logging a bug with OSS developers can be quite a daunting process, people can have some serious egocentric issues, while this of course is also applicable for proprietary software, there are someone higher in the food chain who can be called.
With OSS we of course got the good fortune of being able to go through the source code and try to fix the code ourselves... right?
Have you ever even considered just how bloody huge the code base is for something like a database? Tracking down a bug, well yes, the gdb can tell you where the program stopped working, but unless you have some really really good code reading skills and are up to date on everything that happens algorithm wise you have close to zero chance of fixing anything without causing major problems.

Also as a developer I got enough to do creating my own applications, I simply do not have the time to dig through thousands of lines of code every time something new breaks. Yes open source is nice, small projects are easy to help get along, fixing small bugs, but at some point the project grows so big that anyone using it needs to have someone they can call at 4 am in the morning to help them.

Oh and just because some software is proprietary it doesn't mean you don't have access to the source code, even at Microsoft you can buy access to the source.

We got builds with debug flags from the database vendor because we cannot share our database with them, therefore stack traces etc. has to be generated locally and shipped to them. (yes this is a bit annoying, but having sensitive records out in the wild is a tad more problematic).

I don't pick OSS over proprietary or visa versa, I pick what ever tool fits my needs.
Re:Source code is its own documentation by Xtifr · 2008-06-15 09:27 · Score: 4, Informative

I keep hearing people pro open source code say "I can check it!" Well can you? Have you done so - in a project spanning more than a few thousand lines of code? Yes, all the time. Not every line of code, of course, but with my Debian Developer hat on, I have at least browsed through the vast majority of the code for, e.g. tcl/tk, and at least skimmed the code for hundreds of other projects. And even with my day-job hat on, I have done a lot of ad-hoc browsing through random open-source projects that we're either using or thinking of using. Evaluating the code base is, or should be, a big part of deciding whether to use (or continue to use) a given project or library.

You seem to be suggesting that the only way open-source can be safe or useful is if everyone evaluates every line of code they use. That's silly, of course. Open source can be safe and useful as long as enough people evaluate enough of the code. And given the number of random patches (some good, some bad) that the Debian project alone receives on a daily basis, I can assure you that a lot of people our there are reading a lot of code.

Of course, I don't personally need to evaluate every line of code in a project as long as I know (and I do) that there are others out there like me who at least do spot inspections. A little pro-active inspection up-front to give yourself at least a basic idea of how the code works can save a lot of grief further on down the line. I count it time well spent.
With proprietary code there are someone I can call and they are by contract obliged to fix problems within a certain time frame. That has nothing to do with the code being "proprietary", and everything to do with having a support contract. Do you imagine that companies using open-source don't have support contracts?
Have you ever even considered just how bloody huge the code base is for something like a database? What does that have to do with anything? I've seen tiny projects that were incomprehensible messes of tangled spaghetti code, and huge projects that were clearly and cleanly laid out, well organized, and a piece of cake to maintain, support, study and evaluate. Frankly, I'll take the latter over the former anyday. It's not about the size of the code base, it's about the structure and organization.
Also as a developer I got enough to do creating my own applications [...] Ah, well if you're the kind of developer who works in complete isolation on your own projects with no interaction with anyone else, I can understand your point of view. But that kind of development is pretty rare these days. Most of us work on teams, and evaluating other people's code is an almost-daily part of the job. The majority of that, at least in my case, involves code reviews (formal or informal) for other people in the company, but our code reviews are by no means limited to in-house code. We take more care with our own code because we know that we're the only eyes on it, but that doesn't mean we're foolish enough to assume that all third-party code is perfect and flawless.
Re:Source code is its own documentation by Bloater · 2008-06-15 10:17 · Score: 4, Funny

I disagree. This isn't about closed vs open source, this is about decent programming. Comments in code are neccessary and a minimal requirement for any project. But far less important than most people realise. Most code should be self documenting.
At least add one line to any function explaining what the function does, what its input is and what it returns. Isn't:template<typename InputIterator> typename iterator_traits<InputIterator>::value_type sum(InputIterator begin, const InputIterator& end)
enough?
I don't understand this hatred for comments and the "code-is-its-own-documentation"-philosophy. I really don't. If your code is unreadable, then it is bad (see your example). Oh wait... I think I just had a "Whoosh" moment... I did, didn't I? Somebody mod parent up +1 Funny
Re:Source code is its own documentation by Bloater · 2008-06-15 10:35 · Score: 2, Insightful

I keep hearing people pro open source code say "I can check it!" Well can you? Have you done so - in a project spanning more than a few thousand lines of code? I've checked a few lines here and there that interest me, other people check the lines that interest them. An awful lot of stuff gets checked.
With proprietary code there are someone I can call and they are by contract obliged to fix problems within a certain time frame. But that doesn't mean they will and it doesn't mean you get to sue them, and if you do win in court, it doesn't mean that your business is still viable. This is the big proprietary fallacy "There is someone to blame", you can blame all you want, meanwhile you're on the dole because you /can't/ get stuck in, fix it, and carry on your business. With Open Source you can be sure that the worst case isn't financially devastating.
With OSS we don't have [the vendor fixing things on demand], yes, we can log a bug in whatever bug tracker they use and hope someone will address our issue, but we have no guarantee. I know vendors that just tell you its a feature and that crippling your business is what you paid for. They still make money because not everybody hits the problem but you can't switch because your budget is already spent on this "solution" and hiring more employees is a different budget and never gets accounted as the TCO of the solution in question. A small business will notice but a small business will already be dead by this point.
With OSS we of course got the good fortune of being able to go through the source code and try to fix the code ourselves... right?
Have you ever even considered just how bloody huge the code base is for something like a database? There are no serious OSS databases where you will have a problem with bug filing or serious bug fixing. Since there is only one serious OSS database (PostgreSQL), this is quite easy to determine. PostgreSQL has a great reputation and the chance of you having to fork out on demand for a bug to be fixed is as slim as with the proprietary vendor. You can also buy a support contract from many of the core developers. OSS and money are not mutually exclusive.
but at some point the project grows so big that anyone using it needs to have someone they can call at 4 am in the morning to help them. So get a support contract on your OSS software and call them.
Oh and just because some software is proprietary it doesn't mean you don't have access to the source code, even at Microsoft you can buy access to the source. Buy access? Even after paying for the support contract? surely you should be able to pay for the support contract and have the source for nothing extra, along with the right to extend the functionality and a vendor receptive to the idea of including your work to take the support burden on themselves under you support contract?
Re:Source code is its own documentation by nasch · 2008-06-15 15:26 · Score: 1

You both have good points, but what I took from the GP is that this rejoinder from the open source community that you can always just fix the bugs you find is an oversimplification. Yes, it might be possible to fix the bug. But first you have to wade into an unfamiliar code base to find the bug, and I don't care how well designed or well documented the code is, that takes time. Time that has already been spent by others who you could maybe pay to do it for you (as parent said, this has nothing to do with whether the code is open). Then, as GP said, you have to be sure that you're not breaking other things. More time. Then you have to worry about what happens when the next version of the software comes along. Did they incorporate your patch? If not, what do you do, repatch and retest to see if your patch still works with the new version? More time. In the end, yes it is possible to fix things in an open source package, but IMO it is generally far from practical. OSS certainly has advantages, and I think it's great, but often the ability to edit the code yourself is, ironically, not one of them.
Re:Source code is its own documentation by tibman · 2008-06-15 16:11 · Score: 1

The article doesn't really state that open source isn't documented(commented) very well. Just documented less. Even if their figures are correct, it's still overkill as far as i'm concerned. It's 3 comments in each 10 lines of code on average.

As you said "At least add one line to any function explaining what the function does, what its input is and what it returns." If 30% of your source is directly commented, you've probably done more than enough.

These numbers are probably wrong though.. they say it's 70% undocumented now compared to 30% in 2006? Rarely do i have to time to put comments over 70% of the program. But i admit, i don't even come close to representing a sample of programmers. So i really can't say what's comment overkill and what's minimalist. But that is a huge gap to have crossed from 2006-2007. Not to mention zero of their figures are exact. Especially the "audited between 300M to 500M lines of code" part. That reeks of some figures pulled from someone's ass.

--
http://soylentnews.org/~tibman
Re:Source code is its own documentation by Anonymous Coward · 2008-06-15 16:28 · Score: 0

Nice trolling...
Re:Source code is its own documentation by Kjella · 2008-06-16 00:02 · Score: 1

The only reason why we don't see an article "Undocumented Commercial Software On the Rise" is because the public cannot see how badly documented the commercial software is. No, because to rise it'd have to be documented to begin with...

--
Live today, because you never know what tomorrow brings
Re:Source code is its own documentation by Splab · 2008-06-16 01:28 · Score: 1

Thanks for trolling me, as always I'll feed you a bit, can't have you starving can we?

At no point do I talk about safety, thats the OSS mantra, if you doubt its secure you can always read the source. If YOU think thats what I'm addressing you are just reinforcing my point.
Ah, well if you're the kind of developer who works in complete isolation on your own projects with no interaction with anyone else, I can understand your point of view. But that kind of development is pretty rare these days. Most of us work on teams, and evaluating other people's code is an almost-daily part of the job. The majority of that, at least in my case, involves code reviews (formal or informal) for other people in the company, but our code reviews are by no means limited to in-house code. We take more care with our own code because we know that we're the only eyes on it, but that doesn't mean we're foolish enough to assume that all third-party code is perfect and flawless.

At no point do I make such a claim. What I DID say was that we have enough work to do with our own code, around here developers are hard to come by so we strangely enough value our own projects over external peoples project. If something doesn't work we need someone we can call.

But you are obviously OSS fanatic to the point of being ready to blow up the evil corporations - might wanna seek help on that.
Re:Source code is its own documentation by Anonymous Coward · 2008-06-16 06:46 · Score: 0

You can pay companies with developers who specialize in certain open source projects to fix the bugs for you.

It's called "Open Source Enterprise Support".
Re:Source code is its own documentation by jabjoe · 2008-06-18 01:09 · Score: 1

This is bait, but hey, it's tasty even with the hook. The reason open source will win out is because you really can fix it. It's not work related, but a good example. I was processing some video and the java app doing it crashed out telling what line the problem was it was a simple -1 index problem. I could just stick a index check in and carry on (the bad index was due to bad data I was running the app to fix!). A good proportion of crashes are something silly like that, or a null pointer being used/freed, any programmer can see it and fix it with out having to get to know the whole code base. I get crashes in closed software at work some times, and I can't do a thing about it! I can submit it as a bug to the vendor, and if I'm very very lucky I will get a patch, but most of the time your told it will be fixed in the next version and maybe a work round. I did programmer support for a middleware API for years, I use to do exactly that to customers myself. If there is a work round, ok, unless it's not really workable, but if not, what then? I'm willing to fix it but unable to because I don't have the source. You then get into "Are you willing to pay mega bucks to get it fixed?", again, I use to have to do this to customers myself. With open source you can fix it yourself if need be, or pay to get it done just like you would in closed source. So basically with open source you have all of the safety net of closed source (if you pay for support that is) with the extra advantage that you can help yourself.

Obligatory Slashdot response... by Anonymous Coward · 2008-06-15 04:47 · Score: 0

Well it's better than closed source where you can't even audit the code...

(except nobody is accountable for open source, so you can't even sue the culprit)

Re:Obligatory Slashdot response... by Devin+Jeanpierre · 2008-06-15 04:52 · Score: 1

Yes you can. Get a code auditor, make them sign an NDA, done! In fact, that's exactly what they were doing.

--
-Devin Jeanpierre

Avoid projects with one developer by Animats · 2008-06-15 04:48 · Score: 2, Insightful

The original article is an ad for a service that looks at code for you. But it's a real problem.

A basic problem with open source is that once you get beyond the top 50 or so projects, the quality is usually crap. Look at the source from a few random projects on SourceForge. There aren't that many real "community" projects, where multiple programmers are working on the same code. The long tail isn't very good.

Re:Avoid projects with one developer by Rakshasa+Taisab · 2008-06-15 05:08 · Score: 1

Perhaps you shouldn't be looking at SF for the top open source projects? Having your own infrastructure is a good indication that you might be more dedicated to your project, than making a SF account does.

--
- These characters were randomly selected.
Re:Avoid projects with one developer by jgrahn · 2008-06-15 05:34 · Score: 2, Informative

A basic problem with open source is that once you get beyond the top 50 or so projects, the quality is usually crap. Look at the source from a few random projects on SourceForge. There aren't that many real "community" projects, where multiple programmers are working on the same code. The long tail isn't very good.

You have a point, but s/the top 50/the top 1000 or so/. You have to count various C libraries, and things like the Perl modules at CPAN. Many of them are in wide use, and should be trustworthy.
Also, I'm not so sure that community projects are generally better than single-person projects -- if you don't count crap projects which only the author can love.

70% Undocumented, huh? by Devin+Jeanpierre · 2008-06-15 04:49 · Score: 5, Insightful

How do you measure something like how well things are documented with a percentage? Some code simply doesn't need documentation. Other code needs plenty. Is 0% a 1:1 relationship between lines of code and lines of comments? That whole thing seems a bit strange. They could certainly back it up if they wanted to, but that'd be too much effort.

--
-Devin Jeanpierre

Re:70% Undocumented, huh? by Aphoxema · 2008-06-15 05:02 · Score: 1

Regardless of how meaningless a percentile is, it is apparent that many programs, especially ones included with *buntu-desktop packages, just don't tell you enough.

--
"Most people, I think, don't even know what a rootkit is, so why should they care about it?"
Re:70% Undocumented, huh? by TheRealMindChild · 2008-06-15 05:21 · Score: 1

I hear ya. I read the following line and was completely flabbergasted:

Of that 50% of open source code, 70% was undocumented

When it told you up front that they were doing line counts, "70% was undocumented" tells me that a little under every three lines has a comment. If you ask me, that sounds like an awful lot of documentation.

--

"When life gives you lemons, don't make lemonade. Make life take the lemons back!" -- Cave Johnson
Re:70% Undocumented, huh? by Anonymous Coward · 2008-06-15 07:44 · Score: 1, Informative

If you have ten projects, and two use open source and the others don't, then if your records indicate one project uses open source, your records are 50%. If you don't record the use of open source, you are 0%. Note: this has nothing to do with how well the *code* is documented, it is how well the sources for code are documented.
Re:70% Undocumented, huh? by civilizedINTENSITY · 2008-06-15 07:54 · Score: 3, Informative

But its not per line, but per application. If they used open source and documented "we used code from project whatever", that counts as one case of documented code.
Re:70% Undocumented, huh? by civilizedINTENSITY · 2008-06-15 07:55 · Score: 2, Informative

No, it means that in 100 projects that used open source code, 50 of the projects documented that they had code from a certain open source code base.

Statistics ... by tomhudson · 2008-06-15 04:50 · Score: 3, Insightful

Of that 50% of open source code, 70% was undocumented.

They talked about looking at 300m LOC. I'd hope 70% was "undocumented". 70% of most code is just common-everyday stuff that doesn't NEED to be documented in the sense that comments are completely wasteful. It's the "glue code" that needs to be documented, and the non-intuitive stuff, and stuff that is done for a reason that, on first glance, looks like the writer had a brain fart, but, in this special case, makes sense, or "corner case" situations.

Do *NOT* "insert comments like "for (i=0; i

Re:Statistics ... by tcopeland · 2008-06-15 05:11 · Score: 3, Interesting

> 70% of most code is just common-everyday stuff that doesn't
> NEED to be documented in the sense that comments are completely wasteful.

So true! Rather than this code:
# Finds the most recent orders for the passed in person def get_rec(p) # blah blah end
I'd much rather see an intention-revealing method name (hat tip Marcel Molina):
def find_recent_orders_for(person) # blah blah end

I'm still not really sure what documentation is really useful - maybe a few diagrams plus some use case descriptions that go through the code, maybe? I'm not sure. I guess it depends on the project - it is a widely used library? Is it an internal department app to track the coffee fund? etc.

My experience with open source code has been that the large projects have decent docs... I was just reading through some of the PostgreSQL docs on backups this weekend and they're quite good.

--
The Army reading list
Re:Statistics ... by Splab · 2008-06-15 08:21 · Score: 1

Call graphs!

I recently created a lexer and parser for our database so we could generate maps of how the database objects interacts - we got 300+ procedures calling 70+ tables, having comments tells you what each of them do, but how stuff interacts is a life saver.

Consider:
someddl.sql
--Table for user login
create table login(
login char(64) not null primary key,
password char(32) not null -- applications serve us md5 sum
);

someproc.sql
--Procedure to check login
CREATE PROCEDURE doLogin (username varchar, password varchar)
RETURNS (valid int)
READ SQL DATA
BEGIN
exec sql prepare somecur select 1 from login where login = ? and password = ?;
exec sql execute somecur using(username, password) into(valid);
exec sql fetch somecur;
exec sql close somecur;
exec sql drop somecur;
END;

Comments give a quick oversight of whats going on, however, as a programmer you have to know that doLogin is using login table - it might seem obvious, but when you got hundreds of procedures/functions interacting with tables having oversight is suddenly very hard.

Lets say for instance you have to modify login table, to do so you have to verify that no procedure will be affected by the change - if any is, you have to update them appropriately. This is where my lexer+parser comes into play - I can generate complete call graphs over our system, and using that I can quickly determine what tables and procedures has to be taken into account when doing any modifications. (Also as an added bonus, since I know how everything is interacting I can automate certain tasks - for instance if I need to reinsert anything (lets say someone accidentically dropped 10.000 rows) I can use my program to work out in which order to restore data from running backups without having to rollback the entire database).

Same goes for tables - ER diagram sucks when you hear about them first, but at that point you probably have never had a big database to administrate. When it grows suddenly having a diagram to quickly lookup how everything is screwed together can save you hours hunting for code.

Documentation inside code helps you figuring out what the code is supposed to do, but before you try to figure out what a certain snippet of code does you need to figure out what to look for.

*Disclaimer above sql code is by no means used in live environment, was just whacked together as an example
Re:Statistics ... by tcopeland · 2008-06-15 10:37 · Score: 1

> This is where my lexer+parser comes into play -
> I can generate complete call graphs over our system

That's very cool! Do you do data flow analysis as well?

--
The Army reading list
Re:Statistics ... by Splab · 2008-06-15 16:53 · Score: 1

No unfortunately its created on company time so it only does what I need it to do right now. :-)
Re:Statistics ... by tcopeland · 2008-06-15 17:01 · Score: 1

> it only does what I need it to do right now. :-)

Cool, sure, no harm done there!

--
The Army reading list
Re:Statistics ... by Anonymous Coward · 2008-06-15 18:35 · Score: 0

I think you're missing the point. Yes, a basic for loop doesn't need a paragaph of comments explaining its purpose, but there are all kinds of commenting that people fail to do.

Eg: This function is an implementation of algorithm X, it takes Y parameters and returns Z. It is based on the work of so-and-so from the university of such-and-such. For more details on the algorithm, read the following book/website.

That kind of basic higher-level commenting can make or break a project if it gets passed from one person or group to another. I have worked on some horrible projects where the previous developer thought he could do it all in his head, and perhaps could have if the project hadn't grown beyond his capabilites or grown in scope...

or if the developer hadn't had developed a taste for the nose candy...
Re:Statistics ... by Actually,+I+do+RTFA · 2008-06-16 09:46 · Score: 1

I'd much rather see an intention-revealing method name (hat tip Marcel Molina):

Funny, I'd much rather have a simple function name to type, with comments easily accessible, then have to retype them all the time. Keep comments write once/read many. As opposed to:

Orders* FindRecentOrdersForCustomerOrReturn -Avoiding Lameness Filter- INVALID_HANDLE_VALUEForAnInvalidCustomerOrNULLForNoOrders ( const Person& Customer )....

--
Your ad here. Ask me how!
Re:Statistics ... by tomhudson · 2008-06-17 14:03 · Score: 1

Higher-level stuff is great. Unfortunately, when you see something like someone complaining that "70% of the code wasn't documented", without any further explanation, you've go to wonder just how the did their analysis - was it like the "MIT Dumpster Divers" that SCO claimed to have hired, but who must have been hiding in Blep's suitcase along with the "millions of lines of code".

Same old, same old. by khasim · 2008-06-15 04:52 · Score: 5, Insightful

In today's world of 24/7 and persistent network access, developers dispersed across multi-national sites can include open source, freeware, public domain, evalware (demos of commercial software), etc, into the code they are writing without triggering the usual checkpoints in the procurement process.

I've seen that same issue YEARS ago. And I'm not talking code snippets. I'm talking systems that had "evalware" tools in them.

This has NOTHING to do with "multi-national sites" or any of that.

This has EVERYTHING to do with clearly stating the rules and ENFORCING those rules.

The rules do not enforce themselves. Someone, somewhere has to approve the code that goes in.

The problem is that management does NOT understand code and will happily farm out the work to anyone who says that they can produce X lines for $Y. Without oversight. The less oversight, the less expensive the project is. Which means bigger bonuses for those same executives.

I notice an omission by Todd+Knarr · 2008-06-15 04:55 · Score: 3, Insightful

They talk about how much of the open-source code is undocumented. I notice that they don't bother to mention how much of the in-house code is also undocumented. My experience as a software engineer is that their in-house code's probably at least as poorly documented as the open-source stuff. And if the business finds this state of affairs acceptable for their in-house code, why's it any more of a problem for the open-source parts?

I've also found that when the business does get a consultant in who demands documentation, they usually demand something that's completely useless for the actual developers. Eg., they demand UML models for all the software. Well, that's nice and all, but most of what's in the UML you can see by glancing at the class definitions. The things a developer needs, like what the methods are supposed to do and what gotcha caused a particular way of doing it to be picked and what assumptions the code's making about it's inputs and outputs, have no place in a UML model.

Re:I notice an omission by civilizedINTENSITY · 2008-06-15 07:57 · Score: 2, Informative

Actually, they never said anything about whether the open source code was well documented or not. They said the projects using opensource didn't document that a particular opensource code base was part of the project.
Re:I notice an omission by Todd+Knarr · 2008-06-15 12:00 · Score: 1

Then I suspect the consultants here are making a big deal out of very little. Where I work we use a lot of open-source in our programs, and there's nothing in any of the official documentation about what we're using. All there is is a notation that these programs use outside libraries, and of course the makefiles and such list every such library used. And over in a completely separate team's workspace is the complete set of external libraries we use, along with their dependencies and build instructions. I'm sure these consultants would be horrified at the lack of documentation in our programs of what we're using, but it's simply not a problem. Our programs are for internal use, and all the libraries we use are fine for that (we make sure of that before we decide to use them). It only becomes an issue if we ever want to distribute our software outside the company, which a) isn't going to happen and b) if it ever does happen we know we need to vet all our external libraries and their licenses with Legal before doing so whether they're open-source or proprietary.

Comments are overrated by Anonymous Coward · 2008-06-15 04:56 · Score: 2, Insightful

How much of that "documented" code in the article was documented correctly? Good code is easy to understand by good programmers. Documenting things is just another dependency to fall out of sync. Would you rather fire up a few neurons to grok some code yer working on or spend hours pulling your hair out only to eventually figure out your documentation was wrong?

Documentation should be used sparingly and as tightly woven into the development process as possible. The programmer should document their code when necessary as soon as they think it not at some later pass. Provisions for inline documentation should be used. When a programmer modifies some code they are more likely to also modify the documentation when it is immediately adjacent. The probability that documentation will remain in sync is the inverse of the square of that distance.

Kind of makes the underhanded code contest by the_humeister · 2008-06-15 04:59 · Score: 2, Interesting

hit closer to home perhaps? A quick glance at some of those code snippets and they can be easily missed. Now place them in large applications with thousands upon thousands of lines of code and who knows how long it'll take to find them.

Re:Kind of makes the underhanded code contest by bockelboy · 2008-06-15 07:19 · Score: 1

The funny thing about that contest is they state you have a very low probability of winning if your program is over 200 lines of C.

So - 1 underhanded line out of 200 LOC takes skill to do. Imagine what you could do with 1 million LOC.

Documenting is kind of hard. by Aphoxema · 2008-06-15 05:00 · Score: 3, Insightful

I tried to get into documenting software for Ubuntu. I wanted to help but I didn't really have any programming skills, I think the most complicated stuff I've ever done is scripts for mIRC and some HTML.

After really sitting down with some programs, I realized I just had no idea where to start. There was certainly more to be said than who made the program and what license it was under like many programs have in their 'help' and 'about' menu, but it really does get to be an enormous task and it's a certain amount of responsibility because the few people that will read the documentation first will take everything it says to heart.

I might try again, but I'm going to be sure I really have time to do it and the patience to read through source code. mangu is right, even though I don't know how to program but it's not hard to figure some things out and sometimes there's vital comments 'between the lines'.

I have noticed more programs (included in Ubuntu) have the information I need when I care to look at it now, I generally check documentation for command line arguments and stuff in case --help won't tell me everything or anything at all. At least someone's getting the job done.

--
"Most people, I think, don't even know what a rootkit is, so why should they care about it?"

Re:Documenting is kind of hard. by flnca · 2008-06-15 05:39 · Score: 1

BTW, also check out the "man" and "info" pages. You can do that by writing "man program" or "info program" from the command line or by using man page browsers like XMan or DevHelp (if properly configured). Some OSes like OpenBSD always include the man pages, while Ubuntu apparently doesn't in some cases and put them in separate packages. Sometimes documentation is hidden somewhere in the file system as HTML pages, even. Many open-source packages are documented well; if not in the source code, then at least in the man pages or similar documentation.

Re:Not just for security by Otter · 2008-06-15 05:05 · Score: 4, Informative

"Documented" in this story means that the company's developers have documented what the hell is going into their codebases (with respect to licenses, keeping things updated, and so forth). It has nothing to do with either user documentation or source code comments in the original open source project.

That said, the "70%, up from 30%" numbers are absurd. There is no way that the failure rate to document use of open source code more than doubled in 2007.

--
What I'm listening to now on Pandora...

Send it Back by Blackknight · 2008-06-15 05:06 · Score: 1

Undocumented code eh? Better send it back to Mexico.

Re:Send it Back by Anonymous Coward · 2008-06-15 10:26 · Score: 0

Good idea. But first, run it by Miguel de Icaza and see what he thinks.

We need a fence! by Anonymous Coward · 2008-06-15 05:13 · Score: 0

We need some kind of national fence to keep these undocumented source codes out of the country!

They're taking our jobs!

Trust by Nerdfest · 2008-06-15 05:21 · Score: 1

Even if it was documented, are you going to trust the documentation? If you want to make sure it's doing what you think it's doing, read the damn code, that's what it's there for.

Re:Trust by bluefoxlucid · 2008-06-15 06:33 · Score: 1

verification is easier than discovery. Hash algorithms and digital signatures work off this principle: it's easy to verify an input matches a hash, but it's hard to make an input from a hash. It's easy to verify a given private key encrypts a hash in some way (though in real life you never do this), but it's hard to generate that private key (good -- it needs to stay secret). Code often works the same way, because you are trying to reverse engineer the instruction set to determine its logical function rather than following it to determine if it carries out an already known logical function.

--
Support my political activism on Patreon.
Re:Trust by tomhudson · 2008-06-15 13:40 · Score: 1

Even if it was documented, are you going to trust the documentation? If you want to make sure it's doing what you think it's doing, read the damn code, that's what it's there for.

Too many times there's a divergence between the docs (and even the embedded comments) and the code, because of a "the code is important - the documentation can wait" mindset. People will claim to understand that good documentation, kept in sync, will save money, but their actions betray them - "this is an exception - we can do it later!"
Or "we'll catch it in the debugging phase" - even though not writing the bug in the first place, or catching it at an early stage, can easily be 100 times cheaper (no, that's not a typo - 100 times cheaper. Heck, it can make the difference between success and a death march to failure).

Let me complain a bit too by 32771 · 2008-06-15 05:46 · Score: 1

Frequently I want to have a look at somebodies source and normally I find a comment section at the beginning of a file and I'm happy about this. So when reading beyond the first /* I find the GPL blurb which is great. Then upon reading a bit further I come across the */ so I'm thinking ok I'm fine to read his code since it is GPLed. Any description of the code the programmer could have put into the initial comment is not there probably because the code is new.

When going further into the code I would expect to find some explanation about what each function does even if it is just a construction site.

Ok, so that is missing maybe at least the names are descriptive. What do I get? Something not much better than O_oOo_0(int a,...)

This is just not what I need a GPL blurb for. Why don't you omit that too and just keep us guessing.
Don't worry nobody will touch your precious code - ever!

--
Je me souviens.

Streaming video by Meneth · 2008-06-15 06:10 · Score: 0, Offtopic

I've recently been trying to implement streaming video in a cross-platform system, and the main open-source libraries, ffmpeg and Live555... well, Live555 at least has unstable release packages. That's the best I can say about their project management. :/

Undocumented OSS Code On the Rise? by Anonymous Coward · 2008-06-15 06:11 · Score: 0

I for one welcome our undocumented overlords with open arms.

So Much for RTFM by russlar · 2008-06-15 06:24 · Score: 1

me: "I installed this module, and it borked the kernel."
buddy: "Did you RTFM?"
me: "I can't. there isn't one."

--
Anybody want my mod points?

Re:So Much for RTFM by Bloater · 2008-06-15 10:46 · Score: 1

There isn't a manual but you installed it anyway? You deserve everything you get :)

Meh, I'll save y'all reading all of this by sticks_us · 2008-06-15 06:31 · Score: 3, Interesting

The money shot:

Again, open source is not any more risky than any other kind of code. What is risky is not documenting your use of 3rd party code...here are some quick examples: OpenSSL, phpBB, xt:commerce

So what they're basically saying is that if you use OSS tools in your company, someone should probably be keeping track of them and patching them as needed.

Should this not hold for *all* software you've deployed? Few programs are immune to eventual obsolescence (including ongoing bugs and security problems), so if you think you're safe just because you're running a bought-and-paid-for solution that you've subsequently ignored, you're probably in trouble.

That being said, I wonder about this: ...applications written within the last five years contain 50% or more open source code, by a line of code count

I get the impression that we're not getting the full story here. If their code audit showed that 50% of software X was copypasta from sourceforge, that would be something (you probably have crappy developers plus possible legal hell if there were copyright infractions).

On the other hand, if they figured "hey, your hello world program uses library Y, which is 2 million lines that we don't think is documented properly," then the "application" does not *contain* 50% or more open source code, but rather *references* a certain amount of open source code, which is probably a meaningless statistic.

--
"Beware of bugs in the above code; I have only proved it correct, not tried it." -- Donald Knuth

Re:Meh, I'll save y'all reading all of this by civilizedINTENSITY · 2008-06-15 08:01 · Score: 2, Informative

"hey, your hello world program uses library Y, which is 2 million lines that we don't think is documented properly," then the "application" does not *contain* 50% or more open source code, but rather *references* a certain amount of open source code, which is probably a meaningless statistic.
Its not that the 2 million lines of code is undocumented. It might be documented very well. Its that the project doesn't record the fact that code is used from Project OpenThingee. Thus, when OpenThingee finds a problem and patches it, no one knows that their codebase needs a patch, because no one knows it uses OpenThingee code. Therefor, its not a meaningless stat.
Re:Meh, I'll save y'all reading all of this by sticks_us · 2008-06-15 12:39 · Score: 1

You're right, of course. I think the way they measure this, though, is a little nebulous/sensationalist, and that was my beef above.

For example: If the OpenThingee source tarball is 75,000 lines of code, and I install it somewhere (in an undocumented way), does that mean I need to write 75,000 lines to get the ratio up to 50%?

--
"Beware of bugs in the above code; I have only proved it correct, not tried it." -- Donald Knuth
Re:Meh, I'll save y'all reading all of this by civilizedINTENSITY · 2008-06-16 02:10 · Score: 1

I believe that all you need to do is write one line documenting the use of the opensource tarball. That one line covers the whole tarball, because its really *project* documentation at issue rather than programmer's source code documentation.
Re:Meh, I'll save y'all reading all of this by sticks_us · 2008-06-16 05:54 · Score: 1

I believe that all you need to do is write one line documenting the use of the opensource tarball. That one line covers the whole tarball, because its really *project* documentation at issue rather than programmer's source code documentation.

Ah, that might clear it up, thanks.

Evenso, I'm not sure I think the people in the interview were being a bit sensationalist. Then again, it makes for a fun topic!

--
"Beware of bugs in the above code; I have only proved it correct, not tried it." -- Donald Knuth

Re:Not just for security by tacocat · 2008-06-15 06:33 · Score: 4, Interesting

I would be interested to know what languages you have used.

I have found Perl to be very well documented, even though it appears to be on a decline or leveled off on the number of developers and active projects.

Meanwhile, I have looked into use Rails and found it a great example of shitty code practices. I've stated this very case to the development community and they pretty much debunked my statements as one belonging to an inexperienced developer unwilling to "go the distance".

I hope this might be slightly helpful in getting people like the Rails community to either understand that they really do need documentation or get companies to throw aside Rails as POS software that is so lacking in documentation that it's a greater burden to have it than to use the alternatives.

There is an excellent case where if you have a highly experienced and knowledgeable developer then you maybe don't care. But if you have to replace this developer with one less knowledgeable or want to expand your development team, you suffer a huge start up cost of trying to bring someone up to speed at your expense.

Specifically, the Rails plug-ins are documented with over simplified tutorials that aren't even available for free and so you have to make an extra effort to find the documentation for the software that you download since they aren't in the same location. Restful Authentication is one example in particular.

Add to that the documentation in Ruby DBI. There isn't any. The documentation says to see Perl DBI for documentation. Considering this is a reference to a different language with different syntax and some of the Perl methods aren't possible in Ruby and likewise Ruby DBI has methods that aren't available in Perl. WTF? This is documentation.

News Flash: Code is frequently undocumented by dkleinsc · 2008-06-15 06:45 · Score: 1

Film at 11.

Seriously, this isn't news. I don't care what context you're talking about, programmers often skip over documenting their work. That's largely due to the pressures of how much time they have to work on something (either imposed by The Boss or other time commitments).

--
I am officially gone from /. Long live http://www.soylentnews.com/

Re:News Flash: Code is frequently undocumented by RobertLTux · 2008-06-16 09:44 · Score: 1

Well the problem is not the code having functional comments but code having paperwork stating where it came from
(who actually wrote said code)

Why You Inc should care
You begin writing programs and then selling them. One of your Core Products is say Manager Mind Reader/Parser (this program can be used to figure out what a manager means from what he is saying and a rather nifty General Manager MindMap. Your developer took the code (and data) from a SF project. You then base 85% of your business on this one product and begin making billions. Time passes the developer leaves (vertically or horizontally) and then you get this funny letter from Morrison & Foerester stating that the original developer is now in the process of suing you for the illegal use of GPL'd code (but the suit will be dropped if you A comply with the GPL version 5 that the GM^3 is licensed under B write a check for a few hundred million oh btw C you need to update to the newest version due to a rather nasty bug)

if this happens YOU ARE DEAD win against MoFo on this kind of slam dunk?? Not happening

--
Any person using FTFY or editing my postings agrees to a US$50.00 charge

Palamida is lying. by Anonymous Coward · 2008-06-15 06:56 · Score: 0

Palamida is full of crap. I am a senior employee at a comprehensive security services company, and prior to this job, filled a similar role at an extremely large software company most of us are aware of. One of our services is to review source code for our customers. We can review approx 1200 to 1500 lines of source code per day per analyst. Therefore, *complete* source code reviews are very rare, as they're long and expensive. Therefore, targeted ones are performed. However, 300 to 500 Million lines of source code would take quite a while to review. To be more precise, using the lower bounds of their stated numbers, and the upper bounds of what a highly skilled analyst can achieve, 300,000,000 lines of code / 1500 lines of code reviewed per day = 200,000 man days of review. Assuming that they are fairly stingy with paid time off, their staff will work about 50 weeks per year. 200,000 man days works out to 40,000 work weeks, which at the rate of 50 work weeks per year, would result in 300,000,000 lines of code taking 800 man years. As it was stated this 300 to 500 million lines of source code review occurred in 2007, and not from 1207 to 2007, it can be presumed that they are either: A) time travelers, B) employee more highly skilled security analysts than any other company on Earth C) Doing a piss-poor job using automated source code scanning tools and checklist monkeys D) Full of shit, or E) C and D. My bet is on E. By way of illustration, Windows Vista has somewhere in the neighborhood of 50 million lines of code, and took a the combined resources of multiple security vendors over 6 months to perform even a very limited and targeted source code review. This one company is claiming that they were able to review 10 Vista-sized software projects in one year?

Re:Palamida is lying. by civilizedINTENSITY · 2008-06-15 08:03 · Score: 1

Who said it was a line-by-line review? They were not reviewing the codebase, but whether or not the code base documented its derivative sources. Ask a developer, "Are you borrowing open source? Where did you document it?" That one question just reviewed a 10 million line application in less than a minute.
Re:Palamida is lying. by Anonymous Coward · 2008-06-15 15:43 · Score: 0

Newsflash: that blog article is a complete waste of time that really doesn't explain what they're talking about. It's not clear whether this was a result of unskillful interviewing, or a vendor marketing guy who didn't fully know what he was talking about, or both. Or maybe the marketer's attitude was "call us, invest some time and resources and find out what we mean". In any case the stats that were quoted were utterly meaningless because the terms (such as "documentation") and context were not explained.

Finally competitive by plopez · 2008-06-15 07:23 · Score: 1

With the proprietary code I have been privy to. So much of it was poorly documented and commented.

But seriously, the trend needs to stop as it creates an excuse ban open source from the workplace.

--
putting the 'B' in LGBTQ+

Gotta love Slashdot by Anonymous Coward · 2008-06-15 07:35 · Score: 5, Insightful

Gotta love this place. At the time of this posting, there are 11 comments modded 3 or higher. Of those, only ONE makes any reference to the act of documenting where the code is coming from (which is what the article is about). All the rest are talking about writing documentation for code, or commenting code as its written. Way to miss the ball, guys! This article is addressing you specifically, yet you have no idea what they're even saying because you can't be bothered to try to listen. Nice.

Re:Gotta love Slashdot by Anonymous Coward · 2008-06-15 10:46 · Score: 0

You must be new here. TFA is considered only a starting point for discussion, which not infrequently meanders into Soviet politics, and audio gear demo'd in mock-umentaries of hard rock bands.
Re:Gotta love Slashdot by Anonymous Coward · 2008-06-15 14:04 · Score: 0

Amen, brother, it just gets funnier the more comments I read. Somebody should tag this article 'whoosh'.

Re:Not just for security by Ethanol-fueled · 2008-06-15 08:03 · Score: 1, Offtopic

I agree, that metric sounded much too high for open-source. The 70% up from 30% was actually the incidence of undocumented Hispanic workers in the US's past few years. Statistical cross-contamination?

Sarcasm here I come... by elysium-os · 2008-06-15 08:48 · Score: 1

That is the reason why you should always buy all software from Microsoft.
It will come fully documented!

The article is FUD by SpaceLifeForm · 2008-06-15 10:11 · Score: 1

You are right, if you compare commented lines of code
with total lines of code, the number is possibly plausible.

How often do you comment stuff like

{
}

i++;

--
You are being MICROattacked, from various angles, in a SOFT manner.

Re:The article is FUD by Richard+Steiner · 2008-06-16 07:31 · Score: 1

Depends on the context. It isn't that uncommon for me to add a comment like the following to your example:
{ } i++; // Increment status line counter

which can really be helpful if the amount of code in brackets is somewhat large. It might not always be obvious what "i" is to a support programmer not familiar with that portion of the codebase.
I've worked on code that was written 30 years in the past (no, I'm not kidding -- FIELDATA FORTRAN V is still in use here as well as at one of my former places of employment), and when code is that old the original designers are often retired or dead. You're on your own, and if the project doesn't have a good set of programmer docs for each source module, the source code might be all the documentation you have.
It isn't uncommon for me to spend a paragraph at the start of a complex routine explaining WHAT the routine is doing and (in some cases) WHY the routine does it that way if it's using a novel or nonstandard approach to the problem. Such comments can save you and your teammates (or successors) a lot of headaches in the long run, and I can think of several occasions where such comments have been a lifesaver at three in the morning when I'm on coverage trying to walk my way through a production issue...

--
Mainframe/UNIX Bit Twiddler and long time Windows/Linux Hobbyist.
The Theorem Theorem: If If, Then Then.

Re:Not just for security by MikeFM · 2008-06-15 11:02 · Score: 3, Interesting

Any software product without good documentation is difficult to use. Proprietary programs are potentially a lot worse than open source because you don't have an easy option for figuring out the hard bits for yourself.

My example would be Activant Eclipse (formally Intuit) - the ERP software the company I work for uses. It's expensive, performs poorly (even on the expensive IBM mini that is required), is buggy, is largely undocumented, is hard/expensive to customize and is completely required to keep our business running. We pay for an expensive support contract and support is still often a joke - complicated issues are usually figured out by our staff because their support staff isn't able to do much. A horrible painful experience.

--
At what price learning? At what cost wisdom? The price is a man's peace of mind, and the cost is his life.

At least an executable is well documented by EmbeddedJanitor · 2008-06-15 12:06 · Score: 1

The definition of each instruction is very well defined by the CPU's instruction set.

--
Engineering is the art of compromise.

The more interesting statistic is... by jonwil · 2008-06-15 13:00 · Score: 1

The more interesting statistic is the number of products being shipped that use open source code (e.g. GPL) where they are violating the license, have not released any of the code they are required to and have not acknowledged their use of open source code (sometimes they will comply after a lot of pressure, sometimes they never comply)

Re:The more interesting statistic is... by T3Tech · 2008-06-15 19:10 · Score: 1

I agree. TFA hints at legal issues although it does so WRT security of the code which is the primary focus.

Common sense should dictate that any 3rd party code, where used, is notated as such in order to facilitate security related patching if/when needed and to possibly indicate further review as to how it relates with the rest of the code. But then again common sense isn't so common and introducing 3rd party code into another codebase has the potential to open up an entirely different can of worms as to the security of the application which should naturally be audited.

So what I read the article as saying is that there is a greater chance of applications being insecure of the type which were part of this sample base and that the FSF is now looking for more volunteers in the enforcement department. :)

--
Of course I didn't RTFA... why would I do that? You really are new here aren't you? Don't let my UID fool you.

Nowhere does this article mention unit testing by bADlOGIN · 2008-06-15 13:24 · Score: 1

Comments that are wrong or outdated are worse than comments that are missing. Both are crap compared to well written unit tests. I didn't notice anywhere in the article a mention of weather or not this analysis took unit testing into consideration. Perhaps Palamida's VP of Product Marketing should quit telling people to build software like it's 1999 and perhaps rebuild some of the tools they're trying to sell to the public so they check for test coverage. I'll bet open source developers could go out over the next 3 months and start adding: /* Palamida's coverage tools are crap */

To every function and method in C/C++ apps and they could get a glowing reversal on what wonderful improvements have been made to documentation.

I swear. People who think about software like this should be kept away from computers. And perhaps sharp objects for their own good...

--
*** Sigs are a stupid waste of bandwidth.

Well of course... by T3Tech · 2008-06-15 18:27 · Score: 1

Documentation!?
We don't need no steenkin documentation!

--
Of course I didn't RTFA... why would I do that? You really are new here aren't you? Don't let my UID fool you.

The problem with reading the source by Cro+Magnon · 2008-06-16 01:30 · Score: 1

The source can tell you WHAT the program is doing, but it doesn't tell you WHY. Especially when the programmer did something tricky.

--
Slow down, cowboy! It has been 4 hours since you last posted. You must wait another few hours.

Rails attracts shitty code by metamatic · 2008-06-16 03:10 · Score: 1

"Meanwhile, I have looked into use Rails and found it a great example of shitty code practices."

As someone using Ruby and Rails on a regular basis, I have to say that this is my experience too.

I got involved in a flamewar with developers of a well-known Rails application over my contention that documentation was required. Their position is that you don't need any documentation if you have unit tests; you can simply read the unit tests to work out what the supported API is.

Riiiiight.

--
GCHQ Quantum Insert installed. If only our tongues were made of glass, how much more careful we would be when we speak

Re:Not just for security by RT+Alec · 2008-06-16 03:51 · Score: 1

Let me add PHP to the list! PHP is wonderfully easy to allow inexperienced programmers (read: non-programmers) to build something that looks and feels like it works, but often without the coder knowing HOW (or why) it works.

You say Tomato, I say Tomahto by Anonymous Coward · 2008-06-16 12:45 · Score: 0

Well, no, I say tomato(e) but the point is that the story itself doesn't even make it clear what it's about. There's no reason to make the point that "open source" code is poorly, if ever, desribed in plain ENGLISH text (lack of proper training is the fault), but there IS a reason to make a point about using "open source" code in places where you would not expect to find it. Now, don't confused "open source" with "GPL". One is a hideously communistic plot created by a madman, and the outer is, basically, public domain ("BSD" and other "licenses" notwithstanding). Microsoft uses "open source". But then "open source" uses "open source" so this just became a story that has no ending. I'm outta here --- zing !!

Re:Not just for security by doom · 2008-06-16 19:32 · Score: 1

Do you think you could document your claim that perl use has "declined or leveled off"?
Perl hype has leveled off, now that O'Rielly is focussing
on selling books about other things, but perl usage remains
pretty high, as far as I can tell.

I do agree with you about the high quality of perl
documentation. Perl has always attracted people who like
to write about their code, and one advantage of having
a reputation as a "write-once" language is that people
bend over backwards to document (both internally and externally).

(You want to stay away from people who use phrases like
"self-documenting code".)

Re: Answer by Anonymous Coward · 2008-06-17 05:23 · Score: 0

"How can businesses protect themselves and still draw on open source code effectively?"

By paying the open-source developers.

Re:Not just for security by tacocat · 2008-06-20 13:35 · Score: 1

I absolutely cannot document any decline. I do think the hype has leveled off. But I was in a meeting at YAPC::NA two years ago where they were discussing the decline in new programmers coming into the community.

Bad example. by Photo_Nut · 2008-06-23 06:32 · Score: 1

Your examples try to make a point, but miss the mark. The point of the article is not about understanding what the code in question does, it's about maintaining the code - there is a big difference between understanding *what* code does vs. *how* the code does it. A one line comment before a block of unreadable code doesn't help to debug the code. A one line comment in front of a block of semi-readable code at least helps a little.

The real problem is undebuggable and unreviewable code presenting security risk. Did you use your pointers and buffers correctly? Did you check inputs and deal with errors correctly? Does your code follow the contract specified by all the annotations?

If you write a block of code with an annotation which says that you will initialize an output parameter, but on an error fail to initialize that output parameter, and do not throw an exception, then you can cause bugs upstream. Similarly, if you do not understand the contract of a method and do not check it for errors, you also can cause bugs upstream. Documenting these things is a basic requirement that experienced and unexperienced developers make, and code reviewing is an important part of the development process (as is testing).

Re:Not just for security by ChrisDolan · 2008-06-25 13:56 · Score: 1

The decline of Perl is a myth. A graph of CPAN uploads vs. time shows a dramatic increase in the last couple of years, and 2008 is already ahead of the entirety of 2007.

http://blog.timbunce.org/2008/03/08/perl-myths/

Re:Not just for security by ChrisDolan · 2008-06-25 14:00 · Score: 1

I remember that YAPC discussion. As I recall it, the point was that the average age of Perl developers was increasing, indicating a decline in younger programmers entering the community. Perl is rarely a programmer's first language, so this isn't entirely surprising. PHP is taking Perl's place as the newcomer's first web programming language (which is OK in my opinion -- PHP is easier to learn)

Slashdot Mirror

Undocumented Open Source Code On the Rise

94 comments