When Making a Comprehensive Retrofit of your Code...

object orientation by apsmith · 2001-12-21 10:23 · Score: 3, Informative

we're doing something similar - and switching to java (JSP's + Tomcat, struts) to replace a lot of old perl cgi's. The java code is much, much cleaner. But object-oriented perl code can help if you don't want to take the plunge too far into a new language. And at least find a way to go mod_perl rather than CGI, for the things where performance matters at all.

--

Energy: time to change the picture.

Re:object orientation by TheRain · 2001-12-21 10:43 · Score: 3, Interesting

why is this flamebait? all this person is saying is that object orientation could make the code easier to manage and, thus, help to reduce redundancy. it's a good statement, modularize your code and it becomes more reusable... and therefore less redundant. who cares if he likes java over perl and CGI.

--
Please help! I'm stuck inside my virtual reality headset!
Re:object orientation by apsmith · 2001-12-21 10:47 · Score: 5, Informative

Alright, maybe I posted a little too soon, but shouldn't "flamebait" be attracting flaming responses? I don't see any...

Anyway, if I'd spent a little more time thinking about the advice side of it, taking a look at appropriate programming methodologies (like Extreme Programming advocated in another thread here) would be one piece I'd advocate. Given the size of the code (1 MB = about 20-30,000 lines?) there's no need for major heavy-weight processes here. More important I'd say is sitting down and figuring out in the appropriate level of detail what exactly your system is doing right now - you can do this using UML diagrams which seems to be becoming a standard, though the main use we've found is to try to get an overall view of things which we then throw out when we get into the details again.

The other thing to do along these lines is look for your use of standard patterns within your code - the Design Patterns book is extremely helpful if you're moving to an object-oriented framework at all; following well-known patterns and indicating clearly what you are doing can make your code much easier for others to follow.

--
Energy: time to change the picture.
Re:object orientation by ergo98 · 2001-12-21 15:40 · Score: 2, Insightful

...refactor...

Just as a point of reference: I've noticed that the term "refactor" has gained popularity with the whole XP push. Of course "refactor" is just a cooler way of saying "rewrite". i.e. It's not "We should rewrite that module", but rather it's now cooler to say "We should refactor that module". Blah.
Re:object orientation by anacron · 2001-12-21 16:22 · Score: 5, Insightful

Refactoring is not re-writing. Refactoring is a methodical approach to moving, redesigning, and making the code cleaner. It's not throwing away and starting from scratch. The two words are wholly semantically different.

In his book, Refactoring, Martin Fowler talks about how code "smells". He identifies a whole slew of reasons why code can "smell" bad, such as having a method that's too long, having a method on the wrong object, incorrect use of polymorphism, etc. He then outlines "Refactoring Patterns" which can be applied to certain "smells" to make the code more managable.

The last project I managed was a 3 month re-write of over 75,000 lines of code for an auction component that had gotten way out of control. The original component integrated with a very very poorly written (yet still very expensive) auction engine, and our task was to take it out.

Because there was no time allotted to proper requirements gathering, the powers that be decided that we could use the existing system as a requirements base, and just make the new system "act" like the old one.

So I was thrown two things: A staff that was inexperienced in design, and a lack of requirements. We were re-writing from scratch -- throwing all of the code away and starting again. I'm one of the only ones on my team that actually knows what refactoring means, let alone how to apply it.

So we built the new system based on a solid design. We even did the design in UML. But that only goes so far. Given a properly designed system, there is still another lower step -- how to actually implement the code. Invariably, the design doesn't take into account helper methods that are necessary, other objects, etc. Because we were on a tight schedule, I left it up to each developer to design these new tidbits.

If they had known what they were doing, everything would have been fine, but there are several places where the code is absolutely abysmal. It's not that it will perform poorly or is totally unmanagable, but in some places it's close.

Granted, the business rules for the system are some of the most complex I've ever seen, but if the developers had known more about refactoring, they might have made better choices.

There's a whole notion of Quality here that should be discussed as well. Most good has a good enough quality, and though it looks ugly, I'd throw it away. There's an interesting interview by Joel (a former microsoft programmer) which tackles these problems of Quality head on. I also reccommend reading Zen and the Art of Motorcycle Maintenence: An Inqiry into Values for a pretty good discussion of the whole notion of quality.

I'd say just keep the codebase unless you are totally switching languages (or in our case taking out a 3rd party integration). Otherwise if you re-write you'll wish you hadn't. The new codebase will end up being in the same state as the old codebase, it'll just take a little bit more time to get there.

I'd also reccommend just doing a performance test of the site, and find out which modules really need to be re-written. The 80/20 rule will invariably apply -- 80% of the execution time will be spent in 20% of the code -- so at a minimum just re-write the 20% of the code that's executed on the most frequent paths. Leave the rest alone. Re-writing it will just be a waste of time, and you have to fix all the old bugs that were re-introduced during the re-write.
Re:object orientation by Anonymous Coward · 2001-12-21 16:44 · Score: 1, Informative

Good point about object orientation.
It's often a good thing to take a couragous step like this and rewrite & refactor code (unfortuneatly management tends to see it as unproductive time, but it can pay off in the long run)

Now my main point: I can't imagine having to deal with megabytes of Perl code. It's hard enough to figure out the intent of a simple perl script I wrote a few months ago, I can't imagine having to slog through MEGABYTEs of the stuff. Perhaps while your at it you should take another couragous step and rewrite in another language.
May I suggest something like Ruby: it feels pretty familiar to Perl folks and its quick to learn. Ruby's object oriented-ness certainly would beat trying to create an OO system in Perl (it makes my head hurt just to think of OO perl!)
Check out: http://www.ruby-lang.org
or http://www.rubycentral.com
for more info!
Re:object orientation by anacron · 2001-12-21 18:02 · Score: 3, Interesting

anacron, interesting experience and solid advice. I didn't see Extreme Programming mentioned. Did you use it at all?

Actually we didn't use it formally. But we used some of the best ideas. Iterative development, frequent code reviews, programming in pairs etc. We had a pretty small team so XP might have helped a little bit more, but it's a fundamental paradigm shift that most people I work with aren't comfortable making. My team would have been less effective had I said, "You *must* use XP."

I have been on projects in the past that have embraced more of the XP model, but something like Rational's Unified Process (RUP) kind of fits in more with what we do.

For traditional software shops (e.g. those that make products) a methodology can be set in place by the management, because the company is in essense it's own client. For other types of software shops -- such as consulting firms, or web design shops -- where one-off software is commonplace, the type of methodology that's used is usually determined by the client.

For example, if I'm working with a client that isn't very savvy, I probably don't want to do all of my requirements in formal use-cases, followed up by UML design documents. They won't add any value to the client at all, yet we still have to make sure we understand what it is we're building.

It's an interesting challenge and it's usually best overcome by having solid people on the team that can speak to both sides. A solid requirements team which can talk to the client (be it internal or external) and truly understand the business problems, all the whole clearly articulating those to the technical team. Sometimees double documentation is necessary -- one set for the client and one set for the developers or development lead.

There is just no easy way to do software development correctly. It's amazing that in nearly 60 years, no one has come up with a "right" way. There's only so much a methodology can bring to the table -- after that it's about finding and keeping a competant staff who understand the value of what they're doing.

The methodology is just a roadmap. How the car is driven is completely up to the development team. Even a good technical leader can become a backseat driver at times, as there is just not enough hours in the day to provide oversight to every single member of a team.

I think the more traditional software shops should look at the console gaming industry for tips. They have a difficult challenge -- it's not possible for them to release a fix or update for their game once it's completed. So how do companies like Activision, Square, Nintendo and Sega do their software development, and is there anything that the development teams at large can learn? Rigerous testing is one of the most obvious things that comes to mind for them, but how do they know what it is they're testing when a game's vision usually only exists in the mind of the creator?

.anacron
Re:object orientation by Error27 · 2001-12-21 19:11 · Score: 1

>>Alright, maybe I posted a little too soon, but shouldn't "flamebait" be attracting flaming responses?

It's because the good people of slashdot recognized your feeble attempt at attracting flames and did not deign to give you the pleasure.

Slashdotters are known world wide as the most discerning and reasonable of all people. Please don't consider a personal afront that you were not flamed. Lesser mortals most certainly would have.
Re:object orientation by ergo98 · 2001-12-22 05:06 · Score: 2, Interesting

Refactoring is not re-writing. Refactoring is a methodical approach to moving, redesigning, and making the code cleaner. It's not throwing away and starting from scratch. The two words are wholly semantically different.

"Refactoring" is a trendy term (which alone makes it suspect) that came into mainstream acceptance very recently, but that most certainly doesn't mean that those in the software industry having been doing EXACTLY that for many years since? Why is it that once someone gives a name to something that people have done for years they think they've invented it?

I have never known a software project where the participants didn't actively look through, clean up, and where necessary rewrite old code, but now this is "refactoring" and it's a whole new paradigm in software engineering. Blah I say. I hereby name designing quality code and implementing it in a platform independent fashion "requantization", and my new paradigm is requantization development.
Re:object orientation by anacron · 2001-12-22 07:52 · Score: 1

"Refactoring" is a trendy term (which alone makes it suspect) that came into mainstream acceptance very recently, but that most certainly doesn't mean that those in the software industry having been doing EXACTLY that for many years since?

I agree -- the term 'refactoring' is pretty trendy right now. But for the first time, there's a methodology behind it. Your argument is like saying, "Medicine has been around for 2000 years. All the advances that have been are just trendy and don't really benefit us."

Just because something is done, doesn't mean that it's done correctly, to plan, and for a good reason. Refactoring formalizes the best of what most people have already been doing, puts rules behind it, and quantifies exactly the reasons for doing it.
Re:object orientation by ergo98 · 2001-12-23 02:36 · Score: 1

Your argument is like saying, "Medicine has been around for 2000 years. All the advances that have been are just trendy and don't really benefit us."

I would say that a more apt analogy would be saying "For 2000 years doctors have checked your temperature and done an initial physical evaluation, but now we've renamed it `physioevaluation" and it's a whole new paradigm in health care.

Use a pretty printer by renehollan · 2001-12-21 10:24 · Score: 2, Insightful

for consistent style...;

refactor as a result of learning from your mistakes and redundancies;

and try to minimize the busy parts (where all developers have a hand) when things change (like lists of unique symbols, numbers, etc.)

--
You could've hired me.

Sleeping dogs by Chairboy · 2001-12-21 10:25 · Score: 5, Insightful

I'm sure a bunch of code nazis will disagree with me (please note the clever way I attempt to pre-emptively undermine their arguments by labeling them as 'nazis') but sometimes massive engineering re-writes are not necessary.

Your tangled mass of spaghetti code paths are probably full of almost incomprehensible little design decisions and seemingly out of place declarations and functions, but most of those were probably added as specific fixes for bugs encountered under real-world use.

Most companies that decide to massively re-engineer their code (do a big rewrite) usually end up regretting it because it forces them to re-fix the problems that caused the original strange looking code in the first place.

Does your CGI nest work? If so, maybe you should leave it alone. If you are fixing specific problems, then go ahead, but if this is a generalized attempt to fix the 'not invented here' syndrome that plagues engineers (who will almost universally agree that it is easier to write code then it is to read it), perhaps you should reconsider.

Re:Sleeping dogs by kz45 · 2001-12-21 10:31 · Score: 3, Informative

Your tangled mass of spaghetti code paths are probably full of almost incomprehensible little design decisions and seemingly out of place declarations and functions, but most of those were probably added as specific fixes for bugs encountered under real-world use.

This is a lesson to be learned. Engineer your code from the beginning. Use easy to understand commenting, and strucutured code. Although it takes some discipline, you will almost never have to reconsider "re-writing from scratch".
Re:Sleeping dogs by smileyy · 2001-12-21 10:34 · Score: 2

++ to that comment.

A general refactoring, without the intent of doing it to add new functionality / fix bugs, isn't worth it. What business value do you gain from it?

However, there are times that it is worth undertaking substantial refactoring to add what may seem like relatively small pieces of functionality. I think it was in his _Refactoring_ book that Martin Fowler likened this to reaching a local peak in a mountain range, where you have to go down into a valley before you can climb the next peak.

ObOfftopic: This is my first post in a while that hasn't been a troll or flamebait. It just isn't as fun being informative.

--
pooptruck
Re:Sleeping dogs by Plisken · 2001-12-21 10:41 · Score: 1, Interesting

What we plan on doing is rewriting a module at a time and then doing extensive testing on that module. I know that sounds obvious, but alot of outfits will _figure_ out some grandiose design and then parse out the parts to different developers, working simultaneously. Many times you end with something just as bad because either you didn't learn from your mistakes or that the design was flawed from the beginning.
Then again, if your code is one _huge_ pile of spaghetti, you might have no choice except to rewrite all of it at the same time.
My 2 cents
Re:Sleeping dogs by pyrrho · 2001-12-21 10:48 · Score: 1

But the design is probably flawed at the modular level as well... you will find making cleaner versions of modules in place is like making a fossil... in place replacement can turn flesh to stone, but it will keep the overall shape unchanged. In the case of hard to maintain or troublesome code, this is probably not what you want. If it is... you could just debug the modules and not rewrite them.

--
-pyrrho
Re:Sleeping dogs by Nyarly · 2001-12-21 10:50 · Score: 5, Insightful
Certainly, but if you find yourself trying to make a change to existing code, it will probably not be the last change you need to make. At least refactoring anything your change touches will make future changes easier to make.
Related note: the original poster doesn't say "refactoring" and he does say "Perl." Informative statements relating to these two facts:
- Buy and read Martin Fowler's Refactoring. The examples are mostly Java, and he references the Group of Four's Design Patterns. "Retro-fitting," in which you probably plan to rewrite portions of code from scratch will break your app, your mind, and your budget. Learn what refactoring implies and entails.
- Learn about design patterns in general, and consider how they might apply to your code. One description of a design pattern is "a target for refactoring."
- (Donning asbestos) You might want to reconsider perl as the language of choice for a large scale application. I realize I'm posting this comment to a Perl system, but Perl hangs together like an immense kludge of a language. That said, you're probably stuck with it, and AFAIK, you may be forging new paths in programming for reusability by applying the above concepts to Perl. Good luck, and be sure you can trust your machete.
--
IP is just rude.
Is there any torture so subl
Re:Sleeping dogs by bbqdeath · 2001-12-21 11:20 · Score: 5, Funny

I second the concern about PERL. And I offer my advice as someone who has virtually no qualifications to talk about large systems of code. I just like Python better than PERL because it doesn't hurt my eyeballs like PERL.

Yeah, I guess this is a troll. But it's honest. I use Python like most people use toilet paper: several times a day, and for more things than it was originally intended.
Re:Sleeping dogs by curunir · 2001-12-21 11:27 · Score: 1

The rule that I've learned is, If there is something wrong with the way it is architected,
then go ahead and re-write it if you have the time. If you aren't going to change the design, then there is no reason to throw away the resource you have.

Proper program design (modularity, well defined interfaces, bla bla bla...you've heard it all before) makes code much easier to manage and improve. If you don't have it, then re-writing might be worth it.

However, far too often, people feel the need to re-write code because it feels messy and it is hard to maintain. The fact is that it is easier to write code than to read it and an engineer will almost always find more enjoyment in writing something from scratch than in fixing/upgrading someone else's code. Be careful listening to engineers bitch about messy code. It usually isn't half as bad as they say it is.

However, in this case, it sounds like they do need a serious design revision. I'm not sure I can think of any justification for using Perl CGI for an app that size (mod_perl, java, c++ and even asp would be better).

--
"Don't blame me, I voted for Kodos!"
Re:Sleeping dogs by inbox · 2001-12-21 12:02 · Score: 1

Perhaps the reason for the code-rewrite is because everyone is being fired.

There's your business value. The current group is not cutting it, but their code is crap. They need to go, but the code is useless without them. An idea: get them to clean it up and then fire them and bring in a new team that now has nice, new, clean code.

Conspiracy theory, perhaps. Possibility, definitely.
Re:Sleeping dogs by thogard · 2001-12-21 12:22 · Score: 3, Insightful

You can't engineer your code from the beginning in the part of the real world where I work. I've got a huge mess of perl that do reports and about one out of ten clients wants something different. These request could have have been predicted since many of them have no business reasons and have no function other that look more like their older system which just happed to have done it that way.
Re:Sleeping dogs by thogard · 2001-12-21 12:33 · Score: 1

If perl hurts your eyeballs, you don't understand it.

At work we have one system that was built with python and its just not as stable as everything we have written in perl. At this stage there will be no more large project written in python but I'll be watching how the small project go.
Re:Sleeping dogs by Jeremi · 2001-12-21 12:57 · Score: 2

Your tangled mass of spaghetti code paths are probably full of almost incomprehensible little design decisions and seemingly out of place declarations and functions, but most of those were probably added as specific fixes for bugs encountered under real-world use.

... which is why whenever you add one of these "little design decisions" whose purpose isn't blatantly obvious, it's important to put in a comment saying what it does and why it is there. Otherwise someone might come through later on, think it's an error, and remove it.

--

I don't care if it's 90,000 hectares. That lake was not my doing.
Re:Sleeping dogs by ZeroConcept · 2001-12-21 13:04 · Score: 1

Another strategy is to have a group of persons updating/fixing the current code while another group writes a framework and migrate the pages one by one to the new codebase.

Try to make the iterations as small as possible, that way if you missed something in your design you can fix your design earlier.

I've seen projects where ppl create a HUGE design, implement it and at the end, requirements changed/were missed and the whole thing needs to be hacked to make it work...test your design the earliest possible by migrating pages early.
Re:Sleeping dogs by sunking · 2001-12-21 14:15 · Score: 3, Insightful

So you're saying your problem is too hard to be solved with a clean design? Good software design is all about preparing for the unexpected! In fact, if you ever had a perfect understanding of the problem then it wouldn't be worth designing a solution carefully - you wouldn't need the flexibility!
Thinking that writing one-off code is giving you flexibility is grave mistake.
-sam
Re:Sleeping dogs by Trepalium · 2001-12-21 15:02 · Score: 1

If perl hurts your eyeballs, you don't understand it.

Perl makes it easy to write a quick hack that looks ugly but fixes a real problem. The hurting eyeballs factor only comes in when that quick hack gets progressively promoted to a full mission-critical program with little or no planning. There are well written, structured perl apps, and there's the unstructured ugly ones. Most people have seen more of the ugly ones.

--
I used up all my sick days, so I'm calling in dead.
Re:Sleeping dogs by PD · 2001-12-21 15:08 · Score: 2

Sometimes, but not always, if your code is a mess, then that means that your business is a mess too. If your programmers have a hard time understanding the business, then your customers will too.

When you investigate those twistly little lines of code, see if a business rule can be simplified. Try really hard. If it can't be, then put it into code. Don't make the opposite mistake of making the rules TOO simple.

--
If tits were wings it'd be flying around.
Re:Sleeping dogs by foobar104 · 2001-12-21 15:23 · Score: 3, Interesting

You might want to reconsider perl as the language of choice for a large scale application.

I agree 100%. My company started to bring a commercial application to market a little less than a year ago. I prototyped the code in Perl, and the prototype was sufficiently okay that the decision was made to evolve the prototype into the release code.

This was A Mistake. It was A Dumb Idea. It was also My Decision. I have taken Much Shit for this from my coworkers. But you live and you learn.

I have since (over the past four months or so) rewritten the entire application-- every line, every file-- in C++. The source tree is 3.8 MB, and it compiles to about 100 MB of object code. (The actual executables, of course, are much smaller than that.) It was a pretty big job.

Not only is my code tighter and cleaner than the original Perl stuff (which was actually pretty okay code) but it's between 2 and 10 times faster.

I love Perl, absolutely adore programming in it, but there are some things that are easier to do with C, or C++, or (presumably) Java. When you split a project up among a number of people, for example, using the Bridge design pattern and distributing read-only interface header files makes modular integration so very much easier. That's just one example.

We would not have been able to get our app to market without the Perl prototype. And I don't think it would have been worth a damn if we hadn't rewritten it in C++.
Re:Sleeping dogs by JamesOfTheDesert · 2001-12-21 15:50 · Score: 2

... because it doesn't hurt my eyeballs like PERL.
Do you write your PERL CODE IN ALL UPPERCASE? That *would* hurt your eyes.
Please; it's Perl, or perl, but not PERL.

--

Java is the blue pill
Choose the red pill
Re:Sleeping dogs by V.P. · 2001-12-21 15:52 · Score: 1

What we plan on doing is rewriting a module at a time and then doing extensive testing on that module.
An even better way to go (a.k.a. extreme programming) would be to write the "extensive tests" first. This way you can uncover problems in your design before you start messing with the original code. Besides, having the tests available from the beginning makes it much easier to ensure that your changes don't mess up the intended functionality.
Write tests first!
Re:Sleeping dogs by Dwonis · 2001-12-21 16:25 · Score: 1

Exactly. I think that if it's mission-critical, it should probably be written in C. If you aren't confident that you can write mission-critical code in C, then you shouldn't be writing mission-critical code at all (even in Perl).
Re:Sleeping dogs by bay43270 · 2001-12-21 17:08 · Score: 2

This is a lesson to be learned. Engineer your code from the beginning. Use easy to understand commenting, and strucutured code. Although it takes some discipline, you will almost never have to reconsider "re-writing from scratch". Even if the people maintaining the code had anything to do with it's original design (doubtful), they can't do anything about it now. A more practical solution would be to create tests, and refactor the code piece by piece. I am only beginning to use the ideas presented in Martin Fowler's "Refactoring", but they are really helping me clean up my code. More importantly, the changes are safe. Fowler's book warns me of problems I might run into, and gives hints as to what I should look for. Although I'm using Java and am lucky enough to be using IntelliJ's IDEA (which is an IDE with built in refactoring capabilities), I'm sure there is a solution for Perl (even if it is more manual).
Re:Sleeping dogs by bmajik · 2001-12-21 18:24 · Score: 3, Insightful

"Mission Critical" does not imply C. Infact, it points away from C.

Unless you're writing an operating system, why use C ? What does it buy you ?

IIRC, the Avionics system of the F-15 is done in Ada, because its "Safer" than C. I'd say thats "Mission Critical".

I too am weary of perl, but like everything else, its the right tool for _some_ jobs, arguably, some pretty important jobs.

--
My opinions are my own, and do not necessarily represent those of my employer.
Re:Sleeping dogs by Anonymous Coward · 2001-12-21 19:06 · Score: 1, Insightful

This is a very common mistake that I've been seeing lately.

Frameworks can be used to great effect where each client wants different things out of the same module. Understand the problem at the highest level and create protocols for solutions for the class of problems. Implement common aspects of all solutions as part of the framework. Then each specific solution for a customer can be implemented as a very small add-on module.

Check out the ideas of framework in OO.

I've used this pattern successfully in several large-scale systems over the last 8 years. It works whereever there are uncertainties about customer requirements. And, especially, where it is certain different customers will have different requirements.
Re:Sleeping dogs by consumer · 2001-12-21 20:03 · Score: 1

Isn't it possible that your rewrite was better just because it was a 2nd draft? There's really no reason you can't define interfaces for Perl classes and share them among a group of developers, like you did with your C++ header files.
Re:Sleeping dogs by ameoba · 2001-12-22 00:19 · Score: 2

One other reason I see to move away from Perl as the language for the project is the complete rewrite (IE fundamental changes to syntax, deprecating all existing code) of the language that's currently going on. Perl5 is going the way of COBOL; it will become increasingly difficult to find coders both competant and up to date in their skills.

I'm not really qualified to make any recomendations about which language you should use, but if you're doing a rewrite now, when Perl5 coders are still plentiful, choose a language that you can be reasonably sure will be used 3yr from now. The whole point of the project is to ensure future maintainability for the software..

--
my sig's at the bottom of the page.
Re:Sleeping dogs by dgroskind · 2001-12-22 02:14 · Score: 2

The hurting eyeballs factor only comes in when that quick hack gets progressively promoted to a full mission-critical program with little or no planning.
The trick then is to pick a platform that is suitable for quick hack but can be promoted to mission critical with little of no planning.
Perl is a suitable candidate because
it's reasonably fast in execution
there are a reasonable number of Perl programmers around for future maintenance
it has some cross-platform capablility and scalability
Most people have seen more of the ugly ones.
There is a trade-off between speed of development and maintainability. In the current business environment, speed of development is ascendant. The people who say you should've planned better and used Java or C++ usually come along later and try to score some political points. Where were they when there was work to be done?
Re:Sleeping dogs by m_ilya · 2001-12-22 05:29 · Score: 1

choose a language that you can be reasonably sure will be used 3yr from now.
Perl6?

--
--
Ilya Martynov (http://martynov.org/)
Re:Sleeping dogs by foobar104 · 2001-12-22 06:16 · Score: 2

Isn't it possible that your rewrite was better just because it was a 2nd draft? There's really no reason you can't define interfaces for Perl classes and share them among a group of developers, like you did with your C++ header files.

Yes, TMTOWTDI.

But I still think that C++ is my language of choice for big projects because it helps do some of the job of collaboration for me. If you, as a team member, don't implement the interface completely, your module won't compile. And if you change the interface, nobody else's modules will compile. So less time is spent doing module-level integration.

And C++ has a lot going for it in the preemptive bug-squashing arena, too, with strong typing at compile time and such.

But I didn't intend for this to be a "C++ is better than Perl" comment; as I said before, I love Perl. I just think C++ is more appropriate for some things than Perl is, and medium-and-larger programming is one of them.
Re:Sleeping dogs by Ian+Bicking · 2001-12-22 19:24 · Score: 2

Good software design is all about preparing for the unexpected!
I couldn't disagree with you more. Putting generality into the code that you don't forsee a need for is very bad design, IMHO. Generality is messy, and generality in one piece often makes it difficult to make another part general.
For the most part, your code should be designed to do what it is supposed to do, and no more. Of course, you will have an idea of what it might be needed in the future -- and it will sometimes be a good idea to design the code with that in mind. But usually it is not, because you will often be wrong. And if you don't even have an idea of what need your code will fulfill -- i.e., you are truly planning for the unexpected -- then you would be a fool to mess up your code in planning for it.
One-off code is perfectly fine. Of course you want to reuse code as much as possible, but it is not possible to reuse all your code. If someone wants to do something new, then you'll have to write new code. Hopefully you will do it without using cut and paste -- a horrible way to do code reuse -- but you will have to write some new code. Maybe you'll make a new subclass (my favorite), or add a new option, or refactor, or a combination of the above. And, sometimes, it really will be more expedient to copy and paste.
Re:Sleeping dogs by shannara256 · 2001-12-23 20:03 · Score: 1

> it's Perl, or perl, but not PERL.

No, not PERL, but the first two aren't interchangeable... Perl is the language, perl is the compiler. Intro to the Camel book, v5, IIRC.

-Jason-

Extremem Programming - Refactor by Strudleman · 2001-12-21 10:26 · Score: 3, Interesting

You should read up on Extreme Programming, in particular Code Refactoring. It's a method of cleaning up old code. A very well written book, as well as an excellent code-housekeeping method.

--
Do it doug.

Re:Extremem Programming - Refactor by Anonymous Coward · 2001-12-21 10:50 · Score: 1, Funny

Do 'Extreme Programming' principles dictate that I get an afro perm and adorn my body with piercings and generic "tribal" tattoos? I can't even spell snowboard. Oh my...
Re:Extremem Programming - Refactor by Anonymous Coward · 2001-12-21 10:57 · Score: 2, Funny

Do 'Extreme Programming' principles dictate that I get an afro perm and adorn my body with piercings and generic "tribal" tattoos?

First time I heard the term, I asked the guy "Is that where you jump out of an airplane with a parachute and a laptop?"
Re:Extremem Programming - Refactor by (H)elix1 · 2001-12-21 12:37 · Score: 2, Insightful
Yah, I keep hearing this. Read the whole book - it recommends strong test plans, incremental development, pair programming, tons of snacks, and a maximum of a forty-hour work week. In itself, not a bad thing.

Interpreted by management?
- They see the part about everything being stuck out in the open, with the computers set up in the center of the room - cubes are overrated, offices are evil
- half as many boxes are required since you are "coding in tandem". See above for hint on how personal space is valued.
- Demand complex test plans that will cure everything. These same people never have actual requirement documentation for what the code should do. Classic example? Validation rules for a phone number... gets ugly when someone wants to put in 1 800 FOO-BARR, add an extension, deal with international numbers. Think you can put together a test script? Got to have requirements first, and that is never a hard thing to get from the business (shudder).
- I'm sure you have had complete management buy in on realistic time estimates now, so I wonder where the rumors come from that development tends to work more than an eight hour day. When they find out how long it will take to refactor the code base with through testing and daily updates, they will tell marketing and adjust the dates accordingly. Heck, I thought the Microsoft Project Plan defaulted to an eight-hour day - must be one of those rare Pentium errata issues that messes things up for me.
Not that I'm bitter, cause I'm not... but XP takes a handful of generally good practices and assumes perfect management buy-in and team communication. With any management support and teamwork, almost any method works.
--
+++ UGUCAUCGUAUUUCU

Modules by SCHecklerX · 2001-12-21 10:26 · Score: 2

Break the code into re-usable modules (or objects if you go another route besides perl, or even if you do, if you like that sort of pain), each programmer responsible for his own set (ie layout, calculation, database, etc)

CYA by The+Gardener · 2001-12-21 10:27 · Score: 5, Insightful

Don't stop maintaining the old code code until the new code is on solid ground. No matter how sure you are that you can do it, the new code might never come through.

The Gardener

--
--

Don't do it by mrpotato · 2001-12-21 10:28 · Score: 3, Informative

See the recent Joel Spolsky interview here, that was discussed on /. here.

Basically, Joel's take on a similar problem is: don't do it.

Unless you have a _really_ good reason to do huge change to a big codebase, don't bother, and make something more productive instead.

--

cheers

Code style and large rewrites. by The+Panther! · 2001-12-21 10:28 · Score: 2, Interesting

It always seems to me that large rewrites (though I've never done one as large as the one discussed) tend to make bad code bases worse for a while. If there is any effort made to maintain functionality in a changing codebase, a piecemeal rewrite is more of a headache than a help. So, if you're planning on a big rewrite, please give your developers the full thumbs up to break everything and expect them to put it back together later--and expect to see nothing tangible in the short term. Or call it off.

That being said, my last company sat around and bickered about code style for nearly 4 months and produced no code that wasn't rewritten later. If you are going to concern yourself about style, settle that well in advance and make sure it's logical and consistent.

It's also been my experience that conformant code style is highly overrated. Once the Best Practices document extends beyond language constructs and caveats, into brace styles, spacing, tab size (yes, there was a 3 space tab stop standard at my last job--wretch), and even the naming of locals, parameters, members, constants, enumerations, etc... it got to be a thick ass bible of stuff that only a few people would digest or attempt to adhere to. The point I'm trying to make is, choose your battles. The hope is your developers will make sane choices independently, and use standards to help integrate different peoples' work together. Anything beyond that and it's pissing in the wind.

My $2e-2 or less.

--
Any connection between your reality and mine is purely coincidental.

Horror Story by Brontosaurus+Jim · 2001-12-21 10:33 · Score: 3, Informative

My firm went through this sort of thing just two years ago. The PHB at the time decided, for some reason, that our 300,000 lines of semi-poorly written C code, and 50,000 associated lines of Java (Dont' ask).

Anyway, it took 7 of us over 2 months to get even halfway done. The pressure the boss was putting on us was awful, and he didn't really even understand what we were doing, even though he was the one demanding it. I think she read it in a trade mag somewhere. God I'd do a lot more work if she didn't read that shit.

Anyway, about halfway through the "Great Leap Forward" (as we [appropriatly] named it) the boss quit, and the next boss, who so far has been fairly clueful. He didn't think the whole deal was needed, but he was pressured by the former bosses husband (the CTO) to get it done. Seriously.

Hope yours goes better than ours. From what we did, heres some tips I can give you.

1. Be consistant through the whole thing.
2. Make sure everything is planned before you start. This was the one part we got right.
3. The team you have should have worked together before, because this sort of task requires previous knowledge of eachother.

Other than that, my condolences. Or maybe it will work better for you.

Good luck!

Re:Horror Story by bolthole · 2001-12-21 12:17 · Score: 1

Anyway, it took 7 of us over 2 months to get even halfway done.
That was the problem. Too many people involved.
Re:Horror Story by ScottMaxwell · 2001-12-21 14:51 · Score: 1

The pressure the boss was putting on us was awful, and he didn't really even understand what we were doing, even though he was the one demanding it. I think she read it in a trade mag somewhere. God I'd do a lot more work if she didn't read that shit.

Maybe he/she was just distracted by his/her sex change operation.
the former bosses husband

I'm not touching that. (So to speak.)

--

``Life results from the non-random survival of randomly varying replicators.'' -- Richard Dawkins

Don't ask this here! by Will+The+Real+Bruce · 2001-12-21 10:34 · Score: 1, Offtopic

Slashdot is a site built around perl scripts, so one might expect there to be a considerable amount of knowledge and interest about a topic such as this.

Could you, dear writer, perhaps be the maintainer of a GIANT collection of perl scripts badly in need of a rewrite?

Are you the maintainer of Slash??

--
Will the real Bruce Perens Please Stand Up

Rule #1 by ackthpt · 2001-12-21 10:34 · Score: 2, Informative

We're still in the middle of a sweeping change and lemme tell ya, make d@mn sure there's someone accountable for managing the whole project from beginning to end, particularly this being their main focus.

Transitioning in new managers or having the current manager only look in on the project once in a while is as sure a path to madness and doom as no management at all.

Our due date was mid-August, we'll be lucky to get it through testing and into production by January 31st. All the while with the logjam we're having to put pieces of it into production and cross our fingers that the new changes don't break anything.

Love to talk more about it, but need another gallon of coffee.

--

A feeling of having made the same mistake before: Deja Foobar

If it ain't broke... by bc90021 · 2001-12-21 10:34 · Score: 1

...don't fix it!

"In the interest of consistency and reducing redundancy." There's only 1 MB of code, and while that is a lot considering it's all text, I suspect that you will find yourselves recreating (and then refixing) most of the original problems that are more than likely fixed by your existing codebase.

Old adages got to be old adages because they represent sound wisdom. So if it ain't broke, don't fix it. ;)

bc90021

"Python is executable pseudocode; Perl is executable line noise". - Someone wiser than I.

--
libertarianswag.com

That there is no such thing... by pyrrho · 2001-12-21 10:35 · Score: 4, Insightful

You are going to rewrite the system from scratch. Design from scratch. Your new design might be able to use some old code, if the old code is useful.

A large scale retrofit is really an oxymoron.

IMHO, but with 15 years experience.

--

-pyrrho

Re:That there is no such thing... by BryceH · 2001-12-21 11:05 · Score: 1

Mod parent up.

very well said.

--
"Shut up brain or ill stab you with a Q-tip" Homer Simpson
Re:That there is no such thing... by awol · 2001-12-22 02:49 · Score: 2

I am working for a large company, which was only tiny when I started working for them (over seven years ago). In the early days we were a bunch of really smart guys, a few of which could be characterised as zealots. One would g home on a friday and come in on a Monday to find a fundamental component of the system rewritten in a new and interesting way, even things like new tools or processes that we would all then haveto use. This kind of retrofit never caused us a problem (seriously never) some of the decisions have even been subsequently validated by industry changes.

BUT

Two things. First, in truth we wre cowboys. That kind of thing is such a bad idea that you cannot do it and develop consistent quality product, particularly in larger teams, greater than six or so (or geographically diverse teams). Second we always were able to run these systems in parallel before we switch over to the new way. This is IMPORTANT and some other posters have said it before. Don't get rid of the old until the new has been proven (as for how to prove it well that is real hard but don't underestimate just how hard).

As an aside, we tride a pretty major redesign of a system a few years back and it did not really add any value, but that was because of a lack of a clear objective and we lost the budget after six months. So Try and make keepable improvements with shorter timscales.

--
"The first thing to do when you find yourself in a hole is stop digging."
Re:That there is no such thing... by Kirruth · 2001-12-22 06:03 · Score: 1

For a major refit, re-design should be the bigger part of the effort. For one thing, eliminating features or modules that nobody really uses can cut a huge amount of effort. That's not to say code should be thrown out wholesale: quite often real gems that solve real problems are found buried amongst the junk.

--
"Well, put a stake in my heart and drag me into sunlight."

Not all at once, either by dimer0 · 2001-12-21 10:35 · Score: 1

This is an excellent suggestion. What I always try to do is break the codebase up into loose-coupled modules, and then take the nastiest one of them (well, I guess you could pick which one you want to do first based on anything, really) - and convert that to your new technology.

Doing this will help big time when something goes wrong, which it will. The area that caused the problem will be much more focused.

This also lets you start taking advantages of the benefits of your new technology much sooner. Roll the new in with the old - in production. Less scary as well. Small steps.

Listen To The Experts by ChuckPollock · 2001-12-21 10:35 · Score: 3, Funny

The First Rule of Program Optimization: Don't do it. The Second Rule of Program Optimization (for experts only!): Don't do it yet. -- Michael Jackson

Document by Tosta+Dojen · 2001-12-21 10:35 · Score: 5, Insightful

I suppose what I have to say almost goes without saying, but I know so many programmers who neglect it, I will say it anyway. There are two things you have to know when going back into old code.

1) Know what you are doing. 2) Know what the code does.

Both are expedited by good documentation. This is so important, it deserves to be written thusly:

DOCUMENTATION!

Write everything down. If the code is not commented, figure out what it does and write it down. When you add a line or a module, write down what it is supposed to do. Declarations? Write those down too. Document everything so that you can figure everything out, both now and down the road when you decide to fix something else.

This is the voice of experience. I have had to reverse-engineer my own code 6 months after I wrote it because I failed to document anything. Learn from my mistake.

--

I have a strong belief in the Second Amendment.

Re:Document by JeffFurry · 2001-12-21 11:22 · Score: 1

Tosta Dojen wrote:
This is the voice of experience. I have had to reverse-engineer my own code 6 months after I wrote it because I failed to document anything. Learn from my mistake

Which leads to one of my favorite development sayings:
Write maintainable code, so that a stranger working onit six months from now will be able to figure it out. And remember, that stranger will most likely be you!

"I may make you feel, but I can't make you think" - J. Tull
Re:Document by chris_mahan · 2001-12-21 12:28 · Score: 1

Ever so true...

I have a saying: There should be 5 lines of comment for each line of code. (i'm including the lines of ************ in there too though)
Each program should have a complete changelog, with dates and emails and www pages of authors, a complete breakdown of all functions, at the top, in english, not a mix of english and pseudocode.

Also, a second document should exist, in html (or whatever) that shows the purpose of the program, the strengths and shortcomings of the program, the runtime requirements (dependencies, OS versions, hardware, memory, HDD access permissions, etc), and any outstanding issues, requests for change, etc.

Also, there should be a third document, created before, outlining in english all the expectations of the program to be written. Specs, in fact. This can be html or text, as the programming team prefers.

I program in ASP, so it's very easy to spaggettize. I have learned to enforce iron discipline. Because there is nothing like trying to read through 900 lines of html-producing code for that one thing management wants to change (I call those manabugs--things management considers bugs)).

--
"Piter, too, is dead."
Re:Document by Arandir · 2001-12-21 15:01 · Score: 1

Also keep around a UML diagrams for the program. On Linux, the best UML tool I have seen is Dia. Save these as png and put them in your html design document. I would have one large "overview" diagram, with detailed diagrams for reference. When someone isn't familiar with the code, a roadmap goes a long ways. A UML diagram is a roadmap for your code.

--
A Government Is a Body of People, Usually Notably Ungoverned
Re:Document by mbrod · 2001-12-21 16:59 · Score: 1

We have a new manager of the department that is pushing the documentation aspect real hard. We have a nice tidy drive share with volumes of documentation. I have been laying the groundwork for a massive new web system we will be creating and I find myself spending more time changing and organizing the framework document and DFD's than writing code.

I think it is very dificult to balance getting things done and writing the documentation. It seems to me a number of the programmers are a bit burnt out on a project they were excited about by the time they do the Milestone graph, the DFD's, etc.

My point is there has to be a balance and I see programmers who are hard enough to motivate to begin with just spin their wheels in the documentation and keeping it all up to date if it is to extensive can kill them.

Anyone else have a success story of balancing making developers document well with writing kick ass code please share how you did it.

avoid the "sweeping rewrite" by CamShaft · 2001-12-21 10:36 · Score: 1

In my experience, if you have something, and it is functional, then you are better off to migrate and refactor what you have to end up with improvements rather than doing the slash and burn and rewrite.

The advantage of the migration and refactor approach is you can do it in a way that leaves you with a functional product the entire time, giving you the ability to respond to new requests/requirements as they arise.

If you try to do a full rewrite, and some new urgent deliverable comes up in the middle of your rewrite, the only way you are going to be able to deliver in time is to hack it into your old functional code.

Think again by pkphilip · 2001-12-21 10:37 · Score: 2, Insightful

From what you say, you are planning on making these changes to clean up the code and make it prettier.

I would strongly urge you to reconsider this as the probability that you will end up breaking parts of the code while "cleaning" it up is quite high..especially since you seem to have a fairly large code base ~1MB

Ugly code containing redundant stuff is still better than beautiful code that doesn't work.

Re:Think again by flavor · 2001-12-21 10:46 · Score: 1

I agree with most of what's being said here. Indulge me while I try to rephrase it a little differently.

Rewriting a large codebase may gain you flexibility in building out future functionality, and reduced operational costs.

But be warned that there is very little you can say to justify a rewrite to management/marketer types. To them, you're taking something that works and making it ... still work.

I've found that many users are willing to risk encountering bugs if they're going to be rewarded with better functionality. But you would risk introducing bugs while offering users no reward. This could get rough if they're the ones paying the bills.

Regression tests by Weasel+Boy · 2001-12-21 10:37 · Score: 2, Offtopic

Make sure you have a suite of tests that produce known output for your old code, so that you can ensure that the new code works in exactly the same way. Don't add anything new until you are proven conformant with what you had before.

Re:Regression tests by Doctor+Memory · 2001-12-21 13:47 · Score: 1

Why is this modded Offtopic? Absolutely the first thing you do when you are rewriting code is to write tests against the existing code, so you can make sure you haven't broken anything.

I would suggest that you start small, have lots of code reviews so that you can build consensus on best practices (clue: "best practices" that only half the team agree with are not "best"), and try to do this *without* incorporating any bug fixes. This way your new code will be "bug-compatible" with the old, and you'll get consistent results from your test suites.

--
Just junk food for thought...

See no evil, hear no evil, ...? by ProtonMotiveForce · 2001-12-21 10:41 · Score: 2, Flamebait

Why do people mod comments about alternatives as Flamebait (and presciently, this one)? Are people afraid to hear that you should't write large scale systems in Perl?

It really is valid (and in my opinion, correct) to say that if you _are_ going to do this you should look at other technologies and languages. Perl is for system administrators and system administrators-cum-developers, not real software development. Look at java. Look at PHP. Look at commercial and non-commerical web application systems, like Zope. Or don't rewrite it at all if it works. But for God's Sake, don't rewrite it in Perl - it's pointless.

Re:See no evil, hear no evil, ...? by hondo77 · 2001-12-21 12:05 · Score: 1

Why is this marked as interesting instead of flamebait?

Perl isn't used by real software engineers but PHP is? Gimmie a break (and don't get me started on Java--talk about pointless).

Not understanding Perl is no excuse for name-calling.

If they want to rewrite their Perl (which is probably a bad idea in the first place), Perl IS an excellent choice.

--
I live ze unknown. I love ze unknown. I am ze unknown.
Re:See no evil, hear no evil, ...? by m_ilya · 2001-12-21 12:38 · Score: 1

I can understand when people say that Java is better than Perl for large scale systems (I don't agree but I understand why people think so). But PHP??? Why do you think it is better suited for large scale systems than Perl?

--
--
Ilya Martynov (http://martynov.org/)
Re:See no evil, hear no evil, ...? by chromatic · 2001-12-21 12:54 · Score: 1

Perhaps people consider these comments flamebait because your comment boils down to "Perl is unsuited for large applications because I don't like it."
If you'd care to back up your assertions with specific examples, you might have a better chance of being taken seriously. I'd tend to avoid assertions like "because Matt Wright wrote some awful stuff several years ago, it's impossible for anyone to write clear, thoughtful, or well-designed programs in Perl."
Since you didn't spell it PERL, I'm assuming you do have a serious point.

--
how to invest, a novice's guide
Re:See no evil, hear no evil, ...? by spudnic · 2001-12-21 13:25 · Score: 2

What would the advantage be in changing to another language? Assuming the team develops coding standards and creates clean, documented code that can run under mod_perl, why would PHP or java be better?

The team is obviously already familiar with perl, and some code could be reused after a bit of cleaning. Perl is perfect for applications that require a lot of string manipulation and as a front end for a database. This is what most web programming involves.

The only drawback that I can see in using perl is if your team is messy. Perl will allow you to be messy, but it doesn't have to be. Perl can look just as clean as any other language. I go back to apps I wrote 3 years ago in perl and can follow every bit of it. It's all commented and has plenty of white space to make it easy on the eyes.

I've written several large perl applications for the web and yes, I do know other languages, but perl is better suited for this type of work. It's fairly quick to put together and performs well. Now I wouldn't try to write an OS or first person shoot'em up game in perl, but for this it's just as viable of a choice as anything else.

I'm also not saying that PHP or java would be bad if you have a good grasp on it and are starting a new project. Remember, TMTOWTDI is good!
Just don't use ASP! ;)

--
load "linux",8,1
Re:See no evil, hear no evil, ...? by khkramer · 2001-12-21 13:49 · Score: 3, Insightful

> Perl is for system administrators and system > administrators-cum-developers, not real > software development. Look at java. Look at > PHP.
Mmmm. I write large-scale web applications for a living, and I do it in Perl. By large-scale, I mean sites that are expected to support hundreds of thousands of page-views per day, serve hundreds of thousands of distinct users per month, and collate hundreds of thousands of distinct chunks of content into dynamic pages.

My company is a high-end development shop. We generally bid on projects that will take six to nine months to complete, and we only do jobs for clients who understand how we work and why. Part of our approach is to use very small teams of extremely experienced web developers. We usually deploy four programmers on a project. Other companies that bid against us sometimes use several times that many people on a single development team. Another part of our approach is to build everything on top of our open-source development framework. That sometimes used to be a tough sell ("what, give software away for free, heaven forfend") but these days, most customers are pretty receptive to the twin "more eyeballs means better code" and "you're not locked into our closed, proprietary product" arguments.

We also generally build small clusters of dual-processor Athlon or PIII machines, whereas our Java- (Oracle-, IBM -, BEA-, etc) wielding colleagues often specify absurdly-expensive hardware.

Perl is flexible, complete, performs relatively well and has an extroardinary base of skilled developers and re-usable components (CPAN). We couldn't work as quickly or write code that is as concise and maintainable using any other currently-available language/approach. Most of the lines of code that we write actually go into Perl modules, with HTML::Mason templates handling the dynamic web-page generation. We can push Mason out of the way and use straight mod_perl for small, defined tasks. And we can easily integrate C code into our Perl frameworks in places where performance is really, really critical (though those places are rare, as when push comes to shove, one is almost always waiting on the database).

There are lots of things that aren't "right" about web development. The package that results from gluing HTML and program logic together in a stateless execution environment is sometimes a little lumpy, and unavoidably so. There's no magic bullet toolkit, and (as with other specialized programming arenas, like graphics or embedded systems) a lot of hard-won, domain-specific knowledge goes into the development of a fast, reliable, maintainable web app.

The Perl/Apache/Mason combination that we use is far from perfect. But it's better -- for us -- than any of the alternatives.

I really like Java, and have written big systems in that language, too. If for some reason I had to manage a very large team of programmers, or had to manage a team with a large percentage of less-experienced programmers, I would use a Java-based solution. Java is a more rigid language than Perl, and the structure that the language provides would be a useful management tool in those contexts. But for my small teams of skilled hackers, Perl is more productive. (We have an extensive, evolved, self-imposed "structure," so we don't need the language to impose one on us -- in fact, it gets in the way.)

I would never use PHP for the kind of work we do. PHP just isn't the kind of powerful, flexible, complete environment that Perl is.

Zope and Python are really neat. I'm a fan of the work that folks on that side of the fence are doing. But Python+Zope don't offer us anything new that Perl+Mason+Comma don't. I also like Perl more than Python (which is a subjective preference), and think that the Perl development environment is more mature (which is a subjective judgment).

So don't listen to the folks who tell you to dump Perl. You should certainly consider all of your options and make an informed decision about core tools, but anyone who thinks that Perl is just a "scripting" language, or that it doesn't scale, hasn't been paying attention.

To finish this up with a little more specific advice to the original poster: You mentioned "150+ perl CGIs" in your question. You should consider moving away from the CGI model, if possible. Take a look at HTML::Mason, which is a very good embedded-perl environment. You can build solid, consistent application layers using Mason as a base. Also, I couldn't agree more with the folks recommending writing perl modules and requiring complete regression tests for each module. There are lots of ways to write tests, but in perl-land one of the easiest is to simply make a t/ directory down your module tree, write a bunch of scripts in that directory named <some-test>.t that print out a series of "ok <n>\n" lines, and use make test or Test::Harness::runtests() to invoke them.
Re:See no evil, hear no evil, ...? by m_ilya · 2001-12-21 14:29 · Score: 1

Having seen different style projects I strongly dislike "Database centric" design (i.e. very weak or missing middle layer). I'm not sure if you talking about such design but it looks like you mean it.
Bad programmers can write cryptic Perl but for good developers it is not a problem to develop clean middle layer and frontend layer in Perl. Perl is full featured language which can be used to build big OO projects. On other side I always was under impression that PHP is quite limited language because initially it was designed as quite niche language (embeding in HTML web pages). This is why I think that Perl have more potential than PHP for large scale projects.

--
--
Ilya Martynov (http://martynov.org/)

Architect it to death by ADRA · 2001-12-21 10:41 · Score: 2, Insightful

There is no use in doing a rewrite unless you do it right from the beginning. If you can't spend a decent amount of time planning the architecture of the system, then stop now, and quit.

Also, since you have decided to pour resources into this thing, then my opinion would be to make as much of your code generic so that you don't have to make code changes later. It doesn't matter if there is an initial performance hit with the systems, becuase in the short term, you can convince your boss to get a new leet server, and in the future, hardware needed to run your apps will be trivially cheap anyway.

If you are going to cut a new release, try to avoid going back and taking snippets of code from your old system. It makes people slide back into the odl paradigm and can cause detrimental effects on the bottom line of your new system. This is new, so the least exposure to the old one, the better.

--
Bye!

Re:Architect it to death by Hairy1 · 2001-12-21 17:01 · Score: 2, Interesting

No Don't!

In my early experience in programming I found there was an advantage to up front planning. It seemed like the more planning you did the better the project would do. So there was a tendancy to do more and more up front planning, getting into more minute detail.

However although we spent more and more time planning the payoff in terms of being able to predict schedule and budget diminished. Its classic diminishing return. But why?

Planning a project is like forcasting the weather. You can plan a small project in minute detail for a short time (like being able to predict the weather a day in advance), and you can make vauge predictions for some time out (ie you can know the probability of rain based on the month - statistically on previous years), however you can't plan a large project in minute detail.

The reason is that if you get any of the initial decisions wrong, and there is a minor deviation from the plan early on, this will be reinforced to cause more changes, and this will snowball to change many parts of the final system.

An analogy is trying to get somewhere with a set of instructions. If the instructions are faulty at the very start of the list it doesn't matter that the rest of the instructions are correct - you still get lost. Detailed plans are a little like a set of precise instructions. There is no assurance that all of the instructions are fault free. In fact there is a good chance they will be faulty as they will almost always be written by someone who hasn't been to the destination (ie hasn't written the same program before).

It is better to be an explorer. Your "plan" should be like a map. Only you start out not knowing the terrain - only the general direction of your destination. You map your first days travel and set off. Every day fill in some detail on the map and plan the next days walk - always keeping in mind where you final destination is.

Programming is a journey - not through any physical space, but through the endless possibilies of the programming solution space. This space is filled with broken, useless programs. Its your job to navigate your way to the best solution.

PS : I know this hasn't been very practical, however I have previously posted some more concrete real life advice under this article.

Agreed by ackthpt · 2001-12-21 10:41 · Score: 1, Insightful

A large scale retrofit is really an oxymoron.

Often I've had to modify systems which, once I understand how they work, which parts are actually still used, what's redundant/poorly written, I've usually pulled off changes in 1/2 to 1/5 the amount of code. Nice, neat, clean well documented stuff, too.

--

A feeling of having made the same mistake before: Deja Foobar

Use framework, seperate code form display.. and.. by cybrthng · 2001-12-21 10:42 · Score: 3, Interesting

Well, i can't say it enough. Use a web framework, i don't know of any for Perl off the top of my mind, but i use Resin.

Resin is a JSP, Servlet, XML, XSLT application server that support all the latest and greatest EJB components and Managed persistance on the database and makes a great framework to build from.

I have Postgresl for the Database, and use beans to run queries and output via xml and then i have XSLT draft xml data into the wonderfull HTML code.

The beauty is, my code is in beans, servlets and jsp's. My HTML is in .xtp files (in the case of resin) and simple XSLT sheets parse the XML to render to the user.

Just means i can produce output easily for WAP, Palm, CE, Normal Web Browsing, EMail, and what not without modifying the backend. Just create xslt using a session identifier to bring up the corresponding stylesheets for whatever device is acesssing the page.

Enough about java, but something similar, be it in house developed would be your best bet.

I also get away with using a Swing applicatoin to manage the database, users and run reports provide an easy to navigate gui which just interprests the same xml data that would be retrieved by the html client. Not a single change to the backend since i'm using the beauty of soap, jsp's, servlets and java.

Virtualize your interfaces, standardize on your backend and use re-useable components. Perl is similar in many ways that you can load libaries and abstract your code from the display which will save you tons of hours of hassle in any future upgrades compared to the bit of changes you would do now

Don't Listen by augustz · 2001-12-21 10:43 · Score: 5, Insightful

A ton of people will tell you a ton of things, never having retrofited anything.

- Do not undervalue the investment you have learning your existing coding language. New challenges await you if you jump on a new language like java. Make the jump if you are excited about learning about the new language.

- If you use your existing coding language you will literally fly through the retrofit. Do it piece by piece. Make all those changes first, then test app, then make next set of changes then test. The simple fact is, most wasted time is spent on bugs not working on performance, and you've already knocked down a lot of bugs, don't let them pop back up by blowing everything up. There are books on this.

- Sometimes blowing everything up is worth it. Do it right this time. Realize it won't be as perfect as you might think it will be.

- Remember there are countless open source and shareware products that tried to create TNG with a total rewrite, got nowhere, and ended up improving their existing product. Remember the lesson, bite off what you can chew.

- Spend a week poking around researching possibilities. I do this all the time, bookmark things I think are important. Then for the next project you've got all the little things you might forget at your fingertips. Optimzations/Tools/Paradigms. Think you know it all? You'd be suprised at what is out there and what you missed. And what you spent a month in house re-inventing. This one's important.

- Use open source software. Nothing beats free. Nothing is more fun. Java's ugly standardization history makes me puke... the BS Sun has pulled with Java is staggering. That the Java Lobby swallows it and loves it even more so. This is irrelevant to your question, and not fair to the Lobby, but I like to give them a hard time.

- Colorary to Java. You need less abstract design then you think. Endless object hierarchies will weigh you and your app down... Their are books on this too.

- You need more documentation then you think. Ever found code someone ELSE wrote too EASY to follow. I don't think so. Especially if you are using perl and someone is enjoying the line noise capabalities perl allows. Perl has 20 ways to do EVERYTHING, you may not know the latest or twistiest. Document as you WRITE the code. Do not leave at the end of the day without catching up the docs. A week of documenting is the worst form of hell, avoided with a minutes worth of clarification each time you write a function/class.

- Hardware is cheap.

Anyways, have fun... and good luck. Be interested to read what others have to say.

Re:Don't Listen by Beryllium+Sphere(tm) · 2001-12-21 11:14 · Score: 2, Informative

>- If you use your existing coding language you will literally fly through the retrofit. Do it piece by piece. Make all those changes first, then test app, then make next set of changes then test. The simple fact is, most wasted time is spent on bugs not working on performance, and you've already knocked down a lot of bugs, don't let them pop back up by blowing everything up. There are books on this.

This is good advice. To be more specific:
1. START with your regression test suite
2. Then add self-documentation features like standard naming conventions. Seems dull and bureaucratic and pointless but really truly saves maintenance time.
3. Have a standard comment header for each function. The standard should answer questions like "Can that argument be NULL?" and "What do the error returns mean?"
4. If you're going through every line already, do a security audit.

There's good advice in the refactoring books, for example http://www1.fatbrain.com/asp/bookinfo/bookinfo.asp ?theisbn=0201485672&vm=
Re:Don't Listen by banky · 2001-12-21 11:45 · Score: 2

hardware is cheap, but bumping up the hardware requirements when you're talking about pushing out your new code to several hundred machines in a hosted data center will quickly kill that Xmas bonus you were hoping for...

--
ZOMG I WOULD LOVE TO KNOW ABOUT YOUR FEELINGS ON MACINTOSH VERSUS WINDOWS, VI VERSUS EMACS, AND HOW YOU'RE NOT A DORK
Re:Don't Listen by augustz · 2001-12-21 12:41 · Score: 3, Interesting

Yes, I agree with this. I actually dislike some of the easy but (what I consider heavy) choices like Java for this reason. You can get by with so much less so easily for many projects, and support literally 5x as many users per server. For a small business this is the difference between profit and no profit. For a large business this can be the difference between .com and .bomb.

All too many times I see sites begging for money for hosting/bandwidth. Take a look at their HTML/CSS and see it is HUGLY bloated (no linked css, prevent all caching including image with default cache buster installs) and not gzipped, and I wonder, if what I can see is so bad, behind the scenes it is probably even worse. (ie dynamic page gens where none are needed). Which I had included this in my list and left of the hardware point, which I agree is the wrong message.

But damn, if you do code right, hardware is so cheap I can't beleive it. I'm convinced some 10 machine projects with bad coding can be supported on a single machine now.
Re:Don't Listen by Lumpy · 2001-12-21 14:46 · Score: 2

I agreed with everything until the last line. That statement needs to be violently beaten out of every programmer, manager, and engineer on the planet. I almost want to suggest that the death penalty be decreed on that statement od heresy.

The "hardware is cheap" mantra is exactly why we have crapy code and bloatware. "I dont have to optimize, hardware is cheap." "Why make it efficient? hardware's cheap!" "I dont care that it is slow, hardware is so cheap these days"

Please, everyone, cary stones in your pockets and throw them really hard at the next programmer that makes that statement.

--
Do not look at laser with remaining good eye.
Re:Don't Listen by Arandir · 2001-12-21 15:05 · Score: 1

Remember there are countless open source and shareware products that tried to create TNG with a total rewrite, got nowhere, and ended up improving their existing product.

And there are dozens of examples where the reverse is true. Enlightenment is one example where the entire code base was rewritten from scratch resulting in a far better product. E17 is going through the same process all over again, but for a different reason. From what I can see it's going to be a huge improvement that will spill over into other projects.

--
A Government Is a Body of People, Usually Notably Ungoverned
Re:Don't Listen by RollingThunder · 2001-12-21 15:21 · Score: 2

That last line is making me have to BS my boss.

"Sure, the dual tbird appro should handle 10000 users"

omitting, of course, if it was all written in clean tight C, instead of java.

see joel on software by kubalaa · 2001-12-21 10:44 · Score: 4, Informative

He says the exact same thing as you, only better.

--

"If you look 'round the table and can't tell who the sucker is, it's you." -- Quiz Show

Re:see joel on software by bay43270 · 2001-12-21 17:18 · Score: 3, Interesting

Although I enjoyed his book, and respect his opinion on matters of interface design, Joel Spolsky is no champion of well-designed code. In his book, he even suggests that a programming language shouldn't be used if another can create the same user interface in less time... as if the time it takes to create the first version of a product should take precedence of other factors such as maintainability of code, reuse, and flexibility. He does a good job of regurgitating the works of other interface specialists, but he is not an expert in code maintenance.
Re:see joel on software by kubalaa · 2001-12-22 01:22 · Score: 1

I haven't read his book; his site deals with coding issues, not interface ones. And I don't agree with a lot of what he says, but he's very practically-minded, and a good antidote to academic thinking.

--
"If you look 'round the table and can't tell who the sucker is, it's you." -- Quiz Show

KISS by ZaneMcAuley · 2001-12-21 10:44 · Score: 2, Insightful

Keep It Simple Stupid. Keep code simple, why have funky lines of code that looks awe enspiring. Its pig ugly. I work on the principle of KISS. When I am working on a code area that is being cleaned up or fixed, I try to simplify that area. The simpler it is, the easier and quicker it is to maintain and bug hunt.

Im a strong believer in managed code (whether its C# or Java, except managed extensions to c++ which are damn pig ugly :) Exception handling for seperating error handling from actual code that performs the actual work).

Make sure there is GOOD in code documentation (and out of code documentation) to explain the intention of that code. (Intentional programming is a research area btw, to program ones intentions).

You know when something is bad when you have to maintain that code later on, 6 months down the line or whenever. Thats when it bites you in the buttie.

--
----- Whats wrong with this picture? http://www.revoh.org:1234/whatswrong

Don't by Pussy+Is+Money · 2001-12-21 10:45 · Score: 1, Interesting

Don't do it. Don't go over your code making sweeping changes with the intent of making sweeping changes to your code.

Instead, very globally, identify the features and structures that you require in 3, 6 or 12 months time, from a marketing or similarly broad perspective. Identify the parts of your codebase that would need to be changed in order to achieve those goals (this is as good a definition of "out of shape" as any).

Then as you fix up your code: look at your fixes. And from these, formulate the rules that should guide your next development cycle.

--
Pushin' 'n dealin', shovin' 'n stealin'

Refactoring your project by adamy · 2001-12-21 10:45 · Score: 3, Insightful

To expand on the concept of Refactoring:

1. Write a test for a specific block of code.
2. Appliy the refactoring

You are going to want a good testing framework.

To expand on the modules post: Do a dependency analysis. If you are writing DB based code, look at what tables can be logically grouped together.
We did something like this at my company not too long ago. The basical level package we had was the security package, which identifies users and roles. Most other packages depended on this. All contact management stuff went into a package called Directory. All stuff for the people our system was managing went into Participant etc.

For each of the packages, split the code out into a set of interfaces, and a set of implementing code for business logic, and the UI required to drive that business logic. This is the standard breakdown for code. You may want to further pull out code into helper packages. Avoid the urge to call these helper or util, and instead try to name them based on what they contain: we have one called format for stuff like phone numbers and social security numbers.

Don't forget the make scripts. What ever build you use, it should be used to specify which modules you want to compile/deploy

I recommend a little UML modeling session for the end package structure.

Go in little pieces. After each refacotring, makes sure stuff still runs.

Good Luck

--
Open Source Identity Management: FreeIPA.org

Write the regression test suite _first_ by CatherineCornelius · 2001-12-21 10:46 · Score: 1

If you really have decided that you _must_ rewrite your code (and I presume that you have examined all the alternatives and found none of them viable) then my advice is to write test suites for your code _first_. To perform a functional rewrite, you need a way of automatically testing that the function is performed effectively and faultlessly by the rewritten code.

Get a copy of Expect, or even better, DejaGnu, and start scripting your tests.

Playing with fire... by ZaneMcAuley · 2001-12-21 10:46 · Score: 2, Insightful

Do you have the original engineers for that code?

Most knowledge is in theyre head, not on paper unfortunately. Theyre experience in that area can never be written down.

--
----- Whats wrong with this picture? http://www.revoh.org:1234/whatswrong

Re:Playing with fire... by ZaneMcAuley · 2001-12-21 10:51 · Score: 1

erm, how is this a troll?

--
----- Whats wrong with this picture? http://www.revoh.org:1234/whatswrong
Re:Playing with fire... by ZaneMcAuley · 2001-12-21 10:56 · Score: 1

Maybe the moderators (which seem to be on crack) should have to write an explaination for each moderation done to explain why they moderated that way. :)

--
----- Whats wrong with this picture? http://www.revoh.org:1234/whatswrong

Write the test suite first by z19752002 · 2001-12-21 10:46 · Score: 1

I suggest writing a comprehensive test suite first and making sure that the existing code either works or that the bugs are documented. Then run the test suite against the re-write as often as is reasonable. Use a good source control system. Have seperate development and test systems (it's amazing the number of people who try to test on development systems).

From experience: by tqbf · 2001-12-21 10:46 · Score: 2

Plan a big sweeping rewrite; idealize the system you design based on your experience with the real one. Make the design clean, simple, and flexible.

Don't build it. Instead, evaluate the code you have now and plot a course towards the idealized system. Approach the actual work of the "retrofit" incrementally. Count on having multiple customer-facing revisions of the software tagged and QA'd before the system you're delivering looks anything like the planned rewrite.

Taking baby steps towards a new design is probably the only way you'll ever migrate your project to that design. With the knowledge you've accrued working on the old system, it probably seems straightforward to start from scratch. Even if this isn't wishful thinking, though, it's a waste of time. Part of the discipline of design is an understanding of where the "hot spots" are that can't tolerate inferior implementation, and how to tell those hots spots from the spongy mass of integration, reporting, tracing, and sanitizing that is neither performance sensitive nor mutable enough to justify engineering effort.

You can take early baby steps that make it easier to make holistic changes down the road. Refactor relentlessly. Migrate code recklessly out of subsystems and into common repositories and libraries. I've found it handy to distinguish between "proper" shared library and "dumps" of utility code that don't need scrupulously conceived interfaces.

Most importantly, design for testability. In this respect, the biggest asset you have is the steaming lump of old Perl code you're facing; use it to figure out the expected behavior of subsystems. Write replacements, in modules with clean interfaces, and unit test them. A unit test probes code (functions, statements, internal states) --- NOT entire programs. You'll work ten times faster when you can move forward as a team knowing what components you can trust and what components you need to worry about. You'll work ten times slower if you haven't clarified your outputs, side effects, and return values enough to know whether your replacement parts are valid!

We've seen articles on Slashdot before about this and I agree with the prevailing opinion: rewrites are often seductive traps and time sinks that don't offer value to customers. A better mentality that will eventually get you where you (think you) want to be is to adopt a strategy of constant measurement (testing, profiling, debugging) and improvement.

profile your code by avandesande · 2001-12-21 10:47 · Score: 1

I have coworkers that throw around issues like performance, which can be a red herring if you are doing a web site with only a few users. (one guy insisted on doing all is string stuff as string buffer instead of as strings because it would increase performance)
You may want to look at your code in a similar fashion. There may be just a few modules that are causing problems with redundancy(or whatever is making you want to rewrite them). Why rewrite the ones that have worked faithfully without any problems?

--
love is just extroverted narcissism

Re:profile your code by avandesande · 2001-12-22 04:09 · Score: 1

Performance is nice, but so is fast time to market and maintainability. Unless a site is going to have high traffic and hardware constraints, the latter two come first.

--
love is just extroverted narcissism

Parallel development by rgmoore · 2001-12-21 10:47 · Score: 2

It might seem like and obvious step, but don't throw away the old system until you're sure that the new one works! Keep somebody minding the existing, working system so that if/when your attempt to completely rework it fails you won't be stuck. Once you have rewritten it, try setting it up on a trial basis in parallel to the working system so you can find the crippling bugs before they take down your system.

While it's not a perfect example, Slashdot is actually a decent example with their switch to their new system. They kept the old, crufty version as the primary and set up a beta site with the new software. They knew that there would be problems and got some of their more loyal users to test the new system and only switched over it after they were pretty confident that they had gotten the worst problems out of the way.

You can afford to take a few more risks as long as you keep a known working system around as a fallback.

--

There's no point in questioning authority if you aren't going to listen to the answers.

Document More! by doorbot.com · 2001-12-21 10:49 · Score: 1

Actually, one thing that can be very very helpful is to also write down your assumptions for each function. What format are you assuming the variables will be in when passed to a function? What if they're passed incorrectly? What if someone tries to "hack" the function? How would they go about doing it via a form submittal (for example)?

This is a good exercise and should lead to much more secure code.

Unexploded bombs, not sleeping dogs by ChaosDiscordSimple · 2001-12-21 10:50 · Score: 5, Insightful

Your tangled mass of spaghetti code paths are probably full of almost incomprehensible little design decisions and seemingly out of place declarations and functions, but most of those were probably added as specific fixes for bugs encountered under real-world use.

Yes, and if they're cryptic and uncommented, they are worthless. Eventually one of these incomprehensible, magical fixes will stop working. Perhaps the bug it works around is fixed. Perhaps how the function is being used changes to previously unexpected behavior. Some poor engineer will look at the little big of magic, scratch his head, and be forced make a blind decision about how to fix it. Perhaps he can change the code while leaving the bit of magic in working, but he can't be certain, since he doens't understand it. If the collection of cryptic tweaks becomes dense enough, any attempt to fix a bug or add a feature becomes highly risky.

On a related note, don't let this happen to you. If you add one of these strange little fixes, for the sake of the programmer that follows you, document them. Just a little "Need to toggle the Foo because Qux 1.4 is correctly fails to do so" will bring tears of joy to the eyes of future programmers.

Re:Unexploded bombs, not sleeping dogs by William+Tanksley · 2001-12-21 12:52 · Score: 3, Insightful

You're wrong. The comments you recommend will only /add/ to the mass of code to be grokked. Don't forbid commenting; but don't consider them a solution.

Instead, refactor and unit test. Unlike comments inserted into the middle of the code, unit tests will fail and point you to the reason for the failure. In the above example, when we upgrade to Qux1.5, the unit test which asserts that the Foo is untoggled will fail, and will point right to the function which made the assumption. Bingo -- a quick fix.

-Billy
Re:Unexploded bombs, not sleeping dogs by CodeRx · 2001-12-21 13:38 · Score: 1

Comments are fine as long as they explain why the code is doing what it is, not what the code does or how it does it (that should be obvious from just looking at the code).

Bad Comment:

/* Disable widget by calling disable() method */

Good Comment:

/* Disable widget so it can't be accidently clicked */

-sean
Re:Unexploded bombs, not sleeping dogs by William+Tanksley · 2001-12-21 14:12 · Score: 5, Insightful

Comments used that way are like perfume -- they cover the smell of stinky code. Perfume isn't bad when your code stinks, but better is to make your code not stink.

A better solution would be to extract the thing you're commenting into a function, and name the function so that the comment is unneeded. Don't forget to write a test which checks the purpose of the function. Then (and only then) delete the comment.

MakeNotAccidentallyClickable(widget);

See what I mean? Now it's impossible for your comments to become out of sync with your code -- your code IS your comments.

-Billy
Re:Unexploded bombs, not sleeping dogs by Anonymous Coward · 2001-12-21 16:18 · Score: 2, Funny

My favorite comment to see in the code "This is a bit of a hack"
Re:Unexploded bombs, not sleeping dogs by p3d0 · 2001-12-21 16:49 · Score: 2

I think that the idea that code can somehow take the place of comments is bogus. Code, no matter how clearly written, only tells you what is happening. It can't tell you why.

For instance, why did you make that widget not accidentally clickable? Why not others? Is that widget in particular danger of being accidentally clicked? Are the consequences of accidental clickage very bad?

Code like this seems clearer to me:

/* Make sure user doesn't accidentally delete the document */ disable(widget);

Or:

/* This widget is very large, and liable to be clicked accidentally */ disable(widget);

--
Patrick Doyle
I mod down every jackass who puts his moderation policy in his sig. Oh, wait a sec....
Re:Unexploded bombs, not sleeping dogs by medcalf · 2001-12-21 18:37 · Score: 2

If it's that important to you that "your code IS your comments", just write in COBOL. COBOL is documentation that runs.

--
-- Two men say they're Jesus. One of them must be wrong. - Dire Straits
Re:Unexploded bombs, not sleeping dogs by ednopantz · 2001-12-22 00:13 · Score: 1

Mine too, and you know, it is the most helpful comment I have ever seen. Just when I am thking, "WTF??", I suddenly understand why the code looks like this, and yes, I feel their pain.
Re:Unexploded bombs, not sleeping dogs by William+Tanksley · 2001-12-22 10:18 · Score: 2

I coded for the example I was given, at the level of detail I was given. You've added some more detailed scenarios; and I can code for those too.

The first comment should be a function name, not a comment. There's a LOT more you have to do to prevent document deletion! The comment, as it stands, is misleading, and later modification will possibly have to add more functionality. Just make it a function, so the functionality can be kept together.

The second comment says, "I, the programmer of this system, don't like the UI design." BAD SMELL. Use comments if you must, but a FAR FAR better solution is to fix the UI. Don't you agree?

Now, I'm sure you can invent other details under which those specific solutions are inapplicable -- but for every scenario you invent, I'll invent a way to code it so the comments aren't needed. The only exception is when time or lack of knowledge prevents removing the comments -- and the result is tragic.

-Billy
Re:Unexploded bombs, not sleeping dogs by p3d0 · 2001-12-22 12:33 · Score: 2

The second comment says, "I, the programmer of this system, don't like the UI design." BAD SMELL. Use comments if you must, but a FAR FAR better solution is to fix the UI. Don't you agree?
Hey, you're the one who gave the example, so you tell me why you would call the "MakeNotAccidentallyClickable" function. I can't tell from your code, and that's the exact problem.
It seems to me that my comments have helped you to diagnose problems in the code that were not obvious without those comments.
for every scenario you invent, I'll invent a way to code it so the comments aren't needed
Why would you want to do that? What is the advantage of your scheme over comments?
Your aim of having clearly-written code is admirable. I think code should explain very, very clearly what it is doing. However, explaining the "why" when it is not obvious from the "what" is exactly what comments are good at.

--
Patrick Doyle
I mod down every jackass who puts his moderation policy in his sig. Oh, wait a sec....
Re:Unexploded bombs, not sleeping dogs by William+Tanksley · 2001-12-22 13:21 · Score: 2

Hey, you're the one who gave the example,

I didn't give that example -- look at the history. That was given by someone who thought that the comment was needed (they provided a comment, too). I made a function which contained the code being commented, gave it a name which was essentially identical to the comment, and called it where the comment was. I also claimed that in any real work, I would have first built unit tests which confirmed that the function did what its name claimed.

so you tell me why you would call the "MakeNotAccidentallyClickable" function. I can't tell from your code, and that's the exact problem.

You use it when you want to make something not accidentally clickable. I haven't defined what "something" is, and I haven't defined in what context it's useful. The first job is for the type system, and the second job is for the object system.

It seems to me that my comments have helped you to diagnose problems in the code that were not obvious without those comments.

No! There were no problems in my code; there couldn't be, since I wrote no code. What I did was respond to someone else's problem sketch with a solution sketch. You responded not by pointing out omissions in my solution, but by inventing new problem details and situations. The fact that my example solution didn't cover them hardly matters; it was a solution to a /different/ problem.

What is the advantage of your scheme over comments?

A superb question.

Obviously, it can do everything comments can do except contain language syntax; so it's clear that for most purposes a good function name equals a comment. So why should we choose a function name _over_ a comment?

The first answer is that when more than one line is covered by a comment, those lines form a functional group and MUST be a function of their own. Once you have that, it's silly to not choose the new name to be as descriptive as the comment.

But there's more. If only a single line is covered by the comment, it's still beneficial to extract a function and delete the comment: by encapsulating the "how", you provide opportunity for later changes to the "what".

But the final BIG statement is this: comments can't be tested. Functions can, and a unit test will make it obvious when a function's name doesn't match its function.

However, explaining the "why" when it is not obvious from the "what" is exactly what comments are good at.

Please tell me SOME information in a comment which could not be conveyed in a function name. Fact is, you can't do it. Function names are exactly as expressive as comments.

There is some info that can't be conveyed by code; it's called "requirements". That info can't be conveyed by comments, though, either. It should be contained in use cases and acceptance tests, one acceptance test per use case. Acceptance tests should be automated, but that's a different story, and up to the customer, not the programmer.

-Billy
Re:Unexploded bombs, not sleeping dogs by p3d0 · 2001-12-22 17:26 · Score: 2

I didn't give that example -- look at the history.
I was referring to this code:
MakeNotAccidentallyClickable(widget);
That's what I meant by "example".
No! There were no problems in my code; there couldn't be, since I wrote no code. What I did was respond to someone else's problem sketch with a solution sketch. You responded not by pointing out omissions in my solution, but by inventing new problem details and situations.
Ok, perhaps I made a straw man. The point I was trying to make is that there must be more details to the situation, whatever they are; and because your code didn't specify them, the problems you found in my commented version of the code may or may not exist in yours. One can't know without more information.
Perhaps I can sum up my thoughts this way. Programmers make mistakes. It's hard to tell mistakes apart from deliberate decisions without some indication of the programmer's frame of mind. Thus, comments are only unnecessary in perfect code. If you choose to encode your comments using function names, rather than the features provided by the language specifically for the purpose, that's your choice; but it's not yet clear to me that it is a beneficial one.
A superb question.
And you have provided some superb answers that deserve some thought. I'm not convinced yet, but your position no longer seems ridiculous. :-)
Have you ever come across (or written) any material that discuss the pros and cons of your approach?
I do have a bit of a problem with the assertion that a block of comments should always become a separate function. There is a "sweet spot" when it comes to decomposing functions: with too few functions, they are too large to understand; with too many, algorithms can become scattered and fragmented, like a book with one word on each page. Splitting every function block into a new function seems to run a great risk of fragmenting the code in this way.
To me, function decomposition already serves three purposes: information hiding, code reuse, and divide-and-conquer problem solving. Adding a fourth criterion--of allowing comments to be encoded as function names--seems to raise the possibility that the criteria may conflict, and one may have to be sacrificed for another. Looking at the list, I know which is first on my chopping block.

--
Patrick Doyle
I mod down every jackass who puts his moderation policy in his sig. Oh, wait a sec....
Re:Unexploded bombs, not sleeping dogs by William+Tanksley · 2001-12-27 08:23 · Score: 2

I was referring to this code:
MakeNotAccidentallyClickable(widget);
That's what I meant by "example".

Yes, I wrote that -- but it's a direct translation of the comment someone else told me couldn't be replaced with a function name.

Ok, perhaps I made a straw man. The point I was trying to make is that there must be more details to the situation, whatever they are; and because your code didn't specify them, the problems you found in my commented version of the code may or may not exist in yours. One can't know without more information.

You're absolutely correct. However, this situation MUST be left to the wisdom of the programmer. There's simply nobody else to trust! If the programmer's wrong, he'll write wrong comments as well as wrong function names, and most likely wrong code as well. (See "peer review" and "pair programming" for some solutions to this.)

Perhaps I can sum up my thoughts this way. Programmers make mistakes.

Just a quick interruption: so do maintainers. The programmers shouldn't assume the maintainers are going to make all the mistakes in the book, and neither should the maintainers assume that about the programmers. A lot of common programming practice seems to assume that a horde of idiots are going to take over the code tomorrow, so anything I don't finish today will get destroyed and perverted. This results in late hours, aggressive comments (like "DON'T TOUCH THIS!"), a lack of tests (because you don't think the later programmers will have the wisdom to improve you code), and so on. Some cynicism is appropriate; after all, the later programmers will have at best a foggy memory of what you were thinking when you wrote it; however, at the same time, it's just as likely that the later programmers will understand the overall problem BETTER than you do.

It's hard to tell mistakes apart from deliberate decisions without some indication of the programmer's frame of mind.

This is true, and it's why I want code to be self-documenting. The code has to wisely and correctly make comments unneeded; you can't just delete the information contained in a comment. The advantage is huge: you go from *two* sources if information which may contradict each other (don't tell me that you've never seen out-of-sync comments ;-) to one source which is automatically tested.

Thus, comments are only unnecessary in perfect code.

Not really true. There's no such thing as perfect code; comments are unneeded in code which tells the same story as the comments would. And such code (unlike perfect code) is not overly hard to construct.

If you choose to encode your comments using function names, rather than the features provided by the language specifically for the purpose, that's your choice;

Pardon, but function names are usually more of a language feature than comments; they're checked by the language, they're part of the syntax... Comments are just bolt-ons. And this IS the purpose of function names; but most people don't use them wisely.

but it's not yet clear to me that it is a beneficial one.

Well, I've given some reasons above. To me, the biggest two are as follows:

1. information can't get out of sync.
2. all information appears in two contexts: actual use (in the code), and prototypical use (in a unit test). Function names will look dramatically bad if they're chosen inappropriately. Comments can be ignored.

Comments can get out of sync with the code, in which case all of their "benefits" go to waste; and they can appear in only one place, and aren't checked there.

Have you ever come across (or written) any material that discuss the pros and cons of your approach?

It's a critical component of Extreme Programming, which is being discussed pro and con in a lot of places. I can also link to a few semi-balanced discussions (good paper, but biased toward the rather management-heavier SEI CMM model, which is needed for some projects). BTW, I was going to provide a con link, but the one I found is BAD -- hasty generalizations are just FLYING. Every single "danger" he cites is completely unjustified by experience and study, and many of the quotes he makes are out of context. It looks more like a Usenet post than a web article.

Oh, here's dmoz's directory page.

Anyhow, to implement what I'm describing you'll need at least Refactoring and Unit Testing from XP's arsenal. There's no reason you should have to adopt more, although once you have those in the others usually can be added at will. You also have to practice egoless programming -- don't be offended if someone changes the name of one of your functions; instead, make sure that the name is still appropriate to the use (if it's not, we have a bug being formed, since the person misunderstood the purpose of the function; you should show him a unit test in which the old name makes sense but the new one doesn't).

To me, function decomposition already serves three purposes: information hiding, code reuse, and divide-and-conquer problem solving. Adding a fourth criterion--of allowing comments to be encoded as function names--seems to raise the possibility that the criteria may conflict, and one may have to be sacrificed for another. Looking at the list, I know which is first on my chopping block.

Information hiding: true, but part of the purpose of IH is exposing only the correct information. That's part of the job of the function's name. So really, these two work together.

Code reuse: how can you reuse code if you don't know what it does? By choosing a good name, you are able to tell when things are being reused, and how appropriate they are to the situation. Note that the first time you extract a function, it'll have a very specific name (because you probably took it from a comment). The first time you reuse that function, you'll probably want to think of a little better name, and you'll probably also make some code changes -- the code worked in the one special case before, but now it has to work in two cases. As you keep reusing, your code keeps becoming easier to reuse; the name and the code keep becoming more general. So those two work together as well.

Divide-and-conquer: indeed, this is important -- but it's hard to argue that bad naming will help your strategy here.

-Billy
Re:Unexploded bombs, not sleeping dogs by p3d0 · 2001-12-28 10:11 · Score: 1

Alright, thanks for the links. I'll ponder that some more.
A few more small things...
Pardon, but function names are usually more of a language feature than comments; they're checked by the language, they're part of the syntax... Comments are just bolt-ons. And this IS the purpose of function names; but most people don't use them wisely.
A function's name is not checked against what it does, any more than a comment is. As for most people not using comments wisely, I think that relates to your point that one has to trust the programmer because there's nobody else to trust.
A dummy that lets comments get out of synch is just as likely to leave function names unchanged when their implementation changes. I'm not sure I buy your argument that poorly named functions stand out more than incorrect comments.
Code reuse: how can you reuse code if you don't know what it does?
I think now you're arguing my side. I have been advocating naming a function for what it does. You're the one arguing that it should be named for why it does it, remember? :-)
Divide-and-conquer: indeed, this is important -- but it's hard to argue that bad naming will help your strategy here.
I hope you don't actually think I'm advocating the use of bad names. I'm just advocating naming functions by "what" rather than "why".
Anyway, thanks for the discussion.

--
Patrick Doyle
I mod down every jackass who puts his moderation policy in his sig. Oh, wait a sec....
Re:Unexploded bombs, not sleeping dogs by William+Tanksley · 2001-12-28 14:37 · Score: 2

Good points, again.

A function's name is not checked against what it does, any more than a comment is.

Well, yes; it is. A function's name appears in many places; if functions are supposed to have clear names, the lack of one will be very clear at a glance. A function which is being used for the wrong purpose should be obvious, and the best way to make it obvious is to make its name describe its purpose. Imagine seeing a function named 'SetFontAngle' in the middle of a vector rotation routine. You'd KNOW it was being used in a hackish way, and later font modifications could hammer it! How would you ever notice that if the function were instead named 'SetGraphicAngle' (i.e. it changed a cell named 'GraphicAngle' which happened to be shared by both vectors and fonts)?

Note that you're recommending naming functions for what they do; but the true story of what a function does is actually told by its implementation code. What a function does should be irrelevant to its use; for example, I may have a function which sets a specific bit in a specific I/O port, but that doesn't mean I name the function SetResetBitAndThenPollFor7854ChipReady. More likely, I name it for its intent: Reset7851Chip. Later on, when a reset bug is discovered, I implement the workaround without changing the name -- your function name would have to be changed to SetResetBitAndThenPollFor7854ChipReadyThenClearAll DMARegisters.

OTOH, suppose I'm writing some export unlocking code for the same chip, and step 5 of the unlocking procedure is to SetResetBitAndThenPollFor7854ChipReady. I would naturally notice your function name and would quite possibly use that function, because the name claims to tell what the function does -- but in your mind, that function is supposed to be a Reset7854Chip. What happens when the workaround has to be put in later? The only disciplined answer is that the person who originally wrote the function has to notice that it's being used elsewhere (hopefully), and rather than changing the old function, create a new function which calls the first and then zeroes the DMA registers. The new function would have the name we mentioned above, of course. Now what happens when a third programmer has to reset the chip? Which function does he choose? There's literally no way to know -- perhaps the comments in the calling functions will tell him, but that's a lot of source digging.

Code reuse: how can you reuse code if you don't know what it does?
I think now you're arguing my side. I have been advocating naming a function for what it does. You're the one arguing that it should be named for why it does it, remember? :-)

I think we're getting confused here between 'what', 'how', and 'why'. I'm using 'what' to mean 'what is intended'; you're using it, I suspect, to mean 'what is being done'. I would not use 'what' for that purpose; 'how' is traditionally used. However, I should not use 'what' for my purpose, since 'why' is less ambiguous.

So I want to name functions based on their 'why'; you want to put the why in a comment, and name functions based on their 'how'.

I hope you don't actually think I'm advocating the use of bad names.

I apologise abjectly. That was a terrible connotation on my part.

However, you were clearly saying that function naming was a constraint which might get in the way of writing good functions. If you're going to abandon my constraint because it hinders divide-and-conquer, then you should equally consider abandoning all other naming constraints.

I'm just advocating naming functions by "what" rather than "why".

I hope you see why I believe 'what' is the wrong thing for a function name to say -- at least if the 'what' is 'what this function does'.

-Billy
Re:Unexploded bombs, not sleeping dogs by p3d0 · 2001-12-28 18:57 · Score: 1

I think we're getting confused here between 'what', 'how', and 'why'. I'm using 'what' to mean 'what is intended'; you're using it, I suspect, to mean 'what is being done'.
I think this is indeed a source of confusion. When I say a function name should indicate "what" that function does, I don't mean just what it does now. I mean what its specification indicates that it is supposed to do. This, I think, covers your bit clearing example: if the functions's spec merely claims that it resets a thing, then it should just be called resetThing.
I had perceived that your claim was that a wrapper function called resetThingToIndicateSuccess should be added, for example, to convey not only the function's spec, but also the programmer's intent. My position was that the "to indicate success" part, being indicative of the programmer's motivation for calling the function, should be in a comment rather than in the function's name.
Perhaps we are both mistaken about each other's position? Anyway, any misunderstanding aside, I think we probably still disagree on how to name functions. :-)

--
Patrick Doyle
I mod down every jackass who puts his moderation policy in his sig. Oh, wait a sec....
Re:Unexploded bombs, not sleeping dogs by William+Tanksley · 2001-12-29 07:01 · Score: 2

Perhaps we are both mistaken about each other's position? Anyway, any misunderstanding aside, I think we probably still disagree on how to name functions. :-)

I think we're on the same page, actually. If you'd name a function resetThing then you're naming it according to its intent, not according to its contents.

However, you're also missing a point -- I'm not resetting the chip to indicate success; I'm simply setting the bit which usually means 'reset' and polling the device ready (which happens to coincide with the intended way of resetting the chip). The chip doesn't actually reset; it simply finishes unlocking its cryptographic functionality. So I don't call resetChip, because my purpose isn't to reset the chip, even though I think the procedure is the same. I would be wiser to call a function propagateUnlocking (or some such name). Therefore, when the workaround I mentioned is added to resetChip(), it breaks no code -- because everyone knows that a function named resetChip will be used only when a true full reset is needed.

My point is, as you discuss, that the programmer's intent in calling a function should be the main thing conveyed by the function's name. A function should never be called if its name fails to match your intent; if you examine the implementation, and it matches your needs exactly, you should at least change the function's name so that it fits into your intent as well as into the intent of its previous use. That way, when it comes time to modify the old function, the maintainer will know unmistakably that there's someone using the function in a way which was not originally anticipated.

Of course, if you choose to do that, you MUST also create some new unit tests for the renamed function which illustrate its new use. That way the maintainer will not only know _that_ there is a new use and intent for the function; he'll know also exactly what that new use it.

-Billy

Worse is Better? by qbalus · 2001-12-21 10:50 · Score: 1

Here is an interesting link that leads to others links

http://www.dreamsongs.com/WorseIsBetter.html

If the project is in Java... by Curt+Cox · 2001-12-21 10:52 · Score: 1

I don't know how well this translates into Perl, but if the project is in Java, the steps are pretty straightforward.

1. Determine a clean architecture for how you would have solved the problem, given the benefit of hindsight.
2. Create new interfaces based on your new architecture.
3. Code facades to make your old code implement the new design.
4. One by one, re-implement your new interfaces until the new implementation has solved all of the problems that led you to re-implement in the fist place.

Of course, during development and maintenance, you should be doing all of this stuff everyday, anyway. Reimplementing everything just means the team has agreed on a global change in design, as opposed to the local design changes you don't need any co-operation to achieve.
If you follow this approach, you always can always have working code at the end of the day.

Similar Situation... by Unreal+One · 2001-12-21 10:54 · Score: 1

I've written and maintain a similarly large web application for my company, that was all done in Active Server Pages and SQL Server.

All reusable code, and authentication verification in my web app is in a single 'functions.asp' file in the root of my project. Every single page, page in the project has an include for this file which takes care of the authentication, and provides that ASP code to the rest of the app.

Also in the root, I keep a javascript-functions.asp which works the same way but is included only in pages that have html forms. This file contains common client side functions to validate forms and what-not...

On the same note, I usually use a single CSS file that is also called universally throughout the app to simplify maintenance of the look of the app.

On the database side I'd recommend using stored procedures as absolutely much as possible. Obviously try to make all your transactions atomic and reusable in stored procedures to optimize performance and simplicity.

That's my 2 cents...

buy a formatter by GutterBunny · 2001-12-21 10:56 · Score: 1

We're going to have to make some hard decisions about code style.

The best thing to do about code style is find some method or function code formatter (built into your ide) that most everyone can agree on. Basically, it should be very easy to format your code after writing it. Then code formatting is a non-issue. Everyone lives (and dies?) by that formatting.

--
managers...why god invented purgatory

Re:buy a formatter by Doctor+Memory · 2001-12-21 14:22 · Score: 1

We used to use indent(1) as part of the front-end to our SCMS. Whenever you checked in a module, it was automatically reformatted to the "official" standard. When you checked something out, you had the option of reformatting it to your personal style, or just leaving it as-is. I modified the SCMS checkout command to search for a .indent.pro file in your home directory and automatically invoke indent(1) if it found one. This by and large made everyone (except some of the old guys) pretty happy.

--
Just junk food for thought...

Small steps by ChaosDiscordSimple · 2001-12-21 10:56 · Score: 5, Insightful

Large overhauls are usually mistakes. Details in the previous code are lost. If the overhaul takes non-trivial time, people become frustrated that two weeks ago they had a working (if problematic) system and today most of the system doesn't work.

Instead, make small incremental changes. Pick something lots of code is replicating and attempt to unify it into a shared code base. Spend some time documenting key parts of the code. Pick a particularlly hairy class or function and untangle some of the worst bits. These sorts of changes can reveal minor bugs, build up to significant improvements, and leave you satisfied at the end of the day that you improved things.

If a signficant overhaul is necessary, try to overhaul portions while maintaining the existing bits.

Re:If it aint broke... by de_boer_man · 2001-12-21 10:57 · Score: 1

More realistically:

For every time you modify a part of code, ten new bugs are introduced.

--
.sig wanted. Inquire within.

Retrofit Mercilessly But Quickly by Mentifex · 2001-12-21 10:57 · Score: 1

When making a comprehensive retrofit of your AI Mind for Robots Code, you have to sacrifice the early instantiations for the sake of smooth sailing on The Road Ahead to the Technological Singularity.

A major Open Source AI initiative has no choice but to r00t out and correct mistakes ASAP as soon as they are found. History will judge us be how kind we were to our robot and cyborg mind-brethren, not holding them back but equipping them to meet the world. Therefore Mind.Forth AI is for the robots themselves, while the JavaScript AI (q.v.) caters to human users who need tutorial instruction. Welcome aboard, all AI coders of good will towards bots.

Compartmentalize & Destroy by DotComVictim · 2001-12-21 10:57 · Score: 5, Informative

1) Identify common functionality.

2) Encapsulate in libraries

3) Be sure to extract enough generality that you don't have special case functions

4) Don't extract so much generality that functional interfaces become unwieldy.

5) Write everything in the same language.

6) Find any complex pieces or algorithms. If they can be simplified or re-written, do it. If not, save it so you don't need to debug it again.

7) Throw everything else away.

Joel is Right by qbalus · 2001-12-21 10:58 · Score: 1

Check out this site. Lots of great advice

http://www.joelonsoftware.com/navLinks/fog000000 02 47.html

Re:If it aint broke... by ZaneMcAuley · 2001-12-21 10:58 · Score: 1

Has anybody got links to real world stats on coding for things like bug introduction vs modification rates etc...

--
----- Whats wrong with this picture? http://www.revoh.org:1234/whatswrong

Regression test suit by angel'o'sphere · 2001-12-21 10:59 · Score: 5, Insightful

The first thing you definitly have to do is Setting up a test suit for regressoin testing.

For those not familair with the term 'regression test':

Program a set of so called "test drivers", programms calling your code: routines/scripts.

Define test data, either in a DB or in flat files, used by those driver programs.

The test programs and test data needs to work with the old code, of course.

As the new code should behave similar you only need to adjust PATH or script names to let the test programs work with the new code.

Plan your project by defining which test cases(test program plus test data) should work at a planned milestone with the changed code.

After making changes rerun all tests.

Well, there is a lot more you could do, but that above is minimum (basic software engineering, sorry no art involved here).

Regards,
angel'o'sphere

--
Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.

The Blind Leading the Blind by Junks+Jerzey · 2001-12-21 11:01 · Score: 1, Troll

"Do such and such because I read in an interview that..."

"There's this cool buzzword called refactoring that my professor mentioned and so you should..."

"[blah blah blah] object orientation [blah blah blah]"

"I saw a book about Extreme Programming at the bookstore and..."

Little by little by Mike+Schiraldi · 2001-12-21 11:05 · Score: 3, Interesting

Don't stop development on the old tree and shift all work to the restructuring project.

Instead, leave most of the manpower working on the old tree as always. Take a small team of your best people and have them gather input from the others, while the others keep working on the old tree. Then have the small team outline the changes that need to be made.

Work on the changes while simultaneously working on the usual stuff. Say, 90% of your manpower should do what they do now, and 10% should work on restructuring things. One mouthful at a time.

When you have a mouthful that you think is ready, branch the old tree. Merge your diffs into the branch, TEST IT, and if things seem to work, land the change onto the old tree.

--

--
Mod up a post Rob doesn't like and you'll never mod again

Re:Little by little by William+Tanksley · 2001-12-21 13:03 · Score: 2

Wise, but may I suggest the obvious extension?

Don't play with code trees. Use only one "tree": the current, live one. Don't let one code base rot unmaintained while the other one is hacked to bits, untested.

Do take one mouthful at a time. Add unit tests, make sure they work. Now add a unit test for something you WANT to work, but which doesn't. Make that unit test work by implementing the feature. Release the result. Repeat.

-Billy

It is called Refactoring. by burtonator · 2001-12-21 11:05 · Score: 5, Insightful

Most of the computer industry now calls this Refactoring.

I would HIGHLY recommend the book "Refactoring" by Martin Fowler.

There is a number of things you should do here.

1. Document your plan and come up with an official PROPOSAL document. Allow others to comment on this document and incorporate fix all relevant issues.

I started using this under the Apache Jetspeed project and now a lot of other Apache projects are accepting this practice.

It really allows the community to become involved in your changes and encourages constructive feedback and involvement.

2. Break this into phases. You should NOT attempt to do this all at once. Each phase should be isolated and should consist of one unit of work.

Each phase should be branched off of CVS, worked on, stabilized, brought back into HEAD and tagged. You should then RUN this code in a semi-deployment role for a period of time to correct all issues which WILL arise with the updated code.

After this you can then start your next phase.

3. UNIT TESTS! If management (assuming you have management) has approved the time for this type of refactor then you need to take the time and write Unit Tests for each major component.

It is important that Unit Testing can sometimes be just as hard, if not harder, than the actual development itself.

In some situations you can avoid Unit Testing, some here are going to call me crazy for saying this but it is true. In a lot of high level applications, which are NOT used as libraries by other applications, you can bypass Unit Testing in order to increase development time. This is a dangerous practice but it is often outweighed by the extra functionality you will end up with in your product.

Anyway. Good luck!

Kevin

Re:It is called Refactoring. by Coulson · 2001-12-21 11:37 · Score: 1

Just to emphasize the point burton made, the importance of unit testing cannot be overstated!

Consider setting up an automated testing process which runs your test packages every hour or two minutes.

When you write code, write tests. When you check in code, check in tests. It's truly liberating! Once your code is in and tested, it's not your fault if it breaks! (Well, most of the time. :>). It's the fault of whoever checked in after you without running the tests.

Comphrehensive tests let you know the exact state of your product at any time, and when you want to test on a new machine configuration, you can just run the test suite to get immediate feedback on what works and what's broken.

Refactoring and designing new architectures is one of the best parts of the job, so good luck and enjoy it!
Re:It is called Refactoring. by gidds · 2001-12-21 12:51 · Score: 1

Break this into phases. You should NOT attempt to do this all at once. Each phase should be isolated and should consist of one unit of work.
Amen!!! The more you try to do at once, the less likely it is to ever see the light of day.
You must break this into small, manageable chunks, so that after each one you still have a working, maintainable, testable system. (If this means you have some duplicated functionality during the process, so be it.) If you have regular deliverables it keeps you focussed, reins in scope creep, demonstrates progress, and of course lets you Release Early And Release Often!

--
Ceterum censeo subscriptionem esse delendam.
Re:It is called Refactoring. by Xthlc · 2001-12-21 13:41 · Score: 1

> In some situations you can avoid Unit Testing,
> some here are going to call me crazy for saying
> this but it is true. In a lot of high level
> applications, which are NOT used as libraries
> by other applications, you can bypass Unit
> Testing in order to increase development time.
> This is a dangerous practice but it is often
> outweighed by the extra functionality you will
> end up with in your product.

Amen.

While unit testing is important for key libraries, it's not a panacea. Inexperienced programmers can parrot the maxims of XP all they like, however the fact is a unit testing policy can create a terrible drag on application development if poorly implemented.

A sensible, real-world policy DOES NOT operate on the scale that %90 of the XP acolytes out there recommend. I'm sick to death of smug self-styled gurus (who, mysteriously, don't seem to do much programming since they started writing articles for Javaworld) that admonish us sloppy developers for our unwillingness to sacrifice key features in order to make time to build unit tests for every radio button. One of the most important components of unit testing is understanding the difference between a unit test and a waste of time.

Oh, and don't bother with articles or books that discuss unit testing, unless you interpret them in the most general terms. I've never seen an example of a unit test that wasn't unbelievably trite. Unit tests for an imaginary number storage class? Please. What about when the code unit you need to test manages asynchronous input from five different sources, where that input consists of five different complex packages of information that must be unraveled, interpreted, and transduced based on the code unit's multi-tiered internal state?

Unit testing is an art, not a science -- so don't sacrifice your project and your customer on the altar of XP. Explore unit testing, but don't embark on a comprehensive policy until you understand what you're getting into. A half-assed unit-testing policy is much worse than none at all.
Re:It is called Refactoring. by KyleCordes · 2001-12-21 14:14 · Score: 2

I found a couple of things strange in this post:

[unwillingness to sacrifice key features in order to make time to build unit tests for every radio button]

Personally, I do test-intensive coding because it helps me go faster (build more functionality per unit time). If it didn't, I wouldn't. I also don't see the point in unit testing a radio button.
Re:It is called Refactoring. by KyleCordes · 2001-12-21 14:20 · Score: 2

Oops, posted early. Other strange things:

[asynchronous input from five different sources, where that input consists of five different complex packages]

I've found that it makes a lot more sense to focus on testing the hard stuff, like this, than simple things like the ones you mentioned. By the way, it's also quite useful (totally apart from XP) to decompose such a beast into pieces you can test effectively.

[so don't sacrifice your project and your customer on the altar of XP]

I've never heard of this happening. Can you point to any project which did XP (most or all of it, not just talked about it) and resulted in an unhappy customer for that reason?

I've mostly seen people do little bits of XP (ignoring most of it), fail, and blame XP. A typical story is "we did XP, but we didn't do much testing, we didn't pair program, we didn't have a customer on site, we didn't integrate very often, we didn't track our progress, and our project went badly, therefore XP is bad". Weird.
Re:It is called Refactoring. by Xthlc · 2001-12-21 18:34 · Score: 1

Re-reading my post, I guess it came across that I think XP, per se, is bad.

On the contrary, I feel that many of the assertions XP makes are quite good software engineering practices. However, I also feel that many professed adherents of XP become fascinated with the methodology and forget that, in the end, software engineering is about building a product that makes the customer (not the fake customer on your XP team, the real customer holding the checkbook) happy. This means that XP methodologies must be applied with careful judgement and an eye towards flexibility -- the most elegantly organized project in the world will still net you zero income if it doesn't get out the door on time with all of the required features.

That said . . .

> I've found that it makes a lot more sense to
> focus on testing the hard stuff, like this,
> than simple things like the ones you mentioned.

Exactly my point.
However, I think very few proponents of XP actually understand the principle outlined in this article, namely that XP is a set of best practices and NOT a color-by-numbers guide to successful software engineering. There's a great deal of XP fundamentalism out there, mostly on the part of development managers who are excited to the point of bursting by the childishly simple ABCs of XP according to [insert snide columnist here].

> By the way, it's also quite useful (totally
> apart from XP) to decompose such a beast into
> pieces you can test effectively.

Unfortunately, the actual code that inspired my description is about as atomic as you can get in an asynchronous system. A few inputs, a few outputs, and a very well-defined domain of functionality. However, both internal state and external state, as well as the timing of incoming messages, are quite complex, and non-trivial to simulate as part of a unit test.

My point being: unit testing is Hard in many situations, and most XP literature I've read glosses over that fact and downplays the necessity of picking the right things to unit test.

> Can you point to any project which did XP (most
> or all of it, not just talked about it) and
> resulted in an unhappy customer for that reason?

Well, I can't point to any project that managed to successfully implement "pure" XP, the kind that I've only seen in a training center. Most of the ones I know of that tried it gave up on it pretty quickly, usually because either:
1) They didn't have enough staffing to constantly pair program AND get all the necessary features completed
2) Their time frame wasn't sufficiently flexible to allow for the additional time incurred by XP-style unit testing (as opposed to a more traditional test suite).
3) Their Real Customer (not the XP customer) was not a nice and reasonable person with an understanding of how software development ought to work. He kept demanding additional features, some of which were negotiated away; the remainder necessitated that process be streamlined in order to deliver them on time.

While only a couple of these projects failed, they all wasted effort and money on implementing a methodology of which they had little understanding, but were led to believe was Just That Simple.

Nobody can sensibly argue that the generalities espoused by XP are bad. Testing is good. Iterative development is good. Refactoring is good. Pair programming *can* be good, in the right situations. However, the manner in which XP's principles are presented and disseminated (i.e. "the cult of XP") is, for the most part, misleading. Like most cults, The Cult of XP claims to have the one right and true solution to every problem in existence, which can be yours if you just do A, B, and C. Like many cults, The Cult of XP has some good things to say; in fact, it DOES benefit you to join up if your problem fits certain constraints. Also, unfortunately, like many cults, the Cult of XP is more interested in casting aspersions on its detractors (who are "afraid of change" or "unenlightened") than it is in engaging in serious examination of situations in which its dogma doesn't exactly hold true.
Re:It is called Refactoring. by KyleCordes · 2001-12-22 02:58 · Score: 2

[not the fake customer on your XP team, the real customer holding the checkbook]

XP says to have the real customer. If you have a fake customer, it's not "pure" XP. Many people who like XP but can't get a real customer use a "proxy customer", but they (hopefully) admit that they aren't really doing XP per se, just something similar to it.

You are of course 100% right that satisfying a fake customer is irrelevant if you don't satisfy the person with the checkbook! In my mind, customer equals check-writer.

[There's a great deal of XP fundamentalism out there, mostly on the part of development managers]

That actually doesn't match my experience at all. I see many developers who are very excited about XP, but many development managers seem to like some of the ideas but be *far* from "fundamentalists" about it. Rather they want to immediately disregard major portions of it, typically.

[about as atomic as you can get in an asynchronous system]

I've built a number of asyncronous systems, and agree they can be challenging. I do believe it makes testing even more important. If I was building the system you describe, I would probably insist on testing that piece.

Regarding choosings unit tests, the frequent advice is to test everything that could possibly break, which is pretty vague - but it definately includes this async beast.

[Well, I can't point to any project that managed to successfully implement "pure" XP]

[didn't have enough staffing to constantly pair program] etc.

My last project didn't do RUP. And it didn't do Scrum, although we no doubt did bits and pieces of each. I don't use it as an example of either of those methods succeeding or failing.

Your projects didn't do XP, it did bits and pieces of it, omitting large important portions. So why would it indicate anything about XP? Because the people talked about XP a lot while they did something different?

[The Cult of XP claims to have the one right and true solution]

I've read thousands of messages on the XP mailing list, I've been to an XP conference, I've read a few of the books, and I've seen presentations by most of "big names". In the process, I have not seen such claims.

Who is making this claim?

Complete rewrite necessary or a waste of resources by beamz · 2001-12-21 11:05 · Score: 4, Insightful

Not too long ago a link was posted to an interview with Joel Spolsky who used to work at Microsoft.

His comments about code reworking and rewriting have a lot of insight in them.

Here are some quotes from his article:

SMS: Joel, what, in your opinion, is the single greatest development sin a software company can commit?

Joel: Deciding to completely rewrite your product from scratch, on the theory that all your code is messy and bug prone and is bloated and needs to be completely rethought and rebuild from ground zero.

SMS: Uh, what's wrong with that?

Joel: Because it's almost never true. It's not like code rusts if it's not used. The idea that new code is better than old is patently absurd. Old code has been used. It has been tested. Lots of bugs have been found, and they've been fixed. There's nothing wrong with it.

The point is this... The benefits of spending time rewriting code completely may be a waste of the companies resources, this is for you to determine.

His interview is here:
http://www.softwaremarketsolution.com/index_2.ht m

and his site has more information about the concepts here:
http://joel.editthispage.com/articles/fog0000000 06 9.html

Re:that is LARGE? by Doppler00 · 2001-12-21 11:07 · Score: 1

Code is unlikely to be that large. That would mean that you would have been typing 100,000,000 characters. I don't think that existing code libraries count, but original code does. Try to write a 1,000,000 character document and see how long it takes.

modularity/incremental rollout/unit tests/iterate by tim_maroney · 2001-12-21 11:08 · Score: 5, Informative

I second all that has been said about making sure that you really need to do this and that it is worth the time and risk. One sign that you may need to do so is an excessive reopened bug rate, where fixing one bug often creates another bug due to side effects and component interactions. If you decide that it is, then the three keys to success will be modularity, incremental rollout, and unit tests.

Modularity is probably what you're already thinking about. Go over the old code base, in a code review, and find where the same thing is done over and over either with copy-and-paste code -- the bane of crap engineers -- or with different code that serves the same ends. Look for repeated sequences in particular. Create a new library that encapsulates those pieces of code.

Incremental rollout is vital. Only replace small parts of your system at a time, doing complete retests frequently. Don't write a new encapsulated routine and then roll it out to each of the three dozen places in which it appears in the whole code base. Write the whole function library, with unit tests, and then start applying it to separable modules one by one, retesting as you go. Otherwise I guarantee the whole thing will fall apart and you won't be able to tell why. Ideally, you might set a threshold on the rate of replacement of old modules and work primarily on creating new modules with the abstracted logic.

Unit tests are crucial because, as noted, the messiness of your old code probably conceals a lot of necessary logic. We had this great phenomenon on Apple's Copland where people who had never used the old OS managers were rewriting them in C or C++ from the assembly source. When they saw something in the assembly they didn't understand, they just ignored it. Guess what -- the new managers didn't have any backwards compatibility. The only answer to this is to have a thorough unit test for any module that you replace, against which you can test the new version. This also confers other quality benefits, but during a rewrite it's critical.

Finally, once you have replaced a significant number of your modules, you will find that new levels of abstraction appear. The average size of each function or method will have shrunk considerably, and now it becomes possible to see new repeated code sequences that were not visible due to the old cruft. Move these into your new library modules and start using them in continuing replacement work. In addition, start going back -- slowly and incrementally -- through the already converted modules and replacing the repeated sequences with calls to the new abstractions.

Finally, figure out how you got into this mess in the first place. The worst programmer habit I know of is copy-and-paste coding instead of using subroutines. You can tell people not to do it, but some always will. Those people should be bid farewell -- you can't afford their overhead. Other common problems include lack of planning and review, a code first and think later mentality. Start moving your organization up the levels of the CMM and you may find that you wind up with fewer modules that need replacement.

Hope this helps.

Tim

I'd like some info on this as well... by josepha48 · 2001-12-21 11:09 · Score: 2

in particular how do you write a large program? I work on a program that has over 3000 files that are over 30 to 300 pages each. The software does alot for and has been around for over 30 years and has just had code sloppily added to it. So how does one go about rewriting an application that noone knows all the functionality of?

--

Only 'flamers' flame!

Go slowly young Jedi by 0xA · 2001-12-21 11:19 · Score: 2

This is a redundant post but I think it's imortant enough to be repeated.

Don't set out to refactor your codebase as one big project. Try and split up the code by functional areas and take them on one at a time. Now, this doesn't really work too well most of the time. You're going to run into way too many places where things are interdependant but try anyways.

If you can, resist changes to your database schema while you are refactoring code. Having both of these thing happen at the same time is pretty scary.

You say CGI so I asume you are using PERL, look into OO PERL, it's worth it. Even if you don't want to go OO all the way the 2 big things that can make your life easier are packages and layering.

Using packages, especially for things like DB access can save you tons of time and headaches. You have one place where you run a query and build a hash, all of your code calls it when it needs the data. HUGE adavantages here.

Layering your design is helpful ass well. I've found that you can do a lot of good if you have designed Data Access, Logic and Presentation layers seperately. All each one of these layers needs to do is take the hash ref passed by the other layer and do X with it. You can rebuild each layer at will as long as the data structures passed between them don't change.

Solicited suggestion by ciurana · 2001-12-21 11:21 · Score: 2

chizor wrote:

My programming team is considering making some sweeping changes to our code base (150+ perl CGIs, over a meg of code)... What suggestions might readers have about tackling a large-scale retrofit?

My advise to successfully accomplish the changes:

Where possible, use Python or Java instead PERL
If there is a line-of-business piece of code (i.e. if your company's bread is on the line when this fails) make sure you can roll the original code back if the new version doesn't perform
Divide the programming team in groups of 2-4 people
Use XP techniques for development
Have an external group evaluate progress to keep people honest; this group is responsible for testing that functionality meets or exceeds that of the original system

I led the development and migration of some very large mission-critical systems in my career. Too many programmers making decisions on-the-fly, totally centralized management, or a "leave the technical folks alone until they're done" attitude are sure recipies for disaster.

Good luck with the changes.

Merry Christmas, and God bless us everyone!

E

--
http://eugeneciurana.com | http://ciurana.eu

Re:Solicited suggestion by m_ilya · 2001-12-21 12:45 · Score: 2, Funny
Where possible, use Python or Java instead PERL.
Indead. I'd like to extend this advice:
- Where possible, use Perl instead of PERL.
--
--
Ilya Martynov (http://martynov.org/)

DON'T DO IT! by mcrbids · 2001-12-21 11:26 · Score: 3, Interesting

As posted elsewhere, re-writing is painful, and generally NOT A GOOD IDEA!

I currently maintain a code base of around 120,000 lines of php and html (written by myself in a long, hard year) and have had to "retrofit" it a few times.

I find that when it's time to do an "over-haul" it's generally best to:

1) Pretend I know nothing - redesign from scratch. Write out a spec with flow charts, DB table definitions, etc. - make it VERY DETAILED. Spend lots of time at it. More time spent here saves even more time later.

2) Ignore your spec. (See step three)

3) When a bug comes up, or new functionality needs to be added to the codebase, refer to the spec built in 1, and build to it, and then put in compatability wrappers to work with the existing codebase.

Make these compatability wrappers log their calling in some way, based on a global variable. This allows you to see when they're no longer needed simply by defining a variable in a config file and waiting a while.

4) You'll be slowly bringing the application up to the new spec - eventually you'll reach a point where it's easier just to bring the remaining pieces up to snuff than to build more abstraction wrappers. When you get to that point, you'll find most of the work is already done, just finish it to the spec and remove the compatability wrappers.

This can still be a painful process, but at least it isn't a "gun to your head"! This allows you to regression test your work as it's done, resulting in a more stable deliverable, and you can still meet clients' needs in the meantime without making them wait 6 months while you re-write all your stuff.

Hope this helps...

--
I have no problem with your religion until you decide it's reason to deprive others of the truth.

Re:DON'T DO IT! by esconsult1 · 2001-12-21 11:43 · Score: 1

I know what you mean!

I've also been working on approx 7000+ lines of PHP code over the past several months http://makunu.com

After 3 months coming back to your code, even though it's well commented you start to wonder... "WTF was I doing!?". But guess what, it works! And it usually works well.

I've found with this project, its better to just go in and modify the code. Another thing... do it in with lots of little subs/functions that serve as black boxes. Because of this approach, when the time came for us to switch from Mysql to the more robust Postgresql, then we didn't have to change any code.

Another thing we did was to make the system to be like a "plug-in" type of architecture, so adding or getting rid of modules was childs play without having to rip out the heart of the application.

One drawback to our large library of functions, is that it took a fairly long time to load with each web request. This library contains massive quantities of small functions that do everthing for logging errors to database independent access. But was there a way around this besides senselessly duplicating code... I dont think so.

With such a large code base, I really have no plans to rewrite it any time soon. This system will end up being massively patched over the next few years.

--
Newsfollow.com
Re:DON'T DO IT! by cpparm · 2001-12-21 12:24 · Score: 1

One drawback to our large library of functions, is that it took a fairly long time to load with each web request. This library contains massive quantities of small functions that do everthing for logging errors to database independent access. But was there a way around this besides senselessly duplicating code... I dont think so.

How do you know that's the reason it's slow? Function calls are cheap, performance wise. If you profile your code and look at what is slowing it down, you might be surprised.

Step #1 by Sanity · 2001-12-21 11:27 · Score: 2

...don't use Perl ;-)

[-1 Flamebait]

Re:Step #1 by cscx · 2001-12-21 17:36 · Score: 1

I hear that C# and VB.NET can scale your code down a LOT ;-)...

You have a Big Ball of Mud by mahlen · 2001-12-21 11:30 · Score: 2, Interesting

First, read this essay: The Big Ball of Mud. It is an interesting look at why, when we all know that spaghetti, gnarly, twisted code is bad, that it happens anyway (hint: it may mirror your understanding of the problem).

Ignore the "don't touch it" naysayers. Even before it's done, it'll be much nicer code to deal with. You can make decisions with less nagging doubts. You'll code onward with gusto. You'll be able to accurately predict the names of methods without looking them up.

Test the current state of things at all points through the process. I'm hoping that you have lots of automated tests you can run everytime code is checked in; if not, make them FIRST, before the overhaul. You are majorly diverting the intent of the code at hundreds of points; you can run astray in so many places that the above naysayers would be correct. Constantly assure yourseleves that the code is working. Go out of your way to ensure that the code is buildable and runnable, even to the point of writing scaffolding you know will be soon thrown away.

Burning a little incense every day in obesience to the Gods can't hurt, and will make the room smell nice.

mahlen

Shantytowns are usually built from common, inexpensive materials and simple tools. Shantytowns can be built using relatively unskilled labor. Even though the labor force is "unskilled" in the customary sense, the construction and maintenance of this sort of housing can be quite labor intensive. There is little specialization. Each housing unit is constructed and maintained primarily by its inhabitants, and each inhabitant must be a jack of all the necessary trades. There is little concern for infrastructure, since infrastructure requires coordination and capital, and specialized resources, equipment, and skills. There is little overall planning or regulation of growth. Shantytowns emerge where there is a need for housing, a surplus of unskilled labor, and a dearth of capital investment. Shantytowns fulfill an immediate, local need for housing by bringing available resources to bear on the problem. Loftier architectural goals are a luxury that has to wait. -- from "The Big Ball of Mud"

Quick suggestions...not so quick solution by runswithd6s · 2001-12-21 11:31 · Score: 4, Insightful

As everyone else should be telling you, your existing codebase is a very valuable resource. It's been tested and debugged a number of times. It's mature, it's stable, it's there. Don't throw out the baby with the bathwater.

Now, on coding standards and how to incorporate them into a legacy project. Your concern is NOT format. The format of your code, such as indentation, spacing, etc, should be the least of yoru concerns. Everyone has their own style, but there are wonderful tools that you can use to force everyone to a single style, ident and astyle just to name a couple. Use a wrapper script to the CVS (or similar system) on checkins. Force the code through an automated cleanup. check the code back out and make sure it compiles/runs as expected.

What you should worry about is how much your design team has embraced the "black box" design principal. Parameters go in, results come out with no "side-effects" that impact the remainder of the code. Make your code re-entry safe, i.e. stay away from globally scoped variables as much as possible.

Someone's going to give you the whole OO-Design sales pitch. Yes it's nice on paper, but don't sell out because something looks nice on paper. I learned this the hard way. I have a tendency to overdesign things. When OO, this gets to be really scary. I waste my time writing object classes for "everything" instead of simply designing the software to its functionality spec. Make things more "object oriented", "functional", or "blackboxed" when you find yourself repeating code elsewhere in the application.

Don't spend a lot of time with naming standards such as Hungarian, Modified Hungarian, etc. Find a style that you and your team is comfortable with for the Interface API level. Below the Interface API, be more lenient. It's likely that portion of the code will undergo many changes anyway.

And most importantly, document! This is the singlemost important issue of any coding project. Either force your developers to write docs as they go, use embedded documentation solutions, or hire a techwriter to follow you and your team around for a few months. Documented API is the quickest way to start someone off in the project, and a great way to keep track of the flow of the program.

--
assert(expired(knowledge)); /* core dump */

Re:Quick suggestions...not so quick solution by Fubari · 2001-12-21 18:13 · Score: 1

About over design in OO - check out YAGNI (You Ain't Gonna Need It) and other XP principles here. Or, take a look at this article, Is Design Dead?.

Suggestion for rewrite method by mcpkaaos · 2001-12-21 11:36 · Score: 1

When you look ahead to a large scale rewrite, there are few things you must keep in mind before even attempting such a task. If you expect to jump in and hope to tread water, think again. Unless you sit down before you start dividing the coding tasks up amongst the engineers, you are simply begging to go down the same road twice.

Before you begin, make sure you weight the cost of a rewrite against the cost of maintenance. Many engineers (myself included) feel so ashamed for poor code that they want to dive right in and fix it. Whereas this could potentially solve the problem as far as you can see it, it won't make the financial people happy after watching your team spend 4 times the money than they would just dealing with maintenance. In other words, make sure you are adding a hell of a lot of value with such a huge task. This is a good time to analyze the risk of a rewrite as well. How critical is the system? How many users? How will this affect any users that have built systems around your interfaces? Will they need to recode portions of their systems as well? There are many more questions to ask yourself, but these are generally the ones that will come from people outside of your immediate group (and usually right away.)

If you do decide to rewrite, there are a few things you should do before analyzing the current code base. Review the original requirements and compare them with any new requirements you may have now (or later, if you have any insight into the future, such as rumors of change in business practice, etc.) Once you have worked out any inconsistencies or pitfalls, it's time to compare the original design against the original requirements. Look for inconsistencies between the two. Usually you will come across a few requirements that didn't quite make it into the design as expected (if at all.) If the two are so far apart that there is no feasible reconciliation, your hopes of a rewrite are pretty much dashed - it's time to start looking at a fresh start. Hopefully, however, the design met the requirements adequately, with no "creative designing" put in place to get around poorly thought out requirements. Note that you should do all of this before you start going through the code. Leave the code out of it for now.

Once you have established that you can safely move forward (i.e., the design followed the requirements), hold code review sessions with your entire team, targeted at the original code. Ensure that the design was followed properly and that any bug fixes/workarounds/etc are clearly documented and easy to spot. Usually this is not the case, else you wouldn't be contemplating a full rewrite. This is by far the most painful (and time consuming) task, and is usually the point of failure in rewrites. If not enough attention is paid to the caveats of what you have now (and why they are there and not resolved), you will paint yourself into the same corner with the next go 'round.

Without spending too much time on that topic (which is easy to do, believe me, I've done my fair share of code reviews with large teams), I'll assume that you are planning on continuing with the rewrite.

Once you've made it past the reviews, it's time for a new design. You won't necessarily need to completely redesign the system, but you will have to (at the very least) document a plan that will provide a solid path to the new system. Even if your functionality isn't expected to change, it will, so a mild to moderate design document will be your life-jacket. This will be difficult, as it's no easy thing to feel like you're simply writing the same words over again, but stick with it: you will find new methods in this design that will help to avoid the traps you were caught in the first time around. Have the utmost diligence with this step. Review the design often, and brutally. Bring in members from other teams for sanity-checks - a fresh set of eyes can do wonders, even if they are unfamiliar with your product.

Now, onto the coding!

Whatever you do, do NOT use the original code for anything more than a guideline. Create new projects, start with blank sources, whatever it takes to start anew. When you rewrite a system, you are actually writing a new system that simply mimics the funtionality of the old. Your performance is going to change, your stability is going to change. Don't think for a moment that you will end up with exactly what you had before, only prettier. This is a pitfall that fools many an engineer, especially the engineer who trumpeted the idea for the change. ;)

Finally, more than just about anything else, remember this: when rewriting a system, be sure to adopt a coding style that will prevent spaghetti code. Be true to a modular design and cleanly separate the functionality into logical groups. This is always easier in an object oriented language, but very achievable in the procedural world as well. Make sure that this time you document your code inline. In other words, instead of simply commenting your code to say, "This function does this, then that", say something like, "This function performs the following business rules, as dictated by requirement #x, conforming to the component design section 5.1." Tedious at best, but far more valuable than most people realize. Make sure you also document the bug fixes that were migrated into the new code base, also. Eventually, someone will want to review the code to make sure the same bugs that were once in the old system aren't going to rear their ugly little heads in the new system. Finally, unit test the living hell out of your code as you write it. This is the last ditch effort (before QA and integration testing) for ensuring stability (which is the main justification in the end for the rewrite.)

Anyway, sorry for posting a novel, I hope something I had to say here can be of some use. Best of luck should you tackle the rewrite! :)

-------
[McP]KAAOS

--
It goes from God, to Jerry, to me.

Tablize It! by Tablizer · 2001-12-21 11:39 · Score: 1, Informative

Put as much in possible into tables. That makes it easier to find, view, add new pages, menus, product categories, etc.

Of course there will be exceptionss that need code instead of tables. Have special "override" points that allow you to override stuff as needed. Make the "overrides" their own function. (Example: sub foo(row); if row.clientID=7, then color3 = blue;.....). The override points are a lot like "on_x" IED-GUI or database triggers/events.

If you store SQL in tables for a report generator for instance, there may be situations that tablized SQL cannot handle very well (or not worth adding new columns if only one or two instances differ from the majority). So make a routine to override each SQL section (SELECT, WHERE, ORDER BY, etc.), and another to override the whole SQL for the really complex stuff like correlated subqueries.

I have successfully tablized HTML forms also. It is a lot of up-front work, but is better than coding each from scratch if you have hundreds.

--
Table-ized A.I.

Don't touch the semantics by JojoLinkyBob · 2001-12-21 11:39 · Score: 1

Noticed you have a lot of perl scripts, which is similar to the situation I've been in before. Come up with some agreeable style (i.e. indenting) but stay away from the semantic issues, because a perl script speaks identity loudly. Case in point, how do you reference an item in a list: @list[0] or $list[0], Both are accepted, and in way, both are intuitive, it just depends on how you're learned the ropes. End Transmission.

--
-jc

Re:Don't touch the semantics by chromatic · 2001-12-21 13:08 · Score: 1

Sorry, that's wrong. The correct way to refer to a single item in a list is $list[0]. @list[0] refers to a list of elements. (An list of one item is still a list.) See this Slashdot article for more.
You're asking for subtle bugs by conflating them. (Whether or not this was a good design decision, it'll be fixed in Perl 6.)

--
how to invest, a novice's guide
Re:Don't touch the semantics by JojoLinkyBob · 2001-12-21 13:45 · Score: 1

I'm talking about simple cases such as:

@list = ("apple","banana","chihuahua");
print "\n@list[0] @list[1] @list[2]"; print "\n$list[0] $list[1] $list[2]";

Both of these print statements have the same results. My point is that every programmer has their own special coding nuances. Taking these away to adopt to some standard might be counter-productive, because in most cases the original developer will probably be making revisions later, and some of us probably know how hard it can be to read someone else's perl :) Speaking of semantics, maybe my original comment was worded precisely enough, and therefore misinterpreted...

--
-jc
Re:Don't touch the semantics by chromatic · 2001-12-21 15:30 · Score: 1

I still disagree. If I came across @list[0] in a code review, I'd mark it as wrong. There's a reason Perl warns about it. Any Perl developer who doesn't yet understand the idea of context needs to know it to do anything serious with the language. Any Perl developer who knows better and uses it in production code just to be cute needs a better outlet for his creativity.
If your organization eschews coding standards (especially those that follow compiler warnings!), you'll be very lucky to avoid several common pitfalls. I'd rather have every developer on my team willing and able to modify any part of the code. If that means I have to use 2-space tabs, so be it. If that means I have to teach other people how to use anonymous subroutines and the map operator, so be it.
I trust my fellow developers to follow best practices for a reason -- to make development (including maintenance) easier for everyone. They're smart enough to adapt to intelligent standards.

--
how to invest, a novice's guide
Re:Don't touch the semantics by thogard · 2001-12-22 02:15 · Score: 1

I would back that up as being wrong...
$list[0] is a single element and @list[0] is a list with a single element.

This is a common problem when coders don't understand what is going on. In this case perl will build a new list of one item and then deal with it in a list context when all that needed (and wanted) is to deal with one element. This type of problem isn't perl specifc (but the syntax is) as these types of ineffiencies show up in things like OOD data access methods and many over designed programs.

Can we change the word "Developers"? by IRNI · 2001-12-21 11:44 · Score: 1, Insightful

Ever since I saw that horrible, nightmarish Steve Ballmer video.... "DEVELOPERS DEVELOPERS DEVELOPERS DEVELOPERS DEVELOPERS DEVELOPERS... DEVELOPERS DEVELOPERS DEVELOPERS!!!!!!!!!!!!" Oh god lets call it anything but Developers! The image of a big, fat, bald, sweaty man keeps entering my head when I hear that word.

Rewrite? by stevewz · 2001-12-21 11:52 · Score: 1

Treat your existing product as one big, well developed prototype. Not many people have that advantage in a large-scale development effort. If you can somehow put a function-freeze on the existing product, that will enable to you to develop the new product to match the functionality of the old product without trying to hit a moving target. When possible, scope out and implement your improvements and efficiency improvements at the same time. In many ways, this is the best way to do a large-scale project.

Consistency by Malc · 2001-12-21 11:53 · Score: 1

Unless you want to go and reformat all of your old code, leave it as it is. Apply coding standards to all new code. With old code, use whatever style is already present in the file. If you think that somebody can automate a way of reformating all of the old code in a half a day, do it... but don't bother wasting anymore time on it. Only considering reformating old code if somebody does a lot of work (e.g. bug fixing or functionality changes) in the future on the file anyway.

Yes, I know, it is nice to have a clean slate. But nice is all it is... it's not really necessary, especially for parts of the old code that doesn't get touched very often anyway.

For the Love of all that is Holy! by 955301 · 2001-12-21 11:54 · Score: 1

Don't do it Man!

action:Leaps behind the nearest dune and holds head low in the sand:action

Just let it go! You've laid down your mine field (developed) and activated the detinators (deployed). Don't go walking back through and "sweeping" anything!

Kaaaabooooooommmmmmm! I expect to come back to your page to see camel parts everywhere and your little Nomad guy hanging from the cool purple font!

--
You are checking your backups, aren't you?

No such thing by Neubie+again · 2001-12-21 11:56 · Score: 1

R U Serious!?!

First, read all previous comments.

Second, realize that a "sweeping rewrite" entails just as much work as the first write. The only thing you will (should) get from current code is a functionalty demonstration. (I.e., the current version is a "throwaway" prototype.)

Then, take everything you know about software development, learn a lot more, write the requirements specification, get it approved, then start thinking about technology, languages, and design. Coding should be about a third of the effort - the middle third. (Final third is test/debug/test cycle. If you didn't know that already, then don't even attempt this.)

Re:Help decipher a comment from 1982 by Pinball+Wizard · 2001-12-21 11:58 · Score: 1

Its a pun of the phrase: "Absence makes the heart grow fonder".

Actually that was kind of funny, but you would have to be familiar with said phrase.

--

No, Thursday's out. How about never - is never good for you?

I think he can be the most helpful. by nkabbara · 2001-12-21 12:02 · Score: 1

Go to joel.editthispage.com. He has years of hands on experiance and when it comes to software management his advice is right on.

Re:Use framework, seperate code form display.. and by cwinters · 2001-12-21 12:03 · Score: 1

Perl has plenty of web application frameworks, and they're all approach the problem differently. See the mod_perl page for starters. <plug>OpenInteract is probably one of the most complete of them as it includes a framework for data access, security, user and group management -- all the fun stuff.</plug>

Chris

--

Chris
M-x auto-bs-mode

Does it work? by litewoheat · 2001-12-21 12:06 · Score: 2, Insightful

If it works don't break it. Pointless rewrites do nothing but feed the programmers ego and knock the company out of business. Its happened over and over and over and over again. Progress, not regress.

the obvious by archen · 2001-12-21 12:17 · Score: 1

If you're sticking with Perl make sure that all the code runs flawlessly with warnings.

#!/usr/bin/perl -wT

and

use strict;

I've actually done this on 2 large projects.. by Beatlebum · 2001-12-21 12:19 · Score: 1

..both projects were >250,000 lines of code. Assuming you don't have the resources to redesign from the ground up, my advice would be to retrofit in a piece-meal way. Try and factor out key pieces of code and rewrite if necessary. Whenever you make a change, run the old and changed system side-by-side to verify you have not introduced bugs. As you progress, it makes sense to write scripts which can automatically perform this side-by-side QA. As you rewrite sections of code, use the Facade Pattern to maintain compatibility with old code you don't have time to rewrite. If you do things right, the Facade can be stripped out later if you do decide to change internal API's.

Re:Extreme Programming - IT WORKS! by unixpro · 2001-12-21 12:22 · Score: 1

I've used this exact method (XP) for cleaning up a project as well as with new projects. It takes some work, but the rewards are far more than you have any right to expect.

Re: Design from scratch by Spud+the+Ninja · 2001-12-21 12:28 · Score: 1

This is, I think, crucial. If you're looking at retro-fitting your project, then I'd assume that it doesn't work the way you'd like it to.

If it's really just a case of redundant functionality, then it wasn't designed well in the first place, or later add-ons/bug-fixes broke the original design. That should be fixed. If you need to add more functionality, then design in important in avoiding breaking the current design.

It seemed like a pain in the ass when I first started, but a good design job save so much development time, time that could be used later for documentation!

--
You can never put too much water in a nuclear reactor.

My suggestions by Pinball+Wizard · 2001-12-21 12:32 · Score: 5, Informative

My programming team is considering making some sweeping changes to our code base (150+ perl CGIs, over a meg of code

First of all, I think its important to realize that you have a medium-sized website and not a big software project. Therefore, some of the above comments recommending refactoring, UML, and eXtreme programming may be a bit overkill.

Web programming != software development! Its usually done at a much faster pace. Even if an object-oriented approach is taken, you are still probably talking about simple function libraries rather than complex C++ or Java classes. Again, overkill.

150 files is still a small enough project to be managed by one or two decent coders. Actually, I just looked at the amount of stuff I've written over the years for my online bookstore and its more like 500 files and over 4 megs of code. I don't feel like its too much of a job to manage this codebase by myself.

So, here are my recommendations.

You probably have gotten better at programming since the time you started your project. Take a few of the most recent CGIs you have written and compare them to the first ones you wrote. You just might notice a glaring difference in the quality. Also, the first pages you wrote are likely to be among the most important in your project, yet they are also likely the worst quality-wise.

Regardless of what language you program in, I think its important that you can tell whats going on in the program by reading the comments. If a manager can understand what a program does by reading the English bit, there's a good chance other programmers will be able to jump in and help as well. One specific rule I also follow: if you do regexes, say IN ENGLISH what those regexes do. I say this because regexes are one of the hardest things to read.

Look for any code that can be "factored out" of your scripts and put those into function libraries. Then include those in your program. The only problem with this occurs when you have huge function libraries that slow down your scripts when you include them. In that case you would logically separate your functions into different files. I have included very common functions in different include files, so I can make the actual code compiled or interpreted as small as possible.

Consider using a flowcharting tool as an aid to programming and/or documenting your code.

Standardize how you name variables and functions, write comments, identation, and spacing.

Be sure and include the date you write your scripts in the comments, in case the filesystem wipes this out.

I'm sure theres other things I've left out, but following the above guidelines have helped me do exactly what you are trying to do: manage a growing codebase. But don't forget, this is web programming, not rocket science, and some of the above suggestions may be more trouble than they are worth. Keep it simple.

--

No, Thursday's out. How about never - is never good for you?

Re:My suggestions by angel'o'sphere · 2001-12-22 05:08 · Score: 1

Hm,

The coment is valid in some points but not so good in others.

<i>
Therefore, some of the above comments recommending refactoring, UML, and eXtreme programming may be a bit overkill.
</i>

XP is a very good method for this project size.

Also it *IS* a big project. According to Brown et al. "Anti Patterns" the perfect project size is 4 month done by 4 people. Is any property greater, then the project can be considered big, or wrong approached (1 man working 2 years is WRONG, except you did it in 6 month and then maintaned it for 18)

In conjunction with the allready mentioned book by Fowler: "Refactoring" XP would give you:
o Pair Programming
4 eyes glancing over the code while making changes(applying refactorings)
o Unit or Regression Testing, allready mentioned by a lot of posters
o Other XP properties you get for free(customer in your house, you are your own customer)

The poster above, is also contradicting himself: No UML but later he suggests making flow charts.
Erm ... one diagram type used in UML *is* simply a flow chart, albeit its called different: Activity Diagram.

His suggestion to factor code out in libraries is of course a very good one. (Besides he thinks its overkill :-) that is called: Refactoring ...)

However, why not documenting that with UML?

A simply dependency between packages or components would tell you: this package uses that one (code in this package uses code in that package).

You do not need to use a UML tool, you can do that on paper, photocopy it and distribute it.

Nothing is faster to grap then a visual description of how your code depends on each other and how your code interacts with each other.

If you know your code very well, *YOU* do not need that. But during reworking the code, it will change faster than the team can follow (keeping the mental model in sync with the software build).

So having a picture of what your goal is and also one describing how it currently looks is very good idea.

Finaly I dissagree that there is a hughe difference in "Web Programming" and other programmings.

Depending on the risks involved you might choose different software development processes.

However XP is one of the simplest processes available at all. If you are not even able to perform that one, you surely do not perform any process. (That does not mean that switching to any process takes some time, and patience to do it!! And it does not mean that everything to be done while switching is EASY)

Instead on relying on english comments another poster pointed out, the code should be the comment, speaking function names like:
makeBig(x);
instead of:
f(x); // this makes x big

Thats perfectly right!
For regexp, you only have english comments of course!!

If your manger can not read that, he will also not understand plain written english comments.

Another poster also pointed out: If the management agreed on unit tests(regression tests) you should use the time planned/alloacated for that for describing tests and conducting them.

Hu homm ....

Somehow you will test your software. The only point is software engineers might it not call testing what you do while you 'do testing'.

There is nothing a manager has to descide and shedule or to account on in a special way.

If you do planned testing, the management CAN account.
If you do not plan, it can't.

If you test your web application with a dummy users clicking in a browser on pages ... you waste FAR more money and resources then if you do standard software practice and write unit tests.

Sidenote: about 10% of your effords should go into those unit tests(writing and conducting!).

Its realy strage to me that a lot of posters (I remember about 8) said: no UML!

I can not imagine to start a project without spending at least a man day for absolut minimum designing.

If you use UML and a CASE system you spend 60% of the whole project time with making and refactroing your designs.

But your whole project time might be 50% to 80% shorter than doing it without CASE and UML.

As there are not many CASE tools generatng perl code its of course not worth to spend that amount of time in a PERL environment.

However a day for makng some descriptive diagrams is allways worth it.

angel'o'sphere

--
Cost free eBook I read (by iBook/Kobo/Amazon/ObookO/Gutenberg etc.): "The Green Odyssey" by Philip Jose Farmer.

IBM's (re-) write of OS/400 by Degrees · 2001-12-21 12:38 · Score: 3, Interesting

IBM was faced with a simlar problem when they came out with the RISC version of the AS/400 hardware. They needed to re-write, but they had to maintain absolute compatibility. I think the articles describing this were in Dr. Dobb's Journal (although I might be wrong on this). Unfortunately, I do not have URL's for this.

One of the success factors they found was documenting the interfaces for each and every call between modules. The documentation turned out to be excruciatingly precise - but this led to zero ambiguity (and thus 100% interoperability). It also required meetings (sometimes arguments) between programmers to hash out what was actually going to happen. Another factor was that they decided to allow zero 'overloading' of functions by different modules. A programmer was not allowed to duplicate someone else's work, nor create a second, incompatible version of a function provided in a different module. If the function was provided by someone else's module, the programmer had to call it (properly). The result was that they reaped the benefit of object oriented programming - reuse and refinement of modular libraries.

It would be better if you could get the real scoop from the real programmers - but this might give you something to think about.

--
"The most sensible request of government we make is not, "Do something!" But "Quit it!"

Set up a message board by dilute · 2001-12-21 12:43 · Score: 1

I would set up a Slashdot-like Weblog in order to collect people's learning during code review, and be able to browse and search the results later.

Microsoft, paragon of quality code by dmoen · 2001-12-21 12:47 · Score: 2

This advice makes sense if you are embedded in a culture like Microsoft, where you have a huge quantity of legacy code, and a really shitty software process. Microsoft has no record of what design decisions were responsible for making their code the way it is; they just have the code itself. They can't change it because they don't know what they might break by doing this.

If you enforce a good software process for every line of code that is written, then you have more flexibility. For example, suppose that you use Extreme Programming. Then you will have a unit tests for every bug that was fixed. Each time you re-run the unit tests, you find out if any of your previous bug fixes have become undone. This gives you the freedom to refactor your code whenever it needs to be refactored.

Doug Moen.

--
I have written a truly remarkable program which this sig is too small to contain.

Re:Microsoft, paragon of quality code by William+Tanksley · 2001-12-21 13:05 · Score: 2

Microsoft doesn't have quality code, but it DOES have reams of documentation. They have, and follow sporadically, a really strong process -- in fact, a huge number of really strong processes.

In many of their products, the design decisions are documented, and documented well.

As with all documentation at that level of detail, good luck finding the right page...

Other than that, you're right.

-Billy
Re:Microsoft, paragon of quality code by dmoen · 2001-12-21 13:46 · Score: 2

Interesting. I think that a software process can be "strong" (tedious, bureaucratic) without necessary being "good" (ie, effective). Does Microsoft's current process actually deal with the problem of not being able to refactor your code without losing all of your bug fixes? Keeping all of your design history in labrynthine documentation that is difficult to find anything in is expensive, but doesn't solve this particular problem.

Doug Moen.

--
I have written a truly remarkable program which this sig is too small to contain.
Re:Microsoft, paragon of quality code by William+Tanksley · 2001-12-21 14:16 · Score: 2

Well said, on all points. Microsoft's problem is that they have too many processes, and IMO most of them are too heavy (and therefore they aren't really being well-applied). So they don't have A current process; they have many of them.

I personally have a bias against strong processes. I do admit that they can work, and work well; but they also cost a LOT, in every way; it's very hard to tell whether you're applying enough of the process to make it work. I'm definitely an XP fan; there, your process is light (although it IS hard), and feedback is immediate and automatic.

-Billy

Perl is a *tool* by tmoertel · 2001-12-21 12:47 · Score: 3, Insightful

Perl is for system administrators and system administrators-cum-developers, not real software development.

Baloney.

Contrary to what lots of people around here seem to think, especially those who like to make wide-sweeping declarations about what things are and what they are not, Perl is a tool. Nothing less, nothing more. Like all tools, it can be used to create well-engineered systems, and it can be used to create crap.

The community that grew up around Perl is all-welcoming and generally free of elitism. That's why a lot of newbie programmers and "system administrators-cum-developers" use Perl -- because they can without getting crapped on by others who think they know better. As a result, there is a lot of ametaurish-looking Perl code out there, but that's not a result of the language, that's a result of the all-inclusive set of people who use Perl.

Let's be clear: If you write code in any language and the code sucks, it's your fault, not the language's -- the language is just a tool.

Don't blame the problems of programmers on their tools.

--
Easy, automatic testing for Perl.

Re:Perl is a *tool* by markj02 · 2001-12-21 17:01 · Score: 1

Let's be clear: If you write code in any language and the code sucks, it's your fault, not the language's -- the language is just a tool. Don't blame the problems of programmers on their tools.
There is no problem. Good programmers can produce good code in any language. However, even good programmers take longer to produce good code in some languages than in others. Perl is a language in which, in my experience, it takes a lot of time to create good code. Perl's strength is that producing quick one-of solutions is fast, which is also good, which isn't bad either.
Re:Perl is a *tool* by derobert · 2001-12-22 04:00 · Score: 1

Use the correct tool for the job. Perl, for example, would not be my tool of choice for a compiler. INTERCAL would not be my tool of choice for anything except torture, and job security.

The important thing is to use the right tool for the job. If you use the wrong tool, the code will suck more than it otherwise would. And that, of course, is the programmers fault for choosing the wrong tool.

Learn from those who have succeeded at this... by William+Tanksley · 2001-12-21 12:48 · Score: 4, Interesting

Some here are warning you that major changes always require a total rewrite; yet in real life, total rewrites result in inability to compete (look how long the Netscape rewrite paralysed Netscape, unable to meet Microsoft's challenge!). There's some good discussion of the danger of rewriting at a former MS software engineer's site, and some limited advice about how to get away without doing it.

But you've decided to rework rather than rewrite, you say, so I have no doubt you'll ignore the naysayers here. So what CAN you do? After all, as you recognise, reworking is dangerous!

The following rules have worked for me; I've refined my own experience with advice from Fowler's Refactoring, a book as useful as Design Patterns, and with study of Extreme Programming, a design methodology forged in the traditions of Smalltalk, and in the knowledge that maintainance, the most important and expensive part of software engineering, is also the least studied.

First, do the simplest thing that could possibly work. Don't EVER take your program out of commission for more than a day; make sure it runs at the end of each day. If you're doing something and at the end of the day your code base is broken, STRONGLY consider throwing away your changes and going back to the design stages.

Second, rely on unit tests extensively. Start every change by writing as extensive of a unit test as possible. Unit test every function you touch, BEFORE you touch it, and after. Unit test every change you make, and run the unit test BEFORE you make the change to ensure that it fails (i.e. it detects the change). Write your unit tests BEFORE you write code, whenever possible; you'll objectively know your code is done when your unit tests pass.

Third, don't design too far ahead; you don't know what tricks the old code is going to throw at you. Implement one feature at a time, bringing the code into compliance. Once everything has a unit test (thanks to your following the above principles), THEN you can safely embark on larger design changes -- and in the meantime, you have working code with new features, a win even if your customer/boss/manager decides not to continue.

Fourth, don't be afraid to redesign your own code. The stuff you wrote has more tests, so it's safer to change, but it's more likely than the old code to lack some critical understanding only age can give.

Fifth, use the principles of refactoring. Whenever possible split each code change into two parts: first, a part which changes the structure of the code without changing its function (and which therefore allows you to run the same unit tests); and second, a part which uses the new structure to perform a new function (thereby requiring new unit tests).

Good luck. If you want more advice, read up on Extreme Programming.

-Billy

Re:Learn from those who have succeeded at this... by startled · 2001-12-21 14:45 · Score: 2

Much agreed. We've found, though, that sometimes it's tough to write as many tests as you really should. Before you even start changing code, write a pile of tests-- it's best if you can just allocate a week or three for it.

Beyond that, it's tough to stay motivated about writing tests; people want to write new code, not test the old. So mix it in-- we've decided to require a test to be checked in with every bug fix, figuring that if it broke once, it'll break again.

While I'm at it, I totally disagree that a total rewrite is always the wrong choice-- you just have to keep significant resources going ahead on the old stuff. If your old stuff really was that bad, the new will catch up with it without too much trouble. If it doesn't, well, you didn't really need the rewrite.
Re:Learn from those who have succeeded at this... by William+Tanksley · 2001-12-22 10:31 · Score: 2

Well said. It really, really helps, in my experience, to develop the habit of coding to the tests.

That is, you first write the tests; do the design while you're writing the tests (and have the tests express the design). Then write the code, trying constantly to run it against the tests. AS SOON as all your tests pass, commit the code; you're done. Don't refine anything until you have written a test which fails under the current code.

You see what happens? Pretty soon, instead of tests being your enemies which break your code, they're your friends which tell you when your job is done. You'll quickly find that they're reliable friends -- and long-lasting ones, too.

-Billy

Re:modularity/incremental rollout/unit tests/itera by Tomster · 2001-12-21 12:48 · Score: 1

Me too!

Kidding aside, I wish I could mod the parent up to 6. Tim's right on. Probably the biggest mistake you can make in this kind of task is to bite off more than you can chew. And the more crud there is in the codebase, the easier it is to do! So take it slow. Refactoring is like orbital mechanics; you slow down to speed up :).

I recommend Martin Fowler's "Refactoring" book. It's geared towards object-oriented Java development, but the basic principles will apply across languages.

My company made the mistake of trying to rewrite our product from scratch. The "Tiger Team" effort got about halfway completed. Nobody knows if it would have actually resulted in a better product.

-Thomas

Perlmonks.org by consumer · 2001-12-21 12:52 · Score: 2, Informative

You might want to reconsider perl as the language of choice for a large scale application. I realize I'm posting this comment to a Perl system, but Perl hangs together like an immense kludge of a language.

What a monstrous Christmas troll you are. What qualifies you to make this judgement? Perl, like any other mature language, has people who write kludges with it and people who write clean, elegant code with it. Your lousy Perl code is not indicative of a language problem.

That said, you're probably stuck with it, and AFAIK, you may be forging new paths in programming for reusability by applying the above concepts to Perl.

And this shows how much you know, since the Perl community is full of activity around design patterns, refactoring tools, unit-testing, and other practices which are in favor among experienced people trying to write solid, maintainable code.

My suggestion for those who are looking for actual useful advice rather than this kind of "throw away all your work and learn Java" crap, would be to head straight for http://perlmonks.org/ and read up. There's tons of advice there for serious Perl coders. You would also do well to start reading the mod_perl mailing list, which often has informative discussions about these issues.

Re:Perlmonks.org by benedict · 2001-12-21 13:35 · Score: 2

I know some serious perl heads who say they
wouldn't suggest using it for a large project.

--
Ben "You have your mind on computers, it seems."

Minimise Untested Documentation! by William+Tanksley · 2001-12-21 12:57 · Score: 3, Interesting

Depend on tests, not documents. Documents lie. Tests don't.

Use as little documentation as possible, BUT NO LESS. (In other words, for heaven's sake don't ever try to get away without any documentation.) Documentation should state fundamental premises -- things like "The customer wants X." and "This code checks that I'm fulfilling the customer's requirement of X." Documentation should not state intrinsic properties -- the statement "this code does X" should be made as a test, not a document.

-Billy

Re:Minimise Untested Documentation! by Arandir · 2001-12-21 14:55 · Score: 3, Informative

I'll have to disagree. Document as much as possible, BUT NO MORE! But this documentation must be meaningful and relevant. Otherwise it is worse than useless.

Document every function, listing the purpose of every parameter and the meaning of the return value. Document why you are doing something if there is more than one way to do it. If a section of code fixes a bug, document what it does and do not just document a bug number. Use self-documenting code whenever possible (ei. name your variables and functions meaningfully). Use a document generation tool if possible (javadoc, doxygen, etc). Write the user docs at the same time you're writing the code. Incorporate the user docs into the code if at all possible.

Here is a bad comment: // store x2 in fu
Here is a good comment: // save the index because we will use it later
Here is a bad comment: // this is not meant for you to understand
Here is a good comment: // please see Smith's "The Black Magic of Filesystems" for details on this algorithm

The most important part of commenting is realizing who is going to read it. It may be you. But in all likelihood it will be someone you never met long after you have left the project or even the company. It may be code or design reviewers who don't know programming but do know how to block projects they can't understand. If it's Open Source, it may be some brilliant programmer wanting to fix a bug but without the time to puzzle over your constructs.

In every code review I have ever been in, someone has made some silly assumption about the code, with the final recommendation that that section of code be commented better to avoid future silly assumptions.

--
A Government Is a Body of People, Usually Notably Ungoverned

A Code Nazi Disagrees by dmoen · 2001-12-21 13:02 · Score: 2

Your tangled mass of spaghetti code paths are probably full of almost incomprehensible little design decisions and seemingly out of place declarations and functions, but most of those were probably added as specific fixes for bugs encountered under real-world use.

If you need to make big changes to the code, then you are already hosed. If you have no record of what bugs you fixed, and no way of testing if those bug fixes are still working after you make code changes, then keeping all of your spagetti in place is no guarantee that you won't re-introduce bugs later on.

The Code Nazi approach is to write a unit test for each bug you fix. And you have an easy way to re-run all of your unit tests. After you make a big change, you can test if all of your bug fixes are still working. And now you have the flexibility to refactor your code whenever you want. Which means you can keep your code base clean and elegant, and you'll never reach the crisis point that this group has reached. This is the approach advocated by Extreme Programming, as well as by other software disciplines.

Doug Moen

--
I have written a truly remarkable program which this sig is too small to contain.

Re:A Code Nazi Disagrees by xah · 2001-12-21 14:06 · Score: 1

Great. Now "Nazi" becomes a good word in some cases. Just wonderful.

--
I am not a lawyer. Do not take my words as legal advice. If you need legal advice, consult an attorney.

PerlDoc anyone? by Bat_Masterson · 2001-12-21 13:07 · Score: 1

If you're going to refactor your Perl code, I would suggest pushing to have the code well documented with a tool like PerlDoc. One of the key problems that new people have with code is trying to understand the decisions that previous people made in the design of the code. Pushing more documentation into the code will help that in the long run. Plus, making documentation a standard part of your code will help make the code better as the developers will tend to throw out and/or rewrite code that they can't explain in the documentation.

Hardware isn't cheap by drig · 2001-12-21 13:07 · Score: 2

I agree with your point and I'd like to expand on it.

Yeah, adding a gig of ram or increasing the CPU from 800Mhz to 1.8Ghz isn't expensive. But, if you go beyond what a singl-processor machine can handle, you run into another host of problems.

Adding a second CPU means *MUCH* higher chances of race conditions and other threading bugs. If you know you're coding for a single processor, you can often use a single-threaded model which makes life so much easier.

Adding clustering brings a whole hoist of data synchronization problems. It's *ALWAYS* easier to code for a single-machine than to code for a cluster. There are tools you can use to make shared memory easier, but those often flood the network.

--
Citizens Against Plate Tectonics

Re:Hardware isn't cheap by Arandir · 2001-12-21 15:08 · Score: 1

Yeah, adding a gig of ram or increasing the CPU from 800Mhz to 1.8Ghz isn't expensive.

Pardon me, I just sprayed Jolt Cola out my nose. Next time we need to upgrade 500 machines, I'll come to you for the cash.

--
A Government Is a Body of People, Usually Notably Ungoverned
Re:Hardware isn't cheap by augustz · 2001-12-21 16:29 · Score: 1

Idiot.

If you have any reasonable kind of rotation for hardware (ie 25% per year) you would realize what everyone else has. Hardware has gotten enourmously cheaper, and the development time and money and investment you spend developing for that hardware now makes up a greater proportion of cost on a % basis then it has in the past.

So while you drip cola out your nose and try and clean your keyboard, remember: If you have 500 machines and can't support the cost of an upgrade as small as this guy is talking about, you are just another .com that overexpanded == .bomb
Re:Hardware isn't cheap by Arandir · 2001-12-21 19:28 · Score: 2

If the machine is obsolete, I'll upgrade it. If it can't meet the demand that the clients put on it, I'll upgrade.

But I don't upgrade the machine just because I'm a lazy programmer.

Now on to the rant: I've been with the PC since day one. I learned how to program on a machine with only 8K memory. I remember when if you couldn't fit a program into 64K you were a wus. But over the years we have been taken over by lazy programmers. Their programs get larger and larger. Bloat is king. And the needs of a GUI is no excuse. I remember running a full Motif based GUI desktop on 640K (Geoworks).

And don't talk to me about speed. You're obsolete 800Mhz machine was state-of-the-art the last time you changed your underwear. You don't throw away last year's machine just because there's a new review at Tom's Hardware.

While we're busy upgrading our machines like there's no tomorrow, our landfills are filling up with perfectly functional machines, and leeching enough toxic heavy metals into the ground water to make Love Canal jealous. People like you are the vanguard of the throw-away society.

If you can't dish out webpages on a 800Mhz machine, it's time you looked for a career more suited to your abilities.

--
A Government Is a Body of People, Usually Notably Ungoverned
Re:Hardware isn't cheap by augustz · 2001-12-22 11:31 · Score: 1

Yes... while I run my stuff on fewer machines that use a finer mfg processs (.13) and consume less power, you keep on running your 500 power hungry machines that consume enough power to light a small city, and require enough cooling to freeze a volcano.

I seen your "old time" racks and cabinates, and the amount of energy alone these things waste makes me shudder. I highly recommend folks walk through a datacenter and actually look at the 500 machines this guy is in love with, and realize what a fantastic waste it is.

The real cost is not the cost of the hardware, which is cheap as I have been saying. Look at your rack fees for *500* machines. Multiply that by 12 months/year * 5 year lifespan of the machines. What fits in a 4U 2 years ago fits in a 1U today.

Before talking about the careers others should be taking, I'd take a long hard long at the one you are following.

Those old computers can go to places that don't have to worry about the fact that something takes 4U instead of 1U, or 900W vs 100W. Things like your office, schools etc.

Listen, this is a pretty stupid conversation. EVERY single CIO you talk to at any level will tell you the cost of hardware relative to even the TCO is tiny. That has only gotten smaller recently. On a % basis to IT budgets it is smaller yet. I mean some of this is ancient news.

The original poster asked a question about throwing out his entire program and starting from scratch. I said hardware is cheap in context... So flame away, but you're looking the idiot.

Consider bringing in outside help by Oroborus · 2001-12-21 13:14 · Score: 1

As someone who's been through what you're describing more often than I would have liked, I'll strongly advise you to bring in a consultant to oversee and direct the refactoring process.

The reasons for this are many.

You bring in a fresh viewpoint, someone who isn't married to any particular design decision or strategy. This is one of the biggest stumbling blocks in re-implementing existing code.

You bring in rationalized realism. A consultant to oversee such a process is able to make rational and reasoned projections about timeframes and milestones, and can be held to them. Without a voice of authority it will often end up being a battle of wills between you and your manager(s), each side having it's own ideas about what is, and isn't, possible. You might tell your manager something will take 2 months, and you may or may not be right, and he may or may not believe you. But if a percieved "expert" says it will take 2 months, the manager will have no choice but to agree, and he's likely right to boot.

You'll learn something. You really will. My first interactions with consultants tended to be dismissive, after all, who knew my code better than I? But what they have to teach isn't technical so much as it is organizational and structural. I learned a great deal about how to get things done, skills which served me very well as an employee and now as a consultant myself.

Inviting in an outside professional is a difficult thing to do, it requires admitting that you need help, and freeing up additional funds to pay for it. But in a situation such as yours it will not only pay for itself, but will make your job much easier and stress-free. The only real problem is the difficulty of finding a truly qualified consultant with experience and insight into your particular situation. But that's a topic for a different reply. ;)

Don't rewrite, and don't refactor by splattertrousers · 2001-12-21 13:28 · Score: 2, Insightful

Don't rewrite your code. That's a waste of time.

Don't spend a big chunk of time refactoring it either. Waste of time too.

Instead, make slight refactorings as you go. But make sure you are doing what you are really being paid for: implementing business value.

And you'll find that you'll have much more courage to refactor if you have a full set of automated tests, so maybe you should work on tests first.

Having done complete redesign of dynamic sites by f00zbll · 2001-12-21 13:30 · Score: 2, Informative

I've rebuilt dynamic sites from scratch twice. First and foremost, if you're rewriting because of serious scalability or design weakneses, then it is unavoidable. If it's just to get rid of annoying things, then I would say don't even try it. I consulted at a fairly big E-Commerce site that was crawling and couldn't handle the traffic. The original site was built by a programmer who scaled examples provided by MS. After it was done the whole site was a dog and would crash constantly. They finally brought in a programmer who was able to rewrite parts of it and make it work. After 7 months of intensive work, they 2 people stabilized the site. They decided to completely rewrite the site and I was contracted to help.

In this particular case, it was necessary because the site was right at the max. If the traffic increased, it would kill the site. Since it was an E-Commerce site, rewriting it was fairly straight forward. The old code kept running, until we were able to finish the new system and make sure it was stable and ready.

As a consultant, one of the most important aspects is detailed documentation that explains both the high and low level details. Often I will include very specific details about why a design was chosen and what limitations it has. When applicable, I will also describe how to extend, or modify the code to support additional features. This means you spend a lot of time doing documentation, but it forces you to think about a design more thoroughly and will expose weaknesses. Always keep an open mind and never fall in love with your design. There is no right way to build something, only right for the situation you are given.

look forward by djarum72 · 2001-12-21 13:32 · Score: 1

You're probably planning on working solidly for a couple of weeks or months and coming out with the same product, but cleaner.

Well, consider all the (big or little) features you might like to add, and when you do your rewrite, consider HOW those features would work. That will improve your refactoring and also keep some fun in the project.

I think you should concentrate on improving your processes so that you never have to do this again. So, basically, catch up on all the unit tests and reviews you've never done!

Others talk about tests. That is a valuable practice, but keep it limited to things that are easy to test. If your tests are ten times flakier than the code because its really really hard to write a good test, well, fuckit!

I'd recommend a checkin-review process. Every should be reviewed for:
1. comments-missing-from-unclear-code,
2. violations-of-coding-standard,
3. corner-cases-not-covered.

Those things are pretty easy to discuss with developers, they wont be taken as criticism.

Checkin review REALLY DOES improve your code and it DOESN'T have to be onerous!

Don't break the #1 Cardinal Rule by Pedrito · 2001-12-21 13:32 · Score: 2

Obviously you didn't read this slashdot article about Joel on Software.

What is the #1 thing he says causes software companies to fail? Rewriting from scratch!!!

I won't deny that there are times it has to be done. Joel points out some of those time (and yours isn't one of them even from the little you've written). Ours happened to fit everything he said, and we did rewrite, and not only was it the right thing to do, it was the only option for us, but the company just barely survived the process.

Don't take that lightly. I speak with experience and Joel is right. You don't rewrite from scratch unless there is no conceivable alternative. Joel describes it well, so I won't "rewrite" it. You can just click the link and read it yourself.

Paradigm Caution by Tablizer · 2001-12-21 13:48 · Score: 2, Insightful

There is NO decent evidence that OOP factors better, or does *anything* else better than procedural/realational programming (or other non-OO paradigms).

IOW, don't switch paradigms just because something is in fashion right now.

Look carefully before leaping into something that may just turn out to be a fad or only shines in specific domains.

--
Table-ized A.I.

Re:Paradigm Caution by jslag · 2001-12-21 19:22 · Score: 1

There is NO decent evidence that OOP factors better, or does *anything* else better than procedural/realational programming (or other non-OO paradigms).

On the contrary, there is plenty of anecdotal evidence that relentlessly hyping OO has the effect of getting Joe Average Programmer to start thinking a little about modularity and abstraction.

The best way to clean the stove ... by cpparm · 2001-12-21 13:56 · Score: 1

is to move every other year. When you find yourself in the position where you need to rewrite a whole system, get a new job.

php project mngt app by gol64738 · 2001-12-21 13:58 · Score: 2, Informative

a good project management application is important for any development team. usually, these are hard to come by unless you plunk down $10,000 or more, although these come with a gazillion features that you probably won't end up using.

i discovered a new tool on sourceforge which is an open project written in php.

i'm impressed with it. the code is also well documented.
the homepage can be found here.
i recommend checking out the screenshots as well.

99% Planning, 1% Coding by freality · 2001-12-21 14:22 · Score: 2, Informative

When you first design and implement some module, a
lot of time is involved in cycling between "ok, I
know what to do" and "huh, maybe not". I've found
this crucial, esp. in team work, in order to gain
a good conception of the scope of the task. Also,
many external issues, e.g. how the module interacts
with the system, efficiency, etc. that aren't pure
functional issues, are first grappled with here.

Refactoring is different from this, in that you're
probably very comfortable with the "state of mind"
of the code. Instead of creating, you'll be
clarifying. So, most of the refactoring is in
your head (99%). All the external issues have been
addressed before (or else this probably isn't really
refactoring), so just work at a white board with
your team until writing the code will basically
be transcription (1%).

I've found this to yield the best code.

Don't do it at all! by Anonymous Coward · 2001-12-21 14:37 · Score: 1, Interesting

Per my professional experience with an obsolete, out-of-date PASCAL on DOS system that keeps beeting the daylights out of would-be replacement systems, leave the convoluted, undocumented pile of spaghetti as-is. If your people have too much time and money, they could productively spend it figuring out what everything does and documenting it properly, but the mess that works is priceless compared to the alternative!

Re:Complete rewrite necessary or a waste of resour by informer · 2001-12-21 14:38 · Score: 1

Certainly one of the keys to Microsoft products "working" is when they make well defined interfaces and well documented interfaces. In my experience, if you define your interfaces well and document exactly what they are supposed to do, no matter how terrible your code is, you can make it work, and make it flexible to later changes.

This is where a language like perl is not extremely conducive to well designed systems, however you can of course design a system well in perl.

- Adam

--

If a penguin dies in the woods, and nobody is around to hear it, what sound does it make?

Joel Spolsky's Take... by ijx · 2001-12-21 14:40 · Score: 2

Joel Spolsky has an article up on his blog site that speaks to this point.

He uses Netscape's decision to rewrite Netscape 6 from scratch as an example, and expands upon many of the points mentioned above.

Retrofit? Redesign? by Anonymous Coward · 2001-12-21 15:17 · Score: 1, Interesting

Do it better! Do it faster! Do it cheaper!

Just don't forget to convert from inches to centimeters, and vice versa. (:

Nasa jokes aside, reconsider doing it in Perl. Perl, while a jewelled spectacle upon the crown of Language, is glue. It sticks things together. It fills in cracks and niches. But, like glue, if you just sit there with a handful of it, it's going to be sticky.

Before you touch *any* code, make sure you read the existing code. Go over it. For weeks, or even months. Understand why programmer X added a comment saying, /* Jeebus H. Christ, I quit! */ on top of an odd looking hack. Understand why things were done they way they were done, because, no matter how well your design, you too, will have to make clever hacks and changes upon encountering real world impasses.

Designs always look great on paper. It's when they hit the real world that they go to hell. :)

Why are you rewriting? by Anonymous Coward · 2001-12-21 15:18 · Score: 1, Insightful

You should never do something that will require a long time/resource investment unless you have a good reason. Is this just some clever way for management to justify firing the developers because they won't have accomplished anything at the end of this rewrite?

If you don't have a reason, you can try to find one. You might identify things that are truly broken about the system, and try to fix them. A typical problem with a lot of code is that its not as general as it could be (not portable, or you can only run one instance at a time because of some common resource or something like that.) By finding a concrete reason you will be able to direct your efforts and have a way of assessing your results when you are done.

Tablize It using macros by bcaulf · 2001-12-21 15:40 · Score: 1

Well of course you are quite right: it is a catastrophe to hide tens or hundreds of cases in a nest of code. The logic of a program has to be visible to the maintainer and this means using a language that is appropriate to the problem at hand. A tabular language might be the right one for a particular problem.

It should be observed that there are unpleasant performance implications to doing things this way. For performance reasons it would be nice to have that stuff hard coded! But how do you resolve these two wishes?

You use a code generator that builds code (which is later compiled) using the tables. Now everyone is happy. Speed plus maintainability. What approach should be used for code generation?

You could build your own code generator by hand, spitting out program text by generating strings. But that is not very nice. There are many ways to screw that up both in terms of correctness and in terms of the power you provide to the programmer who uses the code generator.

And so we arrive at the point of this little screed: a macro system is the correct approach for creating a little language inside of a large program. This is one reason why programmers should care about the macro facilities of a language. Perl and Java have none built in. C has the unpleasant cpp. Lisps have lovely macros, making it absurdly easy to do tabular programming, complete with breaking out special cases easily.

Rewriting from scratch by horza · 2001-12-21 15:40 · Score: 2

I have just carried out a similar project, and one thing I learned to beware of is don't suggest any timelines until you know *all* the functionality. You may think you've explored the site but there may be a lot of hidden functionality you don't know about, eg xml exports to affiliates.

I disagree with the "code doesn't get rusty" comment. Often there is a lot of pressure to get a project out the door and the initial launch version contains many compromises the designers didn't want to make. I don't think there is any project I could look back on and think "hey, the next time I would do it *this* way". People learn from other people's (or their own) mistakes, it's how we move forwards.

In response to the person that said don't rewrite as you will have to rediscover all the little quirks that the old site fixed: by doing a rewrite and documenting the little quirks as you find them means you will have a record of them rather than staring blankly at a screen of code wondering why on *earth* the previous coder did *that*. By rewriting from scratch you won't miss any quirks, which then come back and bite you inevitably at the least convenient time.

As for the Perl vs Java vs PHP debate, I'd say use the one you all know best. No language is any better or worse than any other. Some have libraries to make life more convenient, others have cleaner OO support, but in the end if you all stick to the accepted coding standards then half decent programmers can write maintainable code in any of the languages. Just bear in mind that your replacement will need to to know the same language so don't pick something obscure or dying where there will be no pool of available talent.

Phillip.

--
Property for sale in Nice, France

Consistancy is the key... by CTalkobt · 2001-12-21 15:47 · Score: 1

The term "re-write" scared me when reading your query. In large scale systems, especially of the size you're talking about it's best to re-implement certain portions / areas that can be broken apart and handled independantly.

If you wish to convert into a differant environment then treat it as a seperate project from the re-write. If the conversion to a differant environment is not straight-forward ( I don't have a lot of experiance with CGI / Perl.. ) then write implementation wrappers for the missing / alternate functionality so it's merely a simplified change in the function naming rather than a re-implementation.

If you need to do both - handle each independantly. You'll thank yourself when an important requirement that marketing promised pops up when you're in the middle of it.

Seperate, Seperate, Seperate - There are no huge projects - only people who are too all-encompassing to break them down into achievable sub-goals.

Keep breaking them down until each is achievable in a months, 2 months time. None should go over 3 months even for the worst case. Ideal time-spans would be 1 month periods.

--
There's a gorilla from Manilla whose a fella that stinks of vanilla and has salmonella.

Unnecessarily long variable names by i_am_nitrogen · 2001-12-21 15:55 · Score: 2, Insightful

I saw a quote once that said something to the effect of you can always tell a computer scientist's code because he won't have any comments: his variables are his comments (something like that, not sure of the exact words). While descriptive variable naming is important, excessive variable naming is annoying. For one thing, it's a lot easier to type no_acc_click() than it is MakeNotAccidentallyClickable(). Comments are necessary. When code does anything of even a slightly complex nature, there's no such thing as intuitive code. There will always be a need for comments in anything beyond the simplest programs. The best code is properly indented, case is consistent, variables have descriptive but not lenghty names (i is sufficient for an iteration variable, malloc is sufficient for a memory allocation function, etc.), a description at the beginning of the function of all expected uses and known bugs, and last but not least, comments of each change in the code, ChangeLog, and CVS logs.

--
A solution to the problem with music today

Re:Unnecessarily long variable names by Dwonis · 2001-12-21 16:22 · Score: 2, Informative

The best code is properly indented
I think this point needs clarifying: If a project is using one indentation style (this includes tabs vs spaces), for $DIETY's sake, use it!
Also, if you are selecting the coding style style yourself, use K&R style. I don't care if you think it's not 100% logical -- it's readable, and every programmer worth his/her stuff is familiar with it. The only deviation that is really tolerable is one particular hack on function declarations (i.e. some projects use the hack so the regexp /^funcname/ will work, and this is usually OK with most people).
Yes, I'm an asshole, but if you follow these rules, you'll find that your successors will curse your name much less.
Re:Unnecessarily long variable names by i_am_nitrogen · 2001-12-21 18:57 · Score: 1

I think K&R style is pretty good. Personally I code to the Linux kernel standard, which is quite similar to K&R.

Speaking as one who is about to undergo a major rewrite of a minor program (actually, not even a program.. [shameless plug]Palace is my XMMS plugin), which was based on XPLSISNJASP, I can say I understand and appreciate all the comments shared in this discussion. If the XPLSISNJASP source code had been cleaner, non-leaky, commented, etc., it wouldn't have taken me a month to figure out how it works, just so I could pick up where the authors left off over a year ago. The code in releases up to 0.1.3 is in likely worse state, but 0.2.0 should be the "killer app" of parallel port light shows. I plan to incorporate all the applicable advice from this article. Anyone who's interested can e-mail me (my address is in the Palace readme) and see how my rewrite works, suggest features for the new Palace, and learn from my mistakes.

What a waste of space that post was...

--
A solution to the problem with music today
Re:Unnecessarily long variable names by Anonymous+Brave+Guy · 2001-12-22 06:10 · Score: 2

For one thing, it's a lot easier to type no_acc_click() than it is MakeNotAccidentallyClickable().

For every time you type a variable name, it will be read hundreds of times. MakeNotAccidentallyClickable() isn't a great name for a function, but its meaning is clear and unambiguous. It's infinitely better than the meaningless no_acc_click().

Comments are necessary.

Yes, but far less often if your other names are well thought out.

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
Re:Unnecessarily long variable names by p3d0 · 2001-12-22 12:41 · Score: 2
Here are my laws of naming:
1. Names should be as long as necessary to describe what a thing is.
2. The shorter a name is, the better.
The ability to satisfy these two rules simultaneously is a symptom of simple, clear abstractions.
--
Patrick Doyle
I mod down every jackass who puts his moderation policy in his sig. Oh, wait a sec....

Re:modularity/incremental rollout/unit tests/itera by bcaulf · 2001-12-21 16:01 · Score: 1

Word to your mother. Code reuse via copy-n-paste is one of many basic software engineering blunders that is committed constantly in, I would guess, most software development organizations.

Ultimately it's a culture problem. The more junior software engineers in a lot of places just don't care about quality, so they don't pay attention and learn from each other and from more senior people. They don't criticize the flaws that they are no doubt observing in each other's work. A few senior staff can't do all the teaching and critiquing; it has to be an effort from all the developers.

At some point you have to blame management for creating perverse incentives that ignore or discourage high quality results from developers.

Ahhh Refactoring the Sport of Kings... by j-jahnke · 2001-12-21 16:02 · Score: 2, Insightful

I want to start by saying, before you do ANYTHING make it a priority for your team to start writing test cases and implementing them. If you are going to change your system you want your tests to find the problems not your users. Once you have a test suite that effectively tests your system (you will be surpised how quickly these tests will lead to enhancements in your software BTW) then you need to decide if you really want to maintain two seperate teams.

One team will be doing maintainence on your old system that is in use. You can't not keep this team in place, you have to keep customers happy while you do the new thang. You need the second team developing the new sytem (their job will be difficult with good tests, but impossible with out them.) Without tests they will have something that looks kinda like the old thing, but behave differently in a million small ways, some without a doubt will be better, many will not.

The other thing, and more sensible IMHO, is to tell pairs of programmars, to spend some hours each week working on the system together. Pairs becuase this is an excellent opportunity for people with deep knowledge of their systems to share it with other memebers of the team as they ferret out ways to refactor the code so more of it can be shared.

One day a week you can have a team working on some part, documenting spaghetti code, or pulling out examples of where code was cut and pasted and create a function which is called from both places. The stated goal should be fore each team to remove some number of lines of overall code from the project each week.

Remember the metric is the more lines of code you have the more errors you have, make it a mandate for your team to be removing code as well as adding, actually on our team we don't brag about how many lines we added in a week, we brag about how many we removed.
Anyway, the idea is by getting teams to do pair programming looking a specfic areas with good tests you can whip your code into shape pretty fast, and spread the knowledge of the system around as well. AND instead of a jarring change when you unveil the new system you can have gradual changes as fixes are tested and added.

Obviously if your overall architecture is broken and can't move forward this won't work. But it seems to me you are not fully convinced of that fact yet. At the very least by having teams look at code they may not be overaly famililar with you might find that team members can cross pollinate and share good ideas.

Jer,

It's called depreciation by JohnsonWax · 2001-12-21 16:17 · Score: 4, Interesting

There's a concept in the world of stuff you can touch called 'depreciation'. Why the software world hasn't caught onto this is beyond me, but from my experience it seems to apply reasonably well.

The value of your code goes down in value over time. Now, don't confuse that with the value of your design - your design could be grand, but the code is part of your equipment, like your hardware. Depreciate it over time to reflect the increasing cost of maintenance and integration costs in a migrating business.

Reworking your code allows you to make adjustments to your design to reflect a new environment, or to move away from languages/APIs/toolkits that might be hard to maintain.

I depreciate my code over about 3 years. It's all modular, and I replace code about as frequently as I add. A number of years ago, I had tons of bandwidth but not much CPU power, so I tended to push data rather than compute. The reverse is now true, so I made some design changes as part of a standard rewrite - no need to wait until it broke. Overall, most of my code hasn't radically changed in it's design, but it has been rewritten several times. I've had code cut back to 10% of it's original size by adopting a new toolkit, etc. I've made it more robust, faster, cleaner, better documented. I can't think of a case where it's gotten worse, and I can't think of a case where the rewrite took much longer to write than the original code - and more often than not took much less time. It's worked well enough that I've added a considerable amount of functionality, but spend no more time reworking code because it becomes increasingly efficient and is never too far removed from future additions.

Many people suggest that code rewrites are a waste of time, but it's a maintenance function. People that only budget time to write new code often find those extra work hours devoted to maintenance. Budget it in - and the best way is through rewrites.

Re:It's called depreciation by rabidcow · 2001-12-22 06:39 · Score: 1

It's all modular

I think this is the most important thing. Design your code to be modular so you can rewrite bits without it breaking.

Massive retrofit? by chaeron · 2001-12-21 16:24 · Score: 1

Why?

What is pushing you towards such a drastic move, and what do you hope to gain from the effort?

The answers to these questions might make answering Chizor's inquiry easier.

--
.....Andrzej

Chaeron Corporation

Re:Complete rewrite necessary or a waste of resour by pturley · 2001-12-21 16:24 · Score: 1

I entirely agree with Joel's claim and I think the poster was correct to quote him. But notice that Joel doesn't say it's always a bad thing to rewrite your system - it's only bad to do so if your motivation arises from the specific reasons he mentions. IMHO, the original poster has more fundamental reasons for wanting to get rid of his current code base.

RYou can tell he works at microsoft by kronsrepus · 2001-12-21 16:28 · Score: 1

If the orginal developers of the code (be it yourself with less knowledge than you have
now or someone else) wrote the programme in such a way that its absurbly bloated, poor designed and
all the rest of it then theres nothing wrong
with the 'start from scratch' mentaility. If the
people at microsoft decided to redo dos in the early days and get rid of that 640k limit, if they
redid a lot of there less successful product lines
(there is a lot of them, like ms publisher, etc)
then these products wouldn't be useless.

The entire prototype theory comes from starting
from scatch. Its important cycle in software development

Some advice: by Anonymous Coward · 2001-12-21 16:29 · Score: 2, Insightful

I have been through this before - with a different language, but the same principles apply. Here is some adivce, and some ideas:

1. Don't do a 'big bang' rewrite. A Big Bang rewrite is one where you totally rewrite all the code, but have no functional - or very little functional difference.

2. Do refactor or rework small portions of the code. Work on the code as you implement new functionality that the system needs. In other words don't waste time on rewriting working code unless it needs additional features or is broken.

3. Do Introduce Unit Tests and Acceptance Tests. This means you will know quickly when changes you make break code. Don't do a massive "unit test" building exercise however. Write a Unit Test for every peice of new code you do, and every peice of code that needs fixing.

4. Identify "Problem Code". Problem code is the bits which seem to take up most of the maintenance time. Identify that code, and try to work out ways to fix it.

Note : Some people think they are 'clever' by writing complex code which is hard to understand, but may perform better etc. This is the kind of code which usually ends up being a problem. As a professional programmer I try to be clear to understand - not only for the benefit of others, but for my benefit when trying to maintain it.

5. Read about how to manage projects. Extreme Programming is popular now, and I can say that it has some very good ideas. However don't become religious about it, and take a look at some of the 'classics'. I can recommend:

- The Mythical Man Month (tad dated, but a classic).
- Code Complete (also a little dated, but still good advice)
- Extreme Programming Explained

To take a look at my Open Source Resource, which has many articles around the subject of project management, take a look at:

Devcentre

I have to agree with the concensus... by Codifex+Maximus · 2001-12-21 16:40 · Score: 3, Insightful

if it works... don't fix it.

If you feel you have to fix it, then prioritise the most problematic parts and fix them according to a set plan/policy. Use a naming and calling convention. Break functions that do more than one thing up into component functions that can be tested, verified and reused by other parts of your program. Fix it incrementally not all at once. Try using an interface contract when you make objects; that way, your new functions can call new methods and the old code can depend on old methods to be there. Deprecate the old methods when there is no code that depends on it. Don't forget to comment - comment the code then come back the next day and read your own comments. Make changes to the comments so they make sense today.

Blah... blah... blah... Etc... etc... etc...

--
Codifex Maximus ~ In search of... a shorter sig.

I think this speaks for itself... by markj02 · 2001-12-21 16:53 · Score: 1

Not too long ago a link was posted to an interview with Joel Spolsky who used to work at Microsoft.

Yes, so if you want to write code like Microsoft, you follow their principles. However, unless you have the legal team and monopoly position to go with it, that strategy will probably not work as well for you as it does for Microsoft.

Deciding to completely rewrite your product from scratch, on the theory that all your code is messy and bug prone and is bloated and needs to be completely rethought and rebuild from ground zero.

Well, he is right to the extent that if you use the same tools and the same skills, you are going to end up with the same mess. Given that Microsoft has been using the same primitive tools for the last decade, it's not surprising that rewriting doesn't work for them.

It's not like code rusts if it's not used. The idea that new code is better than old is patently absurd.

Again, that's Microsoft-think. Sure, for Microsoft, things don't change unless they change them. For the rest of us, code does "rust", because the environment changes. And for the rest of us, there are new and useful tools and programming languages that we can use.

None of us know your situation by LoveMe2Times · 2001-12-21 17:38 · Score: 4, Insightful

There's a few things that I'll say. One, not a single person posting here really knows what your situation is (well, unless your coworkers read /. and are posting advice for you...). I don't want to be mean, but most of what's been posted here is likely not applicable to your situation. For starters, ignore all the threads arguing over languages: they'll never agree, and in the meantime you have work to get done. Next, I would suggest ignore all of the posts from people who aren't talking about web sites. There's a lot of methodology recommendations there, and it takes a long time to really understand *any* methodology. So if you don't already know it, it won't help for *this* project. If you're interested in that kind of stuff, read up on it; maybe it'll be useful on a future project.
Next, no matter what you decide, from your current position, there's substantial risk involved. If you don't have a good way of estimating the costs and benefits of your alternatives, then you will wind up shooting in the dark no matter what you choose. This isn't really unusual, but it can be stressful! Somebody in management has to decide if they want to play high stakes poker with this project. This will establish how many nice risk mitigating activities you can budget (like unit tests, code reviews, documentation, etc). One thing a lot of technology people have difficulty grasping (this isn't directed at anybody in particular), is that if your management decides on a high risk course of action, it's not WRONG because it's technically inferior. Now, if management doesn't understand that it's a high risk proposition, then there's a communication failure somewhere that will screw you no matter what gets chosen.
Ok, now that you've decided where you're going, I've got 3 pieces of useless advice (all advice is useless advice, btw...)

1) Know your priorities. This is the most important thing to getting ANY solution.
2) Understand your requirements(I don't care how you do this). This is the most important thing to getting (close to) a CORRECT solution.
3) Remember it's only a job. This can be very important for retaining sanity :)

Re: Unexploded bombs, never truer words... by kuma · 2001-12-21 17:51 · Score: 1

when you inherit 70000 lines of code that do not work and the lead developer was logging only one system update per month but actually changing production code every few hours...

those pearls of wisdom will really hit you.

practical matters, bullet points and examples:

1) write code to be read

and if means going against the culture, fuck the culture of perl or c or whatever... believe in karma--got that urge to be clever, you'll get yours someday, trust in the dark side.

2) comment your readable code

but strip out old, boring comments (after they are petrified, before they are eroded by time)

3) put interface documentation in the interface

4) minimize documentation

face it, documentation is great for training new users and system roll-out, but it falls out of step fast and will be of minimal help to coders when the shit hits the fan.

if you cannot afford to have developers who think they should be managers sitting around keeping the documents ship-shape, or (miracle-of-miracles) have a writer on staff, your docs will be a joke.

LAMENESS FILTER AREA

** big fuck you to everyone who thinks shit like this is too cool... (declarations omitted)

$done:=False

Repeat

$position:=Position($3;$4->)
If ($position#0)
$2->:=Substring($4->;1;$position-1)
$4->:=Substring($4->;($position+1))
$done:=True
$0:=1
Else

RECEIVE PACKET($1;$receive;(32000-Length($4->)))
If (ok=0)
If (Length($4->)=0)
$2->:=""
$0:=0
Else
$2->:=$4->
$0:=1
End if
$done:=True
$4->:=""
Else
$4->:=$4->+$receive
End if
End if
Until ($done)

** refactoring can be fun, let everyone in on the party. very few people will know this syntax, but nobody should be mystified...

` spb20011109 standardize on vl_JRDC_position (kill vl_JRDC_Fposition)
` spb20011109 old cycles (2001) gone, remove $simplified_crossouts
` ... switches and vastly simplify logic of price line assembly
` spb20011112 add Crossout Mode and Format interface menus
` spb20011119 fix bug where $second_price not initialized
` ... and ensure no crossout when Just Reduced third price
` ... is blank (no sale beyond discount)
` spb20011127 add redtag format menu
` sbp20011127 update redtag logic to consider case of price
` ... modification pull number being reverted by offer or
` ... exporting option (via check box)
` spb20011128 simplification of sale price method resulted
` ... in stripping some strings from psuedo-price-line, so go
` ... back to promotion record for Just Reduced flag
` spb20011212 fix tagging issue with Just Reduced where
` ... Discount price does not get proper style sheet, remove
` ... placement of unnecessary tabs, and kill $space_char
` spb20011218 fix tab placement for Just Reduced (good
` ... for other types, but mistakenly put two tabs after
` ... full price and none after discount, oops)
C_BOOLEAN($discount_or_sale_exists)
C_BOOLEAN($discount_and_sale_exist)
C_BOOLEAN($crossout_required)
C_BOOLEAN($just_reduced_selection)
C_BOOLEAN($redtag_selection)
C_STRING(16;$crossout_mode)
C_BOOLEAN($crossout_under)
C_BOOLEAN($redtag_graphic_required)
$redtag_selection:=(Position("RTAG";[Promotion]A va ilable)>0)
$redtag_selection:=$redtag_selection&(Not(Offer_ Override_Price_Mod))
$just_reduced_selection:=(Position("JRDC";[Promo ti on]Available)>0)
$discount_or_sale_exists:=(vp_Priceline_Second_P ri ce#"")|(vp_Priceline_Third_Price#"")
$discount_and_sale_exist:=(vp_Priceline_Second_P ri ce#"")&(vp_Priceline_Third_Price#"")
$crossout_mode:=vDigs_Crossout_Mode{vDigs_Crosso ut _Mode}
$crossout_required:=(($crossout_mode="Redtags Only")&$redtag_selection)
$crossout_required:=$crossout_required|($crossou t_mode="All Selections")
$crossout_required:=$crossout_required|($just_re duced_selection)
$crossout_required:=$crossout_required&$discount _or_sale_exists
$crossout_under:=(vDigs_Crossout_Format{vDigs_Cr os sout_Format}="Under Pricing")

Reminds me of something... by jadavis · 2001-12-21 17:53 · Score: 1

Hmmm... reminded me of an article about code rewriting that slashdot posted a while back.

Slashdot Story: How To Make Software Projects Fail

Relevant interview link

--
Social scientists are inspired by theories; scientists are humbled by facts.

Re:Perl vs. PHP ? by tmoertel · 2001-12-21 17:53 · Score: 3, Informative

Could I possibly ask for a fairly realistic little example of something that Perl does well that PHP can't?

Middleware.

For example, most enterprise applications do something important -- trade shares, manage accounts, track patient records, etc. How it's done is governed by business logic -- the company rules, policies, regulations, procedures, and so on. Now, you can spread this logic across your web site (as one might do using PHP, which is tied to the web site). Or you can bundle it up into an independent application, keeping all of the business logic in sensible, cohesive compartments that run on an application server (e.g., using one of the existing Perl app servers or one you've rolled yourself via, say, POE). This not only makes the business logic easier to understand and manage but also makes the logic independent of and accesssible from any number of front ends that you might need. Simultaneous web, client-server, and even command-line interfaces become possible, and for enterprise projects, multiple simultaneous interfaces is often a requirement for backward compatibility with older interfaces.

In summary, for shallow web-only stuff, PHP is a reasonable tool. For stuff beyond web work, PHP is out of its design envelope. However, Perl works here just as well as it does for web work.

--
Easy, automatic testing for Perl.

Re:modularity/incremental rollout/unit tests/itera by Fubari · 2001-12-21 18:05 · Score: 1

You have been getting some really good advice from other people here. You have a huge amount of existing code. How do you choose the first thing to handle?

Let your next feature show you where.

The project I'm on now is doing exactly the same thing you are contemplating.

We are letting new features drive the cleanup. When it is time to add something else, change the existing system to make it easy to add the new piece. Maybe this means touching a substantial number of modules to tweak them to use a new helper function. Fine, you're not changing routines for the fun of it - you have a definate goal in mind. Maybe everyone needs to use a new html formatter. Maybe everyone needs to use a common databse interface. Whatever. When consolidate something, really consolidate - hunt down every thing that is like it and adapt the existing code to use your consolidated routine.

I would strongly encourage you to do it this way, because if you embark to the Great Cleanup it might be a long time (many many months) before you can put new business value on the table.

If you do it this way, it can become part of your team's culture - you should always stop to clean up code every now and then. It is an ongoing process, and it gets easier the more you do it (kind of like laundry or dishes... why wait until life is unbearable to clean up a little?).

Letting features drive your clean up does the following for you:

helps keep you focused (reduces the chance you'll get lost in the woods and wonder around forever).

cleans up areas of the system that are likely to change again soon

finds the most important code

makes it easier to change next time

make it easy for you management to have faith in you.

This last one is really important. If they get regular improvements to the system, they stop worrying/fearing what you are doing and start fighting to shield you so you can keep doing it. At least this matters in a for-profit environment. Make it easy for you manager to believe in you and you will make your jobs easier.

And scope it small - small bites, easy steps, and release often.

And buy the Fowler Refactoring book, and the DesignPatterns book wouldn't hurt... but that helps C++ and Java people cope with brittle type systems... this isn't such a problem for you in Perl. You would benefit more from the principles in the Refactoring book, even though it is in Java.

my suggestions by medcalf · 2001-12-21 18:15 · Score: 2

1. review all of your existing code and list the things that each script does

2. decide how to consolidate and divide out these tasks, and make sure that any functions or data that needs to be shared, or may in the future need to be shared, is separated out into modules

3. document all of the programs to be written, their interfaces and their effects

4. then, and only then, begin coding

-jeff

--
-- Two men say they're Jesus. One of them must be wrong. - Dire Straits

Having attempted this before... by druxton · 2001-12-21 18:49 · Score: 1

The most important factor is having a ruthless project director. You don't accept slippage, you don't allow mistakes.

However, a good project director understands the project and the time it will take, and doesn't curry favour with management by suggesting impossible time-lines in the first place.

Beware the project director who comes in in mid-stream.

Duncan (not the MacBeth one)

some friends by jslag · 2001-12-21 19:15 · Score: 1

I know some serious perl heads who say they
wouldn't suggest using it for a large project.

And I know some serious NT system administrators who ftp from arbitrary terminals to their production servers, using their Administrator accounts.

Re:some friends by benedict · 2001-12-22 05:19 · Score: 2

When I said "serious perl heads", I didn't mean
"drunks I met on the street". I meant "people
involved with the perl design process".

--
Ben "You have your mind on computers, it seems."

Rewrite!! by Scott+Bender · 2001-12-21 19:49 · Score: 1

Sounds to me like someone did a bad job of designing your system. Although I obviously don't know the details of your project, you may want to consider starting over. If you need to do a retrofit, then some bad mistakes were made at the start of this project. Doing any kind of retrofit can only make things worse. If you have time to do a retrofit, you'd be better off taking what you've learned from this failed project and doing it right this time. I'd start with losing perl. I wouldn't touch it with a thousand foot pole for a large project. Then go back to the design phase, and do it right!

real world vs cs101 by thogard · 2001-12-21 20:37 · Score: 1

One example is one client wants a column in a report added up. fine. no problem. It was desigened to do that. Problem is that column is user supplied and not numeric. So the rule is something like an orange counts one and an apple counts 2 if its one type and three if its another.
Your saying you can plan for this kind of crap and design it in? This is a one off and perl makes that easy. In fact the orginal code spawns a 'report' and if checks in ~luser/bin/final_report and runs that if its needed.

You can never underestimate (or plan for) the lame request that some paying customers will want.

Re:real world vs cs101 by DrSkwid · 2001-12-21 22:54 · Score: 1

buy you can plan for the fact that paying customers will make lame requests

--
There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter

Re:Coding standards are important by Nyh · 2001-12-21 23:28 · Score: 1

But de important part isn't the indentation style. The important is that it describes the prefered constructions to solve certaint types of problems. It describes the rules for a good, uniform implementation.

And DO NOT make it a book of rules. For every rule there has to be argumentation why it has to be this way, including examples and they main advantages of the rule (improves robustness, maintainability, clarity, portability, uniformity).

The coding standard will help new and unexperienced to build good solid code. The professionals will like it because after reading the coding standard they know what is kind of code is expected from them.

And, maybe most important, will keep the hackers under control. Hackers won't adhere to rules just because they are the rules. But if they know why the rules are as they are they will use them.

I was a hacker who was coding only his way too, I didn't like rules because they made no sense to me. Right until I read Patrick Savalle's (Synalyse (sorry guys, dutch only)) C coding standard, that one made sense.

I am still using it, both for private and professional work (on most jobs there isn't a coding standard, the one I bring is very welcome and if there is an coding standard it is mostley only about identation so those are replaced just as easy).

Just my EUR 0.02, Hans

comment unneeded? by volpe · 2001-12-22 01:00 · Score: 2

...and name the function so that the comment is unneeded

The only function whose name makes the comment unneeded is "exit()".

Re:comment unneeded? by William+Tanksley · 2001-12-22 10:27 · Score: 2

The only function whose name makes the comment unneeded is "exit()".

That's a stupid universal; I've already shown one example, which therefore disproves it. (exit() sometimes still needs a clear name, anyhow -- you may need to make it clear why you're exiting.)

But is there a form of your statement which has value? I say NO. Comments are only needed when clear code is impossible. Clear code is ALWAYS better. Sometimes it's unattainable, in which case comments are the only thing that can be done.

-Billy

You didn't read his pages did you... by alfadir · 2001-12-22 01:25 · Score: 1

"my first software released under GPL (November 1988) -- now the Slashdot kiddies have to shut up; I was writing GPLed software before they were born, bwa ha ha!"

Posted Thursday, December 13, 2001 on his site..

He has alot of good thing to say about software management and programming.. read and learn instead of getting caught in Microsoft bashing..

Re:You didn't read his pages did you... by markj02 · 2001-12-22 06:12 · Score: 1

He has alot of good thing to say about software management and programming..
I disagree.
read and learn instead of getting caught in Microsoft bashing..
Instead of repeating what other people tell you, turn on your brain and think for yourself. Most of the programming and management advice that self-proclaimed software experts give you is junk.
"my first software released under GPL (November 1988) -- now the Slashdot kiddies have to shut up; I was writing GPLed software before they were born, bwa ha ha!"
That makes him pretty young and inexperienced in my book.

My analogous situation by Chris+Johnson · 2001-12-22 02:15 · Score: 3, Interesting

I'm currently doing something similar- the program I maintain is Mastering Tools, which has long been an over-featured, densely packed program operating on a text input basis with a certain amount of visual feedback on the text input.

However, I've long known that there are two things my real users (ideal users? the serious mastering engineers) want: familiar interface and realtime processing. I can't deliver realtime processing without literally doing the whole thing over again in a language I don't know (it's done in REALbasic, which is little better than a scripting language for speed, though it's got really, really nice prototyping abilities and GUI support.

However, the time's come to completely overhaul the interface, partly because I have some ideas for mid/side processing and don't have any room in the current layout to fit them! The ideas have to do with rectifying the side channel and using it to either enhance or remove signals that aren't in both channels equally- where regular mono is a 'node' that totally eliminates out-of phase content, it's also possible to completely eliminate R and L-only content and keep only R+L and also the out-of-phase content.

That part's the easy part- it's simply signal processing (and will be duly released under the GPL as soon as it's done, as always). However, the interface is asking for a total, complete overhaul in several ways, and that's what's taking all my effort currently. Here's the situation and how I'm handling it...

Layering. It's no longer possible to fit the whole app in a 640x480 area, even with small print. There are several possible answers to this: one would be having separate windows. This would be more adaptable to larger screens, but it's untidy and there are issues with closing windows and still referring to controls they contain- so that's out. What's looking more reasonable is tabbed panels- RB implements a nice little drag and droppable tabbed panel control that appears quite easy (though I had some trouble attempting to do nested tabbed panels- after one experience of having all the nested panels (at identical coordinates) switch to the top panel of their parent panel, I quickly gave up on that concept. Instead I'm using more panel real estate and trying to divide the controls into logical categories. That is, of course, a real headache- doing interface properly is hard! (says 'interface is hard Barbie') It's only somewhat easier with the additional space. Complicating matters further, is the expectation of the intended audience here. It has to both be organized and look and feel like a mixing board, amplifier or rackmount box of some sort.

Solution: implementing controls like knobs and meters. It's actually quite fun to code a knob appearance out of graphics primitives- and surprisingly hard to get mouse gestures to work on the damn thing- using a two-arg arctangent routine in RB that I don't fully understand. I also have a single meter control already implemented for azimuth display- think that it might be best to tear that apart and re-implement it in a more general sense.

That's because the Knob class turned out to be the right thing to do- it's implemented almost totally separate from the main body of the program, as if it were a RB control or something. Knowing I was going to be using it in different sizes, possibly different colors etc, I wrote the knob code as completely scalable- from maybe 20 pixels to over 100. It reads its size from the control width and height, runs itself as far as handling mouse input and storing a control value, and does not inherit comparable interfaces to the controls it will be replacing- so when it's time to plug 'em in I can run the program and see what routines crash and burn (and need to be rewritten for the new control interface). I'm thinking meters need to also be handled in the same way- somehow- not clear on the form yet.

So: dunno what else to tell you, but it seems like the things I'm doing that are helpful are: compartmentalize new interfaces, make them adaptable and get them working independently of the existing code, while leaving the existing code in working condition. Then when the new stuff is brought online do it in such a way that you could do it one control at a time or in small batches.

Dunno if that's relevant to where you're coming from- but it's what I've found necessary when facing a major re-implementation.

My solid advice. by mindstrm · 2001-12-22 02:23 · Score: 2

First, decide WHY you want to re-write all of it.
Second, document, in detail, what it is you want to accomplish (what should each script do, etc).
Thirdly, write up a project plan, IN DETAIL, so you have something to work from
Lastly, actually translate that into code. Do not start writing code until you have the modules fully documented.

Programmer buy-in by arabbit · 2001-12-22 02:25 · Score: 2, Interesting

There are lots of good reasons to refactor, but "consistency and reducing redundancy" doesn't seem very compelling. Anyway.

A lot of folks have talked about process and tools, and those are important and I assume you know what you're doing. The real issue is morale. Let's face it, at this point you're requirements are probably stable and your team is familiar with the system and everyone's probably been thinking about how they might do differently in the back of their head. And code is easy to write.

If you have people on your team see what they've done and know they can do better and want to give it a try, things will go smoothly. If you have people on your team who regard this as a needless rewite, on the other hand, morale will suffer and production will be surprisingly low.

If you have some of both, take the people who want to rewrite it and do it from scratch in another branch. If you don't have buy-in from all of the developers and try to fold changes into the original product and it doesn't go smoothly, company politics will turn against you and you'll probably be looking for a new job.

My advice: Start Entirely Over by Myoot · 2001-12-22 02:45 · Score: 1

Yes, it'll probably be much easier if you just hash out a set of rules and style guidelines and then rewrite everything from scratch, to those guidelines.

You're going to end up rewriting everything along the way anyway, if you're doing a coding-style-level rewrite rather than just a bugfix, and it'll be a lot easier if you start from scratch.

Josh

Perl.... coding style??? by Glabrezu · 2001-12-22 02:56 · Score: 1

bwaaaahahahaha!!!

Even if its probably possible, its a bit hard to define a coding style for a language whose motto is "There's more than one way to do it". Probably the amount of things you should define to have code that looks similar its BIG (where should you use fors instead of maps? should i use $_ or assign it to something else? etc., etc., etc.)...

Its easy to define *your* coding style, but imposse it to your fellow coders might be harder (always speaking about perl). Anyway, you might like to read the "Common practices" chapter in Programming in Perl where Larry gives some good advices about coding style in perl....

My suggestion, if you think the code might start to get bigger... start slowly to switch to another language, 30K lines of perl code just gives me the creeps...

--
Santiago

finding and keeping a competant staff by KyleCordes · 2001-12-22 03:04 · Score: 3, Insightful

[There's only so much a methodology can bring to the table -- after that it's about finding and keeping a competant staff who understand the value of what they're doing.]

You are so utterly right. The key idea is "people over process". I've rather have 5 great people with any old process, than 25 mediocre people with the perfect process.

[look at the console gaming industry for tips]

I have always been impressed with the gaming industry for this reason. Sometime people erroneously think of game programming as being somehow less "heavy duty" than business apps, etc. But games are usually under incredible schedule pressure (for vital marketing reasons) yet also need to ship very few bugs, because buyers are prone to return a game that doesn't work, not become a source of upgrade revenues.

Re:finding and keeping a competant staff by Anonymous+Brave+Guy · 2001-12-23 06:27 · Score: 2

The key idea is "people over process". I've rather have 5 great people with any old process, than 25 mediocre people with the perfect process.

Absolutely. 5 great people will look at the process they've started with, and if it's ****, they'll throw it out, come up with a new one that works, and then after the five minutes that took, they'll get on with the job.

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.

Analyze first! by Flu · 2001-12-22 03:54 · Score: 1

I am not a perl programmer (just C and Java), but these are the steps I try to take whenever I need to rearrange my code:

First of all; make an analyzis of the code; Is it modular? What dependencies are there within the code? Was it thought to be modular from the start, and in that case, has the modules started to deteriate? Are there functions/variables that are just positioned at the wrong places? Functions/variables only used by one or two other functions? Functions/variables used by way too many functions?

Based on those questions; start moveing functions and variables between modules. Consider that a module may be more than a single file, but probably not more than 10 files. Try to keep the number of functions per file to about 10 or less. Also, try to keep the number of files per module to 10 or less. Since you have more than 100 files, try grouping them in at least two blocks. Why not try to use the model-view-control or (interface-control-entity if you're into UML) model?

All this can be done without endangering the code, since you do not rename anything; and thus, you do not risk to suddenly start using some other variable/function than you intended. But, remember not to change more than one thing at at time without testing!

Also, if the language can't help you protecting (file-, module- or block-) private functions/variables, use comments to mark them as non-accesible from functions not in that file, module or block.

Now that the code is logically arranged, it is easy to see if there are functions that perfom a similar thing, since they (probably) would have ended up in the same module - or even the same file.

At this stage, you might want to actually start re-writing one or several functions in order to get rid of those functions that does almost the same thing. Also, you might want to re-name some functions to better represent their actual use.

However, if you do that, first think if you want to enforce any coding-standard starting with new functions? If so, create a coding-standard that most closely matches the code written so far, thus only a smaller portion of the code will fail to comply with the standard.

/Fredrik

Consider keeping programmers too. by guitarrista · 2001-12-22 04:21 · Score: 1

What about personnel? That strange looking code that embodies solutions to real-world problems had to be figured out by SOMEBODY. Such people, as much as working code, are a valuable resource that should not be abandoned lightly.

Additionally, there are difficulties adding new people to a team. Brooks' book "The Mythical Man Month" describes some of the pitfalls.

Re:Help decipher a comment from 1982 by dgroskind · 2001-12-22 04:57 · Score: 1

Absence makes the heart grow fonder:
Isle of Beauty, fare thee well!
Isle of Beauty. , Thomas Haynes Bayly. (1797-1839)

Also the first line in an anonymous poem in Davidson' Poetical Rhapsody (1602) but probably popularized by Bayly.

Slashdot Mirror

When Making a Comprehensive Retrofit of your Code...

292 of 385 comments (clear)