Tips on Managing Concurrent Development?
"Take, for example, the extreme case of something like Linux (not only concurrent development, but geographically distributed development), how is this managed? One solution we were contemplating was to try to do an 'air traffic control' type of sequencing and conflict resolution. As early as possible in the development stage, we try to identify what will be finished when, and assign a one-up sequence number to each patch. Developers then know that they will be patching against the baseline that was patched by the patch with the previous sequence number. It is hoped that this prevents a lot of rework of patches. A potential problem with this approach is the need for a responsive central authority to assign sequence numbers. Also, such sequence numbers may have to be rearranged in the face of last minute advances and setbacks in developer progress. Despite careful scheduling and detailed design, it may be impossible to know the exact check-in sequence of patches more than a week or two in advance.
Will such an idea be successful, or is it fatally flawed? Are there better solutions to the problem with less effort? Are we treating symptoms and not the disease (i.e., should we be planning better so that we know patch sequences and dependencies early on)? Management likes to keep staff productively occupied and working up until deadlines, so this usually means a lot of checkins within a short period of time, rather than staged checkins. Can checkins be spread out over time while keeping developers productively occupied?"
mofo I like it
Maybe we're just stupid. We dont use any CVS or any other versioning type software to keep track of changes. We dont' check in our code or anything.
Luckily our development is done on the web, we just create folders, and move them up when we're done.
It looks like this may be one of the few IE-bug page widenings! It is truly a sad day!
.I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter .telling .you .that .you .don't .have .enough .charaters .per .line .that .really .sucks .when .that .happens .and .you .have .to .put .some .lame .lameness .filter .defeater .text .in .there .i .wonder .how .many .people .will .read .this .whole .comment .I .certainly .hope .it .doesnt .annoy .too .many .people .This .is .just .the .beginning .because .PAGE .WIDENING .IS .BACK .I .like .wide .pages .I .wish .all .pages .could .be .as .wide .as .this .dont .you .wide .pages .are .much .cooler .than .those .narrow .pages .you .are .used .to .reading .because .you .dont .have .to .worry .about .the .lameness .filter
I know this sounds corny, being said on a VA property such as Slashdot, but SourceForge 3.0 is easily the best concurrent development environment i have ever used. It was my love for Sourceforge which made me pursue a job at VA Software in the first place! The fully web-based administration hides all the niggling details of commandline cvs tools and makes managing huge projects a piece of cake.
In short, if you haven't been to VA Software's site, you don't know what you're missing.
http://www.bitkeeper.com/
I wouldn't, however, recommend working with anything from Microsoft. Benchmarks and real-life statistics have shown that their source control solutions are not only slower, but are also less stable and more likely to corrupt your source tree. I hope you have backups!
"I don't know that atheists should be considered citizens, nor should they be considered patriots." - George Bush
I don't think so, but then to many people this might be large...
Of course some of these problems sound like lack of planning early in the game...
For example changing headers that two developers need... The only headers that two groups need should be interface headers, these should be set early and not need a lot of change, with any change taking both developers changing the code internally...
Another note, I get really worried when people say that process problems only show up at the end crunch time. If it is crunchtime it is time to use all of your processes, because the processes should be designed to produce the best bug free code the quickest... otherwise it shouldn't be in the process...
That is just my 2c worth however
I always thought CVS could be used to do something like this. I mean, keeping programmers productively occupied.
I haven't really studied it all that much, so I could be wrong(probably am).
http://bitkeeper.com/Products.BitKeeper.html
;-)
:: All Linux. No ads.
If it's good enough for Linus & friends, it's good enough for me
MONOLINUX.com
I currently work in an organization with 75 Engineers (50 USA, 20 India, 5 Asia) - and we use CVS. It's free, easy to use, and has a simple feature set so that more than one person has enough knowledge to do things like branching and merging branches.
We nearly never have merge problems. It is standard procedure that people keep their tree up-to-date with the cvs tree, and thus conflicts rarely arrise. Even at crunch-time, I probably have one merge conflict every 2 weeks, and with CVS - you are notified of the conflict and it is wrapped with CVS comments.
To put this in perspective - while at Oracle with 1000s of engineers working on the same tree, we used ClearCase and it was awesome. The difference here is that there was much steeper a learning curve, and no normal engineers could actually do complex tasks - i.e. create branches etc. We had a complete groud dedicated to ClearCase.
Conclusion:
Educate your engineers - and politely have the senior engineers tell them when they mess up - enforce a policy that people must update the source every day that they plan on checking-in files.
Also - I don't know what CVS versio you are using, but the latest free WinCVS client will not allow you to check in a file with a conflict! It will force you to update/merge/resolve the conflict before updating the tree. I highly recommend CVS and WinCVS due to the ease of use and cost.
Check out subversion.
It's CVS, but better and based on WebDAV for RPC and BerkeleyDB for storage.
Cheers,
pointer
[%- PROCESS life -%]
The only thing I know that really works (i.e. "is simple") is to lessen the conflicts through design... that is, two people shouldn't HAVE to edit the same modules. Or at least not the same lines of the same modules (those are the only merges that are really painful). Similarly, if you have well understood specification for modules then there should not be a problem when the lines edited don't overlap, because the functions and modules will continue to behave to the spec, which is all the other code can expect.
I know this isn't really easy to do (can't be done retroactively), and doesn't really fit all cases (such as near a release when there is a lot of chaos), but it's the only elegant solution I know of, all the rest are more brute force.
-pyrrho
Most version control systems have an automatic merge utility which makes it easy to handle simple concurrent changes, such people independently adding #defines and typedefs in headers. For .c and .cpp files, you should break them up to minimize developer thrashing (I apologize if you've heard that 100 times already).
-- Nobody should take away Microsoft's freedom to innovate, particularly since they haven't used it yet
One solution to avoid patching problem is to use continuous integration. It's an integration technique that builds your source multiple times a day, getting all the latest source code from the CVS tree, and building from that code. If anything fails, the offending developer gets warned. Mozilla uses the same thing, calling it TinderBox. It's one of the principles of Extreme Programming, and seems to work quite well at our company.
Well, I've been on projects in which people say "oh, you rearrange that code first, then I'll do my thing" (more informally than your "air traffic controller" example, but kind of the same thing). It can sometimes be helpful, but all too often it gives an excuse for inaction ("well, I couldn't do xxx because yyy had to happen first"). One thing which can help a bit is continuous integration - run "cvs update" often and checkin as often as you can (say, whenever your automated tests pass). Oh, and of course trying to not do everything right before the deadline (although it can be hard to change a culture and I'm not sure it is worth it). Other than that, I don't know of any magic bullets - just to say that dealing with the pain of continuous (or at least frequent) integration is better than the alternatives (such as having everyone do their patch against a build from last week and then having an overworked release manager try to assemble them into a working program).
One of the strategies we use at my company (We also use plain ol' CVS, but we don't have many branches on our development tree) is to team-code and to write tests for every script, so that you can tell if/where a problem has been created by someone else editing a header file. A test should simulate a user using the file/program/script/etc and should double-check the values entered against any values that are stored. By running the testing suite after a file changes, you might still have a merge conflict, but after it's resolved you won't have a bad piece of code.
--
Vote for your hopes, not for your fears - Vote Third Party
Check out the development techniques of Extreme Programming (just search Google, silly, and buy a book or three). They have a real solid handle on concurrent rapid development.
The real heart of Extreme Programming is "test-first" programming. The entire development process revolves around unit and integration tests, for extremely fine-grained control over code quality. Any changes that might impact other code should break a test. You fix the stuff that breaks, check in your changes, and move on.
Multiple programmers touching the same C files many times a day sounds like you have either design issues, structural issues, or both. That just should not happen, crunch time or not. Heck, crunch shouldn't happen if you're managing your development correctly.
If you're using cvs, conflicts with source checkins should be very easy to resolve. Even if two programmers touch the same file, they shouldn't be in the same function. If they are, you're back to management and architecture problems, and you need to fix those NOW before work grinds to a complete halt.
Hand me that airplane glue and I'll tell you another story.
Why in the world should developers be applying the same patch multiple times? You've just said that the problem is not with developers needing to touch the same lines of code -- so, once a patch is in, shouldn't the next person be merging their code with what's already there?
If your problem is with people overwriting the changes that previous submitters made, then you've got a very different kind of mgt problem -- one that can be solved by getting people to use the tools they already have. CVS, for example, lets you merge the current branch head with your working copy, incorporating any changes that may have been made since you checked things out.
Submitters should always diff their current code with the head before commiting a check-in, to see if they are breaking previous changes. This kind of practice is more important when schedules are tight, and you shouldn't let people off the hook because they were in a rush or some other lame excuse.
--tsw
These are the ingredients to make large projects successful from a technical point of view. At the company I work for, we have literally hundreds of people working in the same source tree using P4. It manages merges, versioning, and works flawlessly over the internet (well VPN anyway). It is also much, much faster at syncing to the the depot than CVS because the server keeps track of those files that you are editing and does not need to do diffs with the local filesystem. This is very helpful during crunchtime where you might want to sync serveral times a day (and you have about 10000 files in the system). Also, until your locally edited files are resolved with changes in the depot you cannot submit them, so you don't have the problem of ordering patches properly.
For the second part, I highly reccommend that you have automated build and tests that run after changes have been submitted. You can see how this is done en mass on the mozilla.org site. Also, developers should have access to the same build and test infrastructure on their machines so they can do the build and test before they check in their code.
Finally, you need a good bug tracking system. You might try Bugzilla.
Good luck,
Sam
"If I can see farther it is because I am surrounded by dwarves." -- Murray Gell-Mann
nerdz...developerz...anyone reading or partipating in this discussion needs to seriously get laid!!!! gpl this, cvs that, what the f#*$ ever, get a damn life!!!
I admit I haven't programmed part of a big project, so I haven't really experienced the issue at hand. But assuming different parts of the same file are what is being modified concurrently, what about using tags for the different logical parts of the file. So if I modified "ThisFunction" and someone else modified "ThatFunction" they could be tagged as ... and so on. Then, when it is checked in changes made withing those tags are taken and put in a central copy of the file.
Perhaps is too simplistic or just way off base, but that's my thought.
No really, I do. Check it out. Just because
you think you are too cool for words and
give out your email address to any webpage that
asks for it makes you immune from spam well hoss
just forget it. What is the topic of the story
I'm replying to? I forget!
I'm a ClearCase admin at a big company. I might be biased but I think it'll handle everything you're worried about. CC can already keep track of who's doing what and when. Branches keep track of the "who" and CC keeps loads of metadata to account for the "when" amongst other things.
One thing you can do if you really want to keep things straight is to write a trigger that writes "who" and "when" and anything else to a log file whenever someone does a checkin.
Blaze a trail to the New World
This is a common problem most easily managed by a human, not automated tools like CVS. On a per-tree, per-branch or even per-module basis, each chunk has a person responsible for managing changes to the tree, which I traditionally call the "Patch Master". This alleviates the common problem of multiple patches wiping each other out, as described.
Patches are either sent directly to the patch master, diff'd against a base or branch, or are committed on a per-developer branch, after which the patch master is notified either by built-in CVS mechanism or email. In both cases, it is the Patch Master's responsibility to merge changes from diff or from branches. Merging is a tedious process, but this alleviates the productivity problems affecting everyone on the devteam, limiting it to just one person and allowing everyone else to progress with further development.
Some people complain that having one person manage patches does not scale (i.e. "Linus does not scale"), but what I'm suggesting is a more collaborative, distributed, team-oriented approach -- perhaps you have a team of 10 developers with 5 "modules" in active development; each module is assigned a "team lead" as patch master and they are responsible for managing commits.
--jordan
I manage a somewhat larger project (35 developers at two sites), and we have very few issues of the type you describe, despite aggressive development schedules. The key is in architecture and planning. If developers are really performing tasks so different from each other that they aren't already in close communication with one another, you need to ask yourself how they happen to be working in the same source file. With proper architecture and modularization, discrete tasks will naturally land in different parts of your source tree.
There will always be some cases where there's a little overlap, but if your architecture makes these overlaps rare, it becomes relatively easy to see them coming when you lay out your schedule and plan around them.
I'm really surprised that CVS doesn't work for you. I've used it on single person and 15+ person projects, and we never had situations where people had to re-apply their work.
The only thing that I wish CVS had is transactional based updates. That is, either all check-ins work, or they all fail, as a single transaction. And that transaction logs (all files that are part of a check in are remembered as a group, rather than as individual checkins).
And 12 developers isn't large at all. 400, that's large.
The simple fix? Back out any check in that fails a merge or fails to compile, and publicize it!
"Ha ha! You broke the baseline again" is a wonderful motivator for doing your fucking job correctly!
It will only happen once or twice to your good programmers before they learn, and if you piss off those who don't learn and they quit on you, do you really want programmers around who are continually breaking your baseline product?
If you have several people changing the same file in a given day, then one of two things (probably both) is wrong:
- Coordination between features/projects. Somebody should be keeping an eye on the list of fixes/enhancements that are coming down the line and making sure you don't get too many in the same neck of the woods. This person doesn't necessarily have to be a developer (but should be able to speak developerese), and their whole job is to tell people, 'No, I'm sorry, but there's no room in the schedule for feature X that you want. Can I interest you in feature Y, which is in a different part of the code?'
- Code Architecture. If several functionally-unrelated features end up needing to change the same file, then something is wrong with that file. There's too much going on in it--you've got to be dilligent about keeping your components small and keeping each component in a separate file. See the excellent Lakos book for tips on how and why to do that.
Most likely, your organization went way too fast at some point in the course of setting up the core code architecture and the processes by which you decide what does and does not go into a release. You need to get started fixing both--or this problem will keep getting worse and worse until you're unable to move forward through your own inertia.Also, if your code is fairly big (more than a few hundred thousand lines), you need to break it into logical chunks and assign somebody to watch every checkin to each chunk. That person is a developer and responsible for making sure new code gets reviewed and unexpected changes aren't being made. If your code is smaller, one person can probably do that.
My experiences on a 80ppl project:
1) Minimize dependencies through refactoring.
2) Try to avoid branching as much as possible.
3) If branch, minimize the lifetime of it.
4) Before merging back a branch into the main, merge the main to your branch, recompile, test and then merge back.
Just some thougths...
I agree with some of the other comments about how the fact that you've ended up in such a situation probably means that some things weren't planned right to begin with.
However, since you've still got to dig out of the hole that you're currently in, I think you're sort-of on the right track. The problem with bringing in a bunch of changes at once is twofold: you have to do the merges properly, and you also have to ensure that bringing in developer A's changes doesn't break what developer B just did.
To get the merges done properly, your best bet is to stick with your current air traffic control strategy, but make sure that you have a controller who runs the show and does the merges. You need someone experienced enough that they can merge most people's code without breaking it, even if they don't quite understand all of how it works. Devoting a person full-time or almost full-time to merging at crunch time would mean that developers can focus more fully on making the right fixes, testing them, and making them easier for you to merge.
As far as verifying that your merges haven't broken something else, probably your best bet is to test rigorously with each new release via automated tests. This is going to be a pain to set up right now if you haven't planned for this, but there's really no other good solution. You can also try to enforce peer review for changes, on the hopes that developer B will notice that developer A's changes will break something that B just fixed, but peer review doesn't always catch everything, especially at crunch time.
I think you've pointed out an interesting failure of current SCM tools - they make it fairly easy to keep things organized, but they trade off speed for that organizing power. I don't think we have very good tools yet that will allow you both speed and organizational power.
Your right to not believe: Americans United for Separation of Church and
Sorry, but that's a TERRIBLE idea. Things will always happen out of order. The best baseline is a fresh cvs checkout. Programmers are responsible for not checking things in until they know it doesn't break anything. How do they know it doesn't break anything? All the tests pass. Where do the tests come from? (Wahwahwah, we're too busy coding to write tests!) You write tests BEFORE you write code. When you write the test, it's broken. Then you fix it. It works.
Another useful Extreme Programming idea is "stories". Programmers work on problems small enough that they can fit on a notecard. If it won't fit on a notecard, break it into smaller problems that will. Pretty soon, you have a huge stack of "stories". Sit down with your customer and triage them. Release early, release often, and the code is ALWAYS correct. Or at least all the tests run, which is more than can be said for 99% of all the code out there.
Hand me that airplane glue and I'll tell you another story.
We use a couple of methodologies in my work place. Granted we are a Java shop so some of this stuff doesn't apply across the board, but the concepts work for almost any language/development platform.
First, XP Style testing. Test first, and test often. Write a test case for every class you make, test everything, unit test, regression test, integration test you name it just TEST it.
Second, simplify your development process. There really should not be the need for multiple people to be working in the same file/class/header etc... Assing pieces of the project to different developers and model it out, have them work in there rescpective pieces, if you MUST assign multiple people to the same header, thats okay, but make sure they work together closely to not step on each others toes. This is really a planning issue.
Third, I assume you are following a build process (nightly, weekly etc..), we use Ant to help with this. Granted it doesn't help for the problems of developers stepping on each other during the day. But it forces everybody to check in there code and make sure it works, everyday (we use nightly builds).
Okay with all of this stuff, we rarly EVER have problems. Our code is usually close to bullet proof (the constant testing), each developer really knows the portions of the code they worked on, and can quickly make fixes if needed (the simplification of the development process), and we are constantly aware of our timeline and progress (nightly / weekly builds).
Anyways, thats just how we do it ;)
-ryanFor example changing headers that two developers need... The only headers that two groups need should be interface headers, these should be set early and not need a lot of change, with any change taking both developers changing the code internally... I work on a core technology team whom are all very experienced programmers originating from many different software universes (embedded, games, DB, etc). Based on the evil of past experiences, we set out to develop the interface system before starting *any* actual project (or projects). This took a little over a year of design and development evolution with no actual product releases (some protos). In spite of such a long "generation" process it took us only 3 months to release (spit out) one particular product... I truely believe in taking time to dream and design--construction is the easy part.
When I worked with CVS I always had problems of people overwriting my changes or incompatible changes.
In the Linux world there are usually modules maintainers. Often only one maintainer is responsible for the ci/co of the source tree and more often people pick a branch to work on then port their stuff to the branch it gets checked in to.
The Linux kernel does not use CVS, right now they are moving to (or moved) bit keeper.
Where I work we use a custom program that does locking so that only one person can work on a file at a time and this eliminates all conflicts. PVCS also does this and run under both Windows and various UNIX flavors. Locking is not as bad as it may sound and in fact it is good in the case where you have several files that are frequently accessed.
CVS is good when you have less bumping heads. Of course this has been my experience, others may have other opinions.
One thing you could do is build a small program layer on top of cvs, or maybe some scripts to do some locking so that people are less likely to bump heads.
Only 'flamers' flame!
The reason it isn't concurrent is because you have people separately working on patches, but those patches are dependend on one another, so their ordering matters. When ordering matters, the situation is sequential, not parallel. The whole bit about air traffic control and sequence numbers is about serializing the parallel development.
The trick is to decompose the development task into chunks that are in fact parallelizable. In turn, those chunks may have sub-chunks which are not parallelizable; those chunks should be perhaps done by one developer, in the correct order.
No developer should wait around for another's patch, and nobody should develop anything that he or she knows will soon be invalidated by a forthcoming patch so badly that it will have to be substantially reworked. If a unit of work depends on some forthcoming patch so badly, a developer should find something else to work on until that patch arrives. How you know that the patch has arrived is by monitoring your e-mail, or scanning the version control system for changes. The other developers should know that someone is waiting for their patch in order to do the next, dependent part of the change, and broadcast it to the team when they are done.
I work with a number of other software developers on a fairly large project where things are constantly changing. We currently use Clearcase. To avoid multiple submissions in order to have changes accepted once, each developer has their own branch, which is a copy of the main one. Developers work in their branch changing files until they feel they're ready to sumbit changes to the baseline. They merge the current baseline into their branch, grabbing all changes so far, and then merge back into the main branch. This effectively checks out the files, so only one person can make and submit changes at a time.
Then I put on a suit, because you can get away with anything if you're wearing a suit. Suits lie.
I worked on a large project (20+ developers) where this situation occured from time to time. There were specific interdependencies between some sources files for various parts of the development effort, and it was easy to step on other peoples feet unless specific steps were taken to prevent this.
Before I describe how we handled this situation, I want to stress the fact that if someone intelligently devides the labor according to how the changes will affect other parts of the code, the need for developers to sqabble over specific changes in specific files should be eliminated.
Labor should be devided at well defined "interface points" where the additions/changes to the interface can be done quickly, satisfying the needs of other developers requiring those interfaces to build against, and then completion of the underlying code can be done with little interference or effect on others.
In short, devide work along interface boundaries, and stub out interfaces with enough code to allow compiling, while developer(s) continue to actually implement the code behind the scenes. Thus, swapping in the actual code has no effect on any one else code, exept that the stubs are now full implementations and work correctly.
Ok, so what happens if the devision isnt clean and you have two people working on the same file?
NOTE: When I am talking about file granularity, or developers "owning" specific files, you can also substiture "subsystem", etc. Sometimes developers are working in entirely different areas of the source tree for the most part, and it makes sense to assign an entire sub tree to a specific developer. This is the devision by interface, which is the usual case.
What we did was assign specific files to specific developers, who have the most work to add/modify to the file. When other developers require changes to a file "owned" by another, they perform the merge, which is verified by both sides to work correctly, and then it is checked in by the "owner".
This was all accomplished using locks (file checked out, ClearCase) and multiple views. The locking of files was a benifit, as it prevented accidental overwrites of other peoples code. Once you check out a file and lock it, no one else (short of the administrator) can check in a modified version and clobber your changes.
A short scenario:
Alice: owns file/tree "something.cpp"
Bob: owns file/tree "modified.cpp"
Tree is something like this:
RootBranch
|
|-- Devel
|
|-- AliceView ---- BobView
| |
[code] [code]
Alice and Bob are both working in a development branch. Alice has the files she is modifying and "owns" checked out and locked. Same for bob.
Bob realizes he needs to make changes to "something.cpp" to support some changes he is making in "modified.cpp".
He checks out a temporary version, unlocked, of "something.cpp" and makes the required changes.
He then notifies alice of the changes, and using the automated merge features she adds his changes, manually resolving conflicts if necessary.
Bob changes his view to use Alice's version of the file with the rest of the code from his view. He builds and verifies that everything is working correctly. Once this is verified, Alice can check in the changes, and Bob can now use the most recently committed version and continue on his merry way.
What this boils down to is basically enforcing ownership through locks to prevent accidental overwrites of others code, and defining clear lines of ownership so that a change is only accepted and merged when the person responsible for that code has tested it (in addition to the developer desiring his minor modifications be included)
I've admined both extensively, and I can make a couple of comments here. First, Clearcase is licensed software. Understand that when you get locked out because all of the licenses are in use, you cannot touch your source-code (though someone with a license can copy it into a sandbox for you). Also, Clearcase is a resource pig. It wants a pretty beefy central machine to run on, and if lots of people compile at the same time, the virtual filesystem is not very efficient.
Now on to CVS. CVS is most everything you want from revision control. It's biggest shortcomings are in branch management and the ease with which changes can be made incorrectly. Its ability to interface with well known and standard protocols like rsh, ssh and gzip (which is a format more than a protocol) make it painful to move to anything that's overly proprietary. Its use of your local diff is wonderful ("cvs diff -u" was a revelation for me).
Clearcase manages branches better and can handle non-realtime latency in updates (e.g. you can have two Clearcase repositories at different sites and you can connect them by mailing tapes around or by dialing up once a day). This can be invaluable when you're working in high-security environments, but is otherwise mostly a moot point.
Clearcase has improved in the last few years. They've added some local-checkout features where you don't have to work off of the virtual filesystem, and that helps.
Overall, I'd say CVS is the better system, but Clearcase will sometimes get jammed down your throat, and there are definitely worse fates than to have to get your project working under it.
It sounds like you may need to make your projects more atomic if you're having problems like this. Make classes (or whatever organazational unit that you're using) smaller and put less in files. Finalize APIs for components to work together and tell people not to touch what itsn't theirs.
Assign a maintainer to each portion of your code and make it such that if others want to change that, they have to send patches to the maintainer.
There are lots of ways around this, they just require better organazation than Ok, everybody make everything work!
How they communicate is a separate issue. IRC works for some projects; IM is another choice. CVS also offers file-level locking and that can be used to communicate that someone is preparing a patch for a file, but it requires planning ahead and splitting up files.
Most companies find (the ones that actually DO it rather than pay lip service to it) that designing a project PREVENTS the "patches on patches".
Another thing that many people say they do (but rarely do) is actually have meetings that accomplish REAL goals rather than perceived ones. I have been in "design" meetings that were merely CYA meetings - nothing was designed and it was all a waste of time. On the other hand, I have been in meetings that I was invited to (but really had no business being in) that actually SOLVED problems a) BEFORE they happened or b) reworked the nature of the beast so that it was not nearly so intractable design.
Communication. Not just CYA, but actually TALKING and LISTENING (you wouldn't believe the number of software engineers that just will talk and talk and never hear a damn thing).
Making it a death penalty to break someones locks helps too...
IANAL, but I've seen actors play them on TV
even if if think i know who sent this message to make free advertising on it's product, CVS is still the better choice for standard development ...
(I prefer white paste) US...
Meaning let the user see realtime results instead of through the web.
An IDE interface which is sorta like IRC or something
If you use Linux, please help development of Autopac
Where I work, we have ~15 developers working on a web-based application. There are never any problems with multiple developers working on the same code at the same time. Why? We have a policy whereby all checkouts are exclusive, and there are no merges. That way, you get in, change the file you need to change, and then check it back in again.
This may not work for everyone, but if you modularize your code it really isn't that bad. Heck, we use Visual SourceSafe, which has to be the crappiest revision control system on the planet, but we can still make it work.
www.timcoleman.com is a total waste of your time. Never go there.
I was architect for a project with a team of 25 folks. 15 in Hyderabad, India and 10 in Boston. We used StarTeam as the source repository. I have to say it kicks ass!!!
Starteam has got a built in bug tracking system, provides extended support for build labels, branching etc. They provide two clients (1) Windows client (2) Web client (HTML, Javascript etc.). So basically, the server can be configured to listen on port xxxx or can be accessed by a web server which will run the web client.
The downside to it is that it has piss-poor diff-merge tools. But we used Araxis (which really really kicks ass) for merging files.
This setup and some basic procedures let us do continuous builds without too much of a headache.
Conicidence? That's Your call.
I disable sigs...do you?
Perforce has a daemon that is run, and it's got clients for Windows and all the Unix clones, free and non-free.
Now, perforce itself isn't free, but some things are just worth the money.
I have used sourcesafe for about 3 years now, it uses this menthod, and it has always worked out pretty well (although there are some issues with databse corruption when projects get very large!). It really forces discipline, if you don't checkin your code as soon as you are done then other developers will complain. You might think that you lose some flexibility here because two developers cannot work on the same file at the same time, in practice though we have found that there is very little loss. If another developer is working on a file then there is always something else that you can do on another part of the source until they are finished. Not having merge problems or anything like that is a pain.
Does anyone know of any good open source solutions that allow for this type of source management? Sourcesafe may work ok but it is far out of date now
Everyone needs to stay up to date all the time.
If you have a regular (hourly/daily) build that smoke-tests and reports the results to everyone, people will be more willing to sync to the latest and trust that they won't lose lots of time with problems. Embarass anyone that breaks the build, make sure that everyone understands it gets fixed ASAP when it breaks, checking in broken code is NOT ok; and then people can sync every few hours or every day, and the problem simply goes away.
is competition good, or is duplication of effort bad?
- Smaller. You only need one executable on your client. And one more for your server. No kernel patches, no drivers, no installation, just the binaries. Ubercool.
- Multi-platform. Perforce has binaries for practically every platform in use out there. Find me another Version Control System with BeOS, QNX, AIX, SCO, MacOS9 and MacOSX support.
- Fast, fast fast. Because of the low communication overhead, it works extremely well across slow/high latency links.
- Ease of use. It's really easy to configure and setup.
- Great support. We've had to go to perforce support twice and both times they've been awesome, with quick responses and knowledgeable people.
- Price. The single server, two client setup is FREE! And per seat licensing + support is very very reasonable. I use the free one on my laptop to version files.
- Plug-ins. Perforce publishes their API, and they have perl, python, ruby and tcl utilities galore.
Now my process recommendation. If you have paid for ClearCase then pay for some more education. ClearCase/Perforce can do lots of what you need automatically. You should practically NEVER have to repeat the same change on a file. You may want to look at some branching/merging techniques which can eliminate the need for colliding checkin's also. Rational has a bunch of whitepapers up on their site, as does Perforce.Most of all, I would advise you to educate yourself on the options/methods of version control. 12 isn't that big. Wait 'til you get to 1200.
There is no trap so deadly as the trap you set for yourself
-Raymond Chandler, The Long Goodbye
Perforce has them, ClearCase and CVS doesn't. You can check in a list of changed files as one atomic operation and it either succeeds or doesn't. So you can't get a partial check in which breaks the build. This is a totally kickass feature which has saved my bacon a couple of times.
There is no trap so deadly as the trap you set for yourself
-Raymond Chandler, The Long Goodbye
Learn to use OO (or any other methodology which lets you segment code based on likelihood of change). e.g. in Java you do your analysis and design, and your interfaces are more or less 'set'. Then you just code the implementations, and you don't generally tromp all over eachother nearly as much.
good lord
NICE AD
(My screen got jumbled... i hope this isn't a duplicate post.)
My job has been to create a version control system that solves exactly this problem and automates others. Right now, it uses CVS, but is based off of a system that used PVCS and ClearCase.
Our revision system starts with release labels. Each directory gets tagged to a particular release label. Baselines, then are groups of release labels.
When a developer releases, their changes are branched off of the release label the directory started at. Then, the latest release is merged into their changes (if a later release from the one the branch is based off of occured). If everything still checks out, the branch is merged to the main development tree.
We have two scripts that make this process very simple: a commit and a release. The commit will take the current directory and create a users branch. This is for moving between trees - the developer now has a sandbox label they can update to. The release, which calls commit to make the user branch, handles both passes of merging.
I've noticed that CVS is not the greatest system for handling branches. It's ability to detect common ancestors is, well, flaky. I'm eagerly awaiting the release of Subversion, since my company really doesn't have the ooodles of $$ for ClearCase.
Also, note that when a developer releases, we also collect information which can link this release to a bug tracking slip. We also collate the release notes for releases. Ergo, you want to make a patch for your product, it's easy to collect all of the notes of changes and then create a single ChangeLog (which you usually want to edit a bit).
The system actually has been extended to handle multiple streams of development. This allows us to bugfix released versions of the product while making later changes, and smoothing the bugfix integration between releases. Adding this feature is both an enormously complex task and a huge timesaver for a large development team. You'd definitely push the bounds of what CVS can handle at this point.
Just some thoughts. This isn't a simple issue, but can have a solution that is simple to use.
-T
That's the a few points that I've found to be helpful in my professional work. Your mileage may vary.
Good luck,
--Mid
The Largest project I worked on was when I was leader of a team of 6 people, which was in a team of 20 people, in a sub-project of 100 people, in a project of 1,000 people. And that was just development.
Tools are great, but for large projects the hammer rules.... don't know the hammer ? The hammer says "Fuck up the build, I nail your testicals to the floor, fuck up the release and I nail your eye-lids to your testicals first."
An Eye for an Eye will make the whole world blind - Gandhi
Test first....
But tests are written by the developers, so good developers = good tests, bad developers = bad tests.
Don't do XP, anything in engineering that says Requirements and Design don't come before implementation lose the right to be considered engineering.
An Eye for an Eye will make the whole world blind - Gandhi
The way I understand the text of this article is that the patches have to be reworked several times. This is integration work, not submission work. The developer makes a change which depends on some other change, but rather than waiting for that other change, just goes ahead and does it. Then when the dependent change comes in, the patch is wrong and has to be reworked. Am I close?
Believe me, I have experienced your pain, and we weren't even using multiple branches.
It sounds to me as though your developers are NOT updating their local ("sandbox") versions before they merge their changes back in.
Developer A: checks out file, starts work.
Dev B: checks out same file, starts working.
Dev B: finishes, commits changes to CVS.
Dev A: finishes, commits changes to CVS without updating first.
Developer B's committed changes are GONE!
Make it easy on yourself, and do NOT let multiple developers check out the same file.
or having it mod'd up to 5, while competitor (bitkeeper) suggestions mod'd down?
If you want to improve parallelism when people work on different pieces of the same program, try to stabilize the interfaces early. Then two or more developers can walk away and do their thing independently of the others: they are all dependent on the interfaces which change very little, and not on each other.
This is not quite the same thing as merely decomposing a program into modules. It's decomposing a program into modules which have well-defined interfaces that have an identity of their own. But there are other things that are interfaces.
An interface is convention by which two parts of an application relate to each other: a set of functions with given arguments and semantics; messages exchanged over a network; file formats or database schemas, directory structure layouts, names of files, language syntaxes, and so on. Essentially anything which has structure and which is formed in one place, and interpreted in another. All such structures create dependencies, and the dependencies must be identified and managed if you want to know how to parallelize development.
There's a well known answer to this (and many other problems that amateurs run across time and time again) - see subject.
Don't do XP, anything in engineering that says Requirements and Design don't come before implementation lose the right to be considered engineering.
Bingo. XP is just a spiral lifecycle stuck in maintenance mode.
A Government Is a Body of People, Usually Notably Ungoverned
The problems mount as the source files get longer. On the project I'm currently working on, we have an average of 80-100 lines per file. Are some files abnormally huge? Yeah, but for the most part the files are small. Some times a file has to be a little bigger than that to perform its function. Of course with only a few lines of code per file, it will be very rare that more than one developer will have to touch it at a time, and furthermore, there's only room for so many bugs in 80-100 lines before it becomes more worthwhile to rewrite it. If the code is really that buggy, then the small source files will make it easier to gradually reimplement sections and make the task seem less daunting to the developers. Also, If the interfaces are intelligently designed and well documented in advance, then you have the ability to reimplement files without causing troubles. When I open up a piece of someone else's code and find a 7000 line source file (I've seen 20000 line source files in some projects...one piece of control software had an entire user interface implemented in xlib and network communication layer in one 20,000 line file), It becomes a formidable task to understand it well enough to fix or improve it. If the source files are small and their purposes well defined, then everything gets along better.
Brian
In one company I worked for we used Perforce and checkin token file. Only the person who had the token was allowed to check in, also he had to smoke test the project before releasing the token. We also had a tradition of adding haikus to the checkin token file. You can read them in one of easter eggs :)
The last two companies I worked for used Perforce, and it solves this quite nicely. It will tell you if a file you want to check in has been changed by someone else in the mean time, and will help you merge in that person's changes before you make your checkin. Perforce is extremely powerful and easy. The only reason *not* to use Perforce is if you're hung up on using an open source product for your source-repository needs. If you just want to get the job done, you use Perforce. For home use, they even have a limited (2-user) version. For open-source projects, they offer free licenses as well.
I have used both CVS, Clearcase, MKS, straight RCS, and a bit of Visual Source-Safe (Ha!).
What I have found makes the most difference in reducing merge issues is (1), have developers merge on checkin (more on that in a moment), and (2) develop a branching strategy that reduces the need to merge.
On (1). When using ClearCase, have people set checkouts to be "unreserved". When they check out a file it won't lock it so everyone else can use it, and when they check something in ClearCase will force them to perform a merge if it has been updated since they checked it out. Make everyone learn how to use the henious merge tool. Also (and I'm not sure if this is fixed now or if I never figured out how to configure this right) I have had major issues with EOL markers in files and clearcase merge - if a developer edits a file that changes the EOL character(s), then ClearCase merge will claim the whole file differs.
For CVS, it pretty much works naturally the way a system should - the default is to check out something "unreserved" (or at least that's how I remember it!). You can either update a file while you're making changes to keep up to date an make merges smaller, or just wait until you're done - before you can check it in CVS will make you update the file and merge your changes.
In both systems, an approach of having the developer merge the file means the person who really knows what's going on can resolve any conflicts with the merge. Most merges are automatically handled and so often no work is required. I'm not sure how much this really addresses your issue, but it can't hurt to rely more on the source control system helping you manage merges.
For option (2), think carefully about how you want to structure releases. One approach I've used before is having dfferent "levels" of development - you have a production branch, fix branch, development 1, dev 2, etc. You start out code at some given level (say development 2) and as it is completed and tested it moves up through the ranks until it reaches the production level. That worked pretty well and meant a fixed number of branches.
In more recent projects I've been working with a monthly release structure - each months release requires a new branch. This was not my idea, believe me... it might seem nice in theory but in practice you might have three or more months of development at the same time. This leads to something of a merging cross-nightmare. Some sort of branching structure might well help solve your problem, if only in thinking up how you can more clearly seperate changes to a file or set of files.
"There is more worth loving than we have strength to love." - Brian Jay Stanley
I'm especially curious as to what language VA Whatever picked to write the web-based admin in.
Perforce has atomic transactions. Perforce has a slightly biased (but not much) comparison of p4 to cvs. Atomic transactions plus easy administration makes it worth the price for my company.
There is no trap so deadly as the trap you set for yourself
-Raymond Chandler, The Long Goodbye
Sheesh. What editor is modding up all these VA Software advertisements as "Interesting" or "Informative"? And there's a cute little Microsoft bash thrown in at the bottom too, with absolutely no evidence thrown in (or even the name of the Microsoft software that this person used!) Disgusting.
[I am not a CVS guru, I just use it.]
If you have to apply patches multiple times, then you're probably patching branches, and developing in the branches. The "CVS Way" seems to be (corrections welcome) to develop in the default branch, and to tag the tree at drop-points -- when you ship the code. If you need to support an old code-drop, you turn the tag into a branch, and then patch the branch.
If you have too many delivered branches being supported at the same time, perhaps you should upgrade those customers to a newer version of the software. They'll appreciate, and it will simplify your situation.
The develop-in-a-branch-and-ship-the-default is appealing, but troublesome.
Otherwise, it sounds like your developers aren't playing nice... developer A patches the tree, and developer B goes to commit his changes, but gets told that there are conflicts and that he needs to update. Not wanting to deal with the conflicts, he copies his important files to a save spot, updates, copies his "important" files back over the top of the conflicted files, and then commits the whole thing, effectively "rolling back" the patch.
If this is what is going on, you need to educate your developers. With a stick.
Over the years, I've discovered that a significant amount of heartburn I may have with CVS comes not from any deficiency of CVS, but from the fact that I frequently fail to use CVS "properly" -- meaning, of course, "as intended".
Pick One: http://www-rohan.sdsu.edu/~stremler/sigs/sigs.html (Note - disable Javascript first!)
We all know that BitKeeper isn't free software, and I don't really know why it was chosen by Linus to manage Linux development since we have a much better solution in arch.
I think it is great. It follows the *nix spirit, spliting a problem in lots of little parts as addressing each part with one pre-existing tool or some home-made bash scripts. For the diff/patch lovers (like myself), it is great to find another uses for these tools.
I used to use CVS... but it doesn't scale so good. When Linus move to BitKeeper, I give it a try... but it was so disappointing. So I decided to give arch a try.... I am using it 'til now.
You can check it here.
Anyone that says you can do a complete, and working, requirements and design phase _before_ you do any kind of implementation is living in some other reality.
Even when you get to the level of complete Z specifications, all you've done is implement the solution in a language that only humans currently understand. Way to go!
ustr: Managed string API with ave. 44% overhead over strdup(), for 0-20B
I am dead serious, this is how we handle it.
No revision control system. The code is just on a server that all the stations have mounted. Programmers edit what they need to. If you notice some of your work just disappeared, you shout out, "hey, who's working on foo?" Then one of you two decides to work on something else for a while.
The real trick is this: we don't have "crunch time" so we don't bump into each other. If you've got 12 programmers all working on the same code in a hurry, then you have management problems, not technical problems with concurrency. Threads inside a computer needs to deal with concurrency. But people aren't computers, we're above having to worry about that stuff.
I don't think Microsoft has any offerings in the serious source control space. They do have a toy called "Visual SourceSafe (?)" but I don't think even they take it seriously. The leader in source control is ClearCase. The ClearCase VOB server is normally run on SPARC/Solaris. The client runs on everything common, including of course NT, Linux and Solaris.
So I'd leave Microsoft out of this discussion and ask whether Sourceforge.* can compete with ClearCase. Or what Sourceforge.* offers beyond ordinary CVS.
So this is the only sort of development that we do. A team with 20 engineers is actually pretty small here.
The first thing to do is to divide the functionality among engineers and have the engineers work in their own modules, talking with other engineers to find out what their needs are. That way, one person is responsible for changes to any given file. In addition, make sure that files are *locked* while somebody has them checked out -- ie you can't change it while I do.
Next, people should know about what they're going to do to a module before they check it out -- if it's checked out for more than 1-2 days, it's too long.
And, finally, there needs to be a mechanism to group changes together. So, if you're making a change in the code that requires changes in modules A, B and C, you check out A, B and C, and then you check them back in. That way, later on when somebody wants to compile the entire thing, they know they're getting your full change and not just your changes to A & B.
In any sufficiently-large software project, 10% of the code will receive 90% of the changes. The key is to find a way to serialize changes to that 10% -- the other 90% of the code will probably never be an issue.
Um, how can source control be really fast? I'm not sure how speed is an issue...also, I'm sure you're talking out of your ass, what with the lack of factual data in your post.
As the author of the original 386BSD 0.1 PatchKit software, I have to say that your "air traffic control" approach will not work.
The 386BSD 0.1 patchkit used a serialization of patch numbers, with central assignment. The reason for this was that the patch dependency management was done by manually applying patches posted to Usenet, and then diffing the modified version of the code against a version with the previous N-1 applied.
Effectively, it was a "human CVS repository" system.
Ir was necessary, because the latency in the Usenet system meant that you couldn't "lock down" a file or set of files for some major change: you had to do what you wanted to do against what you had, which was almost never "the most currnet concensual version" of the code, and then hope someone else didn't win the race to "the repository" (at the time, terry@cs.weber.edu's incoming email, and then, later, Rod Grimes', Nate Williams', and then Jordan Hubbard's... no one wanted it for very long).
This led to all sorts of problems; the major one was that the patch kit format was "reverse engineered" (not hard; the patch tools, except the creation software itself, were widely distributed), and a group started releasing patches in the "1000+" ID range, under the incorrect impression that the concern was over the patch namespace collision, not topological application problems. This eventually led to a big argument, and other people going off to play in their own sandbox.
You've probably heard of "NetBSD". A couple (not all, of course) were motivated by communit rejection of the 1000+ numbered patches, which, while they were not colliding in serial number space, seriously blew out topological dependency space for modified files.
In any case, that's exactly what you are doing with your code, when you plan on assigning patch numbers based on expectation of completion.
With the number of people you have, the comments about contested interfaces being agreed to beforehand, and the comments about you having no real problem here in the first place are probably accurate.
You can basically take a couple of approaches.
The first is: don't accumulate patches, just check the code in. This respolves the problem of stale patches by not permitting them to become stale in the first place.
The second is: "cvs tag" before any major commits, so that there is a baseline from which to work to resolve conflicts.
Really, you should not be accumulating large patch sets, with as few people as are involved.
If you have a huge offline latency from a developer or group of developers (e.g. you send a CDROM to Antarctia, and two months later the send back a CDROM with their patches on it), or if you have a huge number of developers, you should reconsider your chioce of tools.
The 386BSD patchkit serialization of patch sequence numbers through a couple of human beings was a serious mistake. It had the emergent property of having a tiered set of priviledge. I'm convinced that this is what resulted in the current "core team/committer/less-than-dirt" striation in the BSD camps today.
I mention this, because CVS has a similar, though somewhat less profound, emergent property of "The One True HEAD Branch". By its nature, it encourages a single direction for all experimentation and all forward looking thought, denying nourishment to any contradictory lines of inquiry, by chopping off the roots. CVS is, in a nutshell, anti-research. It prevents people from going off 90 degrees from where everyone else is headed, and discovering new territory.
Perhaps you've heard of OpenBSD. It emerged because there was "One True HEAD Branch" in NetBSD (an early adopter of CVS, in Open Source-land), and several people felt strongly enough that the focus of the project should be secure systems research, that the resulting code directions were incompatible.
Tools issues are at the base of nearly any strong divide you can name in an Open Source community.
Linux currently has issue, where Linus is investigating the use of Larry McVoy's BitKeeper (Larry was smart, in that early on, he recognized the emergent properties tools choices force onto projects, and tried to design around the problem). It turns out that a single human CVS repository doesn't scale infinitely.
FreeBSD is in the throes of a "To use Perforce or not to use Perforce" decision. Perforce supports seperate lines of concurrent developement.
It fosters, as my former boss' boss, Ray Noorda, used to say, "coopetition": help each other make the best implementation according to their design, and then may the best design win.
Perforce lets this happen, but it also tends to balkanize developement, if not everyone is using the tool. There are complaints in FreeBSD that significant work is taking place in Perforce branches that aren't visible to normal CVS users. The Perforce users complain back that there would be no need for Perforce, if the develeopement were permitted in the main CVS tree -- along with the breakage that would entail. Both arguments have merit. Right now, there is a truce... more of an agreement to disagree, and not force the issue today, but a promise that the battle will be fought to the death at some later date.
For your project, a tool which supports multiple concurrent "One True HEAD Branches" seems like it fitys the bill (though as I wrote that, I still asked myself why, with so few people, it was an issue for you in the first place).
Whether the tool you pick is Perforce, Bitkeeper, or some other tool that can support that developement model is irrelevent.
What is relevent is that you understand that our tools shape the way we think about solving problems, and if you have already arrived at an approach that doesn't -- or *can't* -- fit into the shape dictated by CVS, then it's probably time to look at another tool.
Not matter what you do, I can guarantee you that layering another, less adequate, tool on top of an already inadequate tool, will not fix your problem.
I can also guarantee you that if you can't change your model to fit an existing tool, you're going to find yourself in the source code control tools business, instead of the business you intended to be in.
Probably, you should rethink whatever premise it is that's resulting in large, infrequently integrated patch sets. If it's just your release engineering department not wanting to do their work on a branch, well, that's tough. Branch tag for releases as a matter of policy, and move on. If on the other hand, it's something more profound, perhaps you need to rethink your assumptions in favor of what the tools can do, vs. what you would like them to be able to do.
Alternately, welcome to the source code control tool business.
-- Terry
Okay, I think 90% of the people responding are missing the heart of the question. The original question was about parallel development, not just working on small changes in the same source tree and then synchronizing in the changes.
The best way to solve this problem is by better design of the software. If your software is well-designed, you the only time you really need to do parallel development is when you're maintaining multiple versions of the software. (i.e. service pack 1 and 2 to windows 95, while you work on windows 98 elsewhere).
The way we solve the problems above is by using a task-based change management solution. We use a commercial product Continuus/CM (now Telelogic/CM) to handle both the parallel release maintenance problem, and the manangement-didn't-enforce-good-design problem. This problem can be exacerbated by having almost random changes in the requirements combined with very tight deadlines. Fortunately, having a task-based CM tool lets you roll with the punches.
In task based change management, groups of file changes are grouped into a task. This task is one unit of work, something like "change product name string from a character string to a unicode string" which may involve touching several files. These file changes are brought in (or excluded) as a whole - the whole being a "task". Integration test approves "tasks", and if a developer wants a task before it has been integration tested, s/he can bring it in manually, and get all the updates.
This allows a group to work in parallel with the main effort, by including groups of tasks themselves that haven't gone through integration test (perhaps because they don't work yet, other developers' tasks are needed, or it's just a large change requiring more people to work on it before it can be tested).
Merging is done when needed, this way. One thing you can do in this program is "show conflicts", to show you what merges need to be done - on your parallel development effort, not the rest of the tree.
The merges never really get confused if you use a decent merge tool (we use tkdiff). The only time you would have problems is if everyone is rewriting the file to be merged from scratch every time... And in that case, the sofware design problems are so bad you really can't do anything about it.
The Continuus/CM software, however, is very costly. I think BitKeepr does some of the same things, but unless your company is one which doesn't mind its company secrets being posted to the web, you will have to pay for that too.
aegis is another free tool you might look into. It doesn't have a GUI, though.
Source control needs to be inherent in the development environment you are using. Relying on third-party tools or add-ins to successfully manage multiple developers in the same project is a mine-field.
The IDE we use is called Jade (www.jade.co.nz) and is a OO database. The beauty is that the IDE is written in Jade which means that the classes, properties and methods are actually objects in the database. Handling for mutliple developers at once is a breeze as the system does it all for you. We have 10 developers updating and managing a 1,000,000 lines of code project with ease.
I've been the lead techical achitect on a corporate systems integration project (fluctuating between 20 and 80 developers) for almost 2 years now, and though I'm not the version control expert, (we use ClearCase and I've learned to become pretty competent in release management) I've definitely understood the pain that comes when you have merge conflicts. Because we use exlusive locking in our files, the issue only comes up most prominently at bug-fix time when we branch the tree to do production fixes and have to re-merge.
The best solution we have come to is in pure communication. We divide all developer actions into developer-defined ClearCase activites, and for every activity we not only log our changes in CC, but also send out emails to the group mailing list detailing the nature of the activity and the files changed. This definitely has encouraged active communication within our developer group and resulted in far fewer migration/release issues.
If the developers are talking to each other through a structured process, they can resolve issues informally and ensure that code meshes. Currently, are comm. processes are manually, but we are looking at CC triggers to automate the process.
At work we normally use Visual SourceSafe, which, at least in my eyes, has a lot of problems and is definitely not suited for distributed programming.
Recently, however, Ive been using Perforce quite a bit, for a project in which we have three geographically distributed teams (Costa Rica, Utah and California). It is really nice, especially compared to VSS. It is completly multiplatform, very stable, has a nice interface (both winP4 and the command line client), manages branches and merges really well, etc. The only downside I see is that access times are a bit on the slow side, but then again, all of them are here in Costa Rica ;)
We also used CVS a little bit before switching to P4, and it worked well, but I think P4 is a more polished product, specially since not only developers where checking in and out stuff.
-.
I disagree. Big time.
If you seperate your development team into these modular areas, then no one is sharing knowledge in your team. That is a VERY dangerous situation for a development team.
The result it that you have these little areas of code that only one person knows about. If that area of code happens to be vital, then you better pray that the one person who knows about it doesn't get a better job. Or doesn't get run over by a truck. It DOES happen!
Setting up a development team in such a fashion is not only dangerous, I think it is irresponsible. There is too much potential for the company to have to spend extra money to make up for losses of these "towers of knowledge."
Not only that, you also state that you should do all this planning in advance. Very rarely are you able to completely plan everything out about a project in advance. And even if you do, things change, or there are things you missed. How are these little bumps in the road handled if you have such a restricted paradigm.
I have been working in XP (extreme programming) environments for 2 years now on 4 different projects, and I find it very refreshing to be able to go fix a bug that I run across in the code, regardless of who wrote it or is working on it at the moment. Who cares if that file happens to be being worked on at that moment. There are some great tools out there for merging in the fixes. CVS, ClearCase, BitKeeper...
The key is having a disciplined team who is constantly updating their local version of the code base and checking in their changes frequently. Merges do come along, but they come along less often, and they are smaller if you do frequent checkins and updates.
My 2 cents
Has anyone seen ENVY for something other than Smalltalk or VA Java?
This is a dumb way to manage a project. It barely works for Linus. And he's a genius.
The right thing to do is to give your 12 developers access to the CVS repository. If they are geographically separated, use a VPN or ssh to connect to the repository. When they finish their work, they first update their local workspace, compile and test, and if it passes, commit their changes back to the repository. Other developers get the changes as soon as they are available.
Twelve developers is the leading edge of a small project. You can have a single team, and everyone can be aware of what everyone else is doing. The best process is the one with the minumum of overhead that suffices for the project at hand. Don't add process for the sake of process.
On the other hand, don't sacrifice the basics.
Version control is not 'Advanced', it's fundamental. You might as well say you're using advanced visual editors, such as vi. SCCS is from '75. RCS is from '82. CVS is from '86. This isn't new stuff.
I've used clearcase (great tool for many things, but slow and expensive), cvs (cheap, functional, but frustrating at times), and perforce. For those that don't know, perforce is commercial but not that expensive.
perforce makes use of sequence numbers for just about everything.. It uses RCS as the backend, and berkley hashtables for a database. Unlike CVS, all files are initially read-only, and you have to "edit" a file (register an edit with the server). This along with the database provides very extremely fast revision-control operations (diffs, checkins, updates, submits, etc).
perforce has a really sophisiticated branching system (I found a few nifty advanced uses for it, though I wouldn't recommend getting too creative like I did). There is a free version of perforce that only allows a single user. Doesn't allow multiple branches or anything either... But if you're a business, then I'd definately suggest that it's worth it.. It's all the advantages of RCS/CVS with only the problem of cost. (I think it's $200/head / year).
In general, however, you really should establish a commision process... We typically have multiple branches dedicated to different aspects of development.
Each developer gets his/her own branch.. Then there's an integration branch.. A single human being has access to the integration branch.. It is his/her responsibility to take in the changes (from a posted commit number) from each developer's branch and bring into the integration branch.. That branch should then run through sanity testing (to verify more than freedom from conflicts). Finally when everybody is happy, the integration branch gets merged into the main branch (as a form of release).. At each stage, commit labels shbould be applied (for further backtracking).
-Michael
-Michael
XP says that you should pick a subset of the requirements that are known at the present time, and design exactly as much as needed to implement those requirements. Then implement that design.
Then you pick some additional requirements, and modify the design and the code to meet the combined set of requirements.
Repeat until management says to stop (or you run out of requirements.)
If you have no requirements, XP says you're finished.
And you update your design as requirements are added and changed, which frequently doesn't happen in a design up front approach. No one wants to go back up the waterfall, and then start over with the code. (And without requirements and design expressed as executable tests, you do have to start over with the code every time you make major changes in the design.)
generally two developers shouldnt be modifying the same function or even the same file, but if you use cvs properly then you can be notified via email if someone else checks out a version of the file you are working on. cvs watch does this, look it up in the manual.
Then all you do is contact the person and ask them what they are doing.
It's all about communication and good project management.
cheers
peter
I heard that Linux Kernal doesn't use CVS, they use a simple tree. Linus has his tree, Cox has his. They share their diffs. Very Zen.
It seems XP is taking over masses of programmers that love to code. Rational is for pseudo programmers and managerial types who have weak analytical thinking and often like to fall back onto diagrams. UML is no slouch, but it is often used to conceal ignorance and laziness to undestand technical underpinnings of a system.
UML is good, but as many things it is often misused.
Rational software not one I would use, but it should work considering all developers are on the same page. Tools should be used to get understanding of system being built, not building system like lego toys, slapping a schedule on imlpementation and stilling back and relaxing.
Revisions to diagrams are often painful, because you often have to explain just basic things, and gut feelings to managers who do not care, and can't understand what you really mean. Often enegery is spent on changing diagrams around and talking to pseudo programmers, than implementing quality code.
Any methodology can be great, but only if management believs it in, keeps everyone on same page and makes process a development and understanding tool, rather than programmer control tool - which often happens.
I suggest using bitkepper or CVS in combination with automatic documentation generators(Doxygen?).
Microsoft follows the "merge early, merge often" policy detailed quite clearly in the book Microsoft secrets. Its a 2 step process - do a merge (as often as possible), and immediately after doing a merge, check it back in and see if your merge has not broken the build. The person who breaks the build then remains the build manager who maintains "shippable" version, until someone else breaks the build...
If the developers are modifying the same header (*.h) files often, the solution is to "fire your design architect". Your chief architect obviously has little experience partitioning a software system and dividing up the workload.
Which shopping mall do customers frequent? Large software projects are like building a mall. Do customers goto the mall built by skilled brick layers (programmers)? No. They shop at the mall designed by a qualified architect AND skilled brick layers! Fire your architect before it is too late.
What it sounds like you're missing is referred to as a 3-way merge.
When merging 2 different code branches, you need the parent from which the two branches started, and the final version in both branches. Then you take the changes between the parent and one of the new versions, and push it into the final version of the other codeline.
The tool we used at my last job for version control was perforce (www.perforce.com). A company called Araxis makes a great 3-way merge tool for Windows that integrates with it nicely. Perforce prevents you from checking in a change until you resolve the merge with the latest version checked into the system. And it doesn't allow any of the files to be checked in until all of the files are merged. (Atomic Changelists.)
If someone has to re-apply a patch multiple times, something is seriously wrong. The other developers are not correctly applying their patch to the latest version of the code, and are instead overwriting it. This is malpractice on the part of the other developers.
Which is correct? The following same line of source code exists in two copies of the same program file. Resolve the ambiguity!
Line 10 - "The TRUCK is RED"
Line 10 - "The CAR is BLUE"
Resolve line 10 with "automerging". Cannot be done. Automerge sucks!
All developers should work on the same branch to take advantage of winkins and to reduce merging. Read the man pages on clearcase config specs and learn about time stamps to prevent your code from shifting beneath you while you work.
First. If there's only a dozen of you, and you have a branch, perhaps you should talk. Now if you were like some of my former client base who does follow-the-sun development around the world, this gets a little hairy. By that I mean a company like a TI or BT or Philips is passing code around the world and flexibly allocating resources from multiple sites to a project. Serious commercial SCM systems like Continuus/CM, err.. Telelogic CM/Synergy with DCM and CC with Multisite can be used to coordinate. This is one place that open source has failed to penetrate due to the small size of the potential market and the general antipathy that the average developer has to the CM people. Until, that is, they get religion by losing something very important at a crucial moment ;-) Even then most shops can get by with VSS or PVCS.
OBTW: I'm an ex-developler, system admin, desktop video weenie and computer consultant that backed into SCM. I got religion back around the late 80's but it was pre-emptive. Anyway, concurrent development actually comes in three flavors: concurrent by collision, concurrent by release (patches versus next generation) and finally concurrent by platform (Windows vs Unix, etc). Each is handled differently and under different timeframes (including permanently in the latter).
Your post alludes to the first case of a "popular" file that needs checking out a lot like a header file. In a shop your size, this should be handled by parallel checkout notification (a common feature in commercial high end systems) and hopefully a bit of coordination by phone or shouting over the pod wall. If you are in severe RAD mode, I suggest using a shared work area to enforce sequential checkout. Your SCM tool does build a work area doesn't it? If not, hit yourself on the head and ponder why people like me are now on the dole.
Okay, so you decide to branch anyway since you or the other developer are really going to break things with a rewrite. Hey, wait a minute! Shouldn't a big feature change (or small) be considered as a single unit of work where all of the files changed by the feature get moved through the mill at once. You have just re-invented the wheel known as Change Packages, Task Based CM, ChangeSets and whatever else the marketing wonks are using nowadays. Once again the high end tools have this (or should). Continuus had it back around 1997. Rat got it into CC much more recently, but there it is.
If you have a task based system, it makes life much easier to do merges and to put things in or take things out of the build. Being ex-Continuus, I could pound the pulpit about how build managers can check the configuration and its set of included tasks for consistency, but I won't. Instead I'll just mention that graphic history views and a decent merge tool that takes into account ancestor versions should take care of any merging issues.
One more thing. The sooner you catch the branch and do the merge the better. Don't punt it to CM unless they are ex-developers who know the product as well as or better than you. You may also come to find that an integrated Change Request Management system like ChangeSynergy for Continuus (err Telelogic) and Clearquest for CC become very important in your life.
Our Corporate Mission: Strive to slack off much more efficiently
I use to be a Sysadmin. The deal is with CVS is that as many other people have noted merges are not so much an issue if you do your updates regularily in your workspace. Unless the project is really small 2-5 programmers or so most people know that they can working on files in the same module as someone else. When the programmer is ready to unit test it is vitally important that they update. Also regular check in's along the development path is also very important.
Remember also, that the use of tags for marking the progress of files in development from a code ready to test ready to integration ready status is very important and as for remembering cryptic commands that is why you need a technical person and not a corporate type as your CM because all it takes is a few shell scripts, perl scripts and web interfaces to simplify tagging files and looking over the CVS structure. Hack CVSweb that is why its there.
_________________________________
ACK
I don't have much experience with concurrent development involving many programmers, but the problem seems to be similar to that of parallel computing. In other words, the difficult part is probably in how (and whether) the tasks are split up. If each programmer is given a task that doesn't rely on (or interfere with) anybody else's task, you have a more efficient parallel process. In fact, it's probably possible for some projects to have a "coarser grain" than others, and trying to parallelize such projects might just make it more inefficient.
So the question really should be "how to split up projects efficiently" and not "what tools will magically streamline a poorly managed project?"
---
Open Source Shirts
That is NOT a big project. Try 80 devs working in the same source code with multiple builds from the same common source base on both IRIX and NT/2000?
:).
Source control on large projects is TRIVIAL. You just need very specific check in procedures... That's it. People who don't follow the check in procedures just need to get hammered
Loading...
upon checking out a file from a CVS repository, require the person checking them out leave a little note as to why. probly applicable to entire modules as well.
-- justin
--Justin
Changes in Continuus are grouped into tasks. So a task might be "add a back button to the browser". Each developer in the group checks out the baseline code and works on his tasks. This environment is then protected from anyone elses work. When a task is completed, it is checked in as a group of file changes but does NOT immediately go to other developers. Instead, a build manager comes along at some later time and merges the tasks and verifies that they pass some quality assurance tests. Only then are the tasks available to the rest of the programmers in the group.
It doesn't need to be as rigid as I've described here. For example, I can include your task in my development even before the build manager approves it. Or I don't have to update my environment with the new tasks even if the build manager has approved them.
But that's the basic philosophy. Essentially, separate the development and quality assurance into two different jobs. This keeps environments clean until you can trust them enough to move along.
Devon
As pointed out by others, your problem is clearly that your developers are not doing a 'cvs update' before they 'cvs ci'. The tutorials and HOWTO's recommend this consistantly. Otherwise, you end up patching *out* what another developer patched *in*.
Have your developers read the CVS Book: http://cvsbook.red-bean.com/. If it helps, consider buying a copy or several. Helps the authors, and makes good reference for your developers library.
i've searched through the threads on this article, and i'm ABSOLUTELY SHOCKED that not one person has mentioned Aegis. i have used cvs since it's earliest days, and rcs prior to that (not counting a number of commercial offerings), and i must say, i despise it greatly. i find it to be EXCEEDINGLY clunky at managing concurrent development of FEATURES THAT YOU AREN'T CERTAIN ARE GOING TO MIGRATE TOGETHER INTO PRODUCTION.
see, in my experiences, when dealing with huge software systems, it's rather tough to get all DEPENDENT changes to move in lock-step with your own project's changes when the problem is rather large. given that, we end up having to create branches & associated tags for EVERY FEATURE which we then merge all around into release-branches, which all just becomes a rather kludgy mess. the time spent on source code control begins to grow exponentially, and the skill required to do this safely grows with it.
Aegis solves these problems like the commercial configuration management tools (clearcase, PCMS/PVCS, etc, etc) in that it allows for deltas to code to be aggregated together and considered as one atomic change (many times relating these together as implementing a specific managed change (say from a bug-tracking system, etc)).
this is ABSOLUTELY NECESSARY when you have hundreds of software developers on a project in my (admittedly limited in the grand scale of things) experience.
i've always been shocked that so many open source developers were simply willing to put up with cvs given all its warts.
hopefully this article and the discussions surrounding it force some folks to stand up and demand that the "state of the art" be advanced. (i realize that the state of the art is really far beyond what cvs does, but cvs HAS MIND SHARE).
btw: aegis is GPL'ed and has been around for a LONG time. in addition, it's core concepts are similar to almost every other CM tool i've used (even including cvs, not counting the advanced features) so most people can get up to speed with it quickly.
i'm also confident that the SIMPLER aspects of source code control / configuration managment could be integrated into most IDE's by building cvs-command-compliant wrappers around Aegis if the time-to-market for the integration were deemed more important than a native integration...
oh yeah, last thing: aegis heavily entices you to check in test cases for your code. i've found this to be a simple but effective mechanism to aid people in building good regression test suites...
i'm curious to see whether i'm alone in my opinions on these topics or not...
cheers
Peter
For my disertation I wrote a system that locked specific lines in files and then used set theory to manage locking.
The next step from there was to detect when one function/method relied on another holding still and then deny locks on the later until the former had finished development.
There were other rules with timeout on locks and stuff but I hope that explains it.
Oh boy, where to start...
First of all, let me make clear that I don't have any real experience with CVS. My main practical experience is with Rational ClearCase. Free plug: this is a very powerful and sophisticated tool, which will allow you to set up your own customized CM procedures. Branching, merging, complex checkin/checkout rules, triggers to that can be fired on any action, it has it all. It has this extension called MultiSite, which will allow you to do geographically distributed development. Basically, it has all you can imagine. Especially the UNIX version is very stable. So much for the good news. The bad news is twofold. One, it's kind of expensive, especially when compared to CVS. Second, because it is so powerful, you need a ClearCase expert on your team. Quite an investment, especially for small teams.
Which leads me to conclude that for a team of a dozen developers, it is probably overkill. Unless your project has money to burn, you probably can't justify buying ClearCase licenses.
Aside from that, reading your problem description, I'm afraid you have a culture problem on your team, that can't be solved with CM tools only.
First of all, a dozen developers isn't all that many. If you have many checkin and synchronization conflicts, it seems to me that your developers aren't talking to each other. Some years ago, I worked on a control system for the European Space Agency. We had about a dozen developers (a bit fewer in the beginning, a bit more at delivery time), and we managed to keep schedule slip to 2 months on a 24 month schedule. I think this was mainly achieved through intense communication between developers. We used a very primitive CM system (only checkin/checkout, no branching/merging), so that was not the silver bullet.
Tangent: some years ago I read a story about product development at Borland (I think the product was some version of Paradox), where 15 developers successfully developed the product almost on budget and schedule. The interesting part about the story was that the team spent approximately half of their time on meetings!
Anyway, the second problem (I think) that you have is that you are probably working from a poor design. A proper design, whether object-oriented or not, will partition the system into clean, cohesive, loosely coupled units/modules. Developers should not get in each other's way in that manner.
A third problem is universal: us programmers are free spirits. "We don't need no stinkin' control, we know what we are doing!" Wrong! It continues to amaze me that people easily accept the fact that they have to come to work every day, but they don't seem to think they need to follow CM procedures (or coding standards, for that matter). Part of this problem is that you'll get sacked for not showing up, but not for violating procedures.
The fact that you hint that checkins happen at the last minute before the deadline is sort of worrying. The attitude should be: checkout, make change, test change (!), and checkin as quickly as possible. Small, quick changes. This has the side effect that you have more checkpoints to revert to. Also, any change you make is invisible to others until you check it in! Want your changes to be in the baseline? Check in as soon as possible, or someone else will beat you to it.
I could go on and on, but bottom line is this:
- have a CM procedure
- make sure everyone is aware of it
- assign (only) 1 or 2 persons to make baseline changes
- let the CM system enforce the rules.
- and of course, have a design...
Just my $0.02...
MSN 8: Now Microsoft even has bugs in their ad campaigns.
If you want a graphical shell for CVS under Windows, I have heard good things about TortoiseCVS. I have no personal experience with TortoiseCVS however, so whether it is actually better I really cannot say.
Best thing is of course if you try 'em both out and see for yourself.
Installed the Bubblemon yet?
Perhaps by large company standards, $30k is modest. You can even justify it with arguments of how much employee time it will theoretically save. OTOH, it's still, in absolute terms, a lot of money for a small company. The question is not just what benefits would you derive from investing that $30k in source control, but also what alternative benefits could you derive if you went for a cheaper option and invested the remainder elsewhere?
If you went with, say, CVS+WinCVS (which also hides much of the command-line complexity cited earlier in this subthread) then you have a total cost of pretty much zip by comparison. That leaves $30k to spend on something else. Would hiring a part-time office assistant to deal with the paperwork also save your developers at least a week of time over the year, for example?
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
/me points out that you can do such three-way merges in many other tools as well, albeit without the nice buzzwordy title for the feature. We routinely do exactly what you describe with CVS and accompanying tools, for example.
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
As far as I can tell, CVS supports, or provides a direct equivalent to, everything you mentioned.
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
Make sure that your process includes the following, and your development system will scale well (assuming that all else is ok):
1. Your source control system should warn you if you "check out" a file that is also checked out by someone else. You can then judge how much you are going to change it, and if necessary contact the other person for coordination.
2. Update often. Keep an eye on what is changing. If things are changing in areas that you are playing in, skim through the changes to make sure they don't affect you.
3. Update to the latest version of everything, merge any conflicts, build clean, do a quick test of the product (especially in the areas you were working on), get a code review from someone else, and THEN check in. You should probably get somebody else to update, build, and test immediately after your checkin, in case a random disturbance in the force allowed things to work on your machine, but it really has a problem.
4. Set up a nightly "official" build. In the morning, run through a standard set of tests on the "official" build. This helps avoid regressions.
5. Provide lots of pizza.
Time flies like an arrow. Fruit flies like a banana.
My Names's M@, and I use Source Safe... *Hi M@!*
<flame reard>
Yes, we are a microsoft shop. Hence source safe. I am curious about all the talk about TOOLS in this thread and not as much attention to process.
</flame retard>
I imagine it is possible to manage the process without any real off the shelf tools... keeping a backup or huge levels of cascading directories. Couple with a REALLY good, well defined, and well followed process, couldn't this work? I'm not suggesting it be done, but explored as a hypothetical allowing one to discover the nature of the beast much better.
And RE: "Source Safe Sucks", it's just a tool... it should be possible to use it correctly coupled with process to solve the problems.
BTW: I'm very glad to see this thread... I dunno how many times I've searched for this information.
I'll be checking out (no pun intended) SourceForge, but I'd like to hear mention of maybe more books (chekcing out the extreme programming as well)
M@
Krispy Cream is people
But that doesn't scale... So you've got to modularize the project so that each team works against their own baseline. Any changes that you do to another teams part, must be given to them, for them to check out, and integrate themselves. The important thing is communication, it often happens that they are working on fixing the same problem, but in another and better way, and can give you back their fix instead.
So when you're having problems with your revision control system, what you are really experiencing most of the time, is problems with communication.
Ideally, all developers should be in the same building, they should work at (mostly) the same time (normal office hours), and they should all be friends, and keep a list of each others telephone-numbers, and eat lunch at the same time. Development should be split into projects (having 10-50 people on each project) that are mosty independent. All the people on the project should meet weekly, and discuss their project, important changes, etc... On each project there can be one or more teams, each consisting of four to eight people, who should have offices really close to each other. On each team, development should be split into sub-teams, consisting of 1-3 persons (depending on difficulty and experience), who should share an office, and thus communicate even further. And just as important: this should not be formalized (at least not too much). People should rotate around somewhat, not being stuck with the same people all the time, to get them to know other parts of the project, and other people to communicte with.
The important lesson I am trying to give is that the most beneficial communication, is often the informal. While having a tool help you with managing deltas is surely helpful, it can't solve every problem in the world. You need to work together, but you also need to modularize, and most importantly, you all need to be friends...
At the places I've worked that have the multiple developer problem several things were apparent:
1. Use of versioning mandatory.
2. Multiple hourly builds present too many problems to other builders. It usually shows an environment where insufficient testing is done.
The only way out is to control checkins. Generally speaking, a team leader / build leader approves or disapproves checkins. Note that this is only necessary in crunch times but checking with others in yhour team is still a requirement. Don't forget, in teams of under 7 or so, you generally know whats happening.
In several embedded development worksites, it worked pretty well if certain developers "owned" resources -- noone else accessed thos devices. If I needed a special access routine to X's floppy, X would provide it.
The leader in source control is ClearCase.
ClearCase? Perhaps they are the leader in raising TCO in SCM solutions (and no, I meant maximizing TCO, not lowering it) - ClearCase is great if you are a Software Configuration Manager (which I am) and you are trying to build your 'empire'. ClearCase typically requires a full time administrator, and for many shops, and entire team - Compare this to other toolsets (perforce) that are much lower maintenance. ClearCase requires expensive hardware, fast networks, and bloated clients.
ClearCase does allow some amazing things in the world of branching / merging, but if you are in that 80% of typical CM users, you don't really need all the bells and whistles. Yes, they are nice, but is it really worth the TCO?
-jerdenn
Alas it's not mentioned in the manpage. But you can have an overview here: http://www.cvshome.org/docs/ref.html
Yes, this is valid in some situations. However, there's no guarantee that an incremental requirement has an equally 'incremental' cost - adding security or resilience or some other pervasive aspect to an existing code base rather than doing up-front design cost increase costs substantially.
Agility requires Abstraction and Automation - if you're forced to mix different implementation aspects in the same piece of code (persistence, security, communication...) then be prepared for some cost bumps on the development path.
I insisted on this because developers and testers are at cross-purposes: The developer tries to prove the code works; the tester tries to prove it doesn't. You can't do both in one brain.
Oh, and we were able to have the tests ready at the time the code was ready, primarily because developers and testers coordinated between themselves exactly which new features would be worked on when. It worked both ways. Sometimes the testers followed developers' leads, other times the developers coded what the testers were more ready to test.
Note: I'm no XP evangelist. It has some neat ideas, but many of those don't apply well to really large projects, or mature projects with a lot of legacy cruft to work around, and it doesn't work well when you've got one guru for every two slackers and five rookies.
And the name made me immediately think of those guys on snowboards in freefall, hacking on their laptops while doing all those loops and spins.
The only big resource ClearCase really needs that a two-year-old setup wouldn't have is low network latency. We use ClearCase and when we moved our source repository off of a hub in a room full of NT servers and out to the far side of a T3 in a different building, things got much faster. If the droids don't make you put your "server" in a "server room" with crappy network between you and there, it's not nearly as bad as it used to be. (Matter of fact, ClearCase with us in Chicagoland and the server in Akron wasn't as bad as having the server in the NT server room...)
Note that we don't use derived objects (DOs). But we don't have a full-time (or even part-time) ClearCase tuner; I'm not even 100% sure what there is that they'd need to spend all day tuning. The only thing we do beyond 100baseT and an otherwise decent piece of network is cp -R the source to a local disk so we can search for content (find |xargs grep, vs. label/branch/attribute searches) faster when the content hasn't changed recently.
I've heard the stories, but except for clearmake/derived objects I'm not quite sure what the fuss is nowadays.
Developers I work with are asked to do, at least, an update on each file they are about to edit. We almost never have conflicts with CVS.
Interesting. At my current job we just dodged the ClearCase bullet by implementing CVS. Although there is companywide pressure to use ClearCaes, we met management's objectives with CVS. We sensed a bit of 'empire building' on the part of the ClearCase team, and were worried about adding a dependency and possible bottleneck to our small and self-contained team.
However, I still think that ClearCase is the leading source control system. Every big place where I've worked or contracted uses ClearCase for their primary software product. Leading product != lowest TCO. Windows and Oracle are examples. In fact it seems as if the more resource-intensive and troublesome an application is, the higher its visibility and prestige within the enterprise.