Ease Into Subversion From CVS

Re:Summary? by Hamster+Of+Death · 2004-03-07 16:30 · Score: 2, Informative

See the project front page
Subversion

Re:Is there demand? by nosferatu-man · 2004-03-07 16:35 · Score: 5, Informative

We're switching. CVS is crufty, buggy, and slow. That alone is reason enough to switch, but atomic commits and faster and more transparent branching will be, in the long run, a more fundamental win.

'jfb

--
To spur "enterprise Linux," Big Bang, the distributed two-phase commit.

Re:Is there demand? by aurum42 · 2004-03-07 16:40 · Score: 4, Informative

I don't know what your development model is, but branching and tagging are often some of the most frequent (and slowest, in CVS) operations.

Many projects follow the "make branch, fix bug in branch, test branch and then merge" cycle, which makes a lot of sense.

--
"The slave who knows his master's will and does not get ready...will be be beaten with many blows."Luke 12:47-48

Re:Is there demand? by Endive4Ever · 2004-03-07 17:02 · Score: 5, Informative

Do developers out there voice the need to store binaries? I can imagine this being needed for web developers and such, but I think programmers can just build their binaries from CVS.

Yes, developers definitely need to store binaries. I worked on a project awhile back where the boot block code was a finished binary. Because CVS was used to house the project, a horrible kludge involving UUENCODE had to be used to store the binary commits. Sometimes the binary was created by a totally different tool that the main build machine doesn't have. In the case I speak of, the binary was built with an expensive licensed assembler for an Analog Devices DSP chip, and contained as a body of the 'build' because it was dynamically 'injected' into the dsp processor from the native processor, which happened to be an 80196.

There are always cases where a binary needs to be committed. Think about bitmaps and other resources. It doesn't make sense to 'generate them from source' every time a build is done.

Given all this, it's my understanding that with newer versions of CVS binaries can be committed safely. Is this even an instance where 'Subversion' is needed?

--
---

Re:Windows server? by Anonymous Coward · 2004-03-07 17:18 · Score: 1, Informative

You can't be serious. Most serious shops with large development teams, like as over 50 programmers use other source control software. Very few mid size firms use SourceSafe, which sucks big time. Even hardcore MS people I know say it sucks big time.

Re:Is there demand? by Anonymous Coward · 2004-03-07 17:29 · Score: 2, Informative

Do developers out there voice the need to store binaries?

It's a useful feature. Many companies like to store versions of binaries alongside sources. That way, if some customer has a bug with version 2.1.2.4 of Foofware, the company can just check that out, instead of figuring out (and hoping to get it right) how to build it.

And atomic commits are very useful. I wondere how CVS got so popular without them, but I think it is that people don't have them and didn't know what they were missing.

Subversion seems to be provide a lot more of the things I expect from SCM tools.

CVS seems to me to be a layer on top of RCCS. Now, I don't use either. I'm in a PhD program, and I use ClearCase LT thanks to the IBM scholar program. Sure, it's heavyweight, but I got used to it at HP and I like it. Feels solid.

Some answers by magnum3065 · 2004-03-07 17:30 · Score: 5, Informative

Ok, I saw some questions about why people should switch from CVS to Subversion. The article does a nice job of covering what features Subversion adds, but people still seem to wonder why these are important.

Atomic Commits:
As stated in the article, if something goes wrong in the middle of a CVS commit (e.g. network goes down) it can leave the commit only partially complete. This can be a problem if changes in multiple files are dependent upon each other. Say I add a function to an API, then call it in other file. If the call gets committed and the API change doesn't, now the code in CVS won't compile. With atomic commits if the connection was dropped the commit would simply roll back. Then when my network came back up I could try to commit again, but the repository would never be left in a state where it didn't compile.

Constant Time Tagging/Branching:
In Subversion tagging and branching are fundamentally the same, they're both executed as a "copy" command. I'm not sure what the execution time is for these operations in CVS, though I believe it's linear to the size of the repository. In Subversion this is an O(1) operation. While one of the posts commented on tagging being an infrequent operation, this may be true, but why not let it be fast anyways? However, no matter how often you do tags, constant time branching is nice. I can at any time quickly create my own branch of a project to work from. Working in my own branch means that I can keep very granular track of my changes by committing frequently, without worrying about breaking something else. Once I'm satisfied with my changes I can merge my branch with the main code.

Storing Binaries:
"Binaries" does not necessarilly mean compiled code. There are plenty of things that can benefit from this. Anywhere you use graphics: web programming, GUI programming, or say game or other 3D programming andy you want to store your models. Or, you can store documentation in the repository: PDFs, Word docs, spreadsheets, etc.

Finally, the barrier to switching isn't all that high. The command line program has quite similar syntax, so switching is pretty easy, and the other interfaces such as the web viewer, TortoiseCVS, and IDE integrations generally have counterparts for Subversion.

Well, that's all I can think of for now. I'm actually going to try to get my company to switch over to Subversion from a commercial software they were using when we start on our new product. We're using a Java applet to interface with the repository now, and it's not very nice. CVS would work, since the main thing I want is integration with Eclipse and IntelliJ Idea, but there are plugins to support this with Subversion as well. However, Subversion has nice feature CVS doesn't, so I don't see any reason to use CVS over Subversion.

Consider GCC by devphil · 2004-03-07 18:09 · Score: 5, Informative

Once a week, a snapshot release is made. That means a tag is added. This operation takes, on average, 40 minutes, because the GCC source tree is large.

Every time someome makes a branch, they create a tag just before branching (for use later on, with diffs and merging). 40 minutes to tag, another 40 minutes to branch.

All because these are, stupidly, O(n) operations instead of O(1). We'd like to move to Subversion, but can't, until they get annotate ('svn blame') fully working, because GCC developers spend a lot of time doing "revision-control archaeology".

--
You cannot apply a technological solution to a sociological problem. (Edwards' Law)

Re:Consider GCC by nthomas · 2004-03-07 19:16 · Score: 5, Informative

We'd like to move to Subversion, but can't, until they get annotate ('svn blame') fully working, because GCC developers spend a lot of time doing "revision-control archaeology".
Just curious, 'svn blame' was added 2003-10. What about it is not working for you?
Thomas

Re:All your files are belong to us by Anonymous Coward · 2004-03-07 18:35 · Score: 2, Informative

Not only is it in a database, it's in a Berkeley DB. Some thoughts on this:

1) there is absolutely nothing about a version control system that requires a key/value database like berkeley DB. I think they just use it to get free locking and transactions. Strange.

2) berekeley DB is ultra-sensitive. Ever had to deal with a locked Berk DB, when no process was running that had it locked? You have to manually break the locks. Fun. This hasn't happened to me with subversion (yet), but I expect it to be a problem.

3) the *filesystem* already gives you atomic operations and so forth. They could've used that, and then written a thin compatibility layer for windows, which doesn't have posix filesystem semantics.

*grumble* *grumble* overengineering *grumble*

Re:Windows server? by ogre57 · 2004-03-07 18:36 · Score: 2, Informative

In a Microsoft Shop developers will use Microsoft SourceSafe. period.

No, they won't. Can think of several shops/teams using PVCS, plus a handful on other products, but none using MSS. Up front (purchase) cost isn't much of an issue. Time cost (TCO) very much is. MSS is simply much too slow to be competitive.

Re:All your files are belong to us by magnum3065 · 2004-03-07 18:49 · Score: 4, Informative

Someone else already mentioned the ability for live backups with Subversion. Another benefit of the database is built-in journaling support. BerkelyDB logs any changes before making them, so if your system crashes or something, the DB will be restored to a stable point. This is MORE reliable than what CVS offers, even with a journaling filesystem. Also I'm pretty sure that if you REALLY need to hack the DB, there are utilities that will let you do this. However, most of the scenarios that CVS admins needed to hack the ,v files for are no longer a problem in Subversion.

I've tried both Subversion and Arch by dozer · 2004-03-07 20:14 · Score: 4, Informative

Subversion good points:

Finger feel is very similar to CVS
Flexible directory layout & tagging
Extremely stable development.

Subversion Bad Points:

Database & log files take up a LOT of space.
Quite hard to share repositories
No way to mark your branches (if you accidentally check out the directory containing your branches, you just got 50 gigs of 99.9% identical files...)
No distributed development
Pretty weak merging

Arch Good Points:

Extremely good distributed development
Super easy to share repositories
Pretty strong merging.
Very stable development

Arch Bad Points:

Forces you to give your projects weird names ("my-project--branch-1--1.1").
Forces each branch into a different top-level directory in your archive ("my-project--branch-2--1.1").
Doesn't feel anything like CVS.
Pretty slow (but they're working on it).
Somewhat difficult to resolve merge conflicts

I wish I could love Arch because distributed development absolutely rules. I could tolerate its bizarre command set, but I simply won't accept arbitrary (and ugly) constraints on what I name my projects and branches.

Verdict: I'm still using CVS. Subversion is very close to pleasing me enough to switch... I'll probably ditch CVS some time this year.

Re:I've tried both Subversion and Arch by natmsincome.com · 2004-03-07 21:54 · Score: 4, Informative

Some of your Bad points for Subvresion don't sound quite right:

*Quite hard to share repositories

The repositories can be read using any WebDAV complient software. If your talking about on the web the articles says you can use viewcvs as a web interface. If you want poeple to connect to the server then it should be setup by default as it's client server.

*No distributed development

If your talking about multiple servers like bitkeeper then I can't help you *I know nothing* but if your talking about client server then there's a misunderstanding as it's been designed to be client server.

I may have misunderstood what you were saying but the comments were a bit vague.
Re:I've tried both Subversion and Arch by dozer · 2004-03-07 23:22 · Score: 3, Informative

Quite hard to share repositories
The repositories can be read using any WebDAV complient software.
Ever tried setting up a WebDAV server? That fits anybody's definition of hard. The Subversion team recognize this, so they allow you to access the repository over ssh too (thank goodness!). Problem is, everyone using ssh must log in to the same user account or the permissions get screwed up. So, yes, it's quite hard to share repositories in Subversion.
No distributed development
If your talking about multiple servers like bitkeeper...
Um, yeah. OK, allow me to be slightly clearer: Subversion does not support decentralized development. Not at all. It's a major limitation.
Re:I've tried both Subversion and Arch by W2k · 2004-03-08 00:24 · Score: 2, Informative

Ever tried setting up a WebDAV server? That fits anybody's definition of hard.

I strongly disagree. Setting up a Subversion repository to be accessible over the 'net was PISS EASY, even for me, a first-time user. You can use the included light-weight server (svnserve) or Apache2 if you need options like complex authentication. It's very easy to set up and very nice to look at if you enable XML output. :)

There are howtos in the Subversion book. Happy reading.

--
Quality, performance, value; you get only two, and you don't always get to pick.
Re:I've tried both Subversion and Arch by Anonymous Coward · 2004-03-08 00:57 · Score: 3, Informative

Problem is, everyone using ssh must log in to the same user account or the permissions get screwed up. So, yes, it's quite hard to share repositories in Subversion.

i do believe that is wrong. using ssh for access the users need to be in the same group, and the repository directory needs to be sticky and writable to that group.

once setup correctly there is no problems with ssh access by multiple users.

Re:Windows server? by cyborch · 2004-03-07 20:55 · Score: 2, Informative

Also, and much more importantly: MSS only does file locking - not merging file content. It can hardly be called a "real" versioning system.

Binary files by ggeens · 2004-03-07 21:26 · Score: 4, Informative

Do developers out there voice the need to store binaries?

There are definitely reasons for storing binary (non-text) files in a version control system:

Images: quite obvious. You want to version all your artwork. For web-based projects, this can be a large part of your system.
External libraries: if you use third-party libraries, it makes sense to store them in the version control system. If you need a particular build, you check out the correct revision. This allows you to build the exact same binary as it was delivered before. (Of course, if you have the sources to the library, you might want to import them into your project. But if you don't change the sources, that might be overkill.)
Compiled files: some people like to store all object files into version control. Again, this allows you to retrieve a specific version faster (no need to recompile). Personally, I would do this only if the compilation takes too much time.
Documentation: whether you use MS Office or OpenOffice.org, documentation will be in a binary format. (OOo uses compressed XML.)
Test data: you might want to version your test cases, and those will consist of binary data.

--
WWTTD?

Re:All your files are belong to us by Adrian · 2004-03-07 21:28 · Score: 2, Informative

With a database, if things were to get corrupted enough (I have no evidence that this happens often, but still...) you are stuck. Just like with the windows registry, where if it gets messed up you lose big.

I worry more about disk crashes and accidental deletions. This is what backups are for ;-)

You can also serialise everything into a fairly human readable file to with svnadmin dump and svnadmin load if you feel you need something non-binary.

Really not a problem as far as I'm concerned.

Re:Live backups, baby by halfnerd · 2004-03-07 22:18 · Score: 2, Informative

A year?

Taken from http://subversion.tigris.org/release-history.html:

Milestone 3 (30 August 2001): Subversion is now self-hosting.

there's over two years between that, and their 1.0.0 release, without *any* data loss.

Re:Windows server? by Adrian · 2004-03-07 22:32 · Score: 3, Informative

In a Microsoft Shop developers will use Microsoft SourceSafe. period

Not in my experience. Some do and some don't. The absence of pain not using VSS can supply compensates for the lack of tool integration. Even MS doesn't use VSS internally ;-)

Subversion doesn't have a chance to compete because there is absolutely no way that it can integrate fully into the .Net development tools the way Microsoft's Own Source Storage Software is designed to do.

I think the people writing the Subway and sourcecross subversion-SCC interfaces might disagree with you there.

Re:Windows server? by spongman · 2004-03-07 23:11 · Score: 4, Informative

I'm runing svnserve on a windows box in a production environment and it works great.

If you want to start svnserve as a windows service, google for srvany.exe, it allows you to run a regular win32 exe as a service.

Re:Windows server? by spongman · 2004-03-07 23:19 · Score: 2, Informative

i should add: I'd definitely recommend installing TortoiseSVN. Having the SVN operations available as a shell extension is a godsend. For example you can use SVN from within any FileOpen dialog. The only thing it's missing is a directory-diff, but on XP you can show the SVN status of files in explorer by configuring the attribute columns in the details view.

Also, I'd recommend downloading perforce's p4win 3-way merge tool. It's a little better than the one built into TortoiseSVN.

Re:how do you migrate? by TwistedSquare · 2004-03-08 01:20 · Score: 2, Informative

I asked this last time subversion appeared on slashdot, you can go see my comment and its helpful reply

Re:how do you migrate? by mgm · 2004-03-08 01:44 · Score: 5, Informative

Yep, Subversion comes with a conversion script, cvs2svn, which is under very active development right now. It's not quite so wonderful at converting CVS repositories with complicated branches, so you'll want to double-check the conversion, but lots of people are reporting success converting huge multi-gig repositories over to Subversion.

Re:how do you migrate? by Moonbird · 2004-03-08 01:44 · Score: 5, Informative

Look here...

--

--
All extremists should be taken out and shot.

Re:Is there demand? by 0x0d0a · 2004-03-08 02:22 · Score: 2, Informative

Rolling back changes without atomic commits is a pain in fucking ass. Have you ever had to do it? You have to track down every file that you changed (somehow... hopefully you can remember), check which version was the version prior to your commit, and get all those versions of files. For example "Okay, I need version 1.7 of foo.c and version 1.8 of barf.c and version 1.13 of foo.h." It's totally annoying.

Take a look at the -D flag. You'll be pleased.

I agree that CVS was almost mind-bogglingly crufty. It may be the single most crufty piece of software that I used regularly. Everything about CVS was defined by the way RCS worked, which just didn't make that much sense for a CVS-like environment.

--
May we never see th

You want RapidSVN by Valdrax · 2004-03-08 07:48 · Score: 2, Informative

That's a pretty good question in my opinion, and TortoiseSVN's Windows shell-extension doesn't cut it. ("-1, Redundant" my ass.) If you're looking for something more like WinCVS, check out RapidSVN.

--
If it's for-profit but free, you're not the customer -- you're the product (e.g., the Slashdot Beta's "audience").

Re:All your files are belong to us by thelenm · 2004-03-08 08:09 · Score: 2, Informative

I'm not sure that being able to edit the ,v files by hand is an advantage of CVS. If anything, I see it as a disadvantage since: a) you're making changes "behind the system's back"; and b) it's easy to screw up.

The face that Subversion uses a Berkeley DB file backend doesn't mean you're hosed in case of problems, especially if you've been backing your data up. You can make a live backup anytime you want - with every commit, if you're paranoid. It's also possible to dump any or all commits to a human-readable format that can also be used to restore. But usually you won't even have to muck around with restoring from backup - if the repository gets wedged somehow, try 'svnadmin recover' and it will usually solve the problem.

There's a nice chapter in the Subversion online book that deals with all this stuff.

--
Use Ctrl-C instead of ESC in Vim!

Re:All your files are belong to us by empty · 2004-03-08 11:06 · Score: 4, Informative

...It is ok until it gets corrupted, and then you are hosed. Keeping everything in readable files CVS-style is a BIG plus point once you've been in that situation...
...I am also wary of database-based products which are tied to one particular database...

Subversion has a utility that might assuage your fears:

svnadmin dump

The dump command can do a (full or incremental) dump of your repository such that you can completely recreate its history. If you use this command for backup, you will be assured that you don't lose any data.

As a bonus, the dump file is human readable, so there should be no fear of losing data to an inscrutable binary file.

Slashdot Mirror

Ease Into Subversion From CVS

31 of 130 comments (clear)