Slashdot Mirror


Media Cataloging Software?

Rich0 asks: "I have a growing pile of CDs/DVDs holding hundreds of GB of files. I would like a linux-compatible software solution to cataloging and searching these disks. Lots of solutions exist for music/video, but not so many for files. The software should have the ability to easily scan disks: pop in disc; software reads disc; software prompts for a name (with something sensible defaulted); software ejects disc; software tells me what if any label to write/apply to the disc; and software is ready for the next disc. I've seen one or two packages out there but they usually require lots of manual disk labelling, or their search capabilities are limited. Windows-only software won't be of much use to me. What are others using to manage their media collections?"

45 comments

  1. The answer always is... by TheSHAD0W · · Score: 3, Insightful

    ...copy them all to one or two terabyte HDs and be done with it.

    1. Re:The answer always is... by Anonymous Coward · · Score: 0

      OK, so his next question would be...

      "I have a growing pile of 500G hDD's holding hundreds of GB of files. I would like a linux-compatible software solution to cataloging and searching these disks. Lots of solutions exist for music/video, but not so many for files. The software should have the ability to easily scan disks..."

      The problem is the same, regardless of scale. If i had 25 1TB USB2 drives I would have the same problem...

      Still waiting for a real answer, if any one has one.

    2. Re:The answer always is... by TheSHAD0W · · Score: 2, Insightful

      If that were his problem then yes, it'd be a whole lot different. But that's not his problem.

      I've run into variations of this time and time again, people with floppies, people with CDs and Zip drives, with a whole library of them, and needing to get them organized. People going out and buying a CD-R jukebox with 7 CD drives in it to stick on their server, when it'd be cheaper to copy the data to a hard drive and serve it that way.

      When you have a bunch of removable media you need to archive, in the last decade it has almost always been less expensive to copy that data to a big volume than to try and organize it - especially when you consider the man-hours lost to indexing and to finding the data you want. Add in the convenience factor and it's a no-brainer.

    3. Re:The answer always is... by Anonymous Coward · · Score: 0

      It doesn't have to cost anything when you start right with a proper system. Let's take a photographer. He can classify pictures by date, client ID and shoot, which is what they usually do. When a client comes asking for pictures from 5 years ago, the photographer can bring up the location of the pictures created for that client and narrow it down from there. I do this for every kind of file I create and it only takes a few seconds each time to find a file. Proper database and processes help a lot.

    4. Re:The answer always is... by Anonymous Coward · · Score: 0

      Better indexing and searching would still be nice.

    5. Re:The answer always is... by brokenin2 · · Score: 1

      Once it's on the big disk, spend about $100 on an xbox, and $10 on a duox2gs mod chip. Once it's modded, install the latest XBMC, and use it's media library management database to organize everything for you.

    6. Re:The answer always is... by J0nne · · Score: 1

      I agree with this. It's way easier to just put everything on a fileserver (or a NAS, they run Linux too but make less noise), and use locate/Beagle to find your stuff.

      Even if you manage to catalog all the data on your cd's/dvd's, you'll have to keep working on keeping it updated, and it still doesn't allow you to figure out where you left your stuff anyway (did a friend borrow it? did it fall under the sofa?). You can't use locate to find physical items yet, so even if you've catalogued everything, what do you win with it?

    7. Re:The answer always is... by Anonymous Coward · · Score: 0

      I do this for every kind of file I create and it only takes a few seconds each time to find a file. Proper database and processes help a lot.

      Uh, yeah. Exactly WHERE and HOW do you create these indexes? Are you writing them down on a yellow legal pad? Do you have a server running MySQL and a local, custom-coded client that you use? Perhaps you coded your own file system that supports all of this in custom tags?

      The issue isn't WHAT to do. It's how in the hell do you do it?

  2. typo? by Anonymous Coward · · Score: 0

    I have a growing pile of CDs/DVDs holding hundreds of GB of pr0n.

    There. Corrected typo in story.

    1. Re:typo? by ds_job · · Score: 0, Troll

      I have a growing pile of CDs/DVDs holding hundreds of GB of pr0n.

      There. Corrected typo in story.

      Now this question is applicable to me. I'll read the rest of the replies and mentally substitute "Rock" and "Country and Western" for "Straight" and "Water Games" etc.
  3. Just get 4 500GB harddisks by kosmosik · · Score: 1, Offtopic

    Man you use CDs? Really?

    You've mentioned hundreds of gigabytes - that transfers to 999GB of data - this is not THAT much. Actually right now you can get like a PC (with lowest spec - these are not important here) with 4 disks of 500GB capacity each. Use SATA2 devices - they are fastest in cheap range. When you'll get your 4 drives put them into PC. Install OpenSolaris and spawn them into Z-RAID and ZFS - you will get yourself quite cheap storage.

    You will get 1TB of redundant and self-healing data storage. Comparing to bunch of CDs you have much faster access time - like disk seek vs. your hand seeking the CD which and you don't even know which one you seek. :)

    Then transfer all your data to this system. Use some open source indexing system on it - if you want to index the contents. But if you only need to search for filename just use "find" command.

    1. Re:Just get 4 500GB harddisks by Anonymous Coward · · Score: 1

      I run Solaris on my file server too but that is not a solution for everyone and I don't consider that safe enough. I don't know why, but everytime the topic of backups comes up, there are always people who come and write stuff about DVD/CD/tapes being useless... bla bla bla.

      I have 10 years old CD's that are perfect when scanned. I don't trust HD's as much as I've had some go bad just sitting on a shelf. Not that they've gone completely bad but half a dozen corrupted files among a million is just not good enough.

      DVD's are still cheaper than HDD's. $20 for a spindle of 100 is a lot cheaper than the price one 500 GB drive.

    2. Re:Just get 4 500GB harddisks by AvitarX · · Score: 1

      Yes, but you copy CDs/DVDs to drives in folder with a number, then right the number on the CD/DVD. You hard drive index file now doubles as a CD/DVD index file. That must be backed up.

      Put in binders, put binders at someone elses house, in fire box, or even a safty deposit box if you care that much.

      Your CDs will last longer not being slid in and out of a binder, or take up less space not being in a jewel case. Your hard drive is now just a very fancy index for your archive (in that it has the full information of every file) and you hard drive index becomes what the person is looking for.

      I personally like cdcat. It will cache .NFO files and works well. Though it probably won't scale too gracefully that big (the database is XML and must be read entirly into memmory on startup).

      --
      Wow, sent an e-mail as suggested when clicking on "use classic" banner, and got a fast response that addressed my msg
    3. Re:Just get 4 500GB harddisks by Anonymous Coward · · Score: 0

      I run Solaris on my file server too It must be so exciting for the two of you now that you found eachother.

    4. Re:Just get 4 500GB harddisks by nurb432 · · Score: 1

      And if you trust mechanical harddrives without a backup, you are foolish.

      Hardware failures, viruses. I would rather not risk it.

      --
      ---- Booth was a patriot ----
    5. Re:Just get 4 500GB harddisks by kosmosik · · Score: 1

      > And if you trust mechanical harddrives without a backup, you are foolish.

      Who said about not making backups? I did? No.

      > Hardware failures, viruses. I would rather not risk it.

      Yeah because CDs that you throw around are that much safer without backup. :)

    6. Re:Just get 4 500GB harddisks by nurb432 · · Score: 1

      Myself, i have 2 copies of important data DVD's/CD's. One offsite, and one that i 'use'.

      If the 'on site' copy acts strange in the least during a restore, then the off site one gets duplicated.

      And while you may not have discounted the use of backups WITH the hardrives, many people really do think they don't have to have backups. " i have raid... "

      --
      ---- Booth was a patriot ----
    7. Re:Just get 4 500GB harddisks by kosmosik · · Score: 1

      > Myself, i have 2 copies of important data DVD's/CD's. One offsite,
      > and one that i 'use'.

      And you have no clue if the offsite copy still works.

      > If the 'on site' copy acts strange in the least during a restore, then
      > the off site one gets duplicated.

      Yes but the offsite one may not work - do you always check them if they are OK? Filesystem like ZFS has self-healing capabilities so paired with redundant data storage (meaning the data is copied over two discs) it can detect faults "on the fly" and fix them.

      But here I am talking about storage for backup you would probably want to copy important data to tape. And still it is not enough - you need to test your backup procedures sometimes. Like simulation - check if it works. It is hard to check 2000 CDs if they still work or you are left with only one copy. With harddisks it is not so hard since you don't need to swap the disks every few minutes and you can let the process to run automagically.

      > And while you may not have discounted the use of backups
      > WITH the hardrives, many people really do think they don't
      > have to have backups. " i have raid... "

      I didn't said that RAID is a backup. And also the question/problem we are trying to solve here in this topic IS NOT THE BACKUP issue.

      The person is asking about indexing a large amount of data. Not about backing them up.

      My point is that you can not efectively search/index/use a large amount (be it 1TB like OP states) of data on CDs. Well - just imagine - 1TB is like ~1500 CDs - can you even imagine how 1500 CDs look like? I can't. And you have proposed to copy them so now we have 3000 CDs - are they properly labeled? Where do you store them? How can you access the 1042nd disc? How do you know that gile foo.baz is on 301th disc and its copy is on 894th?

      So it is absurd to keep 1TB of data on CDs.

    8. Re:Just get 4 500GB harddisks by nurb432 · · Score: 1

      1 - i use DVD's for large amounts of data. 2000CDs is silly. Tho, when that was all that was available, 300+ wasnt uncommon. ( as was thousands of tapes long before that. Remember the old IBM style silos the size of buildings? I do. )
      2 - Yes, off-sites are verifed. ( on a regular schedule )
      3 - There is also a tape that goes out to different site once a month. But its never been used, thats the last case emergency.
      4 - Yup properly labeled and cataloged. ( in house softare, so it wouldn't have helped the original parent of the story )
      5 - off-site is climate controlled and secured ( as any good off-site storage should be )

      And no YOU didnt say raid 5, i took that leap as that was the attitude i was picking up over other comments ( and its a common flaw for a lot of people ). I got the impression from the story this question was about indexing backups.

      But, if as you say, we are OT, then we might as well drop the thread ..

      --
      ---- Booth was a patriot ----
  4. Pirem by Anonymous Coward · · Score: 0

    I used Pirem CD Cataloguer under Windows before but it's not updated anymore. I still haven't found an alternative that had all the features without being bulky (open source or closed source).

  5. Wine by thegrassyknowl · · Score: 0

    You need to try wine. There are three alternatives; red, white and RPM. Depending on the time of year either is acceptable. After several Wines you'll not care about your growing pile of DVDs and be much less stressed. If all that fails Wine will also allow EvilOS software to run on your X86 Linux machine. If you combine Wine in red or white with wine in RPM then you'll not be aware of the shortcomings.

    Also, seconded for copying everything to a huge disk and being done with it. If it's that important to you keep it online and backed up to a huge tape library for the (un)likely event that the disks crash.

    --
    I drink to make other people interesting!
    1. Re:Wine by Corwn+of+Amber · · Score: 1

      ... for *when* the disks *will* crash, you mean. They always do, at some point.

      --
      Making laws based on opinions that stem up from false informations leads to witch hunts.
  6. The only by El+Lobo · · Score: 0
    The only perfect solution for me was the one that I made myself. I too was searching for a catalog to store my discs, books, DVDs, et all. I reviewed 12 programs for Windows (I don't use Linuzzzzz and don't plan to use it) and none of them hwas good enough for me. So I set up a database with some tables for my media, created the relations betweeen tables, and then I created a web end for my IIS using ASP.NET.

    Now I can from any computer in the world access my database of media items, add, modify and delete them. I have even created a service that consults the free music database to get the song titles when I enter an album title. Yes, it takes some work, but believe me , it's a LOT more satisfying than using a 3rd party software with a lot of features I don't need. And acceesing the database via web i SWEEET.

    --
    It's time to realise that Abble's products are the biggest abomination these days. Just say NO to the dumb iAbble way!!
    1. Re:The only by Anonymous Coward · · Score: 0

      So did you have any helpful, relevant comments to make or was this post just swinging your dick around?

    2. Re:The only by Anonymous Coward · · Score: 0

      Give the guy a break. Maybe he has a small dick so, yes, this was his attempt at a helpful, relevant comment.

  7. Don't help the pack rat by QuantumG · · Score: 2, Funny

    He needs to learn to throw things away.

    --
    How we know is more important than what we know.
    1. Re:Don't help the pack rat by bladesjester · · Score: 1

      Right. Because nobody ever has a legitimate reason for having a few hundred GB worth of data and files.

      Keep telling yourself that.

      --
      Everything I need to know I learned by killing smart people and eating their brains.
    2. Re:Don't help the pack rat by Anonymous Coward · · Score: 0

      Prosecution: So, Mr. Rich0, where *are* these copious financial records which support your claim that you were framed for this Enron-style fraud?

      Rich0: Er, some fool on Slashdot told me to throw them away.

      Judge: <reaches for black cap>

  8. just use find by cstdenis · · Score: 1

    find /mount/cdrom > cd_name.txt

    wrap it in a shell script if you want to be prompted.

    or under windows: dir /s /b e:\ > cd_name.txt

    It's not that hard, you don't really need any special software.

    --
    1984 was not supposed to be an instruction manual.
    1. Re:just use find by Simon80 · · Score: 1

      Hmm, but imagine if someone automated this, and also stored some metadata, like mimetype and last modified date, etc, so you only have to swap disks when it tells you to? That would be pretty convenient.

  9. Seconded by Mendy · · Score: 1

    While I realise this isn't answering the question based on my experiences with disk catalog software I would say that unless you are very short of money copying the drives to hard disk is the way forward.

    The problems I found were...
    If you have been unorganised when assembling these CDs full of files then a catalog of the contents doesn't help much. If you don't know a filename (and the filename is obfuscated) then you're going to have a hard time finding it.
    If your search brings back several potential results then having to insert all the CDs to find the correct one, rather than opening or previewing from the network is very inconvenient.
    If you decide you want to make things easier to find and do some organising you can't do this while you're still storing things on the same cluttered media.

    I copied everything to an array and it's much better. As a bonus I'm able to do backups of the files and am not at the mercy of bitrot on the CDs. I do still keep the occasional CD or DVD but only where there are only one or two things on the disk and so the disk can manually be labeled with the contents.

  10. You forgot the sensible default! by Anonymous Coward · · Score: 2, Funny

    #!/bin/sh
    while
    do
    # software prompts for a name
    echo "insert next disk and type in a name"
    read x
    if [[ $x == "" ]]
    then
    x=midget # sensible default
    fi
    # software reads disc
    ls /dev/cd > pr0n.$x
    # software ejects disc eject
    # software tells you what if any label to write/apply to the disc
    echo "Write '$x pr0n' on the ejected disk."
    # software is ready for the next disc
    done

    1. Re:You forgot the sensible default! by Simon80 · · Score: 1

      hahahaha, thanks for a good laugh, I'm still laughing as I type this.

  11. CD Offline by Anonymous Coward · · Score: 0

    I used CD Offline, a Windows based freeware for quite some time.
    http://cdoffline.webz.cz/download/index.html

    It is not polished and even has some spelling mistakes in the UI. But it does what it is supposed to do and I have not found any other alternatives. I have almost completely moved to Linux these days but manage to run it fine with Wine. I am looking for native Linux alternatives though.

    1. Re:CD Offline by jma05 · · Score: 1

      Never mind, I just found

      1. cdcat
      2. gwhere
      3. gtktalog
      4. katalog

      already.

    2. Re:CD Offline by szo · · Score: 1

      gtktalog has been abandoned for years now, katalog is not there yet :(

      --
      Red Leader Standing By!
    3. Re:CD Offline by Ashrain · · Score: 1

      Maybe CDCollect (http://cdcollect.sourceforge.net/ - Gnome) or cdcat (http://cdcat.sourceforge.net/ - Qt) as they both feature searching, though I don't know whether they'll fit for you (and I didn't test them extensively).

      I'm looking for one too, that truly supports Unicode filenames on Win32, and still searching...

  12. Translation by Anonymous Coward · · Score: 2, Funny

    Rich0 asks: "I have a growing pile of CDs/DVDs holding hundreds of GB of porn. I would like a linux-compatible software solution to cataloging and searching this porn. Lots of solutions exist for music/video, but not so many for porn. The software should have the ability to easily scan disks: pop in disc; software reads disc; software prompts for a name (with something sensible defaulted [porn]); software ejects disc; software tells me what if any label to write/apply to the disc ("Now That's What I Call Porn 56"); and software is ready for the next disc. I've seen one or two packages out there but they usually require lots of manual disk labelling, or their search capabilities are limited. Windows-only software won't be of much use to me. What are others using to manage their porn collections?"

  13. MediaPortal is what you want by Shabbs · · Score: 1, Offtopic

    It ain't Linux, but it's a very good piece of software that will do everything you want, and more.

    http://sourceforge.net/projects/mediaportal/

    If you are really set on Linux, XBMC is being ported to Linux but you will have to wait a while.

    http://www.xboxmediacenter.com/wiki/?title=Linux_p ort_project

    I run an old Gen 1 XBOX modded with XBMC and it does everying I need for CD, DVD, media management. The only draw back is the low end hardware of the XBOX. There are limitations with running HD video etc... It may address your needs too.

    Cheers.

    --
    Mark
  14. gtktalog by fwarren · · Score: 2, Interesting

    It works for me, I have been using it for at least 4 or 5 years.

    The nice thing about it, is on debian based distros it is never any farther away than apt-get install gtktalog

    --
    vi + /etc over regedit any day of the week.
  15. Windows Only isnt an option? by nurb432 · · Score: 0

    Ever hear of WINE?

    --
    ---- Booth was a patriot ----
  16. unlimited size / client-server / multi-platform by Anonymous Coward · · Score: 0

    I have built my own:

    http://www.os10000.net/fs/java/app_discindexer/ind ex.html

    * backed by relational database management system (Apache DERBY RDMS)
    * java (multi-platform)
    * client-server (index several computers at once)

    I hope you like it.

    Oliver

  17. dont do this by thinsoldier · · Score: 1

    what i started doing was I got this little utility that generated a text file of all filenames/paths on a dvd.

    I've got apache, mysql and php running

    The utility saves the text files to a folder under the web servers documents and i run a php script that renames the text files with a random number/letter combo, enters every line of text as a unique record in the database parented to the unique name of the dvd itself and I write that unique string on the dvd label.

    Its pretty straight forward and only took 2 weekends to get 75% of my stuff labelled.

    so now I can just use some basic sql queries in phpmyadmin to find out what which has what I'm looking for.

    But in the future I'm just going to buy a bunch of external drives and hook them up to a mac mini.

  18. What I do by socz · · Score: 1

    My system isn't a system at all. Because of the LARGE amount of legal optical media I own and my "custom" discs, i've run into this problem lot's of times!

    Usually, i start sorting things out on my HDD. Once that is at capacity, i've moved them on to CD's. Now with DVDs cheap as dirt, i've been using those for a while. My problem has always been the different DVDs.

    So what I do now, is have about 5 logictech cases of 320/420 disc capacity. Each one is broken down into a "theme." I have 3 other smaller cases, maybe 120 disc capacity. One is for my Mixed Martial Arts, another is for my "video," another for my music videos, another for my "audio" and books on tape, another for software. Then in my smaller one's i have my necessary tools -- software i throw on every install i do.

    That doesn't sound very helpful, but it really is! Each disc is labeled by folder name/type and number. So for example, MMA d1 would have say, early pride fighting vol 1. MMA d2 would have early pride fighting, vol 2. MMA d14 could have pride 26.

    I try to keep these in order, because it's much easier after i create a list of what each folder contains to keep track of. I've seen list managers, but none do what i want, and most are for HDD solutions anyways. This allows me to constantly add to my collection easily without having to make it a big deal. Stick disc in, ls and print to file, append to "master list" and voila! It's updated.

    When i am looking for something in particular, i just search for the title and it shows up, telling me what folder/volume it's located on. It may not be pretty, but it works really well, for me.

    At one point i was making a website i could share with "friends" so they could see what i had, if it was of any use to them, but it was very time consuming. So i gave it up and never found a better solution.

    Right now i'm at 750GB on my main system, and under 500 on my bsd box. I've been burning to dvd like crazy because i like to binge on the news groups! And as of late i've been into torrents because i found some interesting Japanese animation (Go Baki and Hajimi no Ippo!) So they'll be added to my collection soon.

    Soon, i hope to have the time and considerable money to buy some large HHD's, a low watt-efficient cpu/PS setup that'll allow me to throw everything on a few TB HDD's and allow me to have it directly hooked up to my tv.


    Good luck!

    --
    My abilities are only limited by my imagination