Internet Book Database?

To keep track of my books? by Schwamm · 2002-04-16 10:19 · Score: 2, Informative

I use a bookcase...

What would be the point of a book database? The databases for DVDs and CDs allow for players on a machine to spit out relevant track/title information. I'm having a hard time coming up with a reason to have a book database.

Re:To keep track of my books? by Lemmy+Caution · 2002-04-16 10:24 · Score: 5, Insightful

In conjunction with a barcode scanner/CueCat, it could make it a lot easier to start private libraries. I have a couple thousand books, about a hundred of which I lend out at any given time. Be nice to make a catalog, and a freely available cddb-like ISBN-based system would make that a lot easier.
Re:To keep track of my books? by Venotar · 2002-04-16 10:50 · Score: 2, Informative

ISBN based may be innefficient. If you're library has any real depth you run into the same problem that prevents used bookstores from using a similar system: ISBN's were only widely implemented about 20 years ago(some margin for error there - I don't recall the exact date). Title/Author/Publisher is the only way to id books predating ISBN's.
Re:To keep track of my books? by Lemmy+Caution · 2002-04-16 10:56 · Score: 3, Interesting

Fair enough, the key value would be T/A/P/E (Edition information) + ISDN - but a system that returned T/A/P for ISDN when it's there and vice versa (or null when it isn't) would still be very, very helpful. Reducing data input by the percentage of books in a library that are 20 years old is still a gain.
Re:To keep track of my books? by rsclient · 2002-04-16 10:58 · Score: 1

For those of us who are computer history book geeks, knowing which books refer to which computers would be a big help. For example, if I'm interested in the "System/23", which book actually says anything about it? And at what level is the information? Really technical? Totally dishes the politics? Pro-system/23, or does it mention the flaws? Any pictures?

--
Want a sig like mine? Join ACM's SigSig today!
Re:To keep track of my books? by Jonathan · 2002-04-16 12:01 · Score: 3, Insightful

Well, if you only own a couple hundred books or so, keeping track of them isn't hard. If you own thousands, having some system is needed. I have about as many bookshelves as I have wallspace, and even then books are stacked 2 or three deep on them.
Re:To keep track of my books? by LazyBoy · 2002-04-16 14:03 · Score: 1

I used CueCat to bounce ISBN #s off of the B&N database and make a list of a 50-100 books that I was trying to give away. It was some simple mods of an available Perl based CueCat ISBN/UPC program. (I forget the name.)
The problem is that not all books have bar codes. Older ones don't. Not even all newer ones do.

--
If Chaos Theory has taught us anything, it's that we must kill all the butterflies.
Re:To keep track of my books? by the_radix · 2002-04-16 14:22 · Score: 2

Works nicely, I've considered using that for my own book collection. But, how would you catalog the enormous amount of books that do not have ISBNs? I've got books from the eighteenth and nineteenth centuries, and I can't assign numbers to them quite so easily. So, while this database would be great, you would need a separate primary key from the ISBN.

Granted, almost any book today has an ISBN. But, there's a huge chunk of books that do not. And having another field in the database (for unique IDs of books) bloats up the necessary size. And, by not requiring an ISBN, you allow duplicate entries far too easily. Don't get me wrong, I would love something like this. But, it's more difficult to implement than a unified slashdot opinion.

--
This .sig is either false or a paradox.
Re:To keep track of my books? by Tuonenkielo · 2002-04-17 00:50 · Score: 1

Add to this that most of us are likely to possess other printed material than just books. Like periodicals for our hobbies/technical journals/whatever papers. They have their ISBN-comparable numbering, but they don't have ISSUE numbers in the code (Don't remember right now what it's named like, but I am sure it didn't have issue number) so you'd need to 'bind' stuff.

Of course, with stuff like these, it'd be nice to be able to get the list of articles in the issue on the DB as well.Wouldmake it easier to dig up stuff. Ditto for various non-prose books and their ToC's...
Re:To keep track of my books? by JohnMunsch · 2002-04-17 02:06 · Score: 1

A real, shared book database offers opportunities beyond simply the book titles, isbn numbers, etc.

One thing that bothers me endlessly is that there is no shared database of reviews available. Everybody who wants to review a book goes to where all the reviews are already (i.e. Amazon). If you go to a third party to look at books to see if prices might be cheaper (e.g. Half.com, BookPool) you can't get any reviews for anything. But if there was a shared resource that all those book sellers could access, ala the CDDB, then you could have reviews available at any of those sites. And they could pay a small fee to get access to the data on a commercial basis.

--
Sigs are for people who started using the net _after_ '86.

Have you tried the Dewey Decimal System? by Ummagumma · 2002-04-16 10:20 · Score: 3, Informative

I put all my books in order on my shelves, and make 3.5" index cards for each, organized by the Dewey Decimal System.

That way, when the power goes out, I can still find the right book by candlelight. ;)

--
"The natural progress of things is for liberty to yield and government to gain ground." - Thomas Jefferson

Re:Have you tried the Dewey Decimal System? by Schwamm · 2002-04-16 10:26 · Score: 2, Interesting

I've actually arranged my books based on the Library of Congress call numbers. It worked rather nicely. Of course, it was a pain in the butt to do it at first. For some reason, you only ever come up with an idea for organizing something *after* you have too many to deal with quickly.
Re:Have you tried the Dewey Decimal System? by parc · 2002-04-16 11:19 · Score: 3, Informative

I'm in the process of doing the same thing, but I hit on something nice: the computer's great at putting things in order. So I scan or type in the ISBN, a perl script grabs the books information from the LOC(via z3950), and when I'm done, the system spits out a list of books in LOC order with the Title/Author next to it. As I go down the list of books, it naturally gets put in the right order on the shelf.

At least that's the theory. I'm over 200 books so far, and I've just finished about 1/3 of the books.

This is gonna take some time.
Re:Have you tried the Dewey Decimal System? by danny · 2002-04-16 13:14 · Score: 2

Any chance you could make that perl script available somewhere?
Danny.

--
I have written over 900 book reviews

number of pages! by edrugtrader · 2002-04-16 10:21 · Score: 1, Offtopic

yes... i would definately put that in...

.............. I NEVER LEARNED TO READ!!

nevermind.

--
MARIJUANA, SHROOMS, X: ONLINE?! - E

Wrote my own by smoore · 2002-04-16 10:21 · Score: 2, Interesting

Wrote my own Mysql/PHP. Not very good just enough to keep track of them. http://www.teuse.net/books

--
Shawn Moore http://www.teuse.net

Re:Wrote my own by ChazeFroy · 2002-04-16 13:42 · Score: 4, Interesting

Carnegie Mellon University has been working on the ulib project for a number of years now.

This is also a shameless plug for one of my IRC friends responsible for this. Hi Latinum.
Re:Wrote my own by ShdwStkr · 2002-04-17 00:34 · Score: 2

Yeah, me too.
Re:Wrote my own by ijediknight · 2002-04-17 11:51 · Score: 1

I wrote my own with a friend. Written in Java and IMDB (when it was free--need to migrate to Hypersonic one of these days). It also tracks movies I own and any home inventory stuff I want to as well. My friend did the movie stuff and I did the book stuff. Has a section for comments, notes, authors, editors, etc. You can click on a button and have it display the book info in amazon or the library of congress.

Format issue by Metropolitan · 2002-04-16 10:21 · Score: 2, Insightful

The most likely reason for this is, at least so far, the difference in format between books and digital discs of any kind. It's very easy and direct to examine the structure of a disc, but until books become digital as well this won't be as simple.

Books In Print is a great resource, if you have access to it. Amazon works well as a poor man's version.

What would be the point? by tongue · 2002-04-16 10:21 · Score: 2, Interesting

I fail to see the usefulness of such a database, outside the traditional search engine uses. CDDB and freedb both serve a function in that they identify some electronic data for me so I don't have to--a CD i've inserted into my drive. DVD Profiler presumably performs a similar function (I've not used it so I can't say for certain). But books don't have an analogue in this area. If you had an electronic version of a book, presumably it would also have whatever index you needed with it. And if you wanted an index across titles, you would use some search engine like google. But there aren't enough of these kinds of titles to warrant such an application, and i'm afraid I don't see the advent of that time approaching. Between incompatible proprietary formats and the DMCA, I think it'll be quite a long time before we have a standard "book cd" format that is used in generic book appliances a la' Rocketbook.

Re:What would be the point? by jcwren · 2002-04-16 10:25 · Score: 4, Insightful

My use for such a database is partly for to make sure I haven't already read something (I just love buying the same book twice because the cover changed), and for insurance reasons.

I want to be able to use a barcode scanner (or even type the ISBN by hand), and pull all the relevant information from a DB to my local machine. This is exactly the point of CDDB, as I see it.

If I don't have to enter all the information by hand for a CD, why should I have to do it for a book?

--jcwren (owner of about 2700 books)
Re:What would be the point? by hoggoth · 2002-04-16 10:39 · Score: 5, Informative

Then just stick the ISBN numbers into MySQL, an Excel spreadsheet, or an Access database.
Then write a quicky Perl script to scan through the records and any that don't have all the information filled in, go scrub it off of Amazon's web site.
I've already written several Perl scripts that scrub data from Amazon. It's pretty simple.

(hint:

use LWP::Simple;
$page = get http://www.amazon.com/exec/obidos/ASIN/$isbn;
($d esc, $pgs, $price, $other) = $page =~ /use regex to find (desc) and (pgs) and (price) and (other) usefull stuff/;

)

--
- For the complete works of Shakespeare: cat /dev/random (may take some time)
Re:What would be the point? by sdpinpdx · 2002-04-16 11:05 · Score: 1

For insurance records, certainly.

Also, once you had the database, you (or anyone on the net) could add stuff like scanned images or ASCII versions of the TOC and bibiography (so you can check from work whether that book at home is worth driving back for at lunch), reviews, recomendations for related books, links to or copies of related papers, etc. This makes it feasible to automatically check for updated editions, or anything that refers to it (literary criticism, related work, etc.). It then becomes a tool for finding more relevant books, articles, etc.
Re:What would be the point? by graveyhead · 2002-04-16 11:09 · Score: 2

I just love buying the same book twice because the cover changed
Eew, even worse about six years ago I bought two titles from a certain unnamed snail-mail based bookseller, one titled "Power Macintosh Programming Starter Kit". I forget the title of the other one, 'cause I gave it to a friend a long time ago, but the topic was similar, Power Macintosh programming around the time when the PowerPC was first released. Here's the kicker: the content of the books was identical, although they listed different authors! Access to a database like this would've saved me $29.95 + shipping. At least I had a friend who was interested in the same topic so I could at least give it to someone as a present.

--
std::disclaimer<std::legalese> sig=new std::disclaimer; sig->dump(); delete sig;
Re:What would be the point? by tongue · 2002-04-16 11:14 · Score: 2

Ok, I'll give you the ISBN/bar code bit, although I think its usefulness is probably hampered by the lack of widespread bar-code scanners; anybody got a spare cuecat laying around?

anybody think there'd be copyright issues in snagging the info from amazon?
Re:What would be the point? by geekoid · 2002-04-16 11:16 · Score: 2

personal libbraries.
I owuld love to scan my books barcode, and have all the relevant information out into a local database. Then I can track my books, use a wireless connection to log on at home and see if I already own a book I fine in a local used book store.
I have over 1000 books, this could be tremendously useful. My wife could use it a a quick and effecient means to see if I already have a collector book.

--
The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
Re:What would be the point? by rmohr02 · 2002-04-16 12:20 · Score: 4, Insightful

So you also fail to see the usefulness of the Internet Movie Database? I, personally, visit the IMDb almost as much as I visit /.
Re:What would be the point? by Anonymous Coward · 2002-04-16 16:11 · Score: 1

I would love a good book database for checking out all of the books by a favorite author reading short reviews and getting recommendations of similar books I might like - just like allmovie.com for films / actors or allmusic.com for records / musicians both of which I visit very regularly.

Amazon isn't bad but it's not much good for long out of print stuff.

allbooks.com anyone?? I'd envisage it just covering novels rather than reference.
Re:What would be the point? by Tuonenkielo · 2002-04-17 00:58 · Score: 1

Amazon also is somewhat limited with the more esoteric literature. Like in some of the smaller European languages... And don't tell me people in USA read stuff only in english. I happen to know few people from the States who actually enjoy learning new languages, as unlikely as that sounds...
Re:What would be the point? by n-baxley · 2002-04-17 01:45 · Score: 2

While it may seem like a good idea to scam information off of Amazon, one day that will catch up with you. If Amazon changes their page layout, you're screwed, if Amazon changes their URL format, you're screwed, if Amazon finds out what you're doing, and if you make any money off it, you're sued. If we want a book database that is to be used widely , it must be something that is clear of the pitfalls of stealing from someone else.

Yikes, what does this say about the state of MP3s?

--

THIS SPACE FOR RENT
Re:What would be the point? by hoggoth · 2002-04-17 03:14 · Score: 2

He didn't ask for "a book database that is widely used". He simply wanted to enter his 2500+ books into a database without typing in lots of information. If Amazon changes their format he won't be "screwed", he'll just update his Perl script.

A publically available book database is an entirely different matter.

--
- For the complete works of Shakespeare: cat /dev/random (may take some time)
Re:What would be the point? by bubonick · 2002-04-17 16:17 · Score: 1

I set up a mySQL database much like you mentioned. I am not all that great wiuth perl and would really appreciate if you open source me a copy of the parsing code that extracts author, description, en carta from the amazon.com website.
Re:What would be the point? by hoggoth · 2002-04-18 02:18 · Score: 2

Well, you don't have an email address and the lameness filter, in all it's lameness, won't let me post REGEX or Perl snippets.

--
- For the complete works of Shakespeare: cat /dev/random (may take some time)

you may be asking the wrong crowd... by happyclam · 2002-04-16 10:22 · Score: 2, Funny

While I understand some /. posters actually do know how to read, I suspect that the closest they get to a book is the title, which tells them all they need to know in order to hold definitive opinions on the book's author, subject, publisher, and political position.

All kidding aside, the resource my wife regularly uses is google to find pages regarding books she reads for her book groups.

I would love to see an internet book database, though I know of none. In fact, I would be interested in contributing to such a project.

--
He looked at me and said, "Kid, we don't like your kind, and we're gonna send your fingerprints off to Washington."

Re:you may be asking the wrong crowd... by david614 · 2002-04-17 12:51 · Score: 1

I agree with this. I would be willing to contribute to an internet book database. My wife and I have significant science fiction holdings, and could probably populate entire sub-genres.

Cyberpunk, naturally.

D

--
ELITISM: It's always lonely at the top. Uninvited company is rarely welcome.

The Example of CDDB by Alien54 · 2002-04-16 10:22 · Score: 5, Interesting

I am concerned about the prior example of the CDDB, where all of these people contributed tyo this great resource, only to have the resource get sold off and commercialized and turned into a tool to track users, etc.

So While I really like the idea of the database, I do not like the possibility of the thievery of honest work by generous people.

Is there someway so that this could be donated into the public domain or something from day one?

(just trying to wrap my mushy mind around this for the moment.)

--
"It is a greater offense to steal men's labor, than their clothes"

Re:The Example of CDDB by Xerithane · 2002-04-16 11:42 · Score: 2

A great point, and I think that freeDB is doing a fantastic job.

I'm thinking that if someone took the freeDB model and translated it to handle books or movies, or whatever, it would be a very good thing. One thing that I have wanted for ever is similar to MoodLogic, to categorize and index MP3s. Having that open source, coupling with a database would rock. Even with books, it's very useful.

Soon as my life calms down with work (Oh in say 3 years) that will be the first open source project I start if there isn't already one. Something to manage movies, mp3s, cds, and books. My goal is to have it tie into robotics so it links to a DVD changer to play the song, or loads the mp3 on the computer, or pulls the book out of a bookshelf and monitors whats out on loan, etc.

Just my little perfect world.. i'm also building a router in a tree pot so take my suggestions with a large grain of salt :)

--
Dacels Jewelers can't be trusted.
Re:The Example of CDDB by ewhac · 2002-04-16 11:48 · Score: 3, Interesting

Form a 501(c)(3) non-profit corporation that owns and operates the database. Draw up the corporate charter such that the database must be maintained for the sole benefit of the community, that users' activity will never be tracked, etc.
My (limited) understanding is that the law makes 501(c)(3) charters very hard to change. As such, new management can't just waltz in and "sell out" the company and its resources.
The only remaining danger is that the organization becomes politically influential and either leverages that influence to the detriment of the community, or itself comes under the influence of corrupt organizations.
Schwab

--
Editor, A1-AAA AmeriCaptions
Re:The Example of CDDB by vanyel · 2002-04-16 12:28 · Score: 1

I am concerned about the prior example of the CDDB, where all of these people contributed tyo this great resource, only to have the resource get sold off and commercialized
How do you propose paying for the hosting and bandwidth something like this would consume if it got popular, as cddb did?
Re:The Example of CDDB by mlinksva · 2002-04-16 14:26 · Score: 2

A few categorization tools in development are using the MusicBrainz catalog. Bitzi (warning: I'm an interested party) is attempting to create an open metadata catalog for all content that can be encoded in files.
Re:The Example of CDDB by Anonymous Coward · 2002-04-17 04:59 · Score: 1, Informative

I am also working on an open source project for a library catalog. I'm still in the investigation stages, although I've been mapping out the database in some early prototyping. This time I'm not taking it lightly.

As an example our collection is over 10,000 books way too many to catalog by hand. Although I've trieds several different times to start and keep a catalog going. It is a daunting task.

I know I'm going way overboard for what most people's needs are so after having reviewed many
of the book databases that are out there I came to
the conclusion I needed to try and write my own. AGAIN...sigh.

As a former librarian I feel the need to let people know that a good library database is a catalog of all type of items. Books, music, movies, all other types of media (print and non-print. I've even run into a few that catalog objects, especially works of art and mucical instruments.

There's been a lot of discussion of LOC and some mention of teh MARC record format. One should also consider the OCLC system which many colleges and universities belong to. From working with all kinds of funded catalogs I can tell you that it will be very difficult if not next to impossible to get one together that is "truely" comprehensive. Books have been around since long before Gutenberg's press and between various printings, editions, translations and the like the outporing of the printed word far outstrips the capacity of man.

But back to topic...sorry. My database is going to be for individual or private use. However I am planning ot set it up to go out and collect data to be brought back and stored for processing. As an example...go out to LOC and download the MARC record format of all the records for a particular author (batch process will be much easier than trying to do it by ISBN in our case), translate those MARC records to an XML record and then dump them into a database that is much more useful than flat file. That's the theory anyway. I haven't really started coding yet since I am determined to do it right and hopefully for the last time - this time. (yeah right - says the little voice in the back of my head)

Since this will produce more books than I own, I would then want to be able to check off the books I own and make lists of the ones I'm missng in my sets (oh did I mention that I'm a fanatic?) so I can go shopping. Ideally some thing that can be output to a simple format for shopping, not the whole ball of wax, onto a hand held. I love technology. I also want to keep track of those books I've read from borrowing (public library or friends), an if I want to own them or not.

The interface and the scripts I'll need, to do the job right, are going to have to be able to search several kinds of databases for LOC, to local and school libraries and oh yeah Amazon and B&N or Borders. And the ability to add new scritpting as desired when good sites are found. No one of these is going to have it all since my own collection includes books both pre and post ISBN. (BTW - be careful they do recycle ISBN and LC call numbers.)

I will probably be including all the fields from the MARC record format and then store that as a flat file record and pull the more mundane parts out to a relational set up.

This is all going to take quite a bit of time, so I don't expect to be done anytime soon. I want to structure it all in such a way that as technology changes I can plug in different parts that have better solutions too. GAD - what have I started? (whimper).

I will however keep an eye out for what other's have done in the meantime. Who knows maybe someone will make mine obsolete before I finish. That would be cool too.

Yikes! this has gotten way too long - I think I'd better stop now.
Re:The Example of CDDB by WNight · 2002-04-18 08:19 · Score: 2

Go ahead and charge for access. But don't try to assert ownership of data that was entered by third parties, especially when there's an implicit understanding that if you volunteer effort towards something you'll be able to partake of the fruits of your labour.

So feel free to work out a payment scheme. But don't stop other companies, or generous individuals from doing the same. If you can't offer a better (consistent and fast) service than volunteers can, don't expect payment.

My suggestion for hosting it would be a distributed system where a round-robin DNS points you to various servers based on load and time of day. They all sync up every hour or so, and changes have to come through the master server. (Prevent duplicate entries.)

Space is cheap, so there's no reason to fragment the database, only to distribute the queries to save bandwidth and CPU time.

I'd host a node of this, if it existed. Tell the master server to limit how many people it passes to me and I'd happily dedicate 25% of my bandwidth to it.

Relevant by timothy_m_smith · 2002-04-16 10:25 · Score: 1

I'm not sure that an online free book database becomes very relevant until I start reading books in a digital format. CDDB, freedb, and DVD Profiler are all for digital media while books are still primarily in paper format.

I will continue using my book shelf for the time being.

singlefile by yum · 2002-04-16 10:25 · Score: 4, Informative

==>

Re:singlefile by gadfium · 2002-04-16 10:33 · Score: 2, Informative

That's a pay service costing US$20 per year.

From its website:

Singlefile is an easy-to-use web-based service that helps you organize the books you own, the books you are reading, the books you've read and the books you want to read.

You can use it to keep track of the books you've loaned to friends, or books you haven't bought/read yet, or to find out how many non-fiction paperbacks with 275 pages you own, etc. Singlefile is also great for keeping a record of your books for insurance purposes. And, in affiliation with Amazon.com, you can discover and buy new books you might enjoy based on the authors in your collection!

I think a free service is what is wanted by the original poster!
Re:singlefile by Sir+Robin · 2002-04-17 06:06 · Score: 1

That's a pay service costing US$20 per year.
...
I think a free service is what is wanted by the original poster!
If you're right, all I can say is, heaven forbid the poor bastard should actually be willing to pay $1.67 a month for a service he might actually use and not even have to maintain or upgrade himself. That would be horrible.

--
My /. ID is only 5,210 away from Bruce Perens's.

Cue::Cat by Roadmaster · 2002-04-16 10:25 · Score: 2, Interesting

Yeah, and let's enable the database so that you can point your cue::cat at the book's barcode and up pops the relevant page with information about which book you're reading.

Ain't it easier to just look at the cover??

Re:Cue::Cat by Lemmy+Caution · 2002-04-16 10:29 · Score: 4, Insightful

What if you could scan it, it brings a copy of that record into your local database, prints out a book plate, and the can set up a borrowing schedule for it? That'd be cool. And very helpful for small libraries.
Re:Cue::Cat by curunir · 2002-04-16 10:41 · Score: 2

Ain't it easier to just look at the cover??

I think you just hit the nail on the head.

The point of CDDB like databases is to create a global database of meta data for digital content so to make it easy for people to assign consistant meta data to their content...basically, not to have to type in track info when ripping a cd to mp3.

With books, the physical medium doesn't translate well into a digital form. Sure, there will eventually be e-books, but the current form in which books are sold is hard to make a digital copy of (which is easier, ripping a cd or OCRing 500 or so pages and then spell checking it?)

So, absent digital copies of books that are without meta data, this project wouldn't be particularly useful. The only reason you'd want to search such a database is to find a book to buy. So let Amazon pay the $30k/yr and just search there.

--
"Don't blame me, I voted for Kodos!"
Re:Cue::Cat by raincrow · 2002-04-16 13:32 · Score: 4, Insightful

In the same way that people research CDDB and IMDB to see who has recorded what and who has directed what and who starred in what, people also want to see full author bibliographies, checklists for book series, edition lists (there are people who collect multiple editions and sometimes even printings of the same book). Then the hunt begins. There are more uses of catalogs and libraries than just adding metadata to your own collection. There's also research.

Take a look at the SFDB for an example.

Readerware by chennes · 2002-04-16 10:27 · Score: 2, Interesting

I use Readerware, and while I grant to you that it is "inefficient" in some sense (and yes, the interface sucks), the folks working on it are continuously updating the thing, and its ability to search about 2 dozen different sources for book information is really wonderful. Since most people don't play books by putting them in a slot in their computer, there isn't really that much demand for a really high-power archiver. I personally just scan my new books in and click "update" - Readerware finds everything I need, no problem, and I don't have to do it that often. Chris

Re:Readerware by Anonymous Coward · 2002-04-17 05:28 · Score: 1, Interesting

I love the fact there is a Palm version. I always take my PDA when I'm book shopping to prevent buying duplicates.

What's a Book? by pyrrho · 2002-04-16 10:27 · Score: 1, Funny

Oh, you mean the hardcopy of a web page?

just a bad joke! don't bother to flame me my submodernpostcomslashdotantireactionary freinds... I read bound printouts all the time.

--

-pyrrho

bip ain't that great by brechin · 2002-04-16 10:29 · Score: 2, Informative

I used to work for the local (independent) college bookstore (Illini Union Bookstore), and we had access to Books in Print in both dead tree (very old) and web-based (shared a login with our university's library) formats. While the information was usually very good and very reliable, there were many problems.

Do you have any old books? BIP can be very unreliable when trying to find books published before 1980. Even still, BIP doesn't include information on all the different editions of a particular book, so your ISBN may not yield any results.

Speaking of no results, the search feature on BIP is incredibly unreliable. You can search for an ISBN, not find a book, then search for the title and come up with a book with the ISBN you just searched for. Try putting that ISBN back into the search box and it doesn't work! Sometimes you get what you want, sometimes you don't.

Aside from searching for basic bibliographic information (title, author, illustrator if any, publisher info, etc.), pricing and availability information (available for most books in BIP's database) are not up-to-date as they report them to be. Many times we ordered books and the publisher told us the books were priced very differently from what BIP told us. Good luck getting an accurate estimate of how much your book collection is worth!

In the end, a book database like cddb's cd database or even better, like imdb's movie database including reviews and ratings would help people organize and maintain their private collections, and would help bookstore employees get their job done. If only the book database software our bookstore used had the ability to access an outside database like that!

Why do we need this? by bstadil · 2002-04-16 10:31 · Score: 3, Insightful

I have come to the conclusion that there is not a well developed internet book database.

Why do we need this? Books are not searchable by nature so making it easier to find information about a book still leaves the issue of how do we get access to it. Making an eBook DB makes some sense. The ISBN numbering has been in effect for a long time and you can find any book reference that has a write up or reference on the net via Google. Thirdly the research community has oddles of system for referencing articles and papers.

--
Help fight continental drift.

Re:Why do we need this? by BrokenHalo · 2002-04-16 13:42 · Score: 1

Books are not searchable by nature...
Yes they are.

You can flick through them, browsing as randomly or as focused as you like, read the back cover, read the contents pages, maybe even scan the index.

Seriously, I think we are missing out on a lot if we suppose that only online text is useful.
Re:Why do we need this? by dos+equis · 2002-04-16 17:47 · Score: 1

You can't always find the ISBN on Google for the
simple reason that the format of the number can
contain optional spaces and/or hyphens. Though
there is a standard the spaces and hyphens often
seem to end up in the wrong place. And for rare
books, the ones you most need to do a search for,
this is important.
I found this out a few months ago while searching
for all the translations of Douglas Hofstadter's
"Goedel, Escher, Bach; an Eternal Golden Braid".
Re:Why do we need this? by n-baxley · 2002-04-17 01:38 · Score: 2

I think that the reson that we need this is to allow for a formatted, programatically accessible source of information. If you are just trying to find one book for your own use, it's not a big deal, but if you want to store the ISBN numbers of your personal collection and pull the other information from a central database, individual searches on google won't cut it. I image that you can find all of the information about each CD and track through google as well, but it would take a lot longer.

--

THIS SPACE FOR RENT
Re:Why do we need this? by Art+Tatum · 2002-04-17 02:08 · Score: 1

Ever had to write a paper the night before something is due? Need a bibliography? Bingo.
Plus, it would make it all that much easier to maintain bibliographies. It would be nice if it would format your bibliography and print it for you, too.

Would be good for small libraries worldwide by line-bundle · 2002-04-16 10:31 · Score: 5, Insightful

Some people ask `what is the point?'.

My answer to that is the following: It would be nice to be able to lookup info about a book, given a small amount of information. Suppose you are a library and you want to catalogue books. Instead of having to type in all the information yourself you could just type in the ISBN and all the information get downloaded to the local catalogue.

I have had to make a database and enter data for a library and that would make life a lot easier!.

Re:Would be good for small libraries worldwide by vtrhps · 2002-04-16 10:54 · Score: 2, Informative

There is already a company that provides just such a service: Online Computer Library Center from which libraries can buy bibliographic records to load into their online catalogs (or print for their card catalog). OCLC recently purchased NetLibrary, a provider of e-books. NetLibrary was having financial difficulties, and OCLC jumped in to make sure all those libraries who "purchased" these e-books would still have access.
Another source of Books in Print is through Gale Group. Many local libraries are purchasing access to the Gale Group databases (Books in Print, InfoTrac, etc) for their users. For instance, Virginia residents can type in the bar code number from their library card to get access to these databases from home.
I work in a library, but I'm not a librarian.
Re:Would be good for small libraries worldwide by Anonymous Coward · 2002-04-16 10:55 · Score: 1, Interesting

Small libraries and the like can access OCLC. OCLC provides a definitive copy of the books record. Can you imagine what would happen if some one tried to enter in their own data? Not only do books which have the same title have different ISBNs the data being entered would be subject to the interpritation of the person entering it (eg St. vs Saint)

There are rules that need to be followed in order to maintain any sort of consistancy in record keeping. Remeber, a library isn't kept at all like your bookshelf.
Re:Would be good for small libraries worldwide by Thorin_ · 2002-04-16 11:46 · Score: 1

Can you imagine what would happen if some one tried to enter in their own data?
Having people enter information works quite well for CDDB. Why wouldn't it work for entering books also?
Re:Would be good for small libraries worldwide by Sloppy · 2002-04-16 12:14 · Score: 2

Perhaps mentioning stuff like freedb isn't nearly as useful as mentioning IMDB. IMDB is a really convenient thing if, say, you want to know who the great actors were in Attack of the The Eye Creatures or what George Romero has directed.

Something like that for books would rock.

--
As copyright owner of this comment, I authorize everyone to defeat any technological measure which limits access to it.
Re:Would be good for small libraries worldwide by makohund · 2002-04-16 12:21 · Score: 2, Informative

As a matter of fact... that is actually not that far off from how OCLC works.

From the site...

At the center of OCLC services is the WorldCat database, which:

* Is the most consulted database in higher education
* Holds over 47 million cataloging records created by libraries around the world, with a new record added every 15 seconds
* Spans over 4,000 years of recorded knowledge with 400 languages represented
* Includes 840,637,829 location listings

I'm not a librarian (I'm the sysadmin... the technical services librarian just left for the day or I'd just ask her) but I work in one and I believe the records are all submitted by member libraries.

Anyway, go to the site for more info. I gotta get back to work. :)

http://www.oclc.com/about/
Re:Would be good for small libraries worldwide by Thorin_ · 2002-04-18 09:41 · Score: 1

Is it possible for individuals to querry this database?
Re:Would be good for small libraries worldwide by makohund · 2002-04-24 11:52 · Score: 1

I'm not sure.

It's actually not intended for querying so much as it is for "standardizing" records, and saving libraries the time it takes to create a new record from scratch.

Downloading a complete record (to be loaded into your own catalog/database) actually costs 50 cents a pop, I believe. There are other price plans, too. I'm pretty sure it is a non-profit cooperative, and the charges fund the operation. (Equipment, location, staff, and everything.)

It sounds expensive, but the charges to use this "shared" system don't come close to what it would cost any library (in staff time) to do complete cataloging on their own. Multiply that worktime by the number of member libraries, and it's pretty obvious why it is done that way.

Probably not something usable for individuals, but if you really do have a library (and have to pay employees to maintain it) it's a pretty good way to go.

Watch Out... by Aknaton · 2002-04-16 10:32 · Score: 1

The company responsible for Books In Print probably has a patent on a "list of books". :)

would be nice by ermineshay · 2002-04-16 10:33 · Score: 1

isbn.nu is something of a step in the right sirection...it doesn't sell books directly, so it's pretty much a disinterested party (even imdb isn't really that anymore, since it's an Amazon site). but It would be nice to have something more comprehensive, that covered also books not in print that were important, that would have comprehensive (as best as possible) listings of every edition (in English only) of, say, The Aeneid.

maybe there isn't a widespread interest in this, and that's why it never developed. alt.rec.movies (or whichever usenet group) imdb grew out of obviously filled a need for lotsa people...why didn't this happen with books? maybe it's too late to start now.

still, I'd be interested. as it stands now, people discuss movies, actors, etc in their blogs and link w/out thinking to imdb pages. for books, they end up linking to amazon (or B&N or, even more rearely, booksense). none of those sites give quite the same depth of info on a book as imdb does on a movie.

Re:would be nice by eggboard · 2002-04-17 02:14 · Score: 1

Isbn.nu is my site; thanks for reffing it. Part of my goal in the site was to take a commercial source of book info (Baker and Taylor, in my case) and turn it into something that could be quickly used to find books.

The biggest problem with book information is that there's so damn much of it. The set is too large for individuals to contriubte enough. It grows rapidly and in a distributed fashion (at least 40,000 publishers in the US, and possibly more when you count every single-book publisher; 25,000 new books per year, or perhaps more, depending on how you count it). There are entrenched interests who actually spend millions per year collecting and correcting book info, too.

I'd be more interested in developing a resource that could be shared except that I know how many millions are spent.

The best bet is that publishers could be persuaded to form a non-profit group, the goal of which would be to collect and maintain information on books and disseminate it accurately to all the places that use book info. But that's unlikely, again, because of the cost.

(Formerly, I ran the catalog department at Amazon.com in 96-97, and have consulted for a couple of bookstores, including Powells and Half.com. The problem with book info is a Problem with a capital P.)

--
Freelance tech journalist for the Economist, MIT Technology Review, Macworld, and others
Re:would be nice by Koos · 2002-04-17 21:31 · Score: 2

I'm trying to gather information about books on my site, The Virtual Bookcase. It grew out of trying to make a database of my own books and at the same time doing projects at work regarding books and information about books. I try to gather reviews but I'm ofcourse limited to reviews that people enter (please enter more reviews!) or reviews that I can reuse (such as those from amazon). The amazon-affiliate linking helps recoup a bit of the costs (bandwidth, domain name) but the amount of time invested in the software for the site and the maintainance of the databases is of course never repayed. I do learn a lot in the process :) I'm now at a stage where I think the technical stuff can take care of itself for a while and I need to learn more about site design and usability and how to get other types of information on books such as press releases and general book news.

--
The Virtual Bookcase: book reviews

What's the Purpose? by rubinson · 2002-04-16 10:34 · Score: 3, Interesting

What programs, if any, do those of you reading /. use to keep track of your books? If you were to start an open source internet book database project, what features would you include in it?

What purpose would such a database serve? CDDB/freedb, for example, allow us to automatically download the album titles automatically. Saves everyone a lot of tedious work. Obviously, you're not going to be doing this for books.

As a graduate student, I maintain a single text file of all articles and texts that I've ever referenced. Each entry has a unique identified which I use the UIDs in my own articles instead of typing the full reference. A shell script then updates then updates the references and BibTeX automatically generates the bibliography.

I could see where it could be useful to have a centralized resource that could automatically download those references - but only if it was quicker/easier than typing it in myself (and that only takes a couple of seconds).

What other purposes would such a database serve? How would it make my life easier?

Re:What's the Purpose? by geekoid · 2002-04-16 11:21 · Score: 2

it would allow used book stores a more effecient means of cataloging there inventory.
I perfer small mom and pop shops, and would like to do things to help them.

--
The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
Re:What's the Purpose? by codeartist · 2002-04-16 11:28 · Score: 1

Everyone keeps comparing a system like this to CDDB or the like. I think a much better comparison is iMDB. Yes I never do automatic lookups into iMDB, but there are tons of times when the information on that site is useful. I think a full catalog of a book would be very helpful:
Title, Author, Editor, Publisher, Publishing Date, etc.
Characters, Plot Summary, links to movie adapations (for those who can't stand reading)

And then do what iMDB has also done and allow searches other than by title. Remember the character name in a favorite story, but not the title -- then search by character name. Want to find any other books by the same author?

Don't think of it as a lookup tool for ISBN information, but a lookup tool for all the other meta data on a book

A start by JasonMaggini · 2002-04-16 10:34 · Score: 3, Informative

This site has some stuff on using barcode scanners (including the ever-popular cuecat) to catalog books...
I personally would like to catalog my collection with a relatively decent amount of information, but who wants to sit there and type all that stuff in?
I agree that the trick would to keep a database from going to the Dark Side like CDDB did...

Like this? by danro · 2002-04-16 10:34 · Score: 3, Interesting

Is there someway so that this could be donated into the public domain or something from day one?

Maybe by making the source available under the GPL, and making the ability for different instances of the database to exchange information with each other be a part of the project?
That way anyone with a T1 and a fairly large disc could have his own bookDb.

That way, no single entity would be in exclusive control of the data.

On the other hand no two databasers would be exactly the same.
Hmm...
Database design is not my field really, maybe I should shut up, and just write a few frontends to the db once someone has dreamt one up...

--

"First lesson," Jon said. "Stick them with the pointy end."

Re:Like this? by Phoex · 2002-04-16 10:47 · Score: 1

Maybe have each db query another set of dbs it is connected to if the book is not in the first one? A la Napster et al?

--
00110100 00110010
Re:Like this? by danro · 2002-04-16 10:57 · Score: 2, Interesting

Sure, but the problem is making sure the data is consistent.
Just because the ISBN is in both the querying database and the databases it uses as a reference doesn't mean the entry contains the same data.
And if it doesn't, how should the db know which post is the more correct?
Not a trivial problem to solve, you can't have the databases trust each other too much since you don't want som lame script kiddie getting pleasure from injecting lots of false data and watching it spread...

But, like I said, this is not my field of expertise, I'm sure there are a lot of people on slashdot that know a lot more about the subject...

--

"First lesson," Jon said. "Stick them with the pointy end."

Nice idea, but... by macrom · 2002-04-16 10:35 · Score: 1

Too bad the Cue::Cat makers are no longer in business. Good idea, bad timing I suppose.

I would be interested in writing one by line-bundle · 2002-04-16 10:36 · Score: 1

Reply to this comment with contact info and we could start working on the details.

Re:I would be interested in writing one by niftyzero · 2002-04-16 10:58 · Score: 1

See my other comment. Drop me a note to sdathyperdotto.
Re:I would be interested in writing one by ermineshay · 2002-04-16 11:55 · Score: 1

I'll bite...mailto: is ermineshay at yahoo dot com.

Free Library Databases - and a protocol by outlier · 2002-04-16 10:38 · Score: 5, Informative

Most large university libraries have free (beer) databases that typically contain huge numbers of books (many that are not held by the library).

For example, see mirlyn.web.lib.umich.edu and sign in as a guest and you can do all sorts of searches.

These libraries typically use the Z39.50 standard to connect. Z39.50 is a pretty decent standard, and it is widely used, standardized, and allows you to connect to many many databases.

Sounds like this could be what you're looking for.

Re:Free Library Databases - and a protocol by paynter · 2002-04-16 14:03 · Score: 1

A trivial point, but I can't let this pass:

Z39.50 is a pretty decent standard,...

Actually, Z39.50 is one of the ugliest, stupidest standards I have ever had the misfourtune to use.

You are correct that it is in widespread use.

However, it is barely "standardized": it was created, as near as I can tell, by taking two huge ugly committee-designed standards, munging them together as a hideous compromise, and making everything optional. Thus not every Z39.50 client and server can communicate.

Generally you do okay though if you copy the parts the LoC uses.

Here are some useful links... by sailracer6 · 2002-04-16 10:43 · Score: 5, Informative

The UPC Database

You can add entries here for ANYTHING with a standard UPC, so some books are in here. Very useful.

The Book-Scanning Project

This guy wrote some Python scripts to convert UPC's to ISBN's - it can be done - and then feed them into Amazon's search engine. Very interesting, and he's already done it, so he has some experience.

Here's a use I want to see by ebonic+plague · 2002-04-16 10:43 · Score: 1

You dead right about not needin no index or nothin online that is already in the book.

But peep this: What if there was some kinda database that had the whole book in digital form, but was only searchable instead of readable. That way when fools need to find out instances of a certain word of phrase in a book, they could find out on the web, and get page numbers to look up in they hard copy. It'a keep the publishers happy cause you wouldn't be able to pirate they material, and it'a keep readers happy cause they'd have some of the digital benefits that they cain't get wit a paper book.

--
Na'am sayin?

For items out of copywrite... by V_drive · 2002-04-16 10:43 · Score: 1

Although it's probably just a small subset of what you're looking for, Project Guttenburg is a database of books which are out of copywrite. Since it only contains books out of copywrite, it is able to give you the book contents as well. Not useful for looking up the NY Times bestseller list, but if you need to look up something from The Art of War or Macbeth, see http://promo.net/pg/ and download the whole book as a text file.

--
char *mySig;

Re:For items out of copywrite... by lkaos · 2002-04-16 11:02 · Score: 5, Interesting

http://www.gutenberg.org

is the official url IIRC

absolutely wonderful resource. they have a ton of books and the transcriptions are of pretty high quality--the have an excellent qa process.

--
int func(int a);
func((b += 3, b));

Even More Interesting Application by ltsmash · 2002-04-16 10:46 · Score: 1

What would really be great is having every book ever written on the internet in full-text (not just a summary), stored in a database (like google does for webpages.). Just imagine being able to type in a search phrase and being able to search the text of every book that was ever printed.

I wonder about the possibility of this based on:
1. storage space
2. database efficiency for all that information
3. most importantly: copyright laws.

Re:Even More Interesting Application by niftyzero · 2002-04-16 11:01 · Score: 1

You could make it searchable without making the text available. An indexed text document is not necessarily convertible to the original document.

Anonymous Coward, huh? by sharkey · 2002-04-16 10:48 · Score: 2

Probably an agent provacateur for the Author's Guild trying to incite the Open Source community into writing a book tracking list for them to use to keep track the livelyhood-stealing activities of that awful Jeff Bezos and those bastards at Half-Price Books.

--

--
"Outlook not so good." That magic 8-ball knows everything! I'll ask about Exchange Server next.

What about freedb? by frantzdb · 2002-04-16 10:53 · Score: 2

I have not followed freedb much, but I suspect the software that runs it could be modified to work with books. The Freedb software is under the GPL and uses a MySQL database backend. Someone looking to procrastinate for a few days could probably have a working book database within a few days.

--Ben

37signals by fm6 · 2002-04-16 10:55 · Score: 3, Insightful

Impressive web design. Pretty and usable and minimalist use of HTML. Rare to see all that in one place.

Seems to be a project of 37signals. Some interesting work in their portfolio.

Automated spine recognition? by niftyzero · 2002-04-16 10:55 · Score: 1

Wouldn't it be nice if you could take a digital photo of your bookcase and have it be automatically converted into a list of books?

If such a database also had spine/cover info that let's a program do automated recognition, this would be possible. Then you could put them all up for sale. Or you could look up a book without having to keep your bookcase organized.

The One Major Issue by Etriaph · 2002-04-16 10:56 · Score: 1

You can put your CD and your DVD into your computer, which is why it's handy to have an online database to auto-lookup the details. It's hard to put my copy of Good Omens into my 3 1/2 inch floppy drive, and who the hell would want to type in an ISBN number? And who would even remember an ISBN number? :)

--
"It's here, but no one wants it." - The Sugar Speaker

Bookcrossing.com by chris_mahan · 2002-04-16 10:57 · Score: 2, Interesting

Check out bookcrossing.com. You can have your own bookshelf. Just type the ISBN, it retrieves the cover art, the author and all that. You can fix it too.

I use it, I like it.

--

"Piter, too, is dead."

If you only have one bookcase... by fm6 · 2002-04-16 10:58 · Score: 2

...then I guess there is no point. I have 5 at home and 2 at work.

feed this page an isbn, get XML out by Pinball+Wizard · 2002-04-16 10:59 · Score: 5, Informative

if you are looking for XML data, feel free to use this page(asp at the moment, but it will soon be redone in perl).

The important thing is it outputs XML, so if you want to build an interface to it for your own application, you can. Its not a 100% complete database, but it should give you basic information on any book available.

I wrote this specifically for external search engines back when XML was the new hot thing. Funny thing is, the sites that search us usually want an FTP data feed, so this doesn't really get used much. But again, feel free(be reasonable if you use a bot - maybe limit your bot to a search every 5-10 seconds, please).

--

No, Thursday's out. How about never - is never good for you?

Re:feed this page an isbn, get XML out by geekoid · 2002-04-16 11:26 · Score: 2

is "book id" an internal number?

--
The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
Re:feed this page an isbn, get XML out by Pinball+Wizard · 2002-04-16 11:35 · Score: 2

yes. Its there because an ISBN is not reliable as a unique ID. Ignore it, it only means anything to us.

--
No, Thursday's out. How about never - is never good for you?
Re:feed this page an isbn, get XML out by Rorschach1 · 2002-04-18 09:08 · Score: 2

Most excellent! Many thanks for providing a useful service in an intelligent format.

Though it still seems like you ought to be able to obtain an authoritative database of ISBN numbers. I couldn't find one last time I looked, though.

Writing my own by Thekim · 2002-04-16 11:04 · Score: 2, Interesting

I am writing my own catalog with MySQL/Perl for several reasons.

1) I don't have enough space in my tiny room to fit all my books into bookcases, but with the db I can put some books in boxes in the closet and easily find out in which box a certain book is.
2) I want my books sorted according to a standard classification system but still be able to have them in my own way in the bookcase. Currently I use a heavily outdated (1987) Swedish classification system that the kind folks at my school library lent me. So I'll definitely take look at the Dewey Decimal system mentioned earlier.
3) I have books in several languages and with a db I can have the same kind of information on different books in different languages in the same place. Thus I don't have to look up the romanization for the Kanji (Chinese charachters in Japanese) more than once. But of course it will store the original Kanji-titles as well.
4) I can easily create lists of books that I want to buy and, that friends have borrowed from me or books that I have borrowed.

When it's finished I want it to handle 2-bit languages in a nice way, be compliant with existing standards for book classification, both Swedish and international, allow for easy list creation and have a nice interface.

Use Z39.50 by Anonymous Coward · 2002-04-16 11:04 · Score: 1, Informative

Many large library databases are searchable with a protocol called Z39.50. There is a Perl module implementing this protocol (among many others). Check out http://perl.z3950.org/ for full docs. The reason you get back complex stuff when you do a search should be obvious if you ever read the cataloging information about a book in a library catalog. There's a lot of stuff there. If you're using this for making a catalog of your private library, do a "known item search", for example using ISBN.

Project Gutenberg by jumex · 2002-04-16 11:05 · Score: 2, Interesting

Have you ever heard of Project Gutenberg? It is basically doing what you are talking about and has been since the 1970's. They have a pretty good collection, and I would totally suggest anyone interested in an internet book DB to help them out with their cause. Although I see your point that a full index of all books (without content) would be a pretty cool thing to have.

--
"Your 'Gin n'tonic Futon Brain' sure makes you smart!"
"That's 'Positronic-photon Brain', you idiot!"

Re:Project Gutenberg by bcrowell · 2002-04-16 12:12 · Score: 2

PG is great for old public-domain books that volunteers have digitized. For newer books whose authors have intentionally made them free-as-in-something, see my own site, The Assayer. (The Assayer runs on open-source Perl with DBI, but I doubt the code would be useful to anyone else --- it was my very first Perl app, and it shows...)

--
Find free books.
Re:Project Gutenberg by rusty0101 · 2002-04-16 12:47 · Score: 1

Actually a slightly different purpose here. Project Gutenberg (PG) is oriented around putting public domain books into electronic form. Considering that the timeframe to put a book into the public domain these days exceeds the age at which ISBN's are applicable, it would be a little bit out of their leauge.

The other side of this is that for older books that have value based upon their age, if the book is in PG, someone who owns the book may feel it's value will be better retained if they read the electronic edition from PG, rather than the paper copy they possess.

Hmm. One other feature of a personal bookdb program would be to be able to keep track of the replacement value, and current condition of the books you own. This may have insurance value if you maintain a paper report with your insurance agent.

It might also be usefull to build this into a "domestic property" database, where you would record artwork, craft work, and other property. Generally this will be a much smaller collection of items, but may have a greater general value. I generally won't buy a book for over $100, but if I felt the quality was there, might be comfortable paying over $300 for a piece of art.

This could also work for any collectable, such as comic books or baseball cards.

Obviously baseball cards, artwork, etc won't have ISBN, DD, or LOCBNs, so the keys for these databases would have to be otherwise assigned.

Just some thoughts.

-Rusty

--
You never know...

Does anyone know of a sports statistics database? by Apoptosis66 · 2002-04-16 11:08 · Score: 1

Does anyone know where I could get a Sports Statistics Database, either free or reasonably priced? In particular NBA game by game Stats?

Readerware does use a database...They just hide it by Ted+Cabeen · 2002-04-16 11:11 · Score: 1

Readerware uses the Hsqldb Java Database as its backend. The author doesn't document the fact very well, but you can access the database directly, if you have the client-server edition. I've used the SQuirreL Java-based sql client to browse through the database on a number of occasions, and I'm planning on using it soon to do some data cleanup. (Sometimes the python interpreter that Readerware uses to parse the bookseller's webpages gets confused)

Alternately, the Readerware interface does provide an export capability, so if you really wanted to, you could just export all of your data and then reimport it into a SQL database you're more familiar with.

IMHO, the guy who develops Readerware puts a lot of work into it, and I'm happy to pay him a little for such a useful program. Although an open source version of Readerware would be nice, it'd take a lot of work, and Readerware serves my purposes fine.

Books in Print doesn't cost $30,000 by Anonymous Coward · 2002-04-16 11:20 · Score: 1, Informative

I was suspicious of this price because every little bookstore I go into seems to have online access to BIP, so checked on the Books in Print web site. They have a sliding rate, even a free trial. Unfortunately I couldn't get price details because their web site crashed my Netscape browser.

Less expensive alternative to Books in Print by CaptainStormfield · 2002-04-16 11:21 · Score: 1

OCLC offers much less expensive databases of books. Their WorldCat database includes 47 million bibligraphic records. Based on a quick look at their site, it that only member libraries who share their databases with OCLC have access to to WorldCat. However, I suspect that free, publicly available book database could negotiate membership.

Note: for participating libraries, the cost of WorldCat is much less than $30K. (I don't know how much, but I know that the public library where I used to work could never afford a $30K subscription to anything, but we did have WorldCat access.)

--
"The dinosaurs died because they didn't have a space program." - Niven

Re:Less expensive alternative to Books in Print by CaptainStormfield · 2002-04-16 11:27 · Score: 1

Second sentence should read "it appears that only", not "it that only". Sorry.

--
"The dinosaurs died because they didn't have a space program." - Niven

Re:Does anyone know of a sports statistics databas by outlier · 2002-04-16 11:23 · Score: 2

I'm not sure about NBA, but the kick-ass free Lahman MLB database is available at baseball1.com. It's got stats going back to 1871...

ibsn.nu offers something similar by jbc · 2002-04-16 11:24 · Score: 1

The isbn.nu site offers some of the functionality you're describing. You can input an ISBN, and get a set of standard information about the title. Granted, it's not the same thing as having a public API to the underlying data, but maybe with sufficient encouragement they could see the wisdom of following the Google model in that regard.

One useful thing about it in its current form, by the way, is that it will do a realtime search of various book sites (those evil patent-wielders at Amazon, BN.com, etc.), and display a table letting you comparison by price or reported delivery time. So that's pretty cool.

John
lies.com

[OT] Re:37signals by codeartist · 2002-04-16 11:24 · Score: 1

I agree that this is definitely very pretty web design -- but both singlefile.com and 37signals.com have one fundamental flaw which just drives me up the wall -- there is no way to visually find hyperlinks in the page. I was forced into scanning my mouse cursor over the page to find them (or tabbing through them all, which is even worse). Enough to drive me right off the website...

Re:[OT] Re:37signals by fm6 · 2002-04-16 13:09 · Score: 2

So set your browser to underline links. I prefer that mode anyway. Even when web designer use conventional gimmicks to indicate links, it's not always obvious what they are.

I had to write one myself... by John+Q.+Public · 2002-04-16 11:28 · Score: 1

When we were moving, I needed to catalog my books since they're living in boxes until I can afford new bookshelves. There was nothing really out there except a bunch of pieces, so I assembled them and re-wrote some into a cold fusion app that scrapes Amazon, BN, and AmazonUK for titles and cover images and such into a SQL database. Input comes from a cuecat scan or typed barcode or ISBN. Pretty basic, just for me and my family, and yet I considered sharing it with the world... but I can't afford to have my bandwidth hit. If you have a volunteer to host or want a copy let me know at booklist @ webplumbers dot [skip this] com.

--
UserAdvocate: The voice of the user

Re:I had to write one myself... by rusty0101 · 2002-04-16 12:32 · Score: 1

I almost hate to suggest this, but you might consider making this a sourceforge or Savanah project. The advantage of this is that your code starts as the root of the project. Features can be added, such as a borrowed to and when, book reviews, ratings, target age group. Also if the code is sufficiently portable, which I don't know if cold fusion apps are, it could be made available in a number of languages.

Part of the long term project would be to share the data that each user has, either through a central repository, as cddb/freedb do, or through a distributed repository.

It would be nice if the repository were independantly searchable and browsable, as you could then find out that your co-worker, or friend in your local 0097 club has a copy of that $90 book on cryptography you need for a project you just got involved in, and perhaps can ask either through the program, or via e-mail if you could borrow it vs. buying it, or even if they have already loaned it to someone. That would require a friend/co-worker feature.

One feature might be a cross-conversion utility that would convert from ISBN to LOCBN and or DD and vice versa. If your local public library is online, you could cross-reference their catalog for books as well.

-Rusty

--
You never know...

Hey, Aussies! by technos · 2002-04-16 11:30 · Score: 2

Try the Australian mirror as well. Because of discrepancies in the way countries expire copyright, many books are listed on the Australian site that are not available on the main repository.

And before you go bagging fullscale on the US; There are many books listed on the US site that are not on the British. We're not the worst! =)

http://www.gutenberg.net.au

--
.sig: Now legally binding!

Two Words... by OverDrive33 · 2002-04-16 11:37 · Score: 1

Ugh, Hardcopy....

--
Find Escorts, Strippers, Massage Parlours, Swingers

And books at work... by xixax · 2002-04-16 11:45 · Score: 2

We had a room full of tech-books that had accumilated since before I started. When the PHBs would have dumped the whole lot in a skip, I spent the better part of a week sorting through that lot, Addison-Wesley X11 reference, Adobe Postscript reference, C, Fortran, a complete set of A/UX manuals, etc. etc. etc. No-one in the organisation even knew they existed anymore. Being able to zap the barcodes through a reader and generate a catalogue on our intranet would be cool, there are another three depertments that have similar geek rooms that are always one zealous PHB away from the dumpster.

Hmmm... I am sure there must be ISBN search facilities I can screen-scrape.... Coworker types ISBN of their newly received book into a front end and voila! I can grep for it.

Xix.

--
"Everything is adjustable, provided you have the right tools"

isbn.nu is useful by Anonymous Coward · 2002-04-16 11:47 · Score: 1, Informative

The nice folks at isbn.nu have a database you must check out. Try http://www.isbn.nu/0201563177 for example.

ISBN.NU? by winterdrm · 2002-04-16 11:50 · Score: 1

Hasn't anyone heard of isbn.nu? I use the site almost daily, it's also got links to buy the books on retailer's sites. That, of course, is their source of income (referral fees).

Forget CDDB/Cue::Cat - think IMDB by UsonianAutomatic · 2002-04-16 12:02 · Score: 2

I think a book database could be pretty interesting just as a central ISBN/publisher/year/author reference. (Yes, Google is wonderful, but you never know what context an ISBN match is going to be in; the whole point of having a central resource in consistency.) But then, my wife and I have a living room lined with bookcases, and the bookcases are starting to encroach on our hallway and bedroom too. :)

But you could do some pretty interesting stuff with an IMDB-style book database, at least for fiction. I'm picturing entries for fictional characters and locations, along with birth and death dates, even user-moderated (Wiki?) biographical sketches where available, cross-referenced by author. Instant encyclopedia of Arkham/Castle Rock... cool!

But even outside of a single author's oeuvre, there would be great cross referencing stuff you could do.

Say I read and really liked a detective novel that takes place in Los Angeles in the 1940's.
It would be pretty cool to have a reliable database where I could plug in the ISBN of the book I just read, and get a cross-referenced list of other books set in the same time/place/genre - without the busy, sales-oriented "You might also like" mess you get from a site like Amazon.

Maybe include a user comments section, if there's some sort of meta-moderation available - point-missing/inane/poorly written Amazon user reviews instantly send me into a blind rage :)

-Oh, and you could do automated metasearches with the new Google API, too :)

EndNote by Darth+Cider · 2002-04-16 12:05 · Score: 1

Check out EndNote. You can search the Library of Congress and dozens of university libraries. Very handy.

Re:EndNote by Linuxathome · 2002-04-16 16:57 · Score: 1

Also check out pybliographer (as I have already mentioned in an earlier post) for those of you who strictly run linux (like I do). Hopefully Z39.50 functionality will be integrated soon so that you can search the LoC and other libraries.

--
Linux at home

hmmm, not searchable... by Srin+Tuar · 2002-04-16 12:06 · Score: 2

Books are not searchable by nature so making it easier to find information about a book still leaves the issue of how do we get access to it.

Did I read that right? You mean that title, author, subject, date, and category are not searchable fields? Its impossible to search the contents of a book for patterns? Its not easy to store/index a book's content in the database itself?

Perhaps you typed that wrong, or im misunderstanding you, but that statement sounds profoundly false.

You might want to look into project gutenberg. Because they do that. (If copy restrictions were shorter, they would have tons more stuff too)

Re:hmmm, not searchable... by bstadil · 2002-04-16 18:15 · Score: 1

You misunderstood me or I wasn't clear.

I meant the physical nature of books. ie the paper. Gutenberg, ebooks etc are fine but you do not need any special DB to get to it. As an example take a string of text 6-10 words from Hamlet and put into Google. Sling and arrows of outrageous and entry number 6 is Hamlet. Same with most books on the net.

--
Help fight continental drift.

This would be useful by fishrokka · 2002-04-16 12:15 · Score: 1

I have often asked this same question: basically what I want is an All Music Guide for books.

The site would allow you to look up an author see his or her released books in chronological order by year published. You can't get that from Amazon because it's too clogged with marketing crap and duplicate listings of the same books.

Google and other tools are useful if you know what you are looking for, but for browsing a list of an author's collection there is no reliable source.

Hmmm, if this was 1995 I would smell a business plan brewing...

Actually, there's work being done on one... by ExplodingTeakettle · 2002-04-16 12:28 · Score: 4, Informative

What you're basically proposing is a way to share bibliographic metadata -- not the book itself, but table of contents information, library holdings, etc. There are standards amongst libraries for doing this (ISO Z39.50 and AACR2--both of which are horribly abstruse and generally a pain to deal with). Dr. Rob Cameron, along with a small group of Simon Fraser University students, has been working on the seeds of a system for sharing bibliographic metadata -- see http://www.usin.org. This basically extends the URI standard to support ISBN and ISSNs, initially to support scholarly communication, but also making it possible to create what we call "personal bibhosts" with support for annotations, shared notes, etc. Among other things, we've implemented searches across various worldwide libraries to obtain and compare bits of bibliographic info, and so forth. Yes, you still run into the problems of inconsistent data for a given ISBN/ISSN (as a previous poster pointed out), but hey...you have to start somewhere!

check out the OpenCritic.org project by smallclaims · 2002-04-16 12:33 · Score: 1

I've been working on a project, OpenCritic, which aims to build an IMDB-style, open platform for the collection and cross-referencing of book/movie/music/etc. data.
The goal is to be a) cross-media, i.e. supporting lists and articles that interrelate different media; and b) open content, which is to say, done with a GPL-style license to guarantee open access to it in the future (unlike IMBD or CDDB).

You can check it out and see the project mailing list at http://www.opencritic.org

spdinpdx wrote:
Also, once you had the database, you (or anyone on the net) could add stuff like scanned images or ASCII versions of the TOC and bibiography (so you can check from work whether that book at home is worth driving back for at lunch), reviews, recomendations for related books, links to or copies of related papers...

--
---- Tim McCormick http://www.tjm.org

Don't exclude business by The+Cat · 2002-04-16 12:40 · Score: 2

Something like this is going to have initial and ongoing costs. Even if it is developed under an open license, there should be some provision made for commercial use and licensing, but not ownership, of the database once completed.

The alternative is to have the project run out of money, and be bought, probably by a business, and then commercialized anyway.

The best projects will always be those that balance the commercial aspects with the public interest aspects.

It is an excellent idea, however.

Another use for a Cue Cat by ScottBob · 2002-04-16 13:22 · Score: 3, Interesting

So I scan or type in the ISBN, a perl script grabs the books information from the LOC(via z3950), and when I'm done, the system spits out a list of books in LOC order with the Title/Author next to it.

And what better to scan the ISBN with than a Cue Cat. My mother has about 400 paperback romance novels, and every time she goes to the bookstore, she can't figure out if she's read that book yet or not. She picks a book up, reads two pages, and says "I can't tell if I've read that one before or not." (Of course, I ask her how can she tell?) A Cue Cat and a CDDB style book database would allow me to scan the barcode and catalog every one of her books very quickly so she can bring a printout to the bookstore with her.

re Books In Print by BrokenHalo · 2002-04-16 13:30 · Score: 1

Books In Print is a great reference, but it is far from definitive.

There are lots of publishers whose catalogs never make it into the listing. Also, as we might gather from the title, it only covers books in print, which obviously excludes a huge range of publications.

A database would be a great idea for this reason... (though it doesn't necessarily have to be centralised).

need publicly sponsored book bbs--not Amazon! by rjnagle · 2002-04-16 13:59 · Score: 1

This idea about a master book database is fine, but what we REALLY need is a nonprofit entity that hosts and preserves reader comments.

Amazon now absolutely dominates the book industry because it got there first. Book critics (like
me use amazon to publicly share our comments and criticism.

Amazon does a great service, but it resides in the commercial sphere. What if they decide that you need to pay a subscription to have access to the reviews? What if they decide that the author can't republish the reviews elsewhere? What if Amazon goes out of business? Eventually competitors will spring up, and amazon will no longer wield the influence it does. But we need a centralized solution to store these reviews.

I do agree that this would pose certain programming challenges, as well as legal challenges (i.e., how to moderate postings? how to limit space? how to protect against libel?).

Ideally this sort of database should exist for all the arts, but I suspect that there are advantages to separating them.

--
Robert Nagle, Idiotprogrammer, Houston

I recently started a project by blisspix · 2002-04-16 14:06 · Score: 1

I have several thousand books, and I want to catalogue them for insurance purposes (they're not worth anything, I just want a list in case something happened to them, like fire).

I'm not particularly competent with scripting so I've just been using endnote (www.endnote.com) which is a bibliographic tool, to look up book information and download it to my computer. I then export that information to an Access database. I've also done a similar thing for my CDs.

It is extremely tedious. At work in our library we catalogue everything ourselves because we can't afford access to the big databases like Kinetica.

An open source system would be extremely expensive, because acquiring all that data from LOC, BIP, etc will cost you money. Not to mention the fact that it is hardly comprehensive. Unless you only own run of the mill paperbacks, you will find that there is almost no information left about your books. Knowledge about classic texts from the 60s even, is starting to disappear as last copies are discarded.

I would like to think that the American Library Association or similar could get involved with such a project if it started up but i doubt that they would.

ftp.imdb.com by lostchicken · 2002-04-16 14:12 · Score: 1

Well, one could just use the very mature IMDb engine and just modify it. This would work for all kinds of databases.

You can get the source at ftp.imdb.com. Little known fact. You can get all the files used there, including the database. I don't know the license used, though.

--
-twb

Problems by paynter · 2002-04-16 14:12 · Score: 1

Reasons why a book database is much harder than a CD database:

1. There is generally only one editiion of every CD. There must be hundreds of Editions of (say) Pride and Prejudice. Do you keep one record in your database, or many.

2. How do you uniquely identify a book? CDs have track number and lengths (and maybe digital IDs?) which are always the same. Some books even change their titles between editions. Loc Control numbers and ISBNs only apply on a per-edition basis

3. Performing lookups will be much harder because you have to figure out the ID beforehand and enter it manually, as opposed to just popping a CD in the tray and letting the computer figure out the ID. This will mean fewer people adopt the the syatem. (Bar cide scanners are your friend, in this case.)

This is why library science is such a huge discipline.

Re:*Here's* why we need this. by kiscica · 2002-04-16 14:12 · Score: 2, Interesting

Oh, I get it all right. I have more than 20 thousand books -- no idea of the actual number, that's just based on multiplying the number of packed-full shelves by the number of books on an average shelf. Many of them are old, as in pre-ISBN. Many of them were published in other countries and/or in other languages and don't show up in your typical database. I have numerous Hungarian books, for example, that aren't in the online catalog of any United States library.

I'm working on a catalog of my books (and my etexts, and my tens of thousands of physical and digitized sound recordings, and small quantities of miscellaneous other media -- I'm not really into video). Indeed, bibliography is an interest of mine, and I've long had ideas for very nontraditional, loosely-structured, multiply-hierarchical hypertextual catalogs. I've been implementing small parts of these ideas for over ten years.

But actually getting any reasonable fraction of my library into a database strikes me, on even my most optimistic days, as be a Herculean task. It's hard to get started, because when I do have any free time, I prefer actually reading the books to cataloguing them. Oh, when I actually get out of postdoctoral research hell and get a real job, I might have enough money to hire someone to do data entry (then again, I'm likely to want to spend the extra money on books -- fortunately I just got married and my wife might act as a braking force against that tendency).

With a little luck, I'll have the structural framework for my catalog coded in a year or two. But actually getting the data into a database will be a huge task, and one which my CueCat (or the more professional barcode scanners I recently dumpster-dived) will hardly begin to help with. (Only comparatively-recently published books have bar codes, and not even all of them).

A unified catalog with all the records from Library of Congress, Books in Print, and university/state libraries around the world would be fantastic, though, if only to "fill in the blanks" with a minimum of manual entry for any given book. (I do have access through my university to some things that help, though, the unified bibliographical catalogs that librarians use. But I have to write glue code to automate access to them, and that's a pain in the butt).

Why do I want to catalogue my library? Well, there are a couple of reasons. The main one is probably that I want to build the hypertextual database that I alluded to above. When I read books, I make notes (mentally or otherwise). The notes usually make reference to other books. It would be nice to record these notes in the database; eventually it would be a web reflecting what I've thought about various books throughout time. I'm a fairly disorganized person, and if I just jot something down somewhere I'll lose track of it. And if I try to keep it all in mind, I'll inevitably start to forget.

Being disorganized also justifies a catalog on purely practical terms -- it would be nice to know for sure, when for instance I see a book that I've already read and liked in a used bookstore, whether I already have the book (in which case I certainly don't want a duplicate), or read it somewhere else (in which case I certainly do want to buy it). And, since my books are not shelved according to any rational system, a catalog might help me find them (though I don't usually have much trouble with this). Note that I have no intention of significantly rationalizing the shelving even if I do catalogue the books. I'm much more likely to simply record my idiosyncratic locations in the database.

A final reason for cataloguing is that my collection is fairly comprehensive in a few specialized areas and I definitely do have a few books, at least, that would be very hard to find in this country. I'd be willing to lend out such books to (trustworthy) people. But people need to be able to find out that I have the books, and I need to be able to keep track of any loans as I'd be loath to lose even a single book. A catalog would be absolutely indispensable for this.

Kiscica

Read this by Eloquence · 2002-04-16 14:19 · Score: 2

SearchDay - Curling Up with a Good Book Search Engine.

Re:Read this by rubble · 2002-04-17 01:57 · Score: 1

The largest database of free, full-text books available online and searchable from the University of Pennsylvania.

The Online Books Page
http://digital.library.upenn.edu/books/

noteworthy PHP programs by tshieh · 2002-04-16 15:03 · Score: 1

These PHP programs are noteworthy.

--
sig: BeanShell: lightweight scripting for Ja

I have one in development right now... by ChrisKnight · 2002-04-16 15:04 · Score: 3, Informative

I am currently building a database if ISBN numbers with the following records: Title, Author, Publisher and Media.

It hadn't really occurred to me that others might like access to this kind of data as well.

Seriously, is there enough interest that it might be worth the effort to add a request interface that returned an XML object of the data that I have? Would others contribute to it?

I currently have 294,652 completed entries in my database. I'm out of work and bored, and I'll make it publicly accessible if I get some feeback indicating that it would be worth the effort.

-Chris

--
-- This sig is only a test. If this were a real sig it would say something witty. --

Re:I have one in development right now... by tutal · 2002-04-16 18:04 · Score: 1

Not sure but is the ISBN db proprietary and if so, what ramifications would that have on you distributing this info. (I looked for about 15 min, alas to no success)
Re:I have one in development right now... by ChrisKnight · 2002-04-17 03:02 · Score: 2

I honestly don't know. I am 'scraping' my info from public sources, and the ISBN number is printed on the back of books, so I am just gathering together into one place data that is available elsewhere. I am not just taking a licensed ISBN database and making it public.

My post isn't generating much interest though, so I'll probably just keep my database private and not have to worry about legal issues.

-Chris

--
-- This sig is only a test. If this were a real sig it would say something witty. --

Exchange formats and common pools for free data by mlinksva · 2002-04-16 15:10 · Score: 2

See related article/discussion on Advogato from a few days ago.

my personal solution by sugardaddyano · 2002-04-16 15:19 · Score: 1

I put together a simple mysql/asp (i know, sorry, it was before i learned php) to categorize my books. Still in progress, and ive only gotten around to adding about 1 shelf out of the roomful of books. It primarily uses author/title info for storing books, but also ISBN, publisher name, and pub date. For multiple copies, it keeps track of individual copies. It also keeps track of condition, and copyright date vs. printing date. It is VERY time intensive to add books in however.

my books site

Built one by mattr · 2002-04-16 16:37 · Score: 2

I built one but it is very free-form and may not be what you want.

My objective was to do a quick keyword search on a list of 100,000 records from several different sources. Generally I have one line per book, and while some of the indices provide more information that is all I use.

I didn't want to spend the time to do a real database job and I wanted to use Perl regular expressions to do a quick keyword search within author and title text. So I keep recent indices next to the search program compressed variously with zip, gzip, or bzip2. I can direct the system to make a single text file which contains the unpacked text all appended together and compressed again. It will also list stats for each file.

Its main function is to wait for a keyword to be typed in, and it will immediately (PIII/450MHz Linux Inspiron 7.5K) display a numbered list of matching books, in alphabetical order grouped by indice. You can then select certain numbers from the list, or reduce the number of records by adding more keywords. This is sufficent for me and has helped me discover unknown titles and new authors because of its way of narrowing down on information. Perhaps if I had more structured files I would have used Perl's BoulderIO which has solved huger problems of library science in merging genome data files, see bio.perl.org.

A first? by Sivar · 2002-04-16 16:49 · Score: 1

OMG! An "Ask Slashdot" question in which the submitter checked search engines first!? :-)

--
Computer Science is no more about computers than astronomy is about telescopes. --E. W. Dijkstra

Shameless plug to a very useful program by Linuxathome · 2002-04-16 16:51 · Score: 1

There is an open source bibliography management program called pybliographer that will eventually (I hope, as I am the one who is working on part of it) have Z39.50 client functionality. If you are a programmer who knows Python (or wish to learn), please stop by the website, read the discussions and source code (it's really not that bad, it only took me a few months to pick it up), and maybe help out? From what I've been reading in the mailing list is that the main developers are working on a more robust way to store the data--eventually leading to perhaps a share-able database, maybe even adaptable enough to become the program that the parent post is alluding to.

--
Linux at home

Multilingual book database by dos+equis · 2002-04-16 17:52 · Score: 1

I would love a multilingual book database.
As an amateur linguist I collect books in all the
languages of the world. I often try to get the
original language version of a novel that has
become famous in its English translation. This
type of information is very difficult to find on
the internet currently - especially for non-Latin
script languages such as Chinese, Arabic, Thai.
Such a database would require proper unicode
support, standard romanization methods, and
understand that Author's names don't work the same
in all languages. How many times have I had to
look under both "G" and "M" in bookshops for the
works of Gabriel Garcia Marquez!

Think other countries/languages; old books, &c by CandyMan · 2002-04-16 18:21 · Score: 1

Books in print only covers books currently carried by commercial distrtibutors, and only those in English (I am assuming it covers US, ENglish, Australian, Canadian etc, be it in several volumes or in one fat subscription).

We need something for books in other languages/countries (I am Spanish and own a sizeable number of South American Books).

On other news, Andrew Plotkin (Zarf of Inform fame) has a nice tale of his project to digitize his book collection catalogue: The Book-Scanning Project. Sorry if this is redundant, one really doesn't have the time to read aaaall other contributions.

--
http://barrapunto.com/ - News for nerds, en español

Python + barcode scanner by mmarcos · 2002-04-16 18:39 · Score: 1

This kind gentlman has posted a solution for owners of books that have barcodes. Scan in to a text file, run the script and it gets the info from Amazon.
http://www.eblong.com/zarf/bookscan/

--
Are you spontaneously enthusiastic about everyone having everything you can have? - Buckminster Fuller

To keep track of your references by SgtChaireBourne · 2002-04-16 19:05 · Score: 3, Interesting

BIBTEX and MARC are two format for managing bibliographic data. But if you're thinking of rolling your own reference manager, then you'll quickly find out that it's not just a flat file and then you'll also need to integrate it with your data source and with your editor/wordprocessor.

If you just want to import citations, the Z39.50 search and retrieval protocol is the way to import from yor library catalog and many online databases. Indexdata has number of multiplatform tools that you can use, such as YAZ (a z39.50 client) and PHPYAZ. Three commercial packages import from Z39.50 sources nicely (Bookwhere, Procite and Endnote) both Procite and Endnot work well at managing your footnotes during workprocessing, taking care of numbering and layout (e.g. APA or Chicago Manual of Style, etc.).

If you want something under GPL and more oriented to managing web sites and other Internet resources, then you may want to try hypatia. You'll have to ask special for it, but it's available. Here are the parts I've seen so far:

Web-based interface, both end users and maintainers.
Fully multi-lingual, including both interface and content. (It is very easy to add another language to the interfaces. Right now English and Spanish are complete, Norwegian and Finnish are being translated.) Support for Unicode (Which means you're free to add interfaces in or ).
Useable on many different platforms, including Linux, Unix, and Windows.
Individual installations can exchange records, allowing federated content and service providers to work together seamlessly. (Haven't tried it yet.)
Compatible with relevant standards, including MARC, Dublin Core, and the Networked Reference standard currently under development by NISO.
Special features for digital collections, such as automatic URL checking.
Authority control over names (e.g. People and Organizations).
Uses perl/MySQL/javascript

You can see the end user interface in production at the IPL in the serials, newspapers, or online texts collections. The collection managment interfaces are even nicer and very useful. I'm sure it can be tweaked for data on legacy media as well.

--
Beta is broken and the link to classic doesn't work. Stop wasting our time or there won't be anybody left here.

Re:To keep track of your references by follower-fillet · 2002-04-17 00:07 · Score: 1

I'm amazed the parent post is one of only a couple which mention MARC records--they're pretty central in the library world and any solution should probably include them.
Re:To keep track of your references by grunthos · 2002-04-17 07:10 · Score: 1

There is another Web-based card catalog project on SourceForge (GPL) called glibs. It needs some work but may be a good starting point for a personal catalog. It has a MARC file import facility.
Also, a previous /. on library software

--

My son's 5th grade teacher actually assigned them "write a limerick about a planet". I'm not kidding.

GnuteMberg! Free Documentation Database by sama · 2002-04-16 20:16 · Score: 1

You might want to have a look at project GnuteMberg! (not to be confused with project Gutenberg) subproject, GnuteMberg! Free Documentation Database. It's an effort to catalog all documentation covered by free (as in "free speech") licenses. It might as well be a starting point, and all the code is released under the GPL.

why one might need 11x17 by Crab+with+Boobies · 2002-04-16 23:30 · Score: 1

If you do OCR on books, and don't want to rip them apart, you have to put both pages on the scanner at once, or screw around an emormous amount. 11x17 works OK for this, these new scanners are really junky for doing this.

GPL won't help by Aapje · 2002-04-17 00:36 · Score: 2

As a server-side app, it doesn't have to be distributed to others. So the GPL won't do any good. I like the interchange of info between databases, but that can be very hard to implement correctly. You'll run into all kinds of mismatches where you need to do the right thing (when do records denote the same book?). You can fix a lot of these problems by focusing on the ISBN, but you'll still run into inconsistent data and the like. There will also be security issues, you have to make sure that the guy who you synchronize with hasn't polluted his database with wrong data. Lastly, copying an entire database takes a lot of horsepower. You don't want people doing this to your database during busy hours.

Apart from these technical considerations, it's very likely that one database will become the standard that everyone uses, even when there is more than one choice (everyone will use the most extensive, most correct database). I doubt that there will be many syncronizations into this database, most people will be interested in making a copy of it.

All in all I think the best bet is to incorporate a feature that makes it easy to automatically dump the database to a mirror at a certain time (midnight or so). There's no good way to make sure that this actually get's done though, but you can always write a screenscrape application to punish the organization that doesn't supply you with a dump (scraping the entire database will hurt them bad). When the organization goes nasty, you can switch over to a copy and try to beat them with an open alternative.

--

The Drowned and the Saved - Primo Levi

Fully Index the Books!! by RhettLivingston · 2002-04-17 01:46 · Score: 1

What would be really valuable is a Google style index that pretended that every page of every print version of a book was a different web page and fully indexed the content. The access would have to be a little different, but I'd love to have it.

Basically, you could enter a search phrase, get a list of books it hit with excerpts, click on one of those and get a list of print versions, choose the print version you have and be told what page the hit is on.

The design could be bolstered by allowing you to enter the particular books that you have in your library and automatically narrowing down the lists.

It could also contain online material so as to allow you to search both the Internet and book libraries at the same time.

Where The Libraries Get Cataloging Info by rubble · 2002-04-17 01:48 · Score: 1

Most libraries buy cataloging information from the largest database of records in the world (books, serials, cds, dvds, etc) called WorldCat.
47 million records. Cataloging comes from member libraries from around the world and is shared by all.

Many public libraries are making the WorldCat database available to patrons via remote access
very often through another OCLC service called FirstSearch.

Info about WorldCat at
http://www.oclc.com/

cheers,
rubble (a librarian from d.c.)

Low cost available service by metafeather · 2002-04-17 02:00 · Score: 1

I dont know the means that this site is using but :

http://www.spinfree.com/singlefile/

is an online book cataloguing service that uses ISBN to fill form-data fields, and they have an Export facility

These guys used to have an app that had similar functionality.

[Disclaimer: I am not affiliated with this service in any way - just a happy user]

--

"Everything in moderation, including moderation"

Ultimate Search Engine for Books - Allreaders.com by Allreaders.com · 2002-04-17 02:01 · Score: 1

Try Allreaders.com. (http://www.allreaders.com) They let you search by very specific elements of plot, setting, theme, and character to find exactly the book you're looking for. It's the only "Browsable" engine for books that lets you search for kinds of books, instead of a straight title/author search.

Alexlit by ek_adam · 2002-04-17 02:05 · Score: 1

The Library of Alexandria is a book recommender database. I joined when they were still in their pure data gathering stages. Nowadays you have to click on the Departments:Recommender link to get past the online fiction store.

You rate several stories [Dreadful, Boring, So-so, Enjoyable, Really Good, Excellent, Fabulous]. Then you can ask for recommendations. The database correlates your ratings with everyone elses ratings and finds the people with ratings closest to yours, your "neighbors". Then it uses your neighbors' ratings to recommend books that you haven't rated yet. The recommendations each have a confidence rating [Pure Speculation, Wild Guess, Extremely Low, Very Low, Low, Medium-Low, Medium, Medium-High, High, Very High, Extremely High, Almost Positive] based on how many neighbors recommended the book, what the range of ratings are, and how "close" each neighbor is. Obviously, the more books you rate, the more accurate the system can be. With this system I've discovered lots of books that I love, but never would have picked while browsing in a bookstore.

Dragging this post somewhat back on-topic, users can enter in story title's and authors that are not in the database yet. Similar to CDDB and freeDB, most stories were entered by the users, not the administrators.

I am not connected with the Library of Alexandria website except as an occasional customer of their online store and as a long time user of their database; over the past four years or so I've entered 2314 ratings.

Why a central book db by minx · 2002-04-17 02:17 · Score: 1

For two good reasons:
1.
I am currently developing an online application for college students to list their books for sale to each other. It would be nice to ask for only an ISBN and populate my listings with CORRECT info, it's also be nice to have one record per book. Not one for "john doe, title" and "johhn doee, titlle". Easier to validate, good data coherency and accuracy.

2.
it costs frickin $30,000 a year for the definitive resource, I cannot afford it, my free service will not use it. So there!

My brain mostly by egad_man · 2002-04-17 03:56 · Score: 1

Most of the books I've read I can remeber most of it (or at least the general plot/themes/ideas of the book). I've almost gotten to the point where I could go into a library and just ask for the books that came out in the past week. Or what I do is download new books (legally even at Project Gutenberg, run it through some text-to-speech program, grab the audio and burn it onto CD to listen to while I drive.

--
Hmmm, I have 5 mod pts, its time to metamod, and on top of that I have to meta-metamod? When do I get to read slashdot?

Book Databases by SonOfGodfrey · 2002-04-17 07:19 · Score: 1

This is just goes to show how little people pay attention to books and libraries. Libraries have had an *open* standard, called Z39.50, to access databases containing information about books, for years. It is a standard based on MARC records, and just about every commercial library cataloging program is based on it, and can read/write these records. The Library of Congress provides full access via the Z39.50 interface to two databases, one for people to test with, and the other containing the entire LOC.

I Volunteer by dsoltesz · 2002-04-17 14:38 · Score: 1

For years I've intended to get just my Asimov collection into a database of some sort, but the idea of pulling down all the books and typing the info by hand is daunting (or maybe ridiculous). So, I've spent the last hour or so playing with some ideas I picked up from this article's discussion earlier. After hunting around in vain to see if there was a decent book database out there, I started thinking "ya know, if we /.ers all got together, we could probably build and populate a database..."

...IOW, I'd love to contribute to such a project. If anyone is seriously thinking of starting such a thing, let me know.

please help in my quest! by bubonick · 2002-04-17 16:31 · Score: 1

What timing you have!.. Just 2 days prior to you posting this I was searching on the net for a service similiar to this! I'm trying to set up a friend's bookstore database and was trying to create a script that will automatically import the upc on the back of the book which would automatically populate the database with pertanent info on the paticular book. ie. description, author. Yet I am without luck in this mission,.. If you could please tell me of any info developing on behalf of this post... Please.. please.. tell me! Otherwise I'm gonna have to parse thru amazon.com and that is unstable and possibly illegal!... .. Don't make me break the law! hahaha josh_moch@hotmail.com

endnote by dsoltesz · 2002-04-17 18:31 · Score: 1

Hey, thanks for the tip on endnote. Very nice export feature that will output any format you can imagine! In a matter of a couple hours of playing around, I managed to tweak a custom output template to export the data to XML, and create a rudimentary style sheet to transform what I wanted to see into a table. The biggest pain was catching all the "special" characters, but a few global-replaces and I was done. This is perfect for my own need to track not only what books I do have but what I want to get.

This seems to be a potentially good tool for what the author might need. It will be a two step process -- 1) find what you want using endnote and export the results in XML (or some other easy to manipulate format), 2) an automated procedure for importing and storing the information in the local database.

OCLC by Dubber · 2002-04-19 04:52 · Score: 1

Um...
Libraries already do this via OCLC (and actually there are now vendors/jobbers out there that can and/or will do this for about a US$1.50 per book)

--
Your complaints about being offended offend me.

171 of 231 comments (clear)