Proposal: Put Library of Congress' Contents Online

Er by DrMrLordX · 2004-10-09 17:00 · Score: 5, Insightful

Pardon me for sounding like an eegnoramoose, but isn't at least some of the material in the Library of Congress copyrighted material? Putting it all online would let people get copies of it for *gasp* FREE.

Can't have that, now can we?

Re:Er by ravenspear · 2004-10-09 17:02 · Score: 1

Yes, in fact the vast majority of it is copyrighted.

I can't see how this could be done with any kind of public access to most of the content.
Re:Er by DrMrLordX · 2004-10-09 17:04 · Score: 1

Bugger all.

Does being a visionary mean that you get to ignore inconvinient elements of practical reality? Or, in this case, impractical reality?
Re:Er by Anonymous Coward · 2004-10-09 17:06 · Score: 0

Very simply. Every tax payer in the country would be required to pay an additional tax which would go directly to reimbursing the RIAA/MPAA/Book Publishers - whether or not you ever actually use the library.

I haven't used a public library in about fifteen years (since I was a little kid). It's too much of a hassle. But I would gladly use it if they put the material online. There are many sci fi books I've wanted to read but being unemployed now, I can't afford $8 for a paperback (nor can I afford to pay $3.50 PER BOOK to ship a 75 cent used book through Amazon.com).

Even used book stores charge at least half the cover prive for a used book, which means you're looking at paying a good $4 for a paper back!
Re:Er by silentbozo · 2004-10-09 17:08 · Score: 5, Insightful

Many of the libraries in the country carry copyrighted material. You can walk in and peruse the books at your leisure, for free. Same idea, only you grant access to a lot more people. Scholars routinely pay to get copies of rare items from libraries for research, and every time a query comes in, they have to haul the book out, and run it through a copier. It would be a lot more intelligent to scan once, store it, and make it available on demand.

The chief benefit? Even if the original is lost or destroyed, the digital version lives on - a big issue, assuming that ANY item ever enters the public domain from now on, the way that they were supposed to. Hell, I'd lay out money for a copy of the Library of Congress on a set of blue-ray DVDs, and so would many large corporations (those that still have research labs, that is), universities and colleges, as well as other organizations and governmental entities around the world.
Re:Er by jrockway · 2004-10-09 17:11 · Score: 4, Insightful

Interesting concept, though. It's okay if I go to the Library and look it, but not if I look at it online? Why? ( I guess I know the answer; in real life only one person can see it at a time. Online, everyone on Earth can see it at the same time. Oh well. Information wants to be free. Don't want someone to know it? Don't write a book about it! )

--
My other car is first.
Re:Er by loraksus · 2004-10-09 17:12 · Score: 1

I might umm, sound insensitive, but are you missing legs or something? Libraries are one of the easiest places to get to in pretty much every community that I have been to. Public transport runs by most, or there are smaller libraries scattered around.
BTW, check out half.com (shipping is a bit better, usually) or your local goodwill. I don't go there much, but I've found hardcovers that were out for less than a week on the shelves in mine for around $3. (not quite sure how this works, but whatever)

--
1q2w3e4r5t6y7u8i9o0pqawsedrftgthyjukilo;p'azsxdcfv gbhnjmk,l.;/
Re:Er by Jerf · 2004-10-09 17:14 · Score: 1, Troll

Putting it all online would let people get copies of it for *gasp* FREE.

Who said anything about "free"?

Although this would potentially take dialog about the public domain out of obscurity and into the LoC mainstream, and the LoC does have some influence in the copyright debate. Certainly once the data exists, anywhere, it is going to be harder to make the argument that we should just throw it away, no matter what the reason.
Re:Er by Anonymous Coward · 2004-10-09 17:27 · Score: 1, Funny

If they started now, I wonder how many dog years it would take.
Re:Er by metlin · 2004-10-09 17:28 · Score: 1

Hmm, actually from what I heard, the proposal wasn't suggesting that it would all be made free.

Merely that the contents that could be made free would be, the rest would be up for a price (based on what you need) -- something like a huge searchable database, where you pay for what you access.

I can't bother looking for it, but I think it's more likely that is the case. Especially considering the point that you brought up.
Re:Er by siriuskase · 2004-10-09 17:30 · Score: 4, Interesting

This is one more reason that the whole basis behind IP law needs to be reevaluated. Although we do want authors, inventors, and other creative types to be rewarded for their efforts, it is also true that what they create becomes more valuable the more it gets out into the world. Any academic knows that the more a paper gets cited, the more valuable it is. Likewise, the more a book is read, the more likely it will wind up in the canon of culturally significant books.
Creating primarily for money is shortsighted when a work has the chance to impact the larger culture. Just look at Michael Moore (ooh, isn't he ugly, but that's not the point), he's more interested in people seeing and being influenced by his movies than in getting richer off them. Enough money to be comfortable is great, but then, barriers to free movement of ideas should be relaxed.

--
If you must moderate, please moderate as irrelevent, not something bad, because I'm sure someone will find this interest
Re:Er by sonamchauhan · 2004-10-09 17:34 · Score: 4, Interesting
Putting it all online would let people get copies of it for *gasp* FREE.

Can't have that, now can we?

No, we can't... it not be fair to lots of people whose copyrights haven't yet lapsed.

But scanning the materials is _still_ a good idea. It allows for automated OCR that allows searching for text _within_ a book (like A9.com does, and as Google plans to do.) The difference is that all books published in the US could be searched.

It would also make this scenario possible:
- I walk into a public library
- On a library computer, I enter keywords that search the new "library of congress book text search database".
- Based on the results (matching text snippets from _within_ books), I decide to buy two books.
- I walk to the librarian and pay the purchase price
- She fires up a local print run on the library's new laser book printer
- 500 automatically laser-printed-punched-and-bound pages later, I have my new two books.
Since this process is handled by people trained to respect copyright (i.e. the librarians), it is a win-win for everyone.
Re:Er by DrMrLordX · 2004-10-09 17:52 · Score: 2, Informative

However, if you have an entire library's contents available in digital format, it's possible to make perfect copies of it an infinite number of times. In contrast, there are restrictions as to how and how often copyrighted materials in a physical library can be run through a copy machine.

I can't see publishers liking the idea of an online Library of Congress at all. Viewers would be able to make their own e-books at a whim. Not that *I* would mind, but . . .
Re:Er by mcrbids · 2004-10-09 17:52 · Score: 4, Insightful

One of the greatest catastrophes in human history was the burning of the great library at Alexandria, Egypt.

See, the ancient world had many items of great wisdom, and many of the only copies of these works were contained there. The burning of the great library was the end for countless such works.

Today, however, our knowledge is much more widely spread. We all owe a tremendous debt to Gutenburg, for his printing press (removable type press, 1436) for making this possible.

It's quite arguable that the dawn of the renaissance stemmed not from Galileo, or Kepler, but from the widespread nature of books in general after the removable type printing press made this possible.

How many of these works are unique or very rare? I'd consider that a large percentage of these works fall into this category - in which, it would be a wonderful thing to build in some redundancy into the preservation of not only these works, but the wisdom, insight, and humor contained therein!

Warm up the scanner, says I!

--
I have no problem with your religion until you decide it's reason to deprive others of the truth.
Re:Er by DrMrLordX · 2004-10-09 17:57 · Score: 1

Nobody said anything about it being free, but nobody said anything about charging access for it, either. Seeing as how the article in question detailed the costs of scanning and storing all the data without making any mention of how one might recoup those costs, it would seem that the initial suggestion did not include any plan to charge for access to the online LoC.
Re:Er by mehu · 2004-10-09 18:13 · Score: 1

Putting it all online would let people get copies of it for *gasp* FREE.

Not only have the other comments pointed out that most libraries allow you to check out anything for free anyway (provided they have a copy), most university libraries also allow you to access a huge amount of content ONLINE. There are hundreds of academic databases full of scholarly magazine & journal articles (some w/ just the abstract, some w/ the full text) that can be accessed at a university library computer, or even from home (may require a student login). One of my classes uses this system for all of its reading assignments: go to the library web page, go to the corresponding academic database, look up the article, read the PDF. Beats the hell out of buying books & "course readers", or having 20 people waiting for the one print copy of the 1970s magazine with that article in it.

Having digital copies of large amounts of printed material isn't a new thing, and it doesn't necessarily have to mean it will all be freely available. Some of it might be, but for the rest, you might need to either pay by content, or have a student or subscriber account. Nothing that can't be & hasn't already been done before.
Re:Er by 1u3hr · 2004-10-09 18:19 · Score: 2, Informative

I might umm, sound insensitive, but are you missing legs or something? Libraries are one of the easiest places to get to in pretty much every community
We're specifically talking about the Library of Congress, which has millions of books, not your local library with maybe 100k or so (I rememeber my university had about 800k books, probably a million by now). The idea is not to give access to the NYT bestsellers, but rare books that you would have a hard time finding anywhere else.
Re:Er by Jeff+DeMaagd · 2004-10-09 18:39 · Score: 4, Insightful

Even if the original is lost or destroyed, the digital version lives on

Assuming a large sum of money is spent maintaining the digital versions. Computers lose and destroy data, even good computers fail. So it would require good backups done on a regular basis. File formats tend to change too.
Re:Er by ChuckSchwab · 2004-10-09 18:40 · Score: 0

I totally agree. This will destroy any semblance of IP rights. People forget that the Library of Congress is one place you're going to want your book if it gets published.

And then, if this system ever comes out, anyone can put together the text of an entire book just by downloading it. (How much work would depend on whether it was just scanned or actually typed in.) And once that happens it will be legally possible to pirate a book, at least if you have a team working on it. Then the team could just flood the p2p networks with their legally-gotten gains and then there'd be no point for anyone to buy the book!

Then we're back to square one, with zero intellectual property rights. Why should anyone produce any great literature if others are just going to be able to get it for free? What's the incentive?

This is a full-frontal assault on IP rights, and on culture for that matter.
Re:Er by jcr · 2004-10-09 19:07 · Score: 1

One of the greatest catastrophes in human history was the burning of the great library at Alexandria, Egypt.

I would describe that as a crime, since it was done deliberately.

-jcr

--
The only title of honor that a tyrant can grant is "Enemy of the State."
Re:Er by mrgreen4242 · 2004-10-09 19:53 · Score: 2, Funny

assuming that ANY item ever enters the public domain from now on
The ghost of Sonny Bono with haunt you forever, being sure that you know nothing will ever reach the public domain again...
Re:Er by Anonymous Coward · 2004-10-09 20:01 · Score: 2, Informative

A librarian that respects copyright I can understand. A library capable of maintaing a secure client I think not.

There are a number of issues at play in libraries:
o Funding is scarce enough for books, let alone computers and networks.
o End-Client network security requirements are considered a hinderance and parasitic cost for web service devleopment and deployment.
o IT is considered to be the art of reinstalling Win9x.
o Egress filtering and authentication is largely unknown.
o Public access PCs are invariable on the same network as the "trusted" Librarian PC.
o Most libraries have public access PCs using obsolete and invariabley unpatched software.

Or maybe Bill and DRM will save us all from running unpaid, sorry unsigned, code.
Re:Er by Anonymous Coward · 2004-10-09 20:15 · Score: 1, Interesting

No, we can't... it not be fair to lots of people whose copyrights haven't yet lapsed.
A major problem with copyright law is that congress makes sure that copyrights never lapse. Everytime anything Disney is about to enter the public domain, Disney and the large companies that control all creative material get together and lobby (bribe) congress to extend the term, ensuring nothing created after about 1930 will ever be in the public domain.
That isn't fair to mankind.
Re:Er by Anonymous Coward · 2004-10-09 20:33 · Score: 1, Funny

Some information wants to be free. I happen to have some informations that does not want to be free. Unfortunately the very nature of this information means I cannot tell you what it is.
Re:Er by slashdot.org · 2004-10-09 20:34 · Score: 4, Insightful

I guess I know the answer; in real life only one person can see it at a time.

And that's exactly the biggest mistake people keep making; analogies don't work. The stuff we are dealing with is *new*. A library != Internet. There is no analogy.

I'm not saying that I have a solution to any of this, but I think the first thing people will have to realize is that things have changed in a dramatic way. The traditional way of thinking about IP (or really, information) no longer works.

There is no simple answer to any of this, and it makes no sense to come up with analogies and try to justify or make judgement based on that.

Fact of the matter is, all of a sudden it is possible for people to view/copy information pretty much instantly. What we need to realize is that _we_ are the ones that can/will put together the foundation of how to deal with this. No current laws really are suitable. Look at the mess with P2P networks and the music industry. Surely P2P networks _should_ be perfectly legal, but on the other hand if copying music would become so easy that you could listen to any song you'd like, at any given time without paying for it, it's hard to imagine how artists will be paid (and please don't give me the "they'll have to do live performances to make money" bs).

The people that will be able to figure out what the _real_ answers are to these issues are the ones that will do really well. Think about it. /rant
Re:Er by RWerp · 2004-10-09 20:45 · Score: 1

The idea of putting all contents of Library of Congress (or any other BIG library) sound good at first glance, but its practical realization may prove to be very difficult. Those "rare books" you were referring to may well be books from the XVIIth, XVIIIth or XIXth century. While not printed on the trashy acid paper from the XXth century, they are nevertheless fragile. Scanning them may be impossible, due to the mechanical or chemical (to scan something, you shed bright light on it --- some manuscripts are viewed only by special light, sometimes by... moonlight, because normal light may be damaging to them) damage it may cause. Or very, very impossible and expensive.

--
"Long run is a misleading guide to current affairs. In the long run we are all dead." (John Maynard Keynes)
Re:Er by serutan · 2004-10-09 20:50 · Score: 1

If you are very quiet and listen really hard, you can almost hear the Content Industry picking up the phone to call Orrin Hatch.
Re:Er by mefus · 2004-10-09 20:58 · Score: 1

The Lewis and Clark Expedition notes are public domain.

--
mefus
In Open Society, GPL Software frees YOU!
Re:Er by jrockway · 2004-10-09 20:58 · Score: 1

Nice sig. And username for that matter... I personally got a job and worked hard to get an iPod, and I think that's the way things should be ;)

Now regarding your reply...

You are right. The Internet is the Internet. I think institutional licensing might be a way to satisfy the content providers. My local library pays $someone so I can come and look at everything in the LoC. Seems like a good use of tax dollars / tuition money to me.

My University library already has most journals back to the 1800s online ("JSTOR"), they pay for that but it's worth whatever the percentage of my tuition I pay for it.

--
My other car is first.
Re:Er by sonamchauhan · 2004-10-09 21:07 · Score: 1

[This is a good point - I'll quote it in full since it's rated 0 due to AC posting...]

> No, we can't... it not be fair to lots of people whose copyrights haven't yet lapsed.

A major problem with copyright law is that congress makes sure that copyrights never lapse. Everytime anything Disney is about to enter the public domain, Disney and the large companies that control all creative material get together and lobby (bribe) congress to extend the term, ensuring nothing created after about 1930 will ever be in the public domain.

That isn't fair to mankind.

No, Disney's arrangements don't seem fair because they seem to have one law created just for them. I thought about Disney when I first posted -- that's why I said that seizing all copyrighted materials and putting them in the public domain "won't be fair to _lots_" of copyright holders, not "all" copyright holders.

Settling scores with Disney this way does evil to most other copyright holders... two wrongs don't make a right. The Disney problem should be addressed separately, and through laws.
Re:Er by rjstephens · 2004-10-09 21:13 · Score: 2, Interesting

Maybe you should have a look at How the music biz can live forever, get even richer, and be loved.
Re:Er by arose · 2004-10-09 21:19 · Score: 1

All the more reason to get them digital ASAP, even if the originals are destroyed in the process.

--
Analogies don't equal equalities, they are merely somewhat analogous.
Re:Er by Anonymous Coward · 2004-10-09 21:24 · Score: 0

And there you have finaly found a legal use for warez dudez.
Re:Er by sonamchauhan · 2004-10-09 21:34 · Score: 2, Interesting

The library does not have to be capable of maintaining the client, just of funding the infrastructure. The client may even be a thin client run by an external company. After all, network connectivity would probably be essential to access the Congress book database (for copyright reasons, the entire database would probably run out of a Google-like government contracted facility somewhere.)

About funding being scarce: after initial seed funding by the government, a library should easily be able to fund infrastructure in the same manner the internet funds itself.

That is because giving the local library the ability to sell any of _260 million books_ to anyone who walks in their door, on demand, with effectively zero inventory costs, adds a _HUGE_ improvement to their basic information dissemination activity which is very valuable to the library's customers. The commission on book sales would earn easily recoup the investment on infrastructure. A library could become a sort of a low cost competitor to Barnes and Noble. Barnes and Noble would probably do the same thing as the library, but differentiate themselves by having a better quality printer, and being able to grant 24x7 access to a bn.com server where electronic copy of the book could be accessed by the purchaser. Ah, and the nice coffee shop, where you can read the electronic copy of your book on a nice LCD screen as it is being printed.

Other advantages:
- knowledge hidden in books would be suddenly visible and searchable
- for most people, reading a book is more natural than reading a screen
- when people buy a printed book, they retain first sale property rights
(unlike DRM'd ebook software and music liceses)
- the library could become a focal point for paper recycling efforts. For eg: as part of a loyalty program, it could issue credits for old printed books that people turned in.

Disadvantages:
- paper consumption
- printer consumption

I think the only loser will be online bookstores that have no mortar component, like amazon.com
Re:Er by loveandpeace · 2004-10-09 21:43 · Score: 1

in fact, the copyright governing library information costs all libraries, public and private, many thousands of dollars every year in subscription fees. The problems inherent in current copyright laws are well summarized in a piece written by Duane Webster, Executive Director of the Association of Research Libraries. Though it is written from a librarian's point of view, the analysis of the impact of the extension of copyright by the Sonny Bono Act is applicable to anyone interested in fair use and matters of public domain and privacy.
Re:Er by quax · 2004-10-09 21:46 · Score: 1

Actually it is not quite clear when and how the libary was destroyed.

It either was - as you suggest - intentionally and therefore truly criminal or merely what we tend to euphemistically call collateral damage these days.
Re:Er by Anonymous Coward · 2004-10-09 22:04 · Score: 0

Okay, so the one percent --an extreme figure to be sure-- in this category can be exempt.
Re:Er by 1u3hr · 2004-10-09 22:09 · Score: 1

but its practical realization may prove to be very difficult. Those "rare books" you were referring to may well be books from the XVIIth, XVIIIth or XIXth century. While not printed on the trashy acid paper from the XXth century, they are nevertheless fragile. Scanning them may be impossible...
A librarian's job is the care of books. They can digitise them without destroying them. They've done it for Egyptian papyrus and the Dead Sea Scrolls, I think they can manage. Basically, if a book can be read it can be scanned (maybe not on your handy home USB scanner though).
some manuscripts are viewed only by special light,
I really doubt whether a single exposure to bright light for a few seconds will produce any harm. But if it were thought to do so, a longer exposre at low light levels (or maybe infra red, UV or whatever gives the best result) would do the job.
Re:Er by hidden · 2004-10-09 23:23 · Score: 1

I agree that Gutenburg's press and all those that followed are important. the jump to the scanner *may* not be so obvious.

The original version of the Domesday book, written in 1086, presumably on some form of parchment, is still more-or-less readable.

The digital version prepared...17 years ago, on 12" video disk is, at least for all practical purposes, unreadable.

don't believe me? check it out:
http://observer.guardian.co.uk/uk_news/story /0,690 3,661093,00.html
Re:Er by caseydk · 2004-10-10 00:04 · Score: 1

This is nothing new.... it's called the Audio Visual Prototyping Project and I worked on it from July '01 until March '03.

Yes, EVERYTHING was going to go online and copyrighted stuff was limited to those who were physically present in the library. They public stuff - like Thomas Edison's first motion pictures - are already online through the American Memory project.
Re:Er by cortana · 2004-10-10 00:15 · Score: 1

People always seem tout the Domsday Book as an example of how digital technology can never replace good old pen and paper.

By the time the original Domesday book was written, the technology used had been under refinement for what, six, seven thousand years? For how long had the BBC micro been around?

If you wanted to repeat the project today, would you pick an obscure new data storage format that might not be around for five years, let alone a thousand (say, Blu-ray[0] DVD?

I don't think so. You'd transcribe the data into plain old ASCII, perhaps UTF-8 if you wanted to preserve the original characters. Maybe make a version available that's marked up in XML if you want computers to parse/reason about the data within. Cryptographically sign the data, so that people can verify that their copy they hasn't been modified by some prankster, and make it available for download!

[0] That's another really annoying thing. The word "blue" has an E!! Damned marketing departments...
Re:Er by Anonymous Coward · 2004-10-10 00:30 · Score: 0

Why should anyone produce any great literature if others are just going to be able to get it for free? What's the incentive?

Some people would consider being able to reach the widest audience possible with your message a great incentive. I think your small-minded attitude reflects more on you than on society.
Re:Er by Gadzinka · 2004-10-10 01:03 · Score: 1

This is one more reason that the whole basis behind IP law needs to be reevaluated.

No.

The basis for copyright in US[1] is the greatest piece of philosophy regarding the development of knowledge around the world.

It's the current implementation of "IP laws"[2] that perverted those ideas, deprived all society from access even to public domain knowledge and made whores from people who actually create copyrightable works.

Robert

[1] the letter and the spirit of the original copyright regulations, including constitutional "to promote progress in useful arts and science...", as well as the whole discussion by Founding Fathers[3] regarding copyrights and related laws
[2] there's actually no such thing as "Intelectual Property": copyrights, patents and trademark aren't "property", they are limited-time monopoly
[3] the way I udnerstand it, events and discussions leading to creating a law, constitutional amandment etc in US are as important as the letter of the law in order to decide the hard, borderline cases

--
Bastard Operator From 193.219.28.162
Re:Er by 4of12 · 2004-10-10 01:45 · Score: 1

it's hard to imagine how artists will be paid

I agree.

The magnitude of the artistic catastrophe, the drop-off in the production of great works, the cultural legacies we leave for further generations, would all suffer immeasurable and irreparable harm if the most highly-paid artists, such as Britney Spears, were to suffer getting paid like the other 95% of performers.

--
"Provided by the management for your protection."
Re:Er by Anonymous Coward · 2004-10-10 01:55 · Score: 0

We're specifically talking about the Library of Congress

If you read the grandparent, you'll see that the person was refering to public libraries in general, not the LOC.
Re:Er by The+Only+Druid · 2004-10-10 02:35 · Score: 1

Just a small quibble: you can say there's no such thing as intellectual property all you want, but countless cases quite explicitly refer to trademarks, copyrights and patents as such. Not to mention the fact that they're taught as such in law school.

I think its time to stop protesting the phrase "intellectual property" and just start dealing with the actual law...

--
"Stumble before you crawl"
Re:Er by the_2nd_coming · 2004-10-10 02:56 · Score: 1

so put a lock on the content. when some one is reading it then no one else can. you can have a wait list and when your turn comes up it can email you an ebook formated version and after 30 days, you can't read it any more.

--

I am the Alpha and the Omega-3
Re:Er by antiMStroll · 2004-10-10 03:13 · Score: 1

Past societies passed laws, debated for centuries and enacted far more extreme measures regarding devils and witches. Quite likely even the Druids. :) Those concepts weren't validated by it either.
Re:Er by Anonymous Coward · 2004-10-10 03:42 · Score: 0

True, but...

(1) Analog and digital is better than analog alone. In this simple scenario, the failure of both would be catastropgic, but the failure of both is less likely than the failure of one or the other.

(2) Once you convert from analog to digital, it's a whole lot easier to copy the digital, assuming DRM is not an issue.

(3) Different groups, individuals, etc. will be keeping copies for their own purposes. If we lose the "official" original and digital, for many things, somebody somewhere may have a copy (to be checked against a library of md5sums, or something better). I believed the BBC was able to recover a few episodes of Doctor Who because extra copies were floating around in Australia, and/or the US, etc. (I'm not sure of the details.)

As for file formats changing, we simply need to pick a well-documented commonly known simple file format. Heck, for those materials that use a lot of storage with a simple file format, and thus may require more complex compression, keeping the file format with the document (in each instance, or even one out of n instances, for some not too big n), might be worth the overhead.
Re:Er by khallow · 2004-10-10 03:43 · Score: 1

Since this process is handled by people trained to respect copyright (i.e. the librarians), it is a win-win for everyone.
A professional copy center (eg, Kinko's) would be even better since they receive the sufficient copyright training and have better publishing infrastructure.
Re:Er by Anonymous Coward · 2004-10-10 04:03 · Score: 0

Don't want someone to know it? Don't write a book about it!

I agree with this statement to a certain extent, however there seems to be a disconnect brewing.

We are reaching a point in our "information age" where people's knowledge is their only source of competitive advantage. There is simply too little "margin" left in product development, manufacturing, and distribution for the niche player to stand very long before some behemoth with political connections stomps on the market. Yet we continue to freely share the value, while borrowing to fill our financial gaps. That just doesn't make sense. Don't give away the farm to feed the chickens!

Ever wonder why the patent system is so messed up during the IP rush, why everyone is writing books instead of "doing" what they say is right, and coffee shops are full of people drawing ideas on napkins, while drinking $10 lattes (only the coffee purveyor is "doing" well)? Why is everyone getting a real estate brokers license? Recruiting for multi-level marketing garbage? Trying to be your "financial advisor" (hey folks, they're salesmen, plain and simple - they don't know shit about the financial markets)?

When was the last time someone asked you for some advice, knowing full well they had no attention of paying for it? Someone asked you to write a piece of code to solve a "brick wall" type problem, you emailed them that solution and never heard from them again! You did a bunch of due diligence for a "friend" on a business idea, then they took that work, obtained some financing, then asked you if you wanted to invest. Didn't you ALREADY invest?

You are the smart ones, the ones with the ideas, the ones with the value! Embrace it, and execute on it!
Re:Er by The_Bad_Bob · 2004-10-10 04:20 · Score: 1

Maybe libraries are easy to get to where you live, but not where I do. Not every one lives in a big city. The nearest library is about 30 minutes away for me. And that's a rather small one, too. To find a big library that has what I want to find, it's at least 2 to 3 hours away.
Re:Er by 1u3hr · 2004-10-10 04:23 · Score: 1

I wrote:
>>We're specifically talking about the Library of Congress

Somne AC wrote:
>If you read the grandparent, you'll see that the person was refering to public libraries in general, not the LOC.

I was talking about the original post. What we are talking about. OK?
Re:Er by Dun+Malg · 2004-10-10 04:39 · Score: 1

Just a small quibble: you can say there's no such thing as intellectual property all you want, but countless cases quite explicitly refer to trademarks, copyrights and patents as such. Not to mention the fact that they're taught as such in law school.
He means "no such thing as IP" as in "fails the definition of property". It's obvious to all of us that the term has a legal meaning. The point is, the term is intentionally mileading. It was concoted in the 19th century as a catch-phrase for those arguing for further extension of copyright terms.
I think its time to stop protesting the phrase "intellectual property" and just start dealing with the actual law...
So long as some people still think it's possible to own a song, a story, or a cartoon mouse, it will remain necessary to remind them that the term "intellectual property" is misleading. Copyright is simply a system that allows us to treat something which is clearly not property as if it was property. The works will always be essentially free because there's no way to cage an idea. For example: Michael Jackson didn't buy all those Beatles songs; the songs were already available to essentially everyone. What he bought was the right to copy, which is a government-granted monopoly.

--
If a job's not worth doing, it's not worth doing right.
Re:Er by EvilSporkMan · 2004-10-10 04:45 · Score: 1

Oh, come on...you know this would make the greatest torrent ever.

--
-insert a witty something-
Re:Er by Anonymous Coward · 2004-10-10 05:09 · Score: 0

Copyrighting material has nothing to do with giving away copies for free. Any personal works, including web sites or blogs, are automatically copyrighted by law. The problem exists when people take the copyrighted material and reuse or modify it without the approval of the author or copyright holder. As far as the Library of Congress is concerned, however, this is not really an issue. If any copyright holder of material in the Library does object, however, it could simply be removed from the online catalog.
Re:Er by shalla · 2004-10-10 05:12 · Score: 1

A library could become a sort of a low cost competitor to Barnes and Noble.

I work for a public library and even I think that's a terrible idea. Barnes and Noble has a right to make money without having to compete with government-subsidized pseudo-businesses.

Libraries exist as a sort of co-op. They purchase items or access to information that is shared by the public. If you want information or to read a book, go to the library. If you want to OWN the book, go to a bookstore.

I'd also say that the concept of really cheap books because of lack of physical inventory isn't guaranteed. It certainly hasn't panned out with magazines or academic journals because the largest costs associated with publishing are for editing and marketing. If you doubt this, read up on the cost of electronic science journal subscriptions for academic libraries.

Regarding the tech support aspect, many libraries have full time tech staff who maintain their computers and equipment (mine does), but many do not and limp by with one staff person who knows how to fix a few computer problems. It generally depends on the affluence of the area. I can tell you from our luck with photocopy machines and self checkout machines though that if you expect an external company to provide and maintain a machine, don't expect it to function well or often. We have more luck with computers because if something goes wrong, there are 4 of us who can fix it.
There's also a big copyright issue with the whole concept of scanning in the LoC collection. With physical library items, only one person may have the item at a time, so there's no copyright issue. (No copies are being made.) With a digital version, multiple people can access it at one time. With electronic databases, libraries generally pay enough for access to cover copyright issues and have a limit for access at the same time. With LoC, that doesn't sound possible.

Finally, the upkeep cost for scanned items is huge. A lot of academic sites have been unable to make a go of funding and disappeared. I'm not sure this would work well.
Re:Er by LandPremo · 2004-10-10 05:58 · Score: 1

What are you talking about? library material : limited by copies :: internet material : limited by connections library : xerox :: internet : cut & paste The analogies works fine.
Re:Er by mcrbids · 2004-10-10 06:13 · Score: 1

The digital version prepared...17 years ago, on 12" video disk is, at least for all practical purposes, unreadable.

Don't confuse the issue of replication of vital information with the methods by which this replication is done. The Internet has created a giant, cross-platform information replication device, making the issues of portability and transmission all but disappear, so, on the immediate, it's not the issue that it used to be.

If you RTFA, Britain developed custom 12" video disks for this application. Developing custom, high-tech solutions for long term archiving is a dubious idea, at best.

From the article:
'That means we have to find a way to emulate this data, in other words to turn into a form that can be used no matter what is the computer format of the future. That is the real goal of this project.'
Somebody's thinking about using a standardized document format...

--
I have no problem with your religion until you decide it's reason to deprive others of the truth.
Re:Er by 2old2rockNroll · 2004-10-10 07:08 · Score: 1

A librarian's job is the care of books. They can digitise them without destroying them. They've done it for Egyptian papyrus and the Dead Sea Scrolls, I think they can manage. Basically, if a book can be read it can be scanned (maybe not on your handy home USB scanner though).

While true, this wouldn't be done by the robotic scanners proposed in the article, which makes the $260 million less realistic. Searchability is another factor not mentioned. What format for the documents? PDFs, while probably best, don't lend themselves to easy searching and indexing.
Re:Er by Anonymous Coward · 2004-10-10 07:17 · Score: 1, Informative

I have an idea.

This idea is mine, I own it. I invented it. It took a lot of effort for me to produce. When other people use my idea to create something else based on my hard work, I'd like something in return. Money, perhaps.

What is wrong with this? Why do you feel that I should give my idea away for free? When I produce a physical object, such as a chair, I am not obligated to share that chair with the world without compensation. Why do you think I should give away my ideas without compensation?

I think IP law is important. Getting rid of it is a step backwards for humanity.
Re:Er by 2old2rockNroll · 2004-10-10 07:24 · Score: 1

Interesting concept, though. It's okay if I go to the Library and look it, but not if I look at it online? Why? ( I guess I know the answer; in real life only one person can see it at a time. Online, everyone on Earth can see it at the same time.

Really, it's only a matter of degree. There are thousands of libraries all serving up the same books to many users. Web servers can only serve so many connections at once. The real difference is the ability to copy the digital content and redistribute it. However, the vast majority of books in the LOC should be past the copyright period anyway. Perhaps they could make the majority of people happy by placing the works that are out of copyright online.
Re:Er by sonamchauhan · 2004-10-10 07:58 · Score: 2, Interesting

> Barnes and Noble has a right to make money without having to compete
> with government-subsidized pseudo-businesses.

Sure they do. But I imagine Kinkos and B&N would have access to the same LoC book database, and would be able to print books for purchase too (otherwise it would be unfair to them). This just puts libraries on an even footing - libraries that don't want to sell books to the public could stay that way.

> If you want information or to read a book, go to the library.
> If you want to OWN the book, go to a bookstore.

Yup, that's the current model. Lets break down what you said:

Go to a library for this:
1. information about a book
2. to read a book

Go to a bookstore for this:
3. To own a book

One of the key reasons people use libraries is the library database, and the assurance that a book in the library database is probably "in stock" for lending out. Now if this proposal goes ahead, both Kinko's and B&N will suddenly get #1 - the best library database there can be. With #3 becoming more attractive, (book price reduction due to a larger market -- see below), one of the USPs of libraries simply isn't so anymore.

> I'd also say that the concept of really cheap books
> because of lack of physical inventory isn't guaranteed.
A book typically has a single fixed cost at the start (the authoring). After that, the more copies you sell, the more the profit.

> It certainly hasn't panned out with magazines or academic journals
That's because a journal is different from a typical book - each journal issue is like a book that comes out each month with the authoring costs paid each time, but sold to a very limited market.

Since this proposal would broaden the market immensely both books and journals:
- size of the print run is no longer an issue
- inventory is no longer an issue
- royalties keep flowing in for longer durations ... the costs for all categories of information - both books and journals - would come down.

> There's also a big copyright issue with the whole concept of scanning in the LoC collection.
> With physical library items, only one person may have the item at a time,
> so there's no copyright issue. (No copies are being made.)
> With a digital version, multiple people can access it at one time.

Many copyright holders _want_ this to happen and are already doing this.
For instance, you can go to Amazon's A9.com site and search on Gandhi's wife:
http://a9.com/gandhi%20kasturba
(be sure to click on the books button on the left - this returns matches within a book)

Now if entire books were scanned into the LoC database, a canny person could type in the name of a book, and then "page 1", "page 2"... and so... to essentially read the book without paying for it.

One way to secure copyright against behavior like this is by restrictions that can be imposed on both the server and the clients that are searching the book (say, the client cannot view more than 30 words surrounding each match). Amazon's restrictions seem to be that they just scan in the table of contents, not the entire book.

> Finally, the upkeep cost for scanned items is huge.
Well, that scanning would only be done once, using government funds to scan it into the LoC database. The only thing a library would need would be network access to the LoC database (just like they currently do with some electronic journals and databases.)
Re:Er by BitterOak · 2004-10-10 08:59 · Score: 1

Many of the libraries in the country carry copyrighted material. You can walk in and peruse the books at your leisure, for free.
Yes, but what you are forgetting is that despite what many media companies would have you believe, copyright is exactly what its name suggests: a right to copy. When you walk into a library, borrow a book, read it, then return it, no copies are being made, so copyright law is inapplicable. Of course, you can also photocopy the book, which may or may not be legal depending on the circumstances, but it is you the individual that is making the copy, and assuming any legal risks.
If a library offered books for digital download, then each time someone clicks the appropriate link on their browser, a copy of the work is made and transmitted to your computer. Millions of people can thus read their own personal copy simultaneously. In effect, the library is acting as a publisher/distributor of content, not a lender. Lending is not covered by copyright law, distribution is.

--
If I can be modded down for being a troll, can I be modded up for being an orc, or a balrog?
Re:Er by Reziac · 2004-10-10 11:15 · Score: 1

So you set it up so the server only allows one connection at a time to any given work. This can be done, yes? that way, only one person could view any given book at any given moment, exactly like in a hardcopy library.

--
~REZ~ #43301. Who'd fake being me anyway?
Re:Er by Anonymous Coward · 2004-10-10 11:37 · Score: 0

Surely P2P networks _should_ be perfectly legal, but on the other hand if copying music would become so easy that you could listen to any song you'd like, at any given time without paying for it, it's hard to imagine how artists will be paid (and please don't give me the "they'll have to do live performances to make money" bs).

Okay. I have much worse bs: What makes you think they should be paid? Perhaps the time of music commercialization as we know it, is going to be over. Is it a bad thing?
First, we aren't going to run out of people who do it for free. Second, with progress, it's natural that some jobs just cease to exist. Time to look for a new one.
Re:Er by Kiryat+Malachi · 2004-10-10 13:05 · Score: 1

Honestly, the important searchability is author/title, not so much full text searching. I'd bet on PDF files.

--

---
Mod me down, you fucking twits. Go ahead. I dare you.
(I read with sigs off.)
Re:Er by siriuskase · 2004-10-10 14:13 · Score: 1

Firstly, you can't patent, trademark, or copyright an idea. The only way to keep it a secret is to keep it in your head or hidden somewhere. You can only patent a process or a device that uses your idea, and you can only copyright an expression of your idea.
Now that we've got that out of the way, we can get to what I think is your real point - fair compensation for something that most of us agree is valuable. Your ideas aren't valuable until they are used (inventions) or expressed (in writing or as a performance). Even then they become valuable only when someone other than yourself is exposed to them and values them. See, the value isn't derived by the creator's labor, but by how much other people appreciate it. Otherwise, I'd take http://slashdot.org/~siriuskase/journal/ and my garageband files, present them to the public and collect my compensation.
As more people are exposed to your idea and appreciate it, the more it spreads, the more it increases in value, whether or not you get compensated. Our tradition has been to grant a limited monopoly that you can use to get compensation from those who value your work. At some point, It was more or less arbitrarily decided that after X years, you've been compensated. But that isn't the only way it can be done.
BTW, if you clicked on the link to my journal, you owe me!!! Why, because I said so. Similarly, if you download music, some would have you pay for it even if you decide 3 seconds into the playback that it is worthless.

--
If you must moderate, please moderate as irrelevent, not something bad, because I'm sure someone will find this interest
Re:Er by tehdaemon · 2004-10-10 14:59 · Score: 1

Because, for an individual, an idea is not ownable.
Ownership requires some form of control. If you can't excercize this minimal amount of control, then you cannot own the thing in question. (disagree? answer this question then, what does it mean to own something that you have no control over?)
For an idea, once someone else gets that same idea, whether you told him, or they developed it independantly, you have lost control. Therefore ideas are not ownable for individuals. Since a government can prevent people from using some ideas, governments can own some ideas. (usually a bad idea, but...)
Should you give your idea away for free? Not unless you want to. But just be aware, once you give it to anyone else, they now own the idea as much as you do. (as much, not less, not 'under you' etc.) The complication to this idea is IP law. But that is not individual ownership. It is the government taking ownership of the idea, by force, and granting priveledges to others regarding that idea. Make no mistake though, this is the government owning the idea, not the copyright/patent holder.
I am not saying that owing ideas is wrong here. That is a separate argument. People can't own ideas, just like things falling down, not up, or water being wet. It is simply the way things are.

--
Laws are horrible moral guides, moral guides make even worse laws.
Re:Er by 2old2rockNroll · 2004-10-11 10:44 · Score: 2, Informative

Honestly, the important searchability is author/title, not so much full text searching. I'd bet on PDF files.

I guess that depends on what you're looking for. I'd like to be able to search on quotes or keywords and authors at the same time. If I already know the exact source of the information I'm looking for, I can probably find it using other resources.
Re:Er by Kiryat+Malachi · 2004-10-11 13:05 · Score: 1

Honestly, for a lot of the stuff that's in the LoC. it isn't findable elsewhere. They have a lot of rare/unique volumes.

Keywords should be pretty doable also, since that's likely already going to be in the card catalog, but quotes would be a huge step up in effort required.

--

---
Mod me down, you fucking twits. Go ahead. I dare you.
(I read with sigs off.)

Can't do that. by Pig+Hogger · 2004-10-09 17:00 · Score: 5, Funny

This would violate the publishers' god-given right to milk their "creations" until the heat-death of the Universe.

Re:Can't do that. by craXORjack · 2004-10-09 17:20 · Score: 2, Interesting

Yes, the article does say 'all 26 million books in the US Library of Congress.' I think all the books should be scanned. Imagine if a terrorist detonated a nuclear bomb there and destroyed the largest library in the world. What a loss that would be. But just because we would have a backup of the data doesn't mean they must allow full access to copyrighted works. They could release DVDs of a subset which includes only information in the Public Domain. It would be a huge boon for Project Gutenberg though each scanned work would still need to be transcribed to text.

--
Liberals call everyone Nazis yet they are the closest thing to it.
Re:Can't do that. by Mulletproof · 2004-10-09 17:28 · Score: 1

By heat death of the universe, you must mean that whole 70 years after the author's death. I mean, God forbid they or their childern are able to make money off their own creations and ideas.

Information is a commodity just like any other good or service. I can only imagine your surprize when you realize that the internet hasn't changed that fact.

--
You need a FREE iPod Nano
Re:Can't do that. by operagost · 2004-10-09 17:33 · Score: 2, Interesting

You'll find that many here (including me - and I'm one of the most conservative) find that copyright period oppressively long. Just because you wrote one useful book shouldn't entitle you to a generation of monopoly on its art and ideas. The copyright period was once much shorter, and that encouraged derivative works.

--

Gamingmuseum.com: Give your 3D accelerator a rest.
Re:Can't do that. by InfiniteWisdom · 2004-10-09 17:38 · Score: 4, Funny

until the heat-death of the Universe.

Hey its still a finite time
- Walt Disney
Re:Can't do that. by Anonymous Coward · 2004-10-09 17:39 · Score: 1, Interesting

you must mean that whole 70 years after the author's death.

Wait, don't you mean 20 years after their death? Oh wait, yesterday it was 50 years. Today its 70 years. Every time mickey mouse is up for grabs, that number gets bigger.

And of course, we're completely ignoring that grandparent was talking about the publisher who maybe fronted some money, but otherwise didn't do jack to earn this sort of perpetual power over "their" creation.
Re:Can't do that. by Waffle+Iron · 2004-10-09 17:47 · Score: 2, Insightful

I mean, God forbid they or their childern are able to make money off their own creations and ideas.
Their children can go out and get their own damned jobs. They would then be making a productive contribution to the economy.
My grandpa was a farmer who died over 50 years ago. Since I don't get to collect royalties on the corn he grew in the 1930s, I've had to work to produce my own income. Imagine that.
Re:Can't do that. by Senjutsu · 2004-10-09 17:48 · Score: 4, Interesting

you must mean that whole 70 years after the author's death.

You must mean currently. But we all know that as soon as anything major (like Steamboat Willy) comes close to coming out of copyright, we'll see Congress extend the term of copyright yet again, thanks to 'encouragement' from Disney.

Copyright terms are nigh on infinite in fact, if not in law.
Re:Can't do that. by BenFranske · 2004-10-09 17:58 · Score: 1

Personally, I think copyrights are far too long. I think once books go out of print for some number of years they should be put in the public domain. As a collector of 1950s/60s era literature it annoys me to no end that I can't get copies of books from the period except via ebay auctions and overpriced bookstores. These are books that probably will never be reprinted and the publisher is likely to have forgotten about yet they remain protected by copyright law because Disney can't give up a stupid mouse! It would be great if Mickey became public domain like he should have, but right now I'll settle for automatic expiration of copyrights that are not renewed every X years (with a hefty fee that increases each year to discourage publishers from renewing things that aren't making them a bunch of money)
Re:Can't do that. by laejoh · 2004-10-09 18:21 · Score: 1

As punishment, Krikkit and its sun were sealed into an envelope in which time passed extremely slowly. The envelope was to reopen long in the future, when the rest of creation had died away, so that Krikkit could live out the remainder of the universe's life in the manner they wished.

We can't have that, can we? All those people from Krikkit having access to the library of congress...
Re:Can't do that. by Saeger · 2004-10-09 18:27 · Score: 2, Funny

How did you get on my /.friends-list with an attitude like that?
The internet has stripped away the convenient medium(s) that used to contain an inherently scarce message that could physically command the price you asked for. The new reality of the situation is that either you think DRM + DMCA can and should be used to keep doing things the old way, by keeping a decades-old instance of information artificially scarce, or you think-- like millions already do --that information is cheap, and the value lies in the inherently scarce service of performing or creating NEW information. New systems will emerge to fund creation and promote progress, but it won't be the "right to profit" gravytrain of the past.
--

--
Power to the Peaceful
Re:Can't do that. by UserGoogol · 2004-10-09 18:46 · Score: 1

Information is different, because it is not inherently scarce. Any scarcity it possesses is artificially imposed by the government in an attempt to ensure the profitablility of the creation of new ideas. However, you must remember to balance the benefits of the profability with the costs of this artificial scarcity.

--
"Never attribute to malice that which can be adequately explained by stupidity." -- Hanlon's Razor
Re:Can't do that. by maxpublic · 2004-10-09 19:56 · Score: 1

By heat death of the universe, you must mean that whole 70 years after the author's death.

According to the Constitution the idea of granting copyright in the first place is to allow the *author or creator* a chance to profit off his work. How exactly does an author profit off of continued copyright if he's dead?

Doesn't say a damned thing about his grandchildren profiting, by the way.

Max

--
My god carries a hammer. Your god died nailed to a tree. Any questions?
Re:Can't do that. by flyingsquid · 2004-10-09 21:22 · Score: 1

So what happens if they find a way to fix ol' walt and thaw him out?
Somehow I suspect that if that would keep the Mickey Mouse copyright out of the public domain, then that just might happen
Re:Can't do that. by geminidomino · 2004-10-09 22:43 · Score: 2, Insightful

Just because you wrote one useful book shouldn't entitle you to a generation of monopoly on its art and ideas

death+70 years would actually be about 4 generations (if you include the author as the first).
Re:Can't do that. by rice_web · 2004-10-09 23:47 · Score: 1

The Sunny Bono copyright act served to modernize the United States copyright law. The United States was matching our law to EUROPE's.

--
The Political Programmer
Re:Can't do that. by Anonymous Coward · 2004-10-10 00:21 · Score: 0

given that the entire universe appears to be accelerating away from us, I can't help but wonder if something similar already happened - to Earth. Maybe the aliens were so disgusted by our sick attitude of "information wants to be owned" that they sealed us up in a region of inflationary spacetime...
Re:Can't do that. by aboyko · 2004-10-10 01:20 · Score: 1

Imagine if a terrorist detonated a nuclear bomb there and destroyed the largest library in the world. What a loss that would be.

What? Sir, don't be absurd, sir. The Library of Congress is over one block away from the US Capitol. That's several hundred yards. Why, that's more than far enough away from any sort of potential terrorist target. Nothing to worry about!
Re:Can't do that. by Anonymous Coward · 2004-10-10 01:47 · Score: 0

According to the Constitution the idea of granting copyright in the first place is to allow the *author or creator* a chance to profit off his work.
Nope. It's to promote the Progress of Science and useful Arts. The Constitution doesn't say anything about profit.
Re:Can't do that. by khallow · 2004-10-10 03:40 · Score: 1

Right, but it would have been better for both the US and the EU to modernize their copyright law by lower the copyright duration, say to 50 years after the creation of the work no matter if the author is alive or dead.
Re:Can't do that. by jvalenzu · 2004-10-10 05:00 · Score: 1

I'm glad to see Bush has put US copyright statues to the global test.
Re:Can't do that. by cdrguru · 2004-10-10 05:45 · Score: 1

Naa. The problem is that there are two sides to this issue - the side you appear to be on, and the side of the multinational megacorp.
No, I'm not talking about Disney here. I mean a company like Wal-Mart or Sony.
What do they have to do with this? It's simple. Not everyone has your net access when they want information. When everyone has utterly ubiquitous, portable high-speed connection to the Internet this won't be an issue any longer. But, until then, physical distribution of music, books, magazines and everything else will continue. Who gets paid for this physical distribution?
Today, because of copyright laws and international treaties the creator has a chance of getting something out of it. There are flaws, and it isn't perfect, but the creator has a chance. Without this, Wal-Mart will come in an out-distribute the creator and the creator's agents. Now Wal-Mart (or Sony or pick your other favorite megacorp) gets the content for free and gets 100% of the profits from distribution.
Until that little niggling detail is sorted out, we need copyright and this artificial scarcity. The alternative is just makes Wal-Mart own everything.

well, at least to those who can read English by ForestGrump · 2004-10-09 17:00 · Score: 2, Funny

and to those who can't, they can copy and paste the text into a translator.

So yes, it would benefit society as a whole.

Grump.

--
Is it true that more people vote for the winner of American Idol, than vote for the president? -Ali G.

Re:well, at least to those who can read English by j1m+5n0w · 2004-10-09 17:22 · Score: 1

and to those who can't, they can copy and paste the text into a translator.

Or a speech synthesizer (assuming the text is available as text and not images) such as festival, if your vision isn't very good - also something you can't do with the dead tree version.
-jim
Re:well, at least to those who can read English by flyneye · 2004-10-09 17:25 · Score: 1

Dont worry about it the author of this article,Mark_Uplanguage will be happy to translate or fund the translation.I'm quite sure he didnt mean ,me as a taxpayer should have to pay for those who couldnt be bothered to learn english to foot the bill for their laziness.

--
*Repent!Quit Your Job!Slack Off!The World Ends Tomorrow and You May Die!
Re:well, at least to those who can read English by zloof · 2004-10-11 07:16 · Score: 1

So, does it confuse the crap out of anyone else why the BBC has an article about the _United States_ library of congress?
Maybe it's just me, but apparently it wasn't big enough news for the high and mighty news channels here in the US hu?

If Bill Gates by Anonymous Coward · 2004-10-09 17:01 · Score: 1, Insightful

wanted to do something really important and contributive, he would fund this.

Re:If Bill Gates by Anonymous Coward · 2004-10-09 17:05 · Score: 2, Insightful

>wanted to do something really important and
>contributive, he would fund this.

Yeah, not like funding the B&M Gates foundation is doing anything worthwhile with all that immunization, AIDS research and anti-poverty work.

Darned, useless Microsoft profits. Helping people. Imagine that!
Re:If Bill Gates by jkmiecik · 2004-10-09 17:10 · Score: 1, Informative

Let's realize, this is slashdot. MS is evil, no matter what. You have to ignore the fact that Gates is one of the most philantrophic billionaires (relatively speaking on the amount) of all time. That NEVER gets press here, because we can't say anything positive for M$!

Crap, there goes my karma.
Re:If Bill Gates by aussie_a · 2004-10-09 17:28 · Score: 1

But then he'd DRM it.
Re:If Bill Gates by shubert1966 · 2004-10-09 17:34 · Score: 2, Insightful

Don't feel slammed because they see an easy target. Your sentence is grammatically balanced; funding such a project WOULD be really important and contributive. It was your detractors who sought to insert injustice, and I think they are caffeinated, or worse, sleep deprived. You and I know Bill Gates isn't responsible for the lame-ass products his company delivers/promises. He just owns the company. It's the lame-ass M$ engineers who are to blame.

Bill Gates does plenty of worthy things with the PHAT $$$ his company has liberated from millions of (l)users and this would be a fabulous project for he and Iron Mountain.

--
Stuff that matters.
Re:If Bill Gates by mingot · 2004-10-09 19:18 · Score: 1

And this is a great case where real, working DRM would come in exremely handy. Imagine if, yes you could get a copy of a book free of charge from your government, but that it would only last two weeks. Imagine that the government has purchased a number of licenses (copies) just like every other library and that the authors are still paid for their works. A public library using DRM would not be screwing you out of anything (hey, it's FREE) and would still let creators profit from works they produce.

Before anyone says it, though: Yes, copyrights do last WAY to long. Yes, that needs to be fixed and no amount of Disney releasing patents for using air to launch fireworks is going to change the fact that they buy an extension from the US congress everytime they are in danger of losing micky.

For once by theskeptic · 2004-10-09 17:01 · Score: 2, Funny

a Library of Congress jokes will be on topic.

Storage by Neon+Spiral+Injector · 2004-10-09 17:02 · Score: 4, Funny

How data much storage would this require? Could someone give it to me in laymen's terms?

Re:Storage by jrockway · 2004-10-09 17:05 · Score: 4, Funny

About 1.0003 libraries of congress.

--
My other car is first.
Re:Storage by SunPin · 2004-10-09 17:05 · Score: 2, Funny

About 750 million cubic hogsheads.

--
Laws are for people with no friends.
Re:Storage by DrMrLordX · 2004-10-09 17:07 · Score: 1

The article itself said around 1 terrabyte. Strangely, that doesn't seem like anything terribly large these days.
Re:Storage by Anonymous Coward · 2004-10-09 17:07 · Score: 0

Sure. One Library of Congress is roughly equivalent to 2300 copies of Gigli on DVD, or 4600 episodes of M*A*S*H.
Re:Storage by Ghostgate · 2004-10-09 17:09 · Score: 4, Funny

This will require 1.28 Libraries of Congress to store. The overhead is for all of the faulty copy protection to be added, which a 13-year-old somewhere in Europe is already working on cracking.
Re:Storage by Chrispy1000000+the+2 · 2004-10-09 17:10 · Score: 0

Actually, your closer than you think. I doubt they would actually re-type all that into an online version, so can you say PDF? So can you say ~!40 LoC's worth of data? Gotta love bloat. ;)

--
Sig
Re:Storage by Anonymous Coward · 2004-10-09 17:10 · Score: 0

How data much storage would this require? Could someone give it to me in laymen's terms?
Interestingly, around 0.5 Libraries of Congress.

Consider were the entire library of Congress is converted to digital form. At that point, the Library of Congress contains both the original books PLUS the digital copies. Therefore, the entire Library of Congress has grown by a factor of two, with the digital content comprising merely 50%. QED.
Re:Storage by Big+Bob+the+Finder · 2004-10-09 17:11 · Score: 3, Informative

About ten terabytes. Or maybe 20 terabytes. Or maybe as much as 3 petabytes.
Those first two estimates are based on the text content alone. If the graphical contents of those books were rendered into digital format. The third one assumes maps, photographs, sound recordings, etc.
Re:Storage by BorgCopyeditor · 2004-10-09 17:11 · Score: 2, Funny
Let's see:
engaging "3-2-1 Contact" mode...
- If each byte of data were the size of a grain of sand, the LOC archive would be roughly the size of Laguna Beach!
- If each byte of data were the thickness of a hair on a fly's ass, the LOC's collection, laid side by side, would be over 6 feet long!
- If each byte of data were worth $0.01 U.S., the LOC database would rival the gross national product of Uzbekistan!
--
Shop as usual. And avoid panic buying.
Re:Storage by Anonymous Coward · 2004-10-09 17:13 · Score: 0

Consider were the entire library of Congress is converted to digital form. At that point, the Library of Congress contains both the original books PLUS the digital copies. Therefore, the entire Library of Congress has grown by a factor of two, with the digital content comprising merely 50%.
Ahh, but you have to consider that if you were to scan the entire Library of Congress, and the resulting material was then made PART of the Library of Congress, you have created an infinite series. You then have to scan the scanned files, in order to completely scan the Library of Congress. And then the scans of the scans... Unfortunately, this series does not converge.

The correct answer, then, is that the first scanned version of the Library of Congress will require 0.000 Libraries of Congress of storage. The complete scanned Library of Congress, however, will never be finished and will require an infinite number of Librarires of Congress.
Re:Storage by ravenspear · 2004-10-09 17:13 · Score: 1

If it were stored as text (scanned in with OCR), then you might be able to assume 5k or so per page. Sometimes it's lower than that but it depends.

Then multiply that by the number of pages in a book. I don't know what the average is but just taking 200 pages as a WAG that would mean 1 MB per book. 26 million books would then be 26 million MB == 26 TB.

If on the other hand the pages were stored as images, I think you can safely assume a 10-20x increase. If each page was say a 100kb JPEG, then that would be over 500 TB using the 200 page average for a book.
Re:Storage by Anonymous Coward · 2004-10-09 17:14 · Score: 0

I believe they used to say that the library of congress would consume 10 terrabytes, give or take a bit. Considering how fast and how often new books are published today, I wouldn't be surprised if that amount has almost doubled already.
Re:Storage by Anonymous Coward · 2004-10-09 17:16 · Score: 0

The complete scanned Library of Congress, however, will never be finished and will require an infinite number of Libraries of Congress.
Yes, but realize that the Library of Congress will also require precisely one Library of Congress, since the two series grow at the same rate and are proportional by a factor of one. Therefore, this can also be used as a proof that one equals infinity, which is false, and therefore the Library of Congress does not exist.
Re:Storage by Anonymous Coward · 2004-10-09 17:19 · Score: 0

therefore the Library of Congress does not exist.
Indeed. And if the Library of Congress does not exist, then in fact ANY number is the correct answer, because the total content is therefore zero, and zero times any number is zero!
Re:Storage by TrevorB · 2004-10-09 17:19 · Score: 1

Kudos for refraining from telling the obvious joke and going for the more subtle one... :)
Re:Storage by Anonymous Coward · 2004-10-09 17:22 · Score: 0

the Library of Congress does not exist, then in fact ANY number is the correct answer, because the total content is therefore zero
It's rather sad that it will cost a full $260 million to scan absolutely nothing, but at the same time it makes sense as a large part of that will no doubt be expensive high-speed scanning machines. And those will be a fixed cost regardless of how much needs to be scanned.

That's the government at work I guess.
Re:Storage by operagost · 2004-10-09 17:35 · Score: 4, Funny

Don't use a Pentium chip next time.

--

Gamingmuseum.com: Give your 3D accelerator a rest.
Re:Storage by value_added · 2004-10-09 18:18 · Score: 1

"About ten terabytes. Or maybe 20 terabytes. Or maybe as much as 3 petabytes."

Err ... that would make a Really Big(TM) Word(TM) document, wouldn't it?
Re:Storage by tabrisnet · 2004-10-09 18:21 · Score: 1

Given the lossiness of JPEG compression, and the way they artifact, I doubt you could use JPEG either. I don't think even B&W JPEG. This is yet another case for PNG, probably. Possibly reducible to 1 bit, with careful filtering.
Re:Storage by Lost+Dragon · 2004-10-09 19:05 · Score: 1

Three petabytes? Pfft! How many Library of Congresses is tha.. Oh.. Nevermind.
Re:Storage by 3D+Lover · 2004-10-09 19:48 · Score: 1

I tried to use the google calculator to figure this out, but they don't have "libraries of congress" defined. Too bad.

http://www.google.com/search?hl=en&lr=&q=1+libra ri es+of+congress+in+megabytes&btnG=Search
Re:Storage by zoeblade · 2004-10-09 20:00 · Score: 1

This would take about ten terabytes (according to Berkeley), twenty terabytes (according to Yahoo), or three petabytes (according to Berkeley again). The first two are based on reresenting the library in text form, whereas the last is based on representing it as full-blown images. [Paraphrased from the parent post]

That actually souds much more realistic than the article itself, which says it estimates that "the scanned images would take up about a terabyte of space and cost about $60,000 (£33,000) to store." I'm not even sure where that price comes from, seeing as a RAID of three or four hard drives that all cost less than a few hundred dollars could store well over a terabyte.
Re:Storage by quintessent · 2004-10-09 21:37 · Score: 1

It's about time we figured out exactly how many bytes are in an LOC. Let the scanning begin.

--
Donate background CPU time to fight cancer.
Re:Storage by CaptainCheese · 2004-10-09 22:54 · Score: 1

It's not a constant, it's a variable.

[taken from the text of IEEE 1944: Stupid Standard Units]

One Football Field = a,b (a being area in SI units, b being country-code of usage)

One Library Of Congress (LOC) = x,y (x being defined in number of volumes held, y being seconds since founding of library)
Note: For integer systems of eight bits or less which only use current era times, the variant Great Library of Alexandria (GLOA) may be used - this is egual to (zero,time) UFN

--
-- .sigs are a waste of data...turn them off...
Re:Storage by cortana · 2004-10-10 00:38 · Score: 1

Storing all this data on a RAID of cheap IDE devices would be a mistake similar in magnitude to that made by the digital domesday project. :)
Re:Storage by aboyko · 2004-10-10 01:31 · Score: 1

Even the third estimate is low, because it still assumes the books are stored as ASCII text instead of scanned pages, which is what Kahle's talking about (and, really, what would be the point of scanning millions of books and keeping only an ASCII representation?)

It's a great big bunch of disk space, is what it is.
Re:Storage by Pig+Hogger · 2004-10-10 03:13 · Score: 1

How data much storage would this require? Could someone give it to me in laymen's terms?
About 15 cubic football fields.
Re:Storage by Dun+Malg · 2004-10-10 03:17 · Score: 2, Interesting

About ten terabytes. Or maybe 20 terabytes. Or maybe as much as 3 petabytes.
Heh. Whichever it turns out to be, the LoC, being yet another part of the federal government, will probably make it available for viewing/downloading as a single PDF file.
PDF sucks.

--
If a job's not worth doing, it's not worth doing right.
Re:Storage by EvilSporkMan · 2004-10-10 04:50 · Score: 1

The hogshead is already a unit of volume - it's in the dimension length ^ 3 (i.e. it would convert to cubic inches or cubic centimeters). A cubic hogshead would be of dimension (length ^ 3) ^ 3 or length ^ 9 - what's being kept in the other 6 dimensions? Maybe that's where they put Jimmy Hoffa...

--
-insert a witty something-
Re:Storage by Reteo+Varala · 2004-10-10 09:54 · Score: 1

If each byte of data were the size of a grain of sand, the LOC archive would be roughly the size of Laguna Beach!

Assumptions:

one byte of data is exactly one letter, number, or character in text, one dot in a b/w image, and 1/4th of a dot in a CMYK color image.

All images would be 150 dots per inch (comfortable medium between high-quality printing and short downloading time).

A typical grain of sand measures .25 millimeters, meaning that it would take 101.6 grains of sand to make an inch.

The average book size is 1000 (8.5"x11") pages, with 1/2-inch margins and 10-point text, a quarter of which would be pictures, of which another quarter would be color. (I'm extrapolating an ultimate average, so this is DEFINITELY subject to interpretation)

And I will round the KB and MB calculations to the nearest whole number; only the bytes matter for our calculations.
-----
Now, as such, we first determine how much space on those 1000 pages is actually written on. half-inch margins mean an overall 1 inch reduction both horizontally and vertically.

Now, we have reduced the printable area to 7.5x10

Next, we determine the data density of pictures, which would take up the mass of file sizes. A quarter of 1000 pages is 250 pages, of which another quarter would result in 62.5 color pages, leaving 187.5 pages as black and white.

Now, applying the following formula, we can deduce how much space a page of pictures would take:

Black and White: 150(7.5*10)
Color: 4(150(7.5*10))

So, a B&W image page would be 11250 Bytes, or approximately 11 KB.

A color image page would then be the above result, but multiplied by four, because a CMYK image has four colors to combine, Cyan, Magenta, Yellow, and blacK. So we come up with 45000 bytes, or 44 KB.

Now, if we multiply those figures by the assumed number of pages, we come up with 2109375 bytes(2059KB, or roughly 2MB) of data for the black and white images, and 2812500 bytes(2746KB, or roughly 2MB) of data for color images.

The result so far would be roughly 4 Megabytes of information in a single book in images.

Onto the text. The average number of characters per line in a page ranges somewhere from 70-80. So, I will use an average of 75 for now.

The number of lines is pretty static; letters can take up more or less space, depending on the letter; an l takes up less space than a w does. However, all letters function on a baseline principle; meaning that the baseline of one line will be exactly the same distance from the previous as the next.

To determine the number of lines, we multiply the height of a page's writeable space with the points-per-inch measurement to come up with the total number of points vertically. Note that I'm using the Postscript points definition for simplicity's sake.

To get page hight in points: 72(10)

So, a page is 720 points from top margin to bottom margin. We're single-spacing this document, so we're using a 12-point spacing system, 10 for the letter, and 2 for the "leading," or empty space between lines.

To determine number of lines: 720/12

Ultimately, we come up with 60 lines of 75 characters.

One thing to note, this does not include spaces, so we assume an average of 6-letter words, and add one space per word.

Total number of bytes: 60*(75+(75/6))

Total number of bytes for a single page is 5250, or 5 KB

Now, multiply this by all pages: 5250*750

Total for all the pages comes out to 3937500 Bytes, or 3845 KB, or roughly 4 MB.

ow, to combine the single book, we have 4MB (text) plus 4MB (images), bringing up the total size of a single average book to 8MB

Bring that size into basic bytes, and we end up with: (11250*187.5)+(45000*62.5)+(5250*750)

Total book size in bytes: 8859375 Bytes, or 8651 Kilobytes, or 8 MB.

Now, I recall the article stating that there are approximately 26 million books.

--
The Penguin Producer

We need to get our priorities straight by davmoo · 2004-10-09 17:02 · Score: 4, Insightful

Since Congress and the President can so easily pull out a hundred billion dollars to bomb the hell out of another country, I see no reason we can't come up with a whimpy $260 million for something as worthwhile as this.

--
I want a new quote. One that won't spill. One that don't cost too much. Or come in a pill.

Re:We need to get our priorities straight by Zoop · 2004-10-09 17:31 · Score: 4, Funny

Since Congress and the President can so easily pull out a hundred billion dollars to bomb the hell out of another country, I see no reason we can't come up with a whimpy $260 million for something as worthwhile as this.

I'm sorry, I don't get it. How does your proposal bomb anybody?

Are you suggesting we should bomb libraries?

I mean, I see libraries, I see money, but I'm missing the bombs.

Tell you what, rewrite your proposal with bombs and maybe some cool submunitions and make sure they're Furin libraries, and we'll talk.
Re:We need to get our priorities straight by Anonymous Coward · 2004-10-09 17:40 · Score: 0

May I point out that any well-equipped library will have plans for building atom bombs. Draw your own conclusions.
Re:We need to get our priorities straight by Anonymous Coward · 2004-10-09 18:02 · Score: 0

Tell you what, rewrite your proposal with bombs and maybe some cool submunitions and make sure they're Furin libraries, and we'll talk.

What happens if the do-eviler terrorists attack the Library of Congress with a weapon of mass destruction? Pandemonium!!! Just let Halliburton do the scanning and everyone will be happy.
Re:We need to get our priorities straight by Anonymous Coward · 2004-10-09 18:04 · Score: 0
Priorities:
1. $100,000,000,000 + $260,000,000 for new school sports for our brave warrior jocks?
2. Or, 'underfund' athletics by giving $260,000,000 to the communist nerds to put some FREE books online?
Hmm... tough one! I vote for the manly-man option! Besides, my dad works for a company called Helmetburton that makes the highschool helmets. More money for him equals a bigger allowance for me! Hey... this isn't this kind of like how a corrupt military industrial complex works? I know I should have paid attention in class.
Re:We need to get our priorities straight by Jeff+DeMaagd · 2004-10-09 19:43 · Score: 1

Uh, yeah. You mean the same well funded military that spends $1k for a hammer?

We wouldn't need to be spending $300B+ for military operations if we weren't perpetually trying to insult these nations. Besides, the US is about spending as much as the rest of the world combined on its military. During a post-cold war situation. We've spent an obscene amount of money "disarming" an impovershed country that didn't have WMDs.
Re:We need to get our priorities straight by Skater · 2004-10-10 00:58 · Score: 1

Thank you for brightening my day. Hilarious!

--RJ
Re:We need to get our priorities straight by MikeBabcock · 2004-10-10 01:32 · Score: 1

I didn't know Dubbya had been on Slashdot long enough to get ID 59907.

--
- Michael T. Babcock (Yes, I blog)
Re:We need to get our priorities straight by drinkypoo · 2004-10-10 13:19 · Score: 1

Put another way, the war on drugs consumes over US$50B yearly, between state and local governments, and over the total time that drugs have been illegal in the US it's cost us one trillion dollars. $260M is a fucking pittance. Of course there will be a recurring cost for access but that, too, will be probably be a tiny amount of money compared just to what we spend incarcerating people for posession of marijuana each year.

--
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
Re:We need to get our priorities straight by siriuskase · 2004-10-10 14:20 · Score: 1

I've heard of dropping leaflets all over Afganistan to get out the vote, but that's nothing compared to the parent's idea of dropping the entire loc onto OBL's head.

--
If you must moderate, please moderate as irrelevent, not something bad, because I'm sure someone will find this interest
Re:We need to get our priorities straight by danila · 2004-10-11 01:38 · Score: 1

Sadly the development of a government budget is not a result of rational decision-making. There are probably hundreds of similar low-cost, low-risk, high-payoff projects that could be done using the money collected by IRS, but they do not fit the traditional lobbism scheme and so are not even considered.

--
Future Wiki -- If you don't think about the future, you cannot have one.

It'd be faster... by Anonymous Coward · 2004-10-09 17:02 · Score: 1, Funny

...just to put the whole dictionary on line. Figure it this way- you can use it to make all the other books out of it, and it gets around the whole copyright thingie, too (if you use a sufficiently old dictionary).

Re:It'd be faster... by ziggy_zero · 2004-10-09 19:31 · Score: 1

What about authors that make up words? Damn you James Joyce!

--
I belong to the ______ generation.

Units?! by skraps · 2004-10-09 17:02 · Score: 2, Funny

He estimated that the scanned images would take up about a terabyte of space [...]

Uhh.. "terabyte"? Again with these esoteric units!
Someone, please.. how much is that in LOC?

--
Karma: -2147483648 (Mostly affected by integer overflow)

Re:Units?! by Anonymous Coward · 2004-10-09 17:07 · Score: 0

Uhh.. "terabyte"? Again with these esoteric units!
Someone, please.. how much is that in LOC?
About the size of a Volkwagen Bug.
Re:Units?! by jericho4.0 · 2004-10-09 17:08 · Score: 1

1 terabyte = 1/10 LOC.

--
"A language that doesn't affect the way you think about programming, is not worth knowing" - Alan Perlis

One of the More interesting projects by randall_burns · 2004-10-09 17:02 · Score: 4, Interesting

The government has proposed recently. I would also suggest that they put in place requirements that all future material that is to be copyrighted present appropriate copies in machine readable form so this will be cheaper in the future.

Re:One of the More interesting projects by Brandybuck · 2004-10-09 17:37 · Score: 2, Informative

Tyranny! And don't laugh, I'm serious about this.

All material is copyrighted at the instant of creation. All of it. You write a love letter to your girlfriend and it's copyrighted. It's all copyrighted! Beyond that, you're requiring them to *present* copies. I'm assuming this is to the LoC.

You could make a case for this when a copyright is *registered*, but please don't make a blanket statement like that without first engaging brain.

--
Don't blame me, I didn't vote for either of them!
Re:One of the More interesting projects by brusk · 2004-10-09 19:42 · Score: 2, Interesting

That's an absurd, dangerous proposal.

If I write some poems, print them myself on my old-fashioned movable type press (not that I do this myself, but I know folks who do), and distribute 100 copies, why the heck should I have to submit an electronic copy to anyone?

--
.sig withheld by request
Re:One of the More interesting projects by Anonymous Coward · 2004-10-10 00:24 · Score: 0

The tyranny is that all material is copyrighted at the instant of creation. I never signed the Berne Convention. I'm pretty close to refusing to be bound by it.
Re:One of the More interesting projects by Gulthek · 2004-10-10 02:31 · Score: 1

Funny story:

In the early years of US copyright law, Jefferson required a submission of two copies of for registered works: one for the public record and one for his personal library. :-)

Side note: this was back when copyright lasted 14 years, and *could* be extended another 14. But that was it. None of this "milking the work of your great-great-grandfather nonsense." Just think, if that were the case then the first Jurassic Park movie's first copyright term would be running out in 2007. Hasn't it already generated pretty much as much profit as it can? Sure there could be a hyperglobalcompumega release, but the people buying that really want the extra content, so they'll pay for that even if the movie itself is in the public domain.

Repeat after me: Just because a work is in the public domain, does NOT mean that it has lost all ability to make profit. It does enable more people than the copyright holder to make profit though. Just imagine, MST3Ked versions of movies from 14 to 28 years ago. No cost to license the movie, so it could be done on the cheap and sold on the cheap.

Computer games and movies from 1980 would this year be having their maximum copyright term expire. Surely their profit potential is dryer than a Shelbyville turnip.

Now what Disney wants to protect is Mickey Mouse and other brand representation characters. Fine. Put in a special case into copyright law: if a company can show that certain copyright works are an integral part of their company image they have the craptacular 75 million year-whatever extension.

Just because Disney's cartoons are still in use, doesn't mean that ALL works from that era have to be prevented from entering the public domain -- as they should!

The founding father's argument for copyright law was this:

We want people to make art. People want money. We will allow people to make money from their art by giving them the sole right to copy.

Artistic works enrich our culture. Such works should be available to all, even those that cannot afford them, for inspiration and delight. Therefore, the sole distribution rights granted to the creator shall be limited to a "reasonable time."

I don't think Jurassic Park needs to have its copyright restricted until 2153, do you?
Re:One of the More interesting projects by randall_burns · 2004-10-11 17:26 · Score: 1

No, the question is why should anyone provide copyright protection in that case?

Damn! I'd pay tax for that! by Anonymous Coward · 2004-10-09 17:03 · Score: 1, Interesting

If you folks want out of state donations from non-taxpayers, I'll stump up happily from Canada!

Only English? by AKAImBatman · 2004-10-09 17:03 · Score: 3, Informative

well, at least to those who can read English

Correct me if I'm wrong, but doesn't the LOC contain all materials registered with the US copyright office? In which case it would have any foreign materials registered for copyright protection.

--
Javascript + Nintendo DSi = DSiCade

Re:Only English? by Anonymous Coward · 2004-10-09 17:07 · Score: 0

Yes, you're wrong.
Re:Only English? by AKAImBatman · 2004-10-09 17:10 · Score: 4, Informative

From Wikipedia:

[T]he Library assumed a role as a legal repository to guarantee copyright protection. All authors seeking American copyright had to submit two copies of the work to the Library. This requirement is no longer enforced, but copies of many books published in the US still arrive at the Library regularly.

Damn trolls.

--
Javascript + Nintendo DSi = DSiCade
Re:Only English? by Anonymous Coward · 2004-10-09 17:15 · Score: 0

1) From the FAQ: The Copyright Office, a part of the Library since 1870, is located on the 4th floor of the Madison Building. It has handled more than 20 million copyright registrations and transfers since 1790 and currently deals with some 600,000 new registrations each year.
2) The LoC is not just English language. For example the French Collections is about 1 million volumes. http://www.loc.gov/rr/european/coll/fren.html
Re:Only English? by InterGuru · 2004-10-09 17:34 · Score: 2, Interesting

Let me correct you. They do not keep all the materials from the copyright office. Some they forward to other appropriate places, such as the National Library of Medicine.
Their collections policy statement states that they only keep material specific to their very broad mission statement. This means that they will not keep a copy of a laundry list they received throught the copyright office.
Re:Only English? by Jeff+DeMaagd · 2004-10-09 18:51 · Score: 1

I thought it was a swipe at the US educational system. A lot of Americans graduate illiterate.
Re:Only English? by MikeBabcock · 2004-10-10 01:29 · Score: 1

The same is true here in Canada. A friend of mine released a few CDs and was surprised when the envelope showed up in the mail requesting a copy of the lyrics and one of the CD itself.

--
- Michael T. Babcock (Yes, I blog)

Homeland Security Savings by Anonymous Coward · 2004-10-09 17:03 · Score: 3, Funny

The idea to scan in all materials available at the U.S. Library of Congress was presented at the Web 2.0 conference this week (as just one of many ideas presented). The proposed cost of $260 million would create a huge benefit to society

It would probably pay for itself too since FBI agents would no longer have to travel to libraries to secretly gather records of who borrowed what. They can just use Carnivore to do it instead.

Why not? by Anonymous Coward · 2004-10-09 17:05 · Score: 1, Funny

Should work... its a lot of information though: about 1 library of congresses worth.

Only 260 Million? by HermanAB · 2004-10-09 17:06 · Score: 1

The Canadian gun registry cost is exceeding 2 billion dollars and climbing - 1.9 billion of which is probably wasted on corruption, but that 260 million sounds like a lowball.

--
Oh well, what the hell...

Re:Only 260 Million? by Anonymous Coward · 2004-10-09 19:09 · Score: 0

I'm sure everyone in Canada feels safe with a big database of all the guns up there. I know I'd feel safe here in America if only we had a big database of all the citizens, cross referenced with their voting histories, sexual preferences, and marketing profiles.
Re:Only 260 Million? by yofan · 2004-10-10 03:33 · Score: 1

I don't see $260M as feasible. $500M may be closer to the mark. Also, this doesn't include man-hours that'll need to do the work. The way this government does things, even if it starts today, we probably wouldn't see the finish line in at least 5 years. Then there's the copyright issue....... Lastly, what about all the "delicate" materials. It's not like you can just slap all those things on a flatbed scanner and be done with it....

Ametrica! by Doc+Ruby · 2004-10-09 17:07 · Score: 5, Funny

Finally, Slashdot can establish that for official purposes:

1 Library of Congress = $260M

And the 2004 US Federal budget can be spec'd at 0.000243754522 LoC:s (Libraries of Congress per second).

--

--
make install -not war

Library of Congress - and newpapers by Anonymous Coward · 2004-10-09 17:08 · Score: 0

I'd like to see all available newspapers on line. Old Scientific Americans are great. The NYC Public Library has them. Interesting to follow the building of the elevated railroads and later the subways. I did that years ago.

Scanning is nice but.... by FiReaNGeL · 2004-10-09 17:08 · Score: 1

OK, 260 millions US$ to scan... $60,000 of space (a terabyte) according to the article... put it online... BANDWITH costs estimates? Oops... forgot about that I guess!

"Brewster Kahle's idea is to scan as many books as possible and put them online so everyone has access to that huge amount of knowledge."

The plan IS to put it online, after all...

--
Eureka Science News - automatically updated

Re:Scanning is nice but.... by Anonymous Coward · 2004-10-09 17:19 · Score: 0

They could just make it a 1 terabyte torrent.
Re:Scanning is nice but.... by Anonymous Coward · 2004-10-09 17:26 · Score: 0

The expense is in administrative costs. You can't hire volunteers. So you have to pay employees. That means you have insurance, workman's comp, social security, OSHA guidelines, etc. Then you have to hire managers to manage them. And then managers to manage managers. Then directors, vice presidents, presidents, then promotional expenses. Then hardware. Then contractors. Anyone else could get volunteers together, some cheap scanners with good software, some donated hardware and then be done.

But nope, they'll make it as expensive as possible.
Re:Scanning is nice but.... by hool5400 · 2004-10-09 18:14 · Score: 1

If you're paying $60k for a terabyte, you're getting ripped.

A terabyte is only 1024 gigs.

--

Remember, it takes 42 muscles to frown and only 4 to pull the trigger of a sniper rifle.
Re:Scanning is nice but.... by Anonymous Coward · 2004-10-09 21:19 · Score: 0

Easy: Just auto-setup a BitTorrent per book and we'll all share the cost.
Re:Scanning is nice but.... by Anonymous Coward · 2004-10-09 23:21 · Score: 0

You're clueless. Yes, if you want to store a terabyte at home, you'd pay a lot less. If you want a fault tolerant setup with IO bandwidth enough to handle anywhere near the kind of interest an online accessible archive like this requires a tad more thought than putting 3-4 IDE disks in a cheap case.

search me by Doc+Ruby · 2004-10-09 17:09 · Score: 1

Since the Library of Congress contains mostly copyrighted data, and Amazon is already doing this for profit, this is really just a good way to market Amazon's A9 search engine and the products it sells.

--

--
make install -not war

How many Libraries of congress is that? by Anonymous Coward · 2004-10-09 17:09 · Score: 0

Huzzah! I can finally calibrate my scale to Libraries of Congress and set up a conversion factor to Bytes.

The Whole World by Nom+du+Keyboard · 2004-10-09 17:10 · Score: 1

well, at least to those who can read English

How about the whole world who can find any online translation service that goes from English to Local Dialect.

--
"It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."

Re:The Whole World by blibloblu · 2004-10-09 19:15 · Score: 0

I doubt you ever used these services. It's worth nothing when you are lloking for anything accurate.
Re:The Whole World by 12357bd · 2004-10-09 19:49 · Score: 1

It would be useful yes, but on the other side, going from a single language (english) to any other imply a lost of descriptive power that is too extreme.

We need to actively protect our languages, we are already losing too much of our own knowledge letting disappear the small ones.

--
What's in a sig?

I can see it now.... by Vash_066 · 2004-10-09 17:10 · Score: 1

Soon well see the PIAA (Publishing Industry Association of America) forming up and talking about how it's ideas like this that cause them millions a year in lost revenue. And then start in on how libraries are nothing more than book pirate hang outs....

Re:I can see it now.... by Sheetrock · 2004-10-09 17:17 · Score: 1

The future is now.

--

Try not. Do or do not, there is no try.
-- Dr. Spock, stardate 2822-3.
Re:I can see it now.... by Pantero+Blanco · 2004-10-09 19:11 · Score: 1

" "Technology people never gave their stuff away," Schroeder says. "But now folks are saying, 'You mean the New England Journal of Medicine is charging people?' ""

Misinformed or deliberately misinforming, I wonder which...

Solutions available by Anonymous Coward · 2004-10-09 17:11 · Score: 0

I bet there are solutions for this, such as making that proposed LOC website accessible from public terminals within LOC premises only. Just because it's on the Internet doesn't mean you have to provide access to anyone, does it?

The theory of everything by antikarma · 2004-10-09 17:14 · Score: 3, Funny

At long last, we shall finally know just how much one unit of Libraries of Congress is. This could quite possibly have profound effects on how we understand the universe. For example, for many years we have known that the universe is approximately 42 Libraries of Congress. Now we can fully understand its meaning.

Re:The theory of everything by Pfhorrest · 2004-10-09 19:41 · Score: 2, Funny

Oh dear God, they've found the Question.

Goodbye, Universe.

--
-Forrest Cameranesi, Geek of all Trades
"I am Sam. Sam I am. I do not like trolls, flames, or spam."
Re:The theory of everything by geminidomino · 2004-10-09 22:50 · Score: 1

I thought "The question" was "What is the..."

Nope, forget it. Not gonna do it.

(Anyone mentioning Spoons will be executed as a terrist)

Only the first step by Nom+du+Keyboard · 2004-10-09 17:14 · Score: 3, Insightful

Putting the LoC on-line is only the first step. How long before those Internet book printing stations that can create an entire book for you from an electronic image in a deciminute for $1 tap into this? I'd have to think that this would be good for everyone except B&N who are busy reprinting old classics under their own label right now.

--
"It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."

Re:Only the first step by thephotoman · 2004-10-09 18:08 · Score: 1

Actually, it's not as though everyone has a binding operation under their desks or anything. I don't know about most people, but I for one would prefer to have a high-quality binding of my favorite books. Besides, it's easier to spend hours reading off of paper than it is to read off a CRT or LCD.

--
Haec merda tauri est. Ceterum censeo Carthaginem esse delendam.
Re:Only the first step by norkakn · 2004-10-09 19:12 · Score: 1

I can't tell whether this would be good or not.

I imagine it working where you would go to Barnes and Nobels and they'd have the printer there, but you could get anything, in print or out, and they would print it and bind it in 5 minutes.

But.. I can see them selling mostly crappy cheap books and rip us off if we actually want a quality book (acid free, decent binding, etc)
Re:Only the first step by fireboy1919 · 2004-10-09 19:29 · Score: 1

Very good idea. Actually, it was a slashdot story a few years back, except they were going to put them in bookstores where books were made for you right there and sold to eliminate transport and unused book costs.

I believe that price was a limiting factor - you got a book that was lower quality than what the stores provided for roughly the same price (because people want $ for their copyrights). I don't see how this would be different now.

But someone who is less lazy could go look it up and confirm. :)

--
Mod me down and I will become more powerful than you can possibly imagine!

I disagree by ravenspear · 2004-10-09 17:17 · Score: 1

they can copy and paste the text into a translator.
So yes, it would benefit society as a whole.

Subjecting the world to the Babelfish translator would actually detract from knowledge considering the horrible linguistic bastardizations that people would then take as fact.

Re:I disagree by tftp · 2004-10-09 17:29 · Score: 1

No, it will only cause evolution of English. Imagine how nice it would be if the whole world speaks Englifish :-)

Halfbaked by n08ody · 2004-10-09 17:19 · Score: 3, Funny

you've perused the Libray of Congress, but have you perused the Library of Congress Online

Re:If Bill Gates -- Not So Fast! by Nom+du+Keyboard · 2004-10-09 17:20 · Score: 1, Interesting

the fact that Gates is one of the most philantrophic billionaires

I would reserve that honor for Andrew Carnegie, who basically sold his empire for $485M and spent the rest of his life giving away all his money to good causes. Bill Gates is a far cry from that so far.

Just as a point of /. interest, what is the conversion factor between ACMs (Andrew Carnegie Millions) and BGBs (Bill Gates Billions)?

--
"It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."

HELL NO by Anonymous Coward · 2004-10-09 17:25 · Score: 0

I will not allow no kanuck to use a resource that many generations of hard working americans have paid for.

YOU LOVE US OR HATE US MAKE UP YOUR MIND

Missing something? by ravenspear · 2004-10-09 17:27 · Score: 4, Funny

In a traditional library it's not really easy to...

1. walk in and pick up a book
2. strike the author's name from it and replace it with your own
3. replace the copyright notice with your own
4. Make one thousand perfect copies
5. Offer it for sale, start taking orders, and PROFIT!

...all within 30 minutes.

I could easily do that on the internet.

Re:Missing something? by I+don't+want+to+spen · 2004-10-09 17:50 · Score: 1

But surely it would also be easy to check what you'd done - not by searching the whole library, but by comparing with relevant matches. this could be a boon to finding plagiarism!

--
Don't go to a brothel if you want to buy broth
Re:Missing something? by ravenspear · 2004-10-09 17:54 · Score: 1

Ah but what if I sell it as an ebook under a different title and I don't make the text available to anyone accept purchasers who probably won't know better?
Re:Missing something? by thephotoman · 2004-10-09 17:59 · Score: 2, Insightful

Even still, somebody along the way will get the idea to cross-reference you to the database, perhaps when they try to find out more about you by making an inquiry to the Library of Congress (which handles copyrights in the US) about your copyright.

--
Haec merda tauri est. Ceterum censeo Carthaginem esse delendam.
Re:Missing something? by ravenspear · 2004-10-09 18:07 · Score: 1

Ah but what if I only accept payment through Paypal registered as an alias from a public terminal and then I dump the money to a maildrop, and then...

Ok I'll stop now. I do see your point. It might work, but I think the "visionary" skimped on the details a little bit.

As an author... by Anonymous Coward · 2004-10-09 17:31 · Score: 0, Troll

This is a dumb idea. I'm happy to have copies of my books in libraries (I even donate copies to libraries). One book equals one reader at a time. A library has the same rights as you or I have to lend out *the* copy they own - not to reproduce it en masse. Having all my stuff put online (against my will) would damage the value of my content.

I hear lots of arguments from people who don't create content on why it should be free. The only content producers who seem to think it should be free seem to be ones who produce content with limited commercial value.

I'm a relatively succesful independent author. Part of my success is due to the fact that if you want my content, the easiest way to get it is by buying it from me. Take away my control over my content and I'll make less profit, and ultimately produce less content. Read Atlas Shrugged to see what that leads to...

Re:As an author... by I+don't+want+to+spen · 2004-10-09 18:01 · Score: 1

Relatively successful? By my assessment, you must be incredibly prolific - a large proportion of the comments on Slashdot ard by Anonymous Coward. I can only assume that you are some relation to Noel?

--
Don't go to a brothel if you want to buy broth
Re:As an author... by WaterBreath · 2004-10-09 18:12 · Score: 3, Insightful

A 0-rated post noted that this type of free access is a big deal to people who make an honest living publishing their creations.
This invokes a big, important question. The rise and flourish of the information age has and will continue to provide unbelievable freedom of access to unbelievable amounts of information. Where and how do we draw the line between the freedom of the consumers and the rights of the creators?
I'm a software developer who loves movies: I'm a creator and a consumer, so I see both sides of this coin. And I think there needs to be a compromise between consumers and creators.
Consumers need to realize that at a certain point, amassing more music, or more books, or more movies, or more whatever, becomes a luxury, not a right. So if the price of music prevents you from having a 10,000 song collection, I'm sorry but, "so sad too bad." That's how it's always been for just about every other purchaseable product. Sometimes you have to sacrifice what you merely want to get what you really desire.
Creators need to understand that the information they produce is a drop in the bucket compared to, for example, the estimated yottabyte (1x10^24 bytes) of information on the Internet. So if you want to make money off your creation, it had better stand out, because there's a lot of noise out there to drown it out. Simply put, if you want to get paid, make something people are willing to pay for.
Re:As an author... by 12357bd · 2004-10-09 19:36 · Score: 1

Sorry but NO

Digital content means that any piece of information is just a number, and that means that perfect copy is not only possible but simply natural.

If you really want to 'protect' your work, don't go digital, is that simple, there's no way to stop math.

--
What's in a sig?
Re:As an author... by anthony_dipierro · 2004-10-10 05:32 · Score: 1

I'm a relatively succesful independent author. Part of my success is due to the fact that if you want my content, the easiest way to get it is by buying it from me. Take away my control over my content and I'll make less profit, and ultimately produce less content.

Let's see. You produce less content, which I don't read anyway, because I don't pay for books. Maybe now that the content you've already created is free, I can start reading it. Meanwhile, with your new free time, you get a job fixing potholes or curing cancer, or installing operating systems, or whatever else it is you can do.

Ya know what, sounds good to me.

a real application for internet2 by LuxFX · 2004-10-09 17:31 · Score: 3, Interesting

Right now, Internet2 can download the entire Library of Congress in about 20 seconds.

I'm not aware of any PIAA for publishers, but somebody is going to have a problem with this. And by the time this actually happens, I bet there will be an Internet4 that can do it all in 20ms.

--
Punctanym: alternate spelling of words using punctuation or numerals in place of some or all of its letters; see 'leet'

Re:a real application for internet2 by Derling+Whirvish · 2004-10-09 18:05 · Score: 0, Offtopic

Right now, Internet2 can download the entire Library of Congress in about 20 seconds.
Internet2? So how many of those Internets are there?
I guess Bush is smarter than his critics.
Re:a real application for internet2 by daniil · 2004-10-09 18:40 · Score: 1

There's an enormous number of internets, but only one (The) Internet.

--
Man is a slave because freedom is difficult, whereas slavery is easy.
Re:a real application for internet2 by Pantero+Blanco · 2004-10-09 19:23 · Score: 1

It's called the AAP, and they do.
Re:a real application for internet2 by EvilSporkMan · 2004-10-10 04:53 · Score: 1

Multiple internets would imply that they weren't connected, else they'd just be one internet.

--
-insert a witty something-
Re:a real application for internet2 by Anonymous Coward · 2004-10-11 02:43 · Score: 0

Yep,

ICMP ECHO for Internet4 will have (Complete works of Library of Congress) as data payload.

I can see it now....Out foxing FOX. by Anonymous Coward · 2004-10-09 17:34 · Score: 0

"Soon well see the PIAA (Publishing Industry Association of America) forming up and talking about how it's ideas like this that cause them millions a year in lost revenue. And then start in on how libraries are nothing more than book pirate hang outs...."

*sigh*

Welcome to slashdot, who's busily outfoxing FOX.

The Publishers vs the Libraries territory has already been raked over. Guess who won?

Anyway as someone else suggested they don't have to put it on the Internet, but an Intranet would work.

Just like you have to go to the physical library to access certain resources.

This will be DRM'd correct? by aussie_a · 2004-10-09 17:35 · Score: 1

Otherwise how will they stop me from getting every book that I want to have? The only possible way would be for publishers to not send data to the LoC.

Re:This will be DRM'd correct? by Lord+Moz · 2004-10-09 20:03 · Score: 2, Informative
In order to register your copyright you agree to send a copy to the LoC if they request it. It's the law... in other words, the LoC gets a copy of anything they want that is protected by copyright in the United States.

See below...
______________________________________________
TITLE 17 > CHAPTER 4 > 407

407. Deposit of copies or phonorecords for Library of Congress

(a) Except as provided by subsection (c), and subject to the provisions of subsection (e), the owner of copyright or of the exclusive right of publication in a work published in the United States shall deposit, within three months after the date of such publication--
- (1) two complete copies of the best edition; or
  
  (2) if the work is a sound recording, two complete phonorecords of the best edition, together with any printed or other visually perceptible material published with such phonorecords.
Neither the deposit requirements of this subsection nor the acquisition provisions of subsection (e) are conditions of copyright protection.

(b) The required copies or phonorecords shall be deposited in the Copyright Office for the use or disposition of the Library of Congress. The Register of Copyrights shall, when requested by the depositor and upon payment of the fee prescribed by section 708, issue a receipt for the deposit.

(c) The Register of Copyrights may by regulation exempt any categories of material from the deposit requirements of this section, or require deposit of only one copy or phonorecord with respect to any categories. Such regulations shall provide either for complete exemption from the deposit requirements of this section, or for alternative forms of deposit aimed at providing a satisfactory archival record of a work without imposing practical or financial hardships on the depositor, where the individual author is the owner of copyright in a pictorial, graphic, or sculptural work and
- (i) less than five copies of the work have been published, or
  
  (ii) the work has been published in a limited edition consisting of numbered copies, the monetary value of which would make the mandatory deposit of two copies of the best edition of the work burdensome, unfair, or unreasonable.
(d) At any time after publication of a work as provided by subsection (a), the Register of Copyrights may make written demand for the required deposit on any of the persons obligated to make the deposit under subsection (a). Unless deposit is made within three months after the demand is received, the person or persons on whom the demand was made are liable--
- (1) to a fine of not more than $250 for each work; and
  
  (2) to pay into a specially designated fund in the Library of Congress the total retail price of the copies or phonorecords demanded, or, if no retail price has been fixed, the reasonable cost to the Library of Congress of acquiring them; and
  
  (3) to pay a fine of $2,500, in addition to any fine or liability imposed under clauses (1) and (2), if such person willfully or repeatedly fails or refuses to comply with such a demand.
(e) With respect to transmission programs that have been fixed and transmitted to the public in the United States but have not been published, the Register of Copyrights shall, after consulting with the Librarian of Congress and other interested organizations and officials, establish regulations governing the acquisition, through deposit or otherwise, of copies or phonorecords of such programs for the collections of the Library of Congress.
- (1) The Librarian of Congress shall be permitted, under the standards and conditions set forth in such regulations, to make a fixation of a transmission program directly from a transmission to the public, and to reproduce one copy or phonorecord from such fixation for archival purposes.
  
  (2) Such re
Re:This will be DRM'd correct? by aussie_a · 2004-10-09 20:41 · Score: 1

IF you register your copyright. But (and correct me if I'm wrong) that says not registering your copyright does not mean you won't receive copyright protection. So therefore authors of books aren't required to register their copyright.

More dotcom hype... by stubear · 2004-10-09 17:39 · Score: 2, Insightful

"Despite the hype surrounding the dotcom era, many believe that the vast potential of the net to change society and business remains largely untapped."

If this is such a wonderful idea why doesn't he get a bunch of artists, musicians and writers to donate their own work to this project and actually prove the concept works?

I'm tired of all the rhetoric about business models failing and how the web is going to transform the way society learns, works, and entertains themselves. The dotcom era should have taught these so called visionaries one thing, you actually have to have a business plan before you can transform business models.

If these business models are so full of potential he should start one, with his own intellectual property, and prove that the old economy intellectual property businesses they are extinct. If his ideas work then the dinosaurs of the MPAA and RIAA will either have to adapt to the new economy or die. Forcing them to risk their entire business on a gamble like this is wrong from any perspective.

Re:More dotcom hype... by MrWa · 2004-10-09 18:35 · Score: 4, Insightful

If this is such a wonderful idea why doesn't he get a bunch of artists, musicians and writers to donate their own work to this project and actually prove the concept works?
Work for who? I think you are still confused from the dotcom era still. You must be thinking that "change society and business" means that scanning the entire LoC can make someone money (advertising??)
The important part in this case is the changing society part of the statement, which is what the vast potential of the net is capable of doing. It won't help you make money based on a bad idea (in fact, it may only help you lose money faster!) but it does have the potential to change the way a society views and deals with information.
Right now there is a vast amount of knowledge in the LoC that is effectively out of the ordinary citizen's hands. That is not how it should be. If knowledge is power, there is a storehouse of power waiting to be unleashsed by giving everyone access to what is being stockpiled. It won't happen over night, or in a few years, but eventually it will have a ripple effect. Historians lament the loss of the Great Library of Alexandria, but what difference would it have made if only a few could actually use the information that was contained?
Re:More dotcom hype... by zoeblade · 2004-10-09 19:49 · Score: 1

If this is such a wonderful idea why doesn't he get a bunch of artists, musicians and writers to donate their own work to this project and actually prove the concept works?

Because archive.org have already done this, proving it's possible?

(Shameless plug: my music at arvhive.org)
Re:More dotcom hype... by Anonymous Coward · 2004-10-10 00:59 · Score: 0

Knowledge is not power. That's a myth perpetrated by the not-so-powerful librarians.
Re:More dotcom hype... by dogod · 2004-10-10 02:48 · Score: 1

If knowledge is power, there is a storehouse of power waiting to be unleashsed by giving everyone access to what is being stockpiled.
couldn't have said it any better. i think i found a job that the UN could handle. Putting all books from around the world into one big online libary.
Re:More dotcom hype... by anthony_dipierro · 2004-10-10 05:46 · Score: 1

The dotcom era should have taught these so called visionaries one thing, you actually have to have a business plan before you can transform business models.

More like, when someone offers you a 4 million dollar investment but insists on complete control over the company, you should say "no thanks" and look for a different investor.

But hey, that's just what I learned during the dotcom era.
Re:More dotcom hype... by UnknownQ · 2004-10-10 08:33 · Score: 1

If these business models are so full of potential he should start one, with his own intellectual property, and prove that the old economy intellectual property businesses they are extinct.

Which is exactly what Magnatune is doing.

--
Wherever you go, there you are!

Fuzzy math on storage reqs by ravenspear · 2004-10-09 17:42 · Score: 3, Informative

The article claims that the LOC stored as image data would take up 1 TB.

That's wildly underestimated IMO. The LOC has 26 million books. If we conservatively assume that they each have at least 100 pages, that is 2.6 billion images. That equals 0.03 kb per image. That's some REAL good compression for an image as large as a full page of text.

Re:Fuzzy math on storage reqs by gerardrj · 2004-10-09 19:37 · Score: 1

And what do you do about the sound recordings and other media in the library? The LOC is not exclusively about books.

--
Article X: The powers not delegated... by the Constitution...are reserved...to the people
Re:Fuzzy math on storage reqs by bot24 · 2004-10-09 19:51 · Score: 1

Ascii text, however, is around 300KB for a 150 page book. I doubt that they have 2,600,000,000 pages, but that would be more than 4TB. It can be reduced to less than 2TB by the use of compression.
Compressing images of pages of text is useless. That is like compressing a bitmap rendering of an vector file. The compressed images will still dwarf the source.
Re:Fuzzy math on storage reqs by CaptnMArk · 2004-10-10 00:11 · Score: 1

Maybe not if source is a .doc or the like.
Re:Fuzzy math on storage reqs by The-Bus · 2004-10-10 02:30 · Score: 1

My guess is they meant petabyte, not terabyte. Then, adjusting your assumption to each volume having 150 pages, we've got about 3.9bn images, and about 20.48kb per image. That's still not a whole lot, but enough if you consider that for a lot of works they would simply have files with embedded fonts as opposed to image scanning a black-and-white page.

--
Small potatoes make the steak look bigger.
Re:Fuzzy math on storage reqs by anthony_dipierro · 2004-10-10 05:44 · Score: 1

I'm not convinced that OCR quality is good enough today to store the books as ASCII text. You're going to be doing a lot of work making the scan, and you don't want to do that work all over again in 10 years, when the quality of OCR will have increased drastically. Besides, in 10 years the book will have further deteriorated, making the job of reading the characters even harder, at the least.

obligatory babelfish translation by servognome · 2004-10-09 17:43 · Score: 1

and to those who can't, they can copy and paste the text into a translator.
With that who can't, they can reproduce and stick inside their language teacher.

--
D6 63 0D 70 89 81 BB 8E 7B 7C 5F 5D 54 EA AB 73

Damn YOU, idiot. by Anonymous Coward · 2004-10-09 17:48 · Score: 0

All authors seeking American copyright had to submit two copies of the work to the Library.

Authors in foreign countries don't copyright their works in America -- they copyright their books in their country, and the copyright is automatically valid in (nearly) every other country thanks to the Berne Convention.

Die of shame now, dumbass.

Re:Damn YOU, idiot. by AKAImBatman · 2004-10-09 18:08 · Score: 1

I don't normally resort to this sort of language, but I entreat you sir, to fuck off asshole.

If you had actually read my post, you would have realized that I was referring to any non-english works (foreign written or US written) that specifically sought out US copyright protection. Because, you know, it's not like foreign authors never register their copyrights in other countries.

<sarcasm>You may now begin hailing my amazing wisdom and knowledge.</sarcasm> OR, you could just be a nice guy and apologize for your rude behavior.

--
Javascript + Nintendo DSi = DSiCade
Re:Damn YOU, idiot. by julesh · 2004-10-09 20:30 · Score: 1

The AC may be being an asshole, but I believe he's (at least approximately) right. As a British author, I don't need to register my copyrights in the US to receive the same protection that US authors do. I can easily sue a US citizen for infringing my copyright in a US court and would have the full scope of possible damages available; a US citizen would have to register their copyright to gain this privelege. It's a crazy system, I know, but that's copyright registration for you...

Er-For the love of... by Anonymous Coward · 2004-10-09 17:49 · Score: 0

"This is one more reason that the whole basis behind IP law needs to be reevaluated. Although we do want authors, inventors, and other creative types to be rewarded for their efforts, it is also true that what they create becomes more valuable the more it gets out into the world."

Free the cars. Just look at how many people have seen a car? Been inspired by a car? From how to's to I love my car poems.

"Creating primarily for money is shortsighted when a work has the chance to impact the larger culture. "

Copyright really isn't about one's motivation for why one creates. Copyright is about what's premissible with copies (voluntary or involuntary). i.e. someone swiping the original from me and making copies, as well as when I voluntarily make copies.

Re:Er-For the love of... by 10101001+10101001 · 2004-10-09 18:04 · Score: 1

For all extents and purposes a car is free. What do I mean? It's free in the sense that it's possible with time and effort to find out how a car works, take the bare components to a car, and make a duplicate. The total cost is your time + free blueprints (or cheap blueprints) + the cost of parts. The cost to duplicate a book? If digitized, less than a dollar for the time and the media. I know that's not what you were trying to say, but my point is that it's not even that people want cars or books to be free. They just wish the price was much closer to the duplication cost. For virtually all copyrighted works, that's amazingly small.

--
Eurohacker European paranoia, gun rights, and h
Re:Er-For the love of... by 10101001+10101001 · 2004-10-09 18:16 · Score: 1

Sorry for responding to myself, but I just came up with a more valid point to why copyright in general is a bad idea.

Copyright has basically turned into charging relatively lots of money for the embodiment of an idea (daily writers to newspapers and the like excluded). In truth, most manufactured goods have been outsourced to other countries. Why? To make more money because there's an inherent small mark-up available when companies turn a good into a commodity. But, copyright is different and can enjoy gross mark-ups by comparison. Now, eventually this too will be outsourced (since it too will make more money), but in the meantime, the economy of the US will be based less on trading physical goods and more on trading ideas as currency (strictly speaking, copyright is the embodiment of ideas, but it's more succinct to say it that way).

But you can't trade ideas or the embodiment of ideas as some sort of actually stable basis for an economy. It's just asking for the economy to crumble and/or be taken over by countries which actually *do* make *real* goods. But, it's the obvious economic outcome as people will obviously turn to whatever is the largest profit maker per unit. The outcome is that the entertainment market is become well ballooned beyond what is healthy; this isn't good for the quasi-free market of the US. But with other countries without strong copyright terms yet able to copy the idea of mass entertainment, the entertainment bubble is sure to collapse in the future to foreign markets. And what else does the US have to trade to other countries with except maybe agricultural goods (some of which most of Europe now has banned)?

--
Eurohacker European paranoia, gun rights, and h

Re:No habla francais. by Binky+The+Oracle · 2004-10-09 17:52 · Score: 1

How dare they write in the language their majority of their potential audience is most likely to understand?

I said could understand - not necessarily write.

DAMN YOU WORD PROCESSING FOR MAKING IT SO EASY TO REVISE A STATEMENT!

--

Slashdot comments... splitting hairs since 1997.

Can't do that-Inheritance. by Anonymous Coward · 2004-10-09 18:00 · Score: 0

"My grandpa was a farmer who died over 50 years ago. Since I don't get to collect royalties on the corn he grew in the 1930s, I've had to work to produce my own income. Imagine that."

If you get an inheritance? You effectively do.

Re:Can't do that-Inheritance. by Waffle+Iron · 2004-10-09 18:22 · Score: 5, Insightful

If you get an inheritance? You effectively do.
I might inherit a portion of his farm. But that's a result of money that he saved at the time. I do not collect royalties on the *work* that he did 70 years ago.
If an author or musician wants to leave an inheritance, then they should save the money they make during a reasonable copyright term, and give that to their children. They can leave their typewriters, musical instruments, and other tools of the trade (analagous to a farm) as well.
They might have to actually forego a blowing everything they earn on cocaine and refrain from signing away most of their income on bad contracts to actually achieve this, but then so do the rest of us.
Re:Can't do that-Inheritance. by forii · 2004-10-09 20:12 · Score: 0

I might inherit a portion of his farm. But that's a result of money that he saved at the time. I do not collect royalties on the *work* that he did 70 years ago.

That's only because of the difference in business models. Consider if your grandfather, for instance, had traded in corn contracts that matured in 70 years (not very likely, but just for example), backed by his then-current crop production. Or rather than do that, he may have invested the returns from his crops in a bank savings account. Either way, you could now be reaping the rewards from 70 years ago.
Re:Can't do that-Inheritance. by linzeal · 2004-10-09 20:53 · Score: 2

Gambling is no way to run an economy.

--
An Education is the Font of All Liberty
Re:Can't do that-Inheritance. by forii · 2004-10-09 21:05 · Score: 0

Gambling is no way to run an economy.

I assume you're referring to my reference to contracts. The fact of the matter is that options and futures are an important part of any commodity market. In particular for farmers, they allow producers to manage their risk, in an industry where a large part of their capital is subject to the whims of the weather.
Re:Can't do that-Inheritance. by Anonymous Coward · 2004-10-09 23:09 · Score: 0

Are you saying that shortening copyright terms would lead to fewer artists producing new works? Because that is the ONLY consideration worth making - copyright is there with the intent of stimulating the creation of new works by making it profitable to do so.

Unless artists are actually deciding whether or not to make new works based on how long after their death their children and grand children will be receiving royalties, or a substantial part of their income is derived from selling off rights to royalties expected after their death, there is NO justification for copyrights to extend much after their death.
Re:Can't do that-Inheritance. by Dun+Malg · 2004-10-10 03:08 · Score: 1

My grandpa was a farmer who died over 50 years ago. Since I don't get to collect royalties on the corn he grew in the 1930s, I've had to work to produce my own income.
If you get an inheritance? You effectively do.
No, he doesn't. He's getting money earned by his grandfather before he died. Authors can save their earnings and pass that money on to their descendents just like the rest of us.

--
If a job's not worth doing, it's not worth doing right.
Re:Can't do that-Inheritance. by tepples · 2004-10-10 07:21 · Score: 1

Consider if your grandfather, for instance, had traded in corn contracts that matured in 70 years (not very likely, but just for example)

So in your analogy, where do you get the federal government repeatedly extending the term of futures from 50 to 70 to 90 years?

What, you want me to starve to death? by Banner · 2004-10-09 18:02 · Score: 3, Funny

Oh yeah, put 'em all online. I have a hard enough time already in libraries and book stores! If I could read any book I wanted to (even if they're only the ones already out of copyright) online, I'd probably not leave my computer until I passed out!!

Re:What, you want me to starve to death? by Anonymous Coward · 2004-10-09 19:11 · Score: 0

Difference being that you can't lug a stocked minifridge into your local Barnes and Noble...
Re:What, you want me to starve to death? by Omkar · 2004-10-09 19:28 · Score: 1

Amen. And don't forget the power of online magazines (I'm thinking the Economist).
Re:What, you want me to starve to death? by Anonymous Coward · 2004-10-10 13:29 · Score: 0

If you have passed out, how will you leave then?

Easier task by Anonymous Coward · 2004-10-09 18:05 · Score: 0

Keeping track of millions of guns across the country is a very different proposition from just sticking however many books in an auto-scanner. The registry requires an entire force of people to talk to the owners, enter their data, check the weapons themselves.... don't they even have to physically store two test bullets from each gun? Then there's getting the data to the police all over the country, which required a far more intense data network to be put in place.

The gun registry is much more like setting UP a library of congress. Also, sadly, canadian governemnt programs usually include funding for producing television programming telling people how to use the service. Don't know about this one.

The books at the LoC are already in one place, so that's easy. They're already catalogued, so that's easy. There's already a staff of librarians going through them all the time for research purposes, so you've got access.

The biggest part of the task is just moving the books to the auto-scanners and then back. Some of it is being careful to preserve the original as its done, but not much... most of the books are are excellent condition right now. It's a very good library.

An auto-scanner is a a robotic arm that turns the pages with puffs of air and takes a photo of each. Such a machine can scan a book in almost no time.

There is of course the programming effort and such... that's why it's $260 million. Just scanning the books would probably cost a tenth of that.

Again I stress: There's no reason to suspect they're lowballing because there's already a whole organization there devoted to doing things with those books. They're old hands at this. They know how hard it will be.

Great by Konster · 2004-10-09 18:09 · Score: 2, Funny

This will be great! You know all those ads that claim such and such can transmit the Library of Congress in so and so seconds?

Now we'll be able to test their notions!

Er-For the love of...Mass construction. by Anonymous Coward · 2004-10-09 18:14 · Score: 0

"The cost to duplicate a book? If digitized, less than a dollar for the time and the media. "

Since you brought it up. How much did it cost to actually bring that book, movie, or music into existance?

How do you also keep the cost of copies reasonable, while not leaving any unpaid debts behind?

How do you have enough left over for future endeavours?

Re:Er-For the love of...Mass construction. by 10101001+10101001 · 2004-10-09 18:56 · Score: 1

Ask newspapers. The print in bulk at $0.50/paper, often with many sections. They recycle paper, use pretty bad ink except on Sunday, and they don't give tons of royalties to anyone. Yet, newspapers still make enough money to stay in business (especially the bigger ones) and they share thinks like AP reports. The copyright on most newspaper isn't worth the paper it's printed on, as few people would buy a previous days newspaper (now, 30 years, maybe..). So, it's clear that it's possible to survive on such a system. Of course, for something that takes months to make and can be easily copied, you'd give your customer various "perks" without charging much above what "pirates" sell new stuff at.

Maybe people will still be d/ling things for free. But if you know you can sell a few copies at $0.10 and the publisher is selling it at $0.15, you'll sell it at $0.10 (more than likely). And the publisher will likely still get a lot of buyers because it's a more trusted source and $0.05 is piddly for that guarantee. Of course, I don't really know the price the free market will offer. But I find it hard to believe that absolutely all books and the like will fail to be produced. If anything, someone like Samuel Clemmens believed that it would just mean more profit for publishers and less for authors. But with the internet, the author can be the publisher..though maybe it won't be better for the creator without the huge available mark-up.

--
Eurohacker European paranoia, gun rights, and h
Re:Er-For the love of...Mass construction. by Kierthos · 2004-10-09 19:17 · Score: 1

One thing... Advertising.

Look at your standard paperback novel. You don't see a lot of advertisements in there, do you? (Okay, maybe at the end of the book, a plug for the next book by the same author or other books from the same publisher...)

Look at your standard newspaper. Full page advertisements, or alf page ones, or quarter page ones. Sure, the front page of each section might be free of them, but second page on is fair game.

That pays a lot of the costs involved.

Next, the classifieds. I don't think the paper here lets you put _anything_ in the classifieds for free. That also pays for a lot of the costs involved.

Kierthos

--
Mr. Hu is not a ninja.
Re:Er-For the love of...Mass construction. by 10101001+10101001 · 2004-10-09 20:32 · Score: 1

And there were magazines that used to include either short stories or a chapter of a novel in the past (do they still?). My understand is that's how Charles Dickens did at least a few of his novels. But you bring up an interesting point. People might be willing to pay less for books if they included full-page advertisements.

Of course, that idea really cheapens my idea of books. I'm sure at least *some* people would agree with that. And why can't there be ad-ware books and non ad-ware books. It works for something like Opera. I certainly wouldn't mind paying a little extra to not get fed advertisements in my books. I certainly don't believe that that will kill all books because if nothing else the local shopkeep might have a machine to print your own ad free book after the short extent of copyright.

I'm hopeful, though, that electronic books develop far enough that I'll rarely have a need for real paper. I like a solid book from time to time, but having a single pad with a ton of books is more practical. And to that, I'll probably keep my analog copy of HHGttG. In between, maybe the whole "micropayment" plan will work. Then I can get lots of books for relatively cheap and stick them on my ePad (or whatever it ends up being called).

--
Eurohacker European paranoia, gun rights, and h

Conversion please? by xRelisH · 2004-10-09 18:19 · Score: 1

How much is that in terms of Space Shuttle Fuel Tanks? What about in Weapons of Mass Destruction?

Library of Congress Transfer Rates by xombo · 2004-10-09 18:23 · Score: 2, Funny

Isn't the size of the Library of Congress what people used to use as a quantifier for the speed of high-bandwidth connections? I remember several years ago that companies would brag that they can transfer the entire Library of Congress to England or wherever in less than 2 seconds and what have you. I suppose a statement like that would indicate that there are already digital versions of the Library of Congress out there somewhere meaning it will take virtually nothing dollar-wise to put it online (since I guess it's been flowing back and forth for years).

Re:Library of Congress Transfer Rates by DrVxD · 2004-10-09 23:45 · Score: 1

Unfortunatly, some moron DROPPED the LoC in the middle of the Atlantic - which is why they need to rescan it all...

--
Not everything that can be measured matters; Not everything that matters can be measured.
Re:Library of Congress Transfer Rates by Maxite · 2004-10-10 01:34 · Score: 1

Not if they were actually transmitting photos of the Library of Congress BUILDING.

--
Ah, you found me!

Agreed, what about labour? by xRelisH · 2004-10-09 18:24 · Score: 1

I didn't really read TFA, but how do they propose to actually do the scanning? There seem to be a lot of books in there, is there some sort of book-scanning machine that I've never heard of?

Doing all of this by hand would be insane, even if it's by a large team of volunteers. Maybe 1000 monkeys on 1000 scanners copying all the books will get it done in a few months?

Re:Agreed, what about labour? by erick99 · 2004-10-09 18:35 · Score: 2, Informative

This guy has a $150,000 machine that scan 1,500 bound pages per hour. That would certainly help though it sounds expensive . . .

--
http://www.busyweather.com/
Re:Agreed, what about labour? by erick99 · 2004-10-09 18:46 · Score: 1

Amazon might have some ideas . . .
Amazon does not disclose exactly how it scans books, but a spokeswoman for the company, Jani Baker, said Amazon has already scanned 120,000 books and hopes to archive "virtually everything" in the company's catalog.

--
http://www.busyweather.com/
Re:Agreed, what about labour? by Anonymous Coward · 2004-10-09 20:17 · Score: 0

I'm pretty sure the monkeys can only work on shakespear...

That's what I've heard anyway... they refuse anything else... That's why they type it up all day long.

Re:If Bill Gates -- Not So Fast! by Anonymous Coward · 2004-10-09 18:25 · Score: 0

"Just as a point of /. interest, what is the conversion factor between ACMs (Andrew Carnegie Millions) and BGBs (Bill Gates Billions)? "

http://www.jsc.nasa.gov/bu2/inflateCPI.html
htt p://www.westegg.com/inflation/

From what I could find, Carnegi had donated $450M from 1898 to 1911. That would be around $11Billion today.

this makes news? by Anonymous Coward · 2004-10-09 18:25 · Score: 3, Insightful

Maybe im the odd duck here but somehow waay back in early net days..the 90's i thought that this was such an obvious application of internet technology that it must be part of the original design purposes for the internet (darpanet and all that funding of course)

So the only surprise to me is that were just now hearing a proposal to do this??? sheesh, if i hadnt thought it so completely obvious to every netizen at those old public library terminals i wouda lost so much seep making it happen!!!

so now who's going to do it? and while its limboing through congress can we just put together a consortium to visit thie library we aready own with our digital camera's and OCR the thing into existence... how many of us woud need to donate our gmail 1g accounts to store it all?

Er-For the love of...Economics. by Anonymous Coward · 2004-10-09 18:32 · Score: 0

*smile* I'm going to be busy tonight. I only have 10 posting slots you know.

"Sorry for responding to myself, but I just came up with a more valid point to why copyright in general is a bad idea."

"Copyright has basically turned into charging relatively lots of money for the embodiment of an idea "

And knives have turned into tools to kill people. but no one would argue that the abusing of knives is a reason to eliminate them. So why is the abusing of copyright seen differently?

"Why? To make more money because there's an inherent small mark-up available when companies turn a good into a commodity. But, copyright is different and can enjoy gross mark-ups by comparison."

I would argue that the creation costs of copyrighted "goods" are more hidden (either out of ignorance, or simple apathy) than the creation costs of physical goods.

The main difference between the two is duplication costs, but economics dictates that both costs have to be made up.

"But you can't trade ideas or the embodiment of ideas as some sort of actually stable basis for an economy. "

Remember talk of the knowledge economy? And yes as long as certain things are true, we'll need physical goods.

"And what else does the US have to trade to other countries with except maybe agricultural goods (some of which most of Europe now has banned)?"

You're short sheeting the US. Look more carefully.

Re:Er-For the love of...Economics. by 10101001+10101001 · 2004-10-09 19:10 · Score: 1

>>Copyright has basically turned into charging relatively lots of money for the embodiment of an idea

>And knives have turned into tools to kill people. but no one would argue that the abusing of knives is a reason to eliminate them. So why is the abusing of copyright seen differently?

Actually, that's not really an abuse of copyright. An abuse would be violating copyright or turning it into something that doesn't still meet the definition of copyright. I was merely pointing out what copyright is now for various companies (again, daily newspapers and the like excluded).

>>Why? To make more money because there's an inherent small mark-up available when companies turn a good into a commodity. But, copyright is different and can enjoy gross mark-ups by comparison.

>I would argue that the creation costs of copyrighted "goods" are more hidden (either out of ignorance, or simple apathy) than the creation costs of physical goods.

Last I checked, physical goods aren't created. They're constructed from raw goods. Creation would involve violating some conservation of matter/energy laws (given matter is energy of a different form, even claiming energy -> matter is creation seems as absurd to me as claiming you create iron by converting liquid iron to its solid form). The closet analogy to creation for a physical good is really the design of a good. The fact is, the design for most goods is so blatantly obvious in the good no trade secret would cover it, the trade secret for the good was lost, or the trade secret for the good still exists. But given that the copyright and the trade secret for a copyrighted work are the same, it seems clear that all copyrighted works would fall into the "blatantly obvious" category and there'd be no damages for redistribution.

>The main difference between the two is duplication costs, but economics dictates that both costs have to be made up.

Economics might dictate that costs have to be paid for, but it never dictated that copyrighted works as we know it had to exist. So, the Constitution plays an exception to allow many more copyrighted works to exist than conceivably would exist in a free market. That's not so bad, if copyright's length were proportional to the reward cycle which is proportional to the rate of communication. Copyright has gone so badly in the direction *opposite* of this, that having a breather of no or minimal copyright might be the right direction to give some perspective on what actually is good for our economy.

>>But you can't trade ideas or the embodiment of ideas as some sort of actually stable basis for an economy.

>Remember talk of the knowledge economy? And yes as long as certain things are true, we'll need physical goods.

We'll need physical goods, but if our country doesn't produce many physical goods and uses its IP as the main export for trade, then when other countries make their own IP we'll be in the situation that we no longer have a viable export nor anyone in the country to make the physical goods we need. Of course, eventually we might start making physical goods again. But the truth is that most other countries will probably still be more cost effective at it than we are. That seems to mean that the only main export we have is things like crops. I'll be glad to look more carefully, though, if you can come up with more specific examples of what other physical goods we make.

--
Eurohacker European paranoia, gun rights, and h
Re:Er-For the love of...Economics. by Anonymous Coward · 2004-10-09 19:25 · Score: 0

The Harley isn't made in America. ALL the parts are made in other countries and then it is assembled in america. great scam huh

Eat a bag of dick, moron. by Anonymous Coward · 2004-10-09 18:33 · Score: 0

Authors in foreign countries don't "seek US copyright protection", because global copyright protection is guaranteed by the Berne Convention as soon as a work is copyrighted in any country.

And FYI, non-English works account for less than 2% of the total volume of the Library of Congress so the "well, at least to those who can read English" comment was entirely appropriate, unlike your delusional and paranoid rantings and ravings.

Re:Eat a bag of dick, moron. by AKAImBatman · 2004-10-09 19:01 · Score: 1

Are you an attack dog from the GNAA or something?

Look up the Berne convention. The US didn't join it until 1989. That leaves over a hundred years of work that had to be explicitly copyrighted in the US.

And FYI, non-English works account for less than 2% of the total volume of the Library of Congress

Which is neither here nor there. And by making this statement, you are agreeing with me that the LOC has a substantial selection of non-english works.

unlike your delusional and paranoid rantings and ravings.

I'm sorry. Did I hurt your feelings? Grow up. If you flame someone, you can expect them to respond less kindly than if you had attempted to make a point in an intelligent fashion.

Now I will repeat. Please either act like a man (or woman as the case may be) and apologize for your uncalled for behavior, or get the hell out of this discussion.

Good day to you, sir.

--
Javascript + Nintendo DSi = DSiCade

Now I have a real problem... by HellYeahAutomaton · 2004-10-09 18:39 · Score: 3, Funny

I just downloaded the LoC.ps.tgz from the local WPI Internet2 tap using gnutella and my printer just ran out of ink....

blahhh by Anonymous Coward · 2004-10-09 18:40 · Score: 0

I'm hacking the planet!

Better convert.. by kcb93x · 2004-10-09 18:42 · Score: 1

...that to OpenOffice.org Text Format...much more compressed, and it natively uses XML.

--
There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.

Can't do that-Door knocker. by Anonymous Coward · 2004-10-09 18:45 · Score: 0

"You must mean currently. But we all know that as soon as anything major (like Steamboat Willy) comes close to coming out of copyright, we'll see Congress extend the term of copyright yet again, thanks to 'encouragement' from Disney."

So our intrepid hero is faced with two doors.

Behind door number one our brave hero squares off against those who would try to stiffle the public domain. Rallying legions to his cause. Pretty maidens throw rose petals at his feet.

Behind door number two our simpering hero goes on a popular geek site known as slashdot, and bemoans at how weak he is against the MIGHTY *rolling echo* business, and congressional interests, and everything is business as usual. Legions throw rotten tomatoes at our fallen hero.

Wich will he pick? Place your bets.

"Copyright terms are nigh on infinite in fact, if not in law."

Actual examples will do just fine.

Scupper copyrights by freedom_india · 2004-10-09 18:50 · Score: 2, Interesting

There's a Vulcan saying: "The needs of the many outweigh the needs of the few."
I would say, scupper copyrights for all volumes owned by LoC.Scan and put every volume on the internet.
Within few years we would witness a Renaissance of sorts once again in human knowledge and education.

--
"Doing what i can, with what i have." ~ Burt Gummer

Re:Scupper copyrights by aboyko · 2004-10-10 01:38 · Score: 1

I would say, scupper copyrights for all volumes owned by LoC.

The US Copyright Office, incidentally, is a part of the Library of Congress. The quote you offered should help, since they're well known to be heavily influenced by Vulcan (and to a lesser extent, Elvish) culture, but you might still be facing a little bit of an uphill battle. Best of luck!

Only way it will happen ... by Anonymous Coward · 2004-10-09 18:52 · Score: 0

is if it is a billion dollar award to haliburton.

As an author...Foe sandwich. by Anonymous Coward · 2004-10-09 19:03 · Score: 0

"I'm a software developer who loves movies: I'm a creator and a consumer, so I see both sides of this coin. And I think there needs to be a compromise between consumers and creators."

Well this is going to be my last post before turning in.

There already is a compromise, and has been for decades (copyright).

However what has brought us to our present state is individuals on BOTH sides (consumer and business) that have chosen to not honor this old agreement.

There is also innocents (much like Iraqi's caught between Americans and Insurgents).

The only way out is for the innocents to recognize that neither side represents their interests (extremists never do), and to drive them out. Going back to what originally worked.

"260 million" by DogDude · 2004-10-09 19:06 · Score: 1, Flamebait

Compared to the $200 billion to kill and maim tens (hundreds?) of thousands of people in the name of "terrorism", $260 million to create essentially, a Library of Alexandria is a fucking bargain.

--
I don't respond to AC's.

Re:"260 million" by supmylO · 2004-10-09 19:34 · Score: 1

I didn't think Saddam spent THAT much killing his own people...
Re:"260 million" by DogDude · 2004-10-09 19:35 · Score: 1

Hey, but the Us did it better and more efficiently! Did Saddam kill that many people so quickly?

--
I don't respond to AC's.
Re:"260 million" by Anonymous Coward · 2004-10-10 03:05 · Score: 0

Saddam killed orders of magnitude more people.

Like many, you're letting your hatred for Bush blind you to the facts.

Human's Book Pool by 12357bd · 2004-10-09 19:07 · Score: 3, Insightful

Not only the Library of Congress of the Unites States of America, we should also scan every big library in the world to create a pool of human work to freely share and preserve.

--
What's in a sig?

Re:Human's Book Pool by MikeBabcock · 2004-10-10 01:24 · Score: 1

Oh, no! What about the Copyrights
--
Rosen

--
- Michael T. Babcock (Yes, I blog)
Re:Human's Book Pool by aboyko · 2004-10-10 01:34 · Score: 1

Well, the Library of Congress includes an enormous amount of non-US content; something over half their content is not in English, if I remember correctly. Just sticking to LC content would keep one busy for a good while.
Re:Human's Book Pool by 12357bd · 2004-10-10 07:23 · Score: 1

CopyWhat? :)

Now seriously, there's no excuse, we should have by now all the free content of all over the world online, free for anyone to use and enjoy. Maybe the UN should encompass such a project, meanwhile any effort on that direction is of course welcomed.

--
What's in a sig?

Government Spending by Baseclass · 2004-10-09 19:16 · Score: 1

I want to know why the hell it would cost so much. I realize it's an incredible amount of material, but $260 million!? I'll do it for a cool $1 million no questions asked. It's not like it's hard or requires much technical expertise. Just lots of monotonous labor.

--
^^vv<><>BA

Re:Government Spending by gerardrj · 2004-10-09 19:32 · Score: 3, Insightful

I'd take you up on that offer, but it would be money wasted as you simply can not do the job for that little money.

The LOC doesn't just contain nice black and white typed texts. There are hand written documents in organic inks on animal hide and poorly constructed paper. There are paintings in every medium you can imagine and there are sound recordings on just about every media ever used: wax tubes, glass disks, wire spools, open reel, 8-track, cassette, CD, DVD, etc.

Each of these things needs to be digitized, categorized, indexed and offered in a searchable manner. A printed page, for example, will need to be photographed and transcribed/OCRed.
Much of the work needs to be done on delicate objects that may be destroyed if not handled correctly. If you were to play a wax recording disk with too much pressure, or under the wrong environmental conditions, the disk would shatter in to an irreparable pile of small bits.

What formats will you store them in? What formats will you make them available in?

--
Article X: The powers not delegated... by the Constitution...are reserved...to the people
Re:Government Spending by Anonymous Coward · 2004-10-09 19:47 · Score: 0

This sounds like the perfect job to outsource to India!
Re:Government Spending by databyss · 2004-10-09 20:29 · Score: 1

I don't see what all the big fuss is about...

Throw a webcam at the entrance to the LOC and you've got it all online.

I'll do it for a mere... 0.1615 LOC

--
Hmmm witty sig or funny sig? Maybe elitest techy sig!
Re:Government Spending by Dr_Barnowl · 2004-10-09 20:50 · Score: 2, Interesting

The best way so far of capturing wax recordings and the like is to run the disk under a high-resolution scanner and use a piece of software to render the image of the grooves as a waveform ; this involves no physical wear of the medium. In fact, I'd think that a commercial version of this could well catch on for old-timers with large vinyl collections....
Re:Government Spending by Anonymous Coward · 2004-10-10 03:54 · Score: 0

I'd think that a commercial version of this could well catch on for old-timers with large vinyl collections...
There is a turntable out there that uses a laser to read vinyl. It's quite expensive, last I heard. It was on slashdot, I believe, some time back.
Re:Government Spending by magefile · 2004-10-10 06:00 · Score: 1

No, I believe what was on /. a while back was a system to put an LP on a (commercially available) flatbed scanner and get a (crappy) wav from that. The laser thing would probably be much better quality.
Re:Government Spending by Anonymous Coward · 2004-10-10 09:34 · Score: 0

jesus christ guy, don't you think they mean the more obvious works in the library of congress. If they can do 95% of the documents for 260 million and the price increases exponentially thereafter stop at 260 million. It's not that hard to figure out; it's stupid to not at least try and see what you can get for that much money. 50% is infinitely greater than 0%
Re:Government Spending by gerardrj · 2004-10-10 15:27 · Score: 1

I'm not complaining about the cost at all. I'd say do it if it costs $2 Billion. I'm all for open access to knowledge.

Take the money from the military and "intelligence" budgets.

--
Article X: The powers not delegated... by the Constitution...are reserved...to the people

wow, I am insane, no, really, I am by Anonymous Coward · 2004-10-09 19:19 · Score: 0

now, let's just take this a little over the edge and see what happens.

First, we need to get the library of congress online
Next, we will merge it with archive.org and other relevent places
Then we buy out google and add that to it also
Now that we controle all of this data, we set it up so that when you use information from it, you automaticly pay for it
o, dont forget to allow people to be able to upload stuff to it and get paid if it is looked at
next, let corporations controle the government
o, wait, we allready do
allright, well, we will just let the corporations arm their employes
and become indipendent states

now, all you have to do is make the mafia deliver pizza's

want to try some Snow Crash?

why english? by c4thy · 2004-10-09 19:28 · Score: 0, Interesting

what makes you think all the contents of the library of congress are in english?

--

i am convinced that "/.ers" are homosexuals and imma make that my "sig"

Very true by Anonymous Coward · 2004-10-09 19:32 · Score: 0

Some of the contents are written in American "English".

I propose GNU/(Library of Congress) by Anonymous Coward · 2004-10-09 19:35 · Score: 0

If every person goes into the Library of Congress, borrows a book and scans it, the job is done! :-)

~1 dn`7 ndr57nd u~ by bot24 · 2004-10-09 19:39 · Score: 1

Only five? Would that mean that Klingon and 1337 are included? Would the books have to be written in that annoying overly abbreviated dialect that some AOL users speak? Sure, it is faster to write, but it takes for ever to make the ascii art with the tilde and degree marks.

Works great until by SuperKendall · 2004-10-09 20:06 · Score: 1

Someone googles for a paragraph from your book, and comes up with two different results...

--
"There is more worth loving than we have strength to love." - Brian Jay Stanley

I have to ask... by Civil_Disobedient · 2004-10-09 20:06 · Score: 4, Insightful

As an author, I wonder how much of your valued craft was honed by reading the work of others for education and inspiration. How many books did you buy in elementary school, or high school? Yet that's where you learned your precious language skills you now market.

Knowledge, even the limited knowledge of an author, does not exist in a vacuum. You read, you learn, you practice, then you create. You could not have done this without the beneficence of others who aren't making a dime off the education they provided you.

To unleash the vast amounts of knowledge stored up in the LOC to the world would be one of the single best things this country could do for mankind. One book, one reader my hairy ass. Why not open the floodgates so everyone can benefit?

I understand the motivation of monetary incentives, but I also know a lot of great authors who died penniless. And they were at least brave enough to sign their names to their ideas.

Plain Text by interactive_civilian · 2004-10-09 20:08 · Score: 1

Jeff DeMaagd said:

File formats tend to change too.

Well, the .txt format hasn't changed much has it?

If we are talking about text documents in the LOC, then they should be scanned as plain text (or Rich Text at the very most) to at least preserve the contents in a format that is pretty much basically standard to every computer for at least the last 30 years (not counting unicode).

Of course, I'm sure the Copyright Police would have something to say about preserving the LOC in such an open format...

--
"Empathise with stupidity, and you're halfway to thinking like an idiot." - Iain M. Banks

Re:Plain Text by Anonymous Coward · 2004-10-10 17:25 · Score: 0

Well, the .txt format hasn't changed much has it?

Oh no, not again ...
I hope you mean UTF-8, which, btw, has changed (but fortunately backward compatibly).

I, myself, would recommend (UTF-8 encoded) (x)html as it would allow pictures and (to some degree) layout to be preserved too.
Re:Plain Text by CatMan79 · 2004-10-11 03:29 · Score: 2, Insightful

But what about all the pretty pictures? I can think of a good many textbooks or art collections that would be rather worthless without the images. Including high resolution images in addition to plain text would take a TON of disk space--is this factored into the proposal?

Huge benefit to society? by a+whoabot · 2004-10-09 20:31 · Score: 1

Yeah, just what we need to keep the spiral of information addiction we all have going.

Like the Wikipediaholic who reads articles to find answers to questions no one asked, we're all writhing addicts to information systems.

"(well, at least to those who can read English)" by devhen · 2004-10-09 20:31 · Score: 1

I would imagine a database of documents would be easier to translate than their physical counterparts.

Scanning can be done for 'free' by Anonymous Coward · 2004-10-09 21:28 · Score: 0

As can be seen here at the bottom of page 1: http://www.loc.gov/fsd/fin/pdfs/fy03.pdf
the library part of the LOC costs $ 353 million annually.
So in two years max, the proposed operation could be done budget-neutral.
Anybody want to buy some spacey, sturdy storage room?

Re:well, at least to those who can read PRINT by Anonymous Coward · 2004-10-09 21:34 · Score: 0

Hey, doesn't the LOC have foreign languages?? The Chicago public library sure does, so I would expect LOC to have some too.

They even have a copy of the Gutenberg Bible, don't they? It wasn't in English. Which brings me to...

Project Gutenberg! This LOC project would be wholly redundant to Project Gutenberg's work, and might save time by cooperating with them.

hmmm.... by Atrax · 2004-10-09 21:37 · Score: 2, Funny

well, at least to those who can read English

So that leaves out most Americans. Thanks from the rest of the world!

(tongue firmly in cheek)

--
Screw you all! I'm off to the pub

not really by iamnotacrook · 2004-10-09 21:38 · Score: 1

it wasnt really such a problem in the end, since in those days they didnt have as much knowledge as we do now. i would say it is much more important to protect todays internet than books in those days. and definitely the renaissance stemmed from galileo because he broke the power of the church.

Re:not really by ahknight · 2004-10-09 22:51 · Score: 1

definitely the renaissance stemmed from galileo because he broke the power of the church.

Quit parroting bad history. Galileo's problems with the Catholic Church had nothing to do with its power at the time (which was more the power of the people in it in politics than the Church as a discrete entity) but more to do with the fact that Galileo used the concept of heliocentricism (which most people already believed at the time) to question the inerrancy of Scripture (which would also question the authority of it, which would section the Church). This would, obviously, anger any church.

Galileo did do good science, however, and his work started the scientific renaissance, but not the artistic one that is so popularly generalized as The Renaissance which had its beginnings in place 200-300 years before Galileo was even born (1200s in northern Italy vs. late 1500s for Galileo).
Re:not really by davegaramond · 2004-10-09 23:10 · Score: 1

well, how do we know how much knowledge were lost? for all we know, we could have been more advanced by say 30-40 years had the library persisted...
Re:not really by Anonymous Coward · 2004-10-10 09:45 · Score: 0

More like 300-400. At least.

LOC has more than English by Jonathan · 2004-10-09 21:54 · Score: 1

I live in Washington and often go to the LOC on Saturdays (it's closed Sundays) -- it has a large collection of books in lots of different languages -- even Esperanto and Volapuk!

That's a lot of developers .. are they off-shore? by ReidMaynard · 2004-10-09 21:56 · Score: 1, Funny

Amazon already has 65,000 developers who are working on ways to plunder information on its site for their own ends. The payback for Amazon is the selling of more stuff through its site.

WOW..from Amazon's open job page it seems most developer jobs are in Washington state. If we assume a salary of $100,000 per this results in Amazon spending 6.5 Billion dollars per year, just on developers.

Or maybe this is a typo.

Of course you could do some cost cutting and move it off-shore for less than $500,000,000.00/yr. Anyway, they seem to have a lot of openings.

--
-- www.globaltics.net

Political discussion for a new world

Just make out-of-print books copyable immediately by r6144 · 2004-10-09 22:03 · Score: 1

I think a delay after the books become out-of-print is not really necessary. On the other hand, we don't immediately make the work public-domain when it becomes unavailable, we just make it legal to copy and distribute such works in any way during the unavailable period, and if the copyright holder decide to make the world available again (within the copyright term), other distributors should stop upon notice.

In this way a website can be set up to disseminate digital copies of out-of-print works, and taking down the thing should not cost much in case the thing become in-print again (a dead-tree version of this may be risky, since printing has some considerable up-front cost, but this has at least become legal, while it is probably illegal under current law), and much much fewer valuable works will get lost. As for the "authorized" publisher, if the sales have fallen so low that it has become unprofitable to keep the printer running, they should have got most of the profit they can ever gain anyway, and even if the work happens to become popular again they can still restart printing and gain most of the profit from the exclusive copyright.

Re:That's a lot of developers .. are they off-shor by jas79 · 2004-10-09 22:23 · Score: 2

I think that they are talking about developers who use Amazon's webservice.

this is available at the french national library by Anonymous Coward · 2004-10-09 22:51 · Score: 1, Informative

you can do this at the french national library (see http://www.bnf.fr/pages/zNavigat/frame/accedocu.ht m, yes its in french)....

Re:If Bill Gates -- Not So Fast! by odin53 · 2004-10-09 23:03 · Score: 1

The endowment for the Bill Gates's philanthropic foundation is currently more than $20 billion. As of 2003, he had already donated more than $5 billion, mostly to global health organizations and education causes. He has been saying for years that he plans to give eventually 95% of his wealth away. While $5 billion (so far) in 2004 dollars is still less than Carnegie's lifetime philanthropy (about 1/2, after giving effect to inflation), I wouldn't call it a "far cry", as if $5 billion is pocket change and Gates doesn't still have many more years to donate his money.

A huge benefit to society by OutOfMyTree · 2004-10-09 23:07 · Score: 1

Hang out the flags -- this is a brilliant project. It would be a huge benefit to society, even if works still in copyright were not made freely available. And the benefits are not limited to the English-speaking world, just as that world has benefited from plenty of material not written in English originally.

I hope the Library of Congress are already scanning all books where there is reason to suspect they have the only copy. There are plenty more where this is our best hope of being able to read the contents in a reaonable timeframe. There are millions of books which are not 'rare' but for which this will provide the most convenient form of access. I hope they work out that cooperation with Operation Gutenberg should multiply the good effects of both projects.

A good companion project would be a campaign to honor those authors who voluntarily put their works into the public domain before the last tendrils of copyright law relinquish their grip.

It is good that the UK's equivalent, the British Library is involved in a project which will preserve and copy endangered archives around the world. The budget is not on the same scale though! http://www.bl.uk/cgi-bin/press.cgi?story=1418 (and the story mentions that the charity part-funding the programme is also trying to preserve endangered languages).

Pilot Program by PMuse · 2004-10-10 01:14 · Score: 2, Insightful

Putting it all online would let people get copies of it for *gasp* FREE. Can't have that, now can we?

No, we can't... it not be fair to lots of people whose copyrights haven't yet lapsed.

Let us scan only things for which the copyright has lapsed. This has several advantages.

Promotes and makes accessible works that are now free. (Project Gutenberg would be over the moon at a $1MIL grant, let alone $260MIL.)
Provides citizens a cheap method of checking that a copyright has, in fact, expired for debunking false claims of continuing copyright.
Shows the public what a public domain is and why it's valuable. Helps demonstrate that perpetual copyright is a theft from the public at large.
Is far cheaper than scanning everything, requires no legal battles, and needs no DRM.
Avoids promoting works still subject to copyright, which is the job of their owners, not the govt.

--
"We reject as false the choice between our safety and our ideals." --The American President (20.1.2009)

French National Library by Anonymous Coward · 2004-10-10 01:25 · Score: 0

The French National library has been scanning its archives (only books whose copyright has expired) and now has 70,000 books and 80,000 images scanned and available in PDF and TIFF formats for free download. The homepage for their "Biblioteque Numerique" is http://gallica.bnf.fr/

Some Jules Verne anyone?

Better Access for Everyone by azemon · 2004-10-10 02:01 · Score: 3, Interesting

What a cool idea and, even "if" the dollar estimate is too low, who cares? $260M is chump change for our gov't.

Right now, the only way to access the stuff in LoC is to go there in person. Anyone can do it but you have to travel to WashDC and pass through security and so forth to get into the LoC public reading room. Then you have to ask the librarian to pretty-please bring you the book that you want.

Now imagine that you can access any item in the LoC by simply entering the building and using a public kiosk with a browser. LoC's software would only permit use within the copyright so that is OK. But you don't have to mess with as much security because LoC isn't handing over the physical book.

Now imagine that, from any web browser, you can access any book in the LoC for which the copyright has expired. I like that idea!

My opinion... skip the buy on the next couple of cruise missiles and digitize LoC's books instead.

Oh yeah, before I forget, LoC already has tons of seriously neat stuff online. My favorite is this collection of tons photos from Russia. These were taken between about 1907 and 1915! I don't know about you, but I never dreamed that I would see color photos that are almost 100 years old.

Cheers,
-- Art Z.

Project Gutenberg by cpghost · 2004-10-10 02:13 · Score: 2, Interesting

Now imagine that, from any web browser, you can access any book in the LoC for which the copyright has expired. I like that idea!

That's the idea of Project Gutenberg. It's been around for quite some time now, and everybody is free to join their distributed proofreading network!

--
cpghost at Cordula's Web.

Re:Project Gutenberg by azemon · 2004-10-10 02:17 · Score: 1

Yes, Project Gutenberg is very cool. But LoC has different stuff so I'd like to see it on-line also. IMJ, more choice == goodness.

Cheers,
-- Art Z.

Print on demand by bitingduck · 2004-10-10 02:37 · Score: 1

I walk to the librarian and pay the purchase price. She fires up a local print run on the library's new laser book printer

I keep seeing stuff about print-on-demand coming to bookstores some time in the not to distant future (but it never seems to get here). Using LC as a source for at least some books (public domain, out of print) would be a nice extension. UMI already prints copies of theses on demand-- if you want to order a copy of a PhD thesis, you give them a credit card number and they shoot you a freshly printed and perfect-bound copy of the thesis in the mail.

Life Expectancy. by Anonymous Coward · 2004-10-10 02:50 · Score: 0

"Side note: this was back when copyright lasted 14 years, and *could* be extended another 14. But that was it. None of this "milking the work of your great-great-grandfather nonsense.""

Another side note: The life expectency was lower back then. 28 was effectively one's life.

Re:Life Expectancy. by Dun+Malg · 2004-10-10 04:02 · Score: 1

Another side note: The life expectency was lower back then. 28 was effectively one's life.
Bullshit. Thomas Jefferson lived to be 83. Average life expectancy was pulled drastically downward by infant and child mortality, but if you made it to age 21, you could expect to easily live into your 60's. Here's a quickie on the reasons why the "avg life expectancy was 25 in the stone age" theory is a load of crap based on statistical oversimplification.

--
If a job's not worth doing, it's not worth doing right.

Already Happening by Anonymous Coward · 2004-10-10 02:55 · Score: 0

I used to work in the music department of the Library of Congress, and there was already scanning going on, in fact there was about three terabytes of books all ready to go up on the web and pictures of a collection of over a thousand flutes that is kept at the library were also being prepared to go up. They also already have a large amount of audio files up for grabs. http://memory.loc.gov/ammem/browse/ListSome.php?ca tegory=Performing%20Arts,%20Music
As well, on the topic of how many books actually get in to the LOC, the copyright office is in the same building as one of the three LOC buildings, so it's pretty easy for stuff to make its way there.
-Julius

Er-Human Nature. by Anonymous Coward · 2004-10-10 03:05 · Score: 0

As usual technology isn't the problem (Echelon), but human nature. Would we really be having this long drawn out discussion if everyone played by the rules?

"The people that will be able to figure out what the _real_ answers are to these issues are the ones that will do really well. Think about it. "

How to bend human nature away from our baser side. Truly a herculean task. Look at how well we've done elsewere (drug war).

It has already been done by Russians by Uzbek · 2004-10-10 03:05 · Score: 1

www.lib.ru is a project started as a hobby by Maxim Moshkow in 1994. Today it's a comprehensive online library containing over 20,000 text and 37,000 other files. Uzbek. (they did have some issues with copyrights!)

Who needs backups? by mwa · 2004-10-10 03:08 · Score: 1

Once it's up, just put it online and let Google cache it!

The complexity of avoidance. by Anonymous Coward · 2004-10-10 03:11 · Score: 0

Compare the complexities of all the schemes to avoid paying one's fare due (ad this, no-ad that, e-paper, etc). Verses the simplicity of the present system that works.

Re:The complexity of avoidance. by 10101001+10101001 · 2004-10-10 07:55 · Score: 1

Should we give a monopoly to one soft drink company? One car manufacturer? They all have to fight with each other and have complexity to gain money. Yet they all are still able to prosper. The free market gives a fair due by guaranteeing an optimal societal and supplier surplus. And the amount of money earn decides if such a company should exist or not. In the free market, the price is a combination of the supplier's willingness to sell and the consumer's valuation of the product. What is more fair than these two forces working together to set a price? To act like copyright is the only exception and should be elevated is your overvaluation of copyrighted works which merely overprices it for the rest of, decreasing consumer surplus. Unless you can actually *prove* that copyright is an exception and is worth more in external effects than the free market accounts for, I don't see why leaving copyrighted works companies/people to find a complex system to substain themselves is a bad thing.

--
Eurohacker European paranoia, gun rights, and h

Does the money estimate add up? by sbaker · 2004-10-10 03:45 · Score: 2, Interesting

According to the LOC website, they have 119 million items in the library.

They tell us that there are:

4.5 million maps.
14 million 'images' ...so I guess we assume the rest are books and newspapers.

So in round numbers, let's say there are 50 million books and 50 million newspapers, periodicals, comic books, etc.

$260 million to scan all that stuff? $2.60 per book or newspaper? That seems a little unlikely. The book would have to be carried off the shelf to the scanning machine, mounted in the machine (which would clearly have to turn the pages and scan and index them 100% automatically), the title and such would probably have to be typed in manually, then the book carried back to the shelf and placed back in the correct place.

I find it hard to believe that a machine for scanning newspapers could be devised that could turn the pages automatically...but even without that, the project is still possible. At minimum wage, you'd need to pay people to scan a complete newspaper in maybe 20 minutes.

Then some significant fraction of the collection would probably be too fragile for the automatic page turning machines...the cost of hand-scanning those would be FAR more than the bulk of the books. Some books would be *so* fragile and valuable that scanning them would be a considerable expense.

Then there is the cost of the storage media. Suppose those 100 million books and newspapers had just 100 pages each on average. To get a readable image of the page you're going to need to scan at maybe 2000 x 2000 resolution. So we'll have something like 10^16 pixels, let's be generous and allow 100:1 compression ratios - and one byte per pixel. So we have 1000 terabytes. That's a lot - but to put it in context, it's only about a fifth of the amount
that Google is estimated to have in their main cluster. Goggle spent $250 mil to buy that - so maybe only 20% of the LOC's budget needs to be for storage.

OCR'ing and indexing all that data would be an incredibly valuable thing - the extra storage is trivial and the cost can be low if you aren't in a hurry to get the project done. Just stick a few thousand PC's in a room and wait!

Dunno - $260 mil sounds like a low end estimate to me - but it seems do-able.

--
www.sjbaker.org

Fantasy by Anonymous Coward · 2004-10-10 04:23 · Score: 1, Interesting

Some wild assumptions flying around here. Even if the LOC could get funding for the project, and even if the publishers did not tie it up in the courts for decades, there is still the questionable assumption everyone seems to be making -- that LOC would make the digitized texts available free. There is no reason why they should be expected to do that. In fact, they would most likely have to charge for use of copyrighted materials. The fee would necessarily include some sort of negotiated reimbursement to the copyright owner/publisher. Otherwise, publishers would just stop contributing their books to the LOC.

But, there's an even bigger fantasy involved. Does anyone really think that the right-wing protectors of our morality who are running our government would stand by and allow a government agency like the LOC to spend tax dollars scanning gothic novels, adult literature, subversive tracts, revolutionary polemics, treatises on abortion rights, non-christian religious texts, pagan and satanic epistles, books critical of the administration, etc, and making them available to the country at large? The Shrub would burst into a Burning Bush instantaneously at the idea.

The only possibility is digitization of "suitable" and "defensible" public domain items, which is already under way in piecemeal fashion.

LOC has more than English-Stability. by Anonymous Coward · 2004-10-10 04:25 · Score: 0

What most people don't know is that other countries send their material here for safekeeping.

Say what one will, the US is relatively stable compared to some other places.

... Franklin/Jefferson/Madison hype... by ankhank · 2004-10-10 04:38 · Score: 1

> If this is such a wonderful idea why doesn't he
> get a bunch of artists, musicians and writers to
> donate their own work to this project and actually > prove the concept works?

It's been done, and the idea worked -- the United States convinced a bunch of artists, musicians and writers to donate their own works to the public domain AFTER A PERIOD OF TIME, THE COPY-RIGHT period.

Well, perhaps not convinced -- but promised and protected the copy-right, so artists, musicians and writers were encouraged to work and publish in the US.

Of course during the same first century or so, the US completely ignored the copyrights of other countries -- arguing it's the right of a poor developing nation to take whatever it needs of the intellectual source material to assure its survival. Only the large, established nations honor copy-rights, historically speaking.

MOD PARENT UP by Anonymous Coward · 2004-10-10 04:47 · Score: 0

*sigh* I posted before I saw this...

the Physical level by tepples · 2004-10-10 05:17 · Score: 1

You'd transcribe the data into plain old ASCII, perhaps UTF-8 if you wanted to preserve the original characters. Maybe make a version available that's marked up in XML if you want computers to parse/reason about the data within. Cryptographically sign the data, so that people can verify that their copy they hasn't been modified by some prankster, and make it available for download!

No matter how self-descriptive you make the bitstream, how would you suggest to preserve it against a catastrophe at the Physical level?

[0] That's another really annoying thing. The word "blue" has an E!! Damned marketing departments...

The thinking is that misspellings create more distinctive trademarks, and governments deem more distinctive trademarks worthy of a larger scope of exclusive rights.

Re:the Physical level by cortana · 2004-10-10 05:52 · Score: 1

Multiple backups, all over the place. I should have been more explicit about this.

If the digital domesday project files were plonked on an ftp somewhere, lots of people would mirror it--researchers and historians at universities, random curious members of the public who just want to take a look, etc.

Finally, it's not too difficult to copy the files to a variety of storage media, and (this is the important part) charge someone with the task of transferring the data to new media, as they are invented and move into common use. Although I think it's unlikely that we'll lose the ability to read data CDs in the short or medium term, relying on them working for all of time is foolish. :)

Government-subsidized pseudo-businesses by tepples · 2004-10-10 05:44 · Score: 1

Barnes and Noble has a right to make money without having to compete with government-subsidized pseudo-businesses.

Copyright itself is a government subsidy. Doesn't that make The Walt Disney Company into what you call a pseudo-business?

If you want to OWN the book, go to a bookstore.

And if neither Barnes & Noble nor BN.com carries the title I want, then what?

So ... space wise, by magefile · 2004-10-10 05:46 · Score: 1

So how much space would that be, in LOCs?

"Through laws"? Try corruption by tepples · 2004-10-10 05:47 · Score: 1

Settling scores with Disney this way does evil to most other copyright holders... two wrongs don't make a right.

Even if suddenly putting things into the public domain isn't acceptable, wouldn't phasing out the scope of exclusive rights on a sliding scale over the term of a copyright limit the damage?

The Disney problem should be addressed separately, and through laws.

Except who both makes laws and is not part of the problem?

Re:That's a lot of developers .. are they off-shor by anthony_dipierro · 2004-10-10 05:55 · Score: 1

More specifically, this is probably developers who have signed up for their (EULA-laden) API.

This license does not include any resale or commercial use of this site or its contents; any collection and use of any product listings, descriptions, or prices; any derivative use of this site or its contents; any downloading or copying of account information for the benefit of another merchant; or any use of data mining, robots, or similar data gathering and extraction tools. This site or any portion of this site may not be reproduced, duplicated, copied, sold, resold, visited, or otherwise exploited for any commercial purpose without express written consent of Amazon.com.

Not very much incentive for us developers to "plunder information on [Amazon's site for our] own ends".

Creating Jobs AND Creating benefit by the_REAL_sam · 2004-10-10 06:50 · Score: 1

You've seen Them boast about creating JOBS when they're really just creating WORK. This project creates jobs and REDUCES work. Very nice.

A remarkably good proposal. =)

--
"Forgive us our trespasses, as we forgive those who trespass against us." -Jesus Christ The Lord's Prayer

And then when Europe goes to life plus 90 in the 2010s, then what will happen? You get a leapfrogging effect.

Register before legal action by tepples · 2004-10-10 07:34 · Score: 1

In order to sue an alleged copyright infringer in the United States, you have to register the copyright in the work in question first, as part of the procedure for establishing evidence of copyright ownership. Thus, the LoC has a copy of every work whose copyright has been enforced.

www.loc.gov by pNutz · 2004-10-10 07:35 · Score: 4, Informative

Of course this instantly deteriorates into a discussion about the shameful state of IP and copyright laws, the need to pool all human knowledge, and how crappy the US budget deficit is.

If you go to the LOC's site, you'll notice American Memory on the front page.

American Memory is where you can get a good portion of the public domain stuff (books, letters from immigrants to their families back home, photos of civil war enlistees, audio, Edison-era short movies) for free in a low-quality format. Archival quality copies and custom scans/recordings are available for $$$. Almost any work in the LOC can be scanned on request (3 week waiting time or so); this is how they manage to continue adding scans to their collection without requiring public or private funding. It's underfunded as it is and needs more bandwidth.

This idiot in the article's proposal is completely unrealistic. Books can contain 100,000 to 5,000,000 characters. That's 100k-5Mb per book, times 26,000,000 books. That's not including the images and illustrations in some of these works. Many of the texts have value beyond the words they contain. We may be talking about image scanning the pages to preserve the look of the type, paper, and images. Archival TIFFs, since that's what the LOC uses.

The article also mentions $60 thousand to 'store' this data (per month?, per year?, just once???, what about access?, searching?, redundant backups?). Another unrealistic number, even working off of the 1TB estimate.

--
Death and danger are my various breads and various butters.

Distributed Proofreaders by tepples · 2004-10-10 07:36 · Score: 1

I'm not convinced that OCR quality is good enough today to store the books as ASCII text. You're going to be doing a lot of work making the scan

A lot of work by Distributed Proofreaders?

Re:Distributed Proofreaders by anthony_dipierro · 2004-10-10 07:46 · Score: 1

They've finished what, 5000 books in 4 years? How long will it take them to finish the Library of Congress? We'll have good OCR software that can do it automatically long before they'd ever get done.

The Gutenberg project hasn't even managed to get Encyclopedia Britannica 1911 done yet. Distributed proofreading is not the solution. The time wasted on these efforts should be spent toward making natural language processing software which could do this work automatically.

Could the addiction in fact prove healthy? by tepples · 2004-10-10 07:39 · Score: 1

Yeah, just what we need to keep the spiral of information addiction we all have going.

We're all addicted to air, water, and food, but nobody complains that those addictions are always unhealthy.

(Frankly, I'm more concerned about my 18 month old cousin's addiction to Winnie the Pooh videos.)

Way Inflated by Galahad2 · 2004-10-10 08:10 · Score: 1

The 3 petabyte one is way inflated. 2 petabytes of that is 3.5M sound recordings. Doing a little math, they're assuming that each of those 3.5 million recordings takes up 600MB. I guess they consider each sound recording to be a full CD. First, most audio CDs don't fill up the entire 600 MB. Second, most of those recordings probably aren't entire CDs. Third, you can compress audio very well. Even losslessly, you can compress AIFF files at least 2x.

I'd say the upper bound on the Library of Congress is about 1 petabyte.

Monkies! by Fubar420 · 2004-10-10 08:12 · Score: 1

Do this the cheap way.

Give infinite monkeys access to the internet, and allow them to type in documents.

Eventually, you'll have every item in the library of congress at your disposal, and searchable via pigeonrank.

Hazzah!

--
-- (appended to the end of comments you post, 120 chars)

Re:the Physical level, media+hardware+training by UnapprovedThought · 2004-10-10 08:53 · Score: 1

Multiple backups, all over the place. I should have been more explicit about this.

To break it down more explicitly (I am agreeing with you but am warning that the devil may be in the details) you would need:

Multiple backups of the media, which may become unreadable even on the same drives that recorded them due to bad media quality, and
Multiple backups of the hardware, because the quality of the drives themselves may be too low to last more than the 1 year warranty.
An unbroken line of training to assure that there always exists the engineering expertise to make head or tail of old media, old hardware, old software.

With enough (masssive) redundancy maybe a future Alexandria-style fire event can be avoided. It may be cheaper in the long run just to produce quality products with the redundancy built in to the technology, but the current distribution infrastructure seems to favor only the survival of commodity vendors.

Either way, it would be a great great boon to research to put so much wisdom at everyone's fingertips (now if only if we can get Congress itself to use it...)

Only those who speak english? by HexRei · 2004-10-10 09:20 · Score: 1

Wouldn't one of the chief advantages of scanning it be OCR'ing it, and then being able to translate it using software assistance?
It sounds like anyone could benefit from this.

Where do I send my dollar? by Corwyn+ap · 2004-10-10 09:28 · Score: 2, Interesting

$260 million is $1 per US citizen. A bargain if ever there was one. I suspect that this estimate is extremely low.

The hard part is, of course, proofreading. See distributed proofreading at http://www.pgdp.net/c/default.php

Let's get started on the out-of-copyright stuff NOW. Maybe b the time is online, people will see the benefit of making everything available.

Thank You Kindly.

Re:well, at least to those who can read PRINT by ForestGrump · 2004-10-10 09:46 · Score: 1

project gutenberg.
i like their audio books burn to cd and listen while i drive.

--
Is it true that more people vote for the winner of American Idol, than vote for the president? -Ali G.

Two birds with one stone by Anonymous Coward · 2004-10-10 10:05 · Score: 0

The copyright/IP issues with this are huge and significant. It is simply a bad idea to unleash all of this material without addressing this issue. That said I think I have a good idea for dealing with this problem and addressing another major issue as well, the deteriorating state of public libraries.

First, we spend whatever it takes to get all this information online. Then we make it accessible ONLY at libraries via a secure library network. You must visit a terminal at a public library (and have a library card) in order to access books over the LOC web. This way the information is made widely available but the number of people accessing is still limited. Perhaps 50k people can be using the network at any time, depending on the number of access points in local libraries throughout the country.

This IMMEDIATELY expands the value of small branch libraries throughout the country. They can all continue to operate much as they do now with physical books while simultaneously offering this great new service. It will dramatically increase support for libraries and also control the information.

It may even be possible to create a check-out policy. You select an LOC text and a disc is burned with the data for you to take home that must be returned to the library much as other things are. I know the DRM will be cracked quickly but the fact remains that THE VAST MAJORITY prefer a book to reading on a screen.

Until there is the ability to mass produce pirated versions that RESEMBLE THE ORIGINAL book, like happens with cd's and dvd's, piracy on a mass scale will not be a serious problem. Certainly not much greater than it is now.

Anyway, the idea here is to use the network of local libraries to control and distribute this information.

All of this of course requires repealing and SPECIFICALLY PROHIBITING the fbi from snooping in library records.

One advantage by Anonymous Coward · 2004-10-10 10:16 · Score: 0

Congress doesnt need more warehouse to store more books. Everything can be scan in, and store on DVD/hard drive and can be online for everyone to search. The best thing - library doesnt need expensive air conditioning and humidifier to control book age. It may cost $260 millions, but save billions in other cost.

social issues by MikeFM · 2004-10-10 10:41 · Score: 1

Ever been an artist? The fact of the matter is that most artists don't get paid, or are very poorly paid, now. It sucks but IP laws haven't made many artists rich and haven't even made very many a decent living. The system is already disfunctional before you even start considering what technology is going to do with it.

The only thing we can do to make it better is convince people that if they like an artist and want them to keep producing then they need to make an effort to support that artist. It's a social issue and not a technical issue and not something that can easily be forced by passing laws. If you're an average person then pick out those artists you like and donate money to them. If your rich then sponsor artists to create new works of art. Possibly a not-for-profit organization that takes donations for artist in general and uses that to fund young artists would be a good idea. Both consumers and well off artists could donate towards sponsoring new generations of artists.

The same issue exists for programmers, whom I consider a type of artist, in that we often are not well paid for our work. Especially those of us that give away our code for the public good could use more support from our users. If you use an opensource program you should consider donating to the developers. If you see a developer that looks promising you should consider donating to them. Pick one project a month and donate $10 to them. If even a fraction of the users of opensource sftware did this then there would be much more, and higher quality, opensource code available.

I imagine the same idea works for supporting the providers of any free service. Websites, etc.

It's the honor system. You can copy anything you want but if you continue to use it then you should make a donation. Yes, you can be cheap and not make your donation but by doing so you're hurting yourself too. Give what you can afford and convince others to do likewise.

--
At what price learning? At what cost wisdom? The price is a man's peace of mind, and the cost is his life.

Re:social issues by arminw · 2004-10-10 16:14 · Score: 1

Long before anyone ever thought of copyright, there were artists, such as Bach, Beethoven, Rembrandt, Milet etc. They were paid by persons of means who appreciated their skills. Maybe such a system as you are talking about could be worked out, such as is in place in Germany. There, every household owning a TV or Radio is taxed 18 euro each month to pay for broadcasts. Now they want to extend these taxes to PCs as well.

If all society benefits from the work of good artists, then perhaps all of society should pay in some way and all copyright is done away with. The reason for copyright was for the benefit of everybody, not just to enrich a few. If everybody benefits, perhaps everybody should pay. The biggest problem would not be in collecting the money, but figuring out an equitable way to reward the creators in such a way to give them an incentive to continue their creativity. The copyright system was supposed to do this and largely did before the digital age. Now however it is obviously broken and is not working any more. Modern digital technology has made the copyright system unworkable, just as the automobile has supplanted the horse and buggy.

The chance of the honor system working in our society are about as good as you getting all the money in your wallet back that you accidentally left on a transit bus in any big city.

--
All theory is gray

COPY RIGHT by coyotedata · 2004-10-10 11:11 · Score: 0

There r some people on the board who seem to remember what a copyright and English r!!!

IANAL, but if I recall correctly you must register copyright in the US to sue in the US. However if you have registered copyright in any other (BERN) country you can register in the US based on the date of that.

You automaticly have a copyright when you create something. If someone copies it you can sue them for damages, but you must register it with the copyright office first ($25 last I checked). If you register it before the violation, then you can sue for triple damages, even if you only registered in some other country. You still have to register the copyright in the US to sue, but having the copyright elsewhere counts as registering it before the violation.

Library of Congress by Jondo · 2004-10-10 11:38 · Score: 1

Could someone explain what exactly is in this library of congress? Is it just a big library of stuff?

Based solely on the name, I would infer that it would contain alot of US historical documents, government stuff, and what not. In which case, scanning would NOT "benefit" society as a whole. Perhaps US society. Not the rest of the world. Nobody else would really care.

Re:Library of Congress by This+Is+Ridiculous · 2004-10-10 20:02 · Score: 1

The Library of Congress is a lot more than just a collection of government documents. When you register a copyright in the United States, you have to send two copies of the work to the LOC. Many of those works are then sent to regional libraries around the country, but the LOC keeps the cream of the crop, so it has a collection of what it considers to be the best books, audio recordings, movies, and videos in the US.

Digitizing the LOC would be a boon to the English-speaking world. Unfortunately, it doesn't seem likely to happen...

--
Hey, you try to find an open nick these days!

DjVU... The format for this task... by sergio · 2004-10-10 14:15 · Score: 1

Recently someone mentioned to me that it is possible to put hunderds of books in a CD using DjVu. It looks like DjVu is the MP3 of books!

Take a look at http://www.djvuzone.org/

The curious thing is that there is great support for it under Linux and KDE in particular.

Re:No habla francais. by Binky+The+Oracle · 2004-10-10 15:01 · Score: 1

Damn those hundreds of thousands of authors all writing in English! How dare they write in the language their majority of their potential audience is most likely to understand? Can't they think of the children??? Can't we get some law passed saying that all works submitted for copyright must be made available in any language spoken by more than five people?

Sweet jebus, sometimes I wish I could use mod points on the original post...

Woo hoo - I finally got my cherry popped a modded down as a troll. I wonder if the modder recognizes the irony of branding "troll" on a post that was branding the original post a troll...

--

Slashdot comments... splitting hairs since 1997.

$260 Million Breakdown by Lord+Flipper · 2004-10-10 16:44 · Score: 1

$2500 - Scanners $89.99 - Text Bridge Pro (OCR) $259,997,410.01 - Labor (correcting OCR),

I do similar work on military tech manuals, and believe me, they've way underestimated the labor part. They'll never make it, unless the entire city of Bangalore decides to go for 17 cents an hour, then maybe.

English by Anonymous Coward · 2004-10-10 17:31 · Score: 0

Don't be silly -- LOC has hundreds of thousands of books in more than a dozen languages

Indexing - experience from a previous comparable e by RockDoctor · 2004-10-10 19:11 · Score: 1, Interesting

A couple of years ago the Harvard University Centre for Astronomy had one of it's collections of technical publications scanned in order to be put online. But to make the material actually usable they had to launch a program over the net for volunteers (predominantly amateur astronomers) to view the scanned pages and enter, by hand, the necessary bibligraphical information (authors, paper titles, etc), as well as to QC things (look for duplicated pages, missing pages, work out which of several scans of fold-out drawings is the best image, etc).

The scanning step was trivial (probably lots of bored students on minimum wages, getting brownie points from their professors); the INDEXING process has been going on for over 2 years now and is not yet finished.
NASA ADS at SAO: Historical scans currently in the ADS

--
Birds are not dinosaur descendants;birds are dinosaurs, for all useful meanings of "birds", "are" and "dinosaurs"

The Library? by This+Is+Ridiculous · 2004-10-10 19:52 · Score: 1

He uploads it to the CIC database--the Library, formerly the Library of Congress, but no one calls it that anymore. Most people are not entirely clear on what the word "congress" means. And even the word "library" is getting hazy. It used to be a place full of books, mostly old ones. Then they began to include videotapes, records, and magazines. Then all of the information got converted into machine-readable form, which is to say, ones and zeroes. And as the number of media grew, the material became more up to date, and the methods for searching the Library became more and more sophisticated, it approached the point where there was no substantive difference between the Library of Congress and the Central Intelligence Agency. Fortuitously, this happened just as the government was falling apart anyway. So they merged and kicked out a big fat stock offering.

--Neal Stephenson, Snow Crash

--
Hey, you try to find an open nick these days!

Re:Er -- public domain by testadicazzo · 2004-10-10 23:50 · Score: 2, Insightful

Well, there's a huge portion of it which is already in the public domain. So we could start with that.

While we are at it, let's scale back the copyright limits back to life of creator + 20 years (or even farther back as far as I'm concerned), and bring back more of the booty which the corporations have plundered from us, the public.

All that PRON! by cylcyl · 2004-10-11 04:22 · Score: 1

I hear that the LoC has one of the best Playboy collections in the world! This will put playboy.com right out of biz!

We can fund it ourselves by bradsears · 2004-10-11 13:20 · Score: 1

I've started a fund raising co-op at http://www.ideacradle.com/givesupport_virtual.php? currentIdea=11/ I concede this is a long shot but we will see.

--
I'm building co-operatives right now at http://www.ideacradl

Ulterior motives by Marxist+Hacker+42 · 2004-10-14 03:53 · Score: 1

Where I can appreciate some of what the Gates Foundation does- the majority of those three items (immunization, AIDS Research, and anti-poverty work) is far more about opening India and China as markets and sources of cheap labor, than it is about actual philanthropy. It's a clever thing to do with the foundation to look like Bill is helping people when he's really just building a bigger user base for Microsoft. But then again, Bill's object and purpose in life isn't to be a billionaire- and he's not going to be leaving his children with anything other than a legacy and maybe a $100,000 loan (or the equivalent in 2030 dollars to his 1970 dollars that started Microsoft) to start their own legacies- the money is beside the point for him. While I don't respect what he's done or his own technological ability- I do respect him for his REAL purpose behind Microsoft; a computer on every desk running an operating system that is as easy for the end user as a TV set. The billions? They're just what comes from realizing that dream in a totally unethical way.

--
SJW: a person who perceives an injustice, and while correcting it, commits a greater injustice.

What about the CoS? by Anonymous Coward · 2004-10-16 05:10 · Score: 0

Does the Church of Scientology register theirs?

well.. by Gregory-Eric · 2004-10-17 17:08 · Score: 1

Considering all things, I think the dollar amount is to low...way to low.
I do like the idea, just can get a very good visual of it running 2.5-3 times above the projected budget.

Slashdot Mirror

Proposal: Put Library of Congress' Contents Online

394 comments