Release of 33GiB of Scientific Publications
An anonymous reader writes "A Wikipedian, Greg Maxwell, has released 33GiB of scientific publications [note: torrent] from the Philosophical Transactions of the Royal Society in response to the arrest of Aaron Swartz for, effectively, downloading too many articles from JSTOR. The release consists of 18,592 scientific articles previously released at $8-$19 each and all published prior to 1923 and so public domain."
in response to the arrest of Aaron Swartz for, effectively, downloading too many articles from JSTOR
No, that jackass got arrested for not only hacking MIT systems but physically breaking into equipment cabinets and messing with the hardware. He did this so that he could "effectively download too many articles from JSTOR", but that isn't what he got arrested for.
I am all for civil disobedience when it is merited, but don't go crying when you get arrested and charged for that disobedience. Especially don't go trying to distort the reasons for your arrest to try and trick people into supporting you. There are so many causes I support where I wouldn't be caught seen with the "activists" pushing them because they do bullshit like this.
========
CINC, 4th Penguin Legion
There's a connection here to rational ignorance. Rational ignorance is when you don't bother to understand and complain about the sugar quota, because it's only costing you a few dollars per year, whereas informing yourself and complaining about it would cost a lot more.
From George Will on America, Politics, and Baseball
From An Interview with Milton Friedman
The problem with this is that the sugar families note that hardly anyone is actively complaining and use this to argue "well, no one complains, so it's all right".
In the courts, the flow of money is tangible, whereas pervasive resentment masked by rational ignorance is not. JSTOR will attempt to use this to their advantage. The only way to drive a wedge into this equation is to make both sides tangible.
These old papers weren't published directly on internet in 1923. Someone had to transfer all of them from physical form to digital form, page by page. That's is a huge amount of work. Should we all be entitled to enjoy them free of charge? So who's paying the workers?
it is somehow immoral to charge $8 to fetch an out-of-copyright article
No its immoral to charge anything for any bit of information.
---- Booth was a patriot ----
The benefit of giving someone the copyright is so that they can distribute copies to everybody (for a cost, of course, if they so desire), without fear that it might be copied. Whereas if someone has the only copy, but can not get a copyright, then they will prevent anyone from obtaining the document. So, even if something is in the public domain, you don't automatically get a right to have access to the work in order to copy it.
Protection from access can by via lock (royal society), or can be attempted by trying to give you access only if you sign some agreement (JSTOR).
I think also under some conditions, the original work is under public domain, but a derivative (say retranslation, or new photo, or some such) gets a new copyright.
So, it isn't just important that the a work is in the public domain, but that many people actually have copies of it.
I thought the reason for librarians is to make data easily found by users of the library. Then I go to do research and am astonished by the amount of effort required to find anything in digital format. I think my mom could come up with a better system... the librarians are failing.
So basically, the answer is "it's complicated, and it's harder than you think."
Speaking both as both a producer and consumer of scientific articles, there is actually a simple answer to this: granting agencies need to mandate open-access publishing as a condition of funding. This still costs money, obviously, but I think there's ample justification for the agencies taking this into account when calculating awards. There should be a limit, of course - publishing open-access in Nature costs something like $7000, compared to $1500 for most non-profit journals.
Howard Hughes Medical Institute already started requiring this several years ago, and it effectively forced the publisher Elsevier to accept its terms. The NIH eventually went even further and mandated that all future NIH-funded articles needed to be uploaded to the PubMed Central database after six months. I don't think they even agreed to compensate the journal publishers; NIH-funded research makes up such a huge fraction of biomedical publications that they can do whatever they want. Since virtually everyone, including biotech and pharma companies, despise the scientific publishers, there is considerable political support for further moves in this direction.
Of all the academic paper databases to pick on, why JSTOR? It's a non-profit institution that tries to make as much as possible available at low cost. Zero cost? No. They have bills to pay too. And while I understand the principle that if it is out of copyright, it would be ideal if it were freely available to everyone, scanning thousands of documents takes time (i.e. money if you are hiring people to do it) and hosting thousands of documents for access by thousands of people costs money too.
If people want this to change, they have two legitimate options:
1) scan the old, out-of-copyright papers in themselves and make them available for free in a volunteer, "open" effort (paying for the website hosting is left as an exercise for the reader)
2) donate enough to JSTOR that they don't have to charge for access to public domain works anymore (if you have boatloads of money, make it a condition)
Ripping off JSTOR's hard work scanning in documents isn't a solution even if we are talking about public domain materials. "Public domain" doesn't mean "free access" as in $$$, it means "free to copy", as in if you can get an original copy, copyright does not prohibit you from copying it and doing whatever you want with it. If you've gotten access illegally in the first place, that's a whole other equation. You might not be brought up on copyright charges, but you could get charged with a different crime.
I love open access, and I have a couple of things published that way (just got invited to do another book chapter, which I'll squeeze in if at all possible), but the publication costs can really hit hard. The PLOS journals are $1350US on the cheap end (PLOS One - not sure why it's cheapest) to $2250US or $2900US for the others. It's hard to explain to a granting agency (or your supervisor, or a grant administrator) why you want to spend $3000 on a publication in a low impact factor journal instead of nothing on a pub in a well recognized journal.
I think the solution to this problem is going to have to involve the libraries. A little support from them for reputable open journals (there are quite a few not so reputable ones) would go a long way.
Transformative work such as translation or scanning on to new medium, whilst laudable should not gain a full new copyright but one in proportion to the effort. Translation is hard and often requires creative thought to translate cultural idioms, so maybe 20-30 years benefit to reflect that there was effort, making it worthwhile but reflecting that it not an original work. Format shifting on the other hand isn't amazingly hard nor creative, and can often be automated therefore the gain should be made from distribution and it shouldn't gain much if any additional protection (perhaps 5 years if it is from medium classified by a national archivist as endangered).