Slashdot Mirror


FBI Investigates Liberator of Court Records

eldavojohn writes "Federal court documents aren't free to the public, they cost $0.08/page through a system called PACER. During a period when the US Government Printing Office was trying out free access at a number of courthouses around the US, a 22-year-old programmer named Aaron Swartz installed a small PERL script at the 7th US Circuit Court of Appeals library in Chicago — a script that uploaded a public document every three seconds to Amazon's EC2 cloud computing service. Swartz then donated over 19 million documents to public.resource.org. That's when the FBI took interest in the programmer responsible for this effort and ran his name through government databases. How did he discover this? His FOIA was approved, of course, and he received the FBI's partially redacted report on himself. The public.resource.org database was later merged with that of the RECAP Firefox extension, which we discussed a couple of months back." Update: 10/06 18:22 GMT by KD: Timothy Lee pointed out that the summary as originally posted garbled the Swartz / RECAP connection. Improved now.

10 of 445 comments (clear)

  1. 19,856,160 records at 3 seconds per record by circletimessquare · · Score: 4, Informative

    ((((19,856,160 x 3 sec)/60 sec)/60 min)/24 hours)/365 days = 1.9 years

    entirely doable

    --
    intellectual property law is philosophically incoherent. it is your moral duty to ignore it or sabotage it
  2. Re:Wow , at 8 cents a page for a PACER document... by dwillden · · Score: 5, Informative

    Yes it all costs money, and we the TAX payers have paid that money. Thus the works are public domain.

    --
    I'm too lazy to compose a creative sig.
  3. Re:So, it took 1.9 years? by kent_eh · · Score: 4, Informative

    > What am I missing?

    Document != page

    19,856,160 pages at 3 seconds per court document.
    I expect many (most?) of those court documents are multi-page documents.

    --

    ---
    "I can't complain, but sometimes still do..." Joe Walsh
  4. Re:Wow , at 8 cents a page for a PACER document... by BigRedFed · · Score: 4, Informative

    When something is in the public domain, you can still charge for reproducing or hosting it. You just can not prevent someone from copying and distributing it in the manner that they want as well. Public domain does not preclude paid access. Also, since people pay money to file court documents, IE filing fees, etc. There are instance where you paid NOTHING for court documents to be produced at all. Courts do not run wholly on tax dollars alone.

  5. Pacer charges even more than it says by Anonymous Coward · · Score: 4, Informative

    Pacer is worse than presented. Itâ(TM)s not just 8 cents a page for downloaded, itâ(TM)s 8 cents a page for any page you pull into your browser. They consider any Web page you surf on their site in search of the legal document to be a âoedownloadedâ document.
    I work at a newspaper and one of my reporters ran up a $250 bill with Pacer checking many times a day to see if an important local opinion were issued. When it was, it was just 4 pages long; I expected to pay 32 cents. Instead they said we owed over $250. We never paid it and consequently no longer use Pacer.

  6. Re:And where did he get the password? by PhilHibbs · · Score: 4, Informative

    RTFFBI report, they say that he ran the script from a location outside of the library using the library password. Either the FBI are wrong, or the article summary is.

  7. Re:Doesn't the FBI have better things to do? by natehoy · · Score: 4, Informative

    Well, it all depends. The information is "freely" available as in "free as in speech". You can go on PACER any time you want and download anything you want.

    However, PACER itself is not (or at least not fully) tax-funded, so it's not "free as in beer". There is a user fee involved if you want to download the originals off the PACER system, which funds the system and makes the documents accessible. Once you have a copy of a document, you are free to do anything you want with it including share the document with anyone you want, which is why groups like RECAP can re-share any documents they've paid for or had donated to them.

    This one is an interesting case, because the library access was initially set up so people could do free searches for small numbers of records, expecting a small number of hits. When the number of hits started skyrocketing, the government got suspicious as to who was collecting all of the documents and why. The FBI started an investigation, and it sounds like they discovered that nothing illegal was going on after all and dropped it. I'd say the number of hits on the system was enough to raise suspicion and justify a further look into what was going on, but that's one man's opinion.

    While I applaud Aaron's efforts on behalf of RECAP, the net result was the publication of a few million files (good) and the shutting down of the free access to PACER at libraries due to what PACER obviously thought of as abuse (not so good).

    If everyone expects/gets access to all PACER documents for free, then there won't be any money going into PACER to pay to scan the documents, organize them, and make them available. Then PACER will either cease to exist, or require additional taxpayer funding to continue since they won't be making any funds from user fees.

    I'm not saying that complete taxpayer funding is a BAD idea, only that it is not how PACER is funded at this time. RECAP's initial approach was to collect donations to get "first copies" of a bunch of records from PACER then make them freely available to all (or to ask people to donate "first copies" they'd already purchased). So PACER was making revenue, and everyone was happy.

    --
    "This post contains words, known to the State of California to cause thought. Wash brain thoroughly after reading."
  8. Re:Money by clintp · · Score: 5, Informative

    Pournelle's Iron Law of Bureaucracy also applies:

    "In any bureaucracy, the people devoted to the benefit of the bureaucracy itself always get in control and those dedicated to the goals the bureaucracy is supposed to accomplish have less and less influence, and sometimes are eliminated entirely."

    --
    Get off my lawn.
  9. Re:What's wrong with this picture? by NotBornYesterday · · Score: 5, Informative

    He created an executable file containing computer instructions written in a programmming language. How is that not a program? He received no authorization to install it, and therefore it was "unauthorized". The description "installed an unauthorized program" sounds right on to me.

    --
    I prefer rogues to imbeciles because they sometimes take a rest.
  10. Re:You know what pisses me off about stuff like th by amplt1337 · · Score: 4, Informative

    bleeding hearts are responsible for the national debt

    Actually, you're wrong.

    "social services" really ought to be handled by private organizations like they used to.

    You might want to do some research on the 1880s, and how effectively social services were handled by private organizations back then. Protip: they weren't handled at all. People died in the streets in massive numbers.

    Most of the cries of "ooh big government! big government!" that people love to wave around come from an ignorance of how important government programs are to maintaining social order and a modicum of well-being for poor people. Well, that and a gross misconception of how much of the federal purse is spent on social programs, versus the things that the libertarians actually think are worthwhile. (We could just as easily cut almost all of our defense spending, since it's pretty much worthless).

    --
    Freedom isn't free; its price is the well-being of others.