Slashdot Mirror


More PDF Blackout Follies

georgewilliamherbert writes "The latest installment of "As the PDF Blackouts Turn" hit today, with a U.S. government apparently releasing a redacted version of their court filing in the Balco grand jury leak case which merely stuck a black line over the text, which remains available in the document. As with prior documents, entering text cut/paste mode in a normal PDF browser such as Acrobat allows a reader to access the concealed text. Previous incidents include an AT&T filing in the NSA case." This works with Xpdf and KPDF, too; for KPDF, use the selection tool (under the Tools menu) around the redacted section, copy to clipboard, then paste into the text-manipulator of your choice.

103 of 309 comments (clear)

  1. Maybe by GmAz · · Score: 4, Funny

    Perhaps the people making these "blacked out documents" should be taught a little about Vector Graphics and that a black box is not the same as a sharpie. One word for them 'n00b'!!

    --
    Click Click Bloody Click PANCAKES!
    1. Re:Maybe by gEvil+(beta) · · Score: 3, Insightful

      You don't even need to go into vector graphics with these people. All you need to do is attempt to convince them that white text is still text, or that black text on a black background is still text. Either way, the text is still there. The only way to ensure that it's gone is to ACTUALLY GET RID OF THE TEXT.

      --
      This guy's the limit!
    2. Re:Maybe by Mirlas · · Score: 5, Funny

      Maybe we need to go back to good-old fashioned text files.
      It was good enough back in the days of wood-burning computers;
      it should be good enough now.

    3. Re:Maybe by Nutria · · Score: 3, Insightful
      these fellows at the NSA

      NSA? Since when does the NSA redact subpoenas for the District Attorney?

      --
      "I don't know, therefore Aliens" Wafflebox1
    4. Re:Maybe by HumanisticJones · · Score: 5, Funny

      Indeed it does seem I have allowed my mind to wander while writing a post. Anyone want to try out my new book, "Not Hitting Preview First: Making an Ass of Yourself in a Public Forum"?

    5. Re:Maybe by Doctor+Faustus · · Score: 3, Informative

      All you need to do is attempt to convince them that white text is still text, or that black text on a black background is still text. Either way, the text is still there.

      This is a confusion over the way the Adobe Imaging Model works, not white-on-white or black-on-black. In Adobe's model, you start with a blank page, and you essentially paint on it; newly drawn things cover previously drawn things. Basically, despite what the previous commenter said, it really is like a Sharpie.

      When you physically draw over something with a black marker, the previous text may be impossible to see, but it's still there. In the PDF, you'd only have to skip the instruction drawing the box to get the text out. Even if Acrobat didn't let you get at the text by cutting-and-pasting, someone familiar with the PDF format could still get to it with some work.

    6. Re:Maybe by morgan_greywolf · · Score: 2, Funny

      Maybe we need to go back to good-old fashioned text files.
      It was good enough back in the days of wood-burning computers;
      it should be good enough now.


      Definitely! Then we can redact things with fancy ANSI terminal codes ^[[30;40mlike this super secret hidden message[[m!

      w00t! No one will EVER figure how to defeat that!

    7. Re:Maybe by indifferent+children · · Score: 3, Insightful

      Just because you're releasing the 12th printing of the 4th edition, does not make this a 'new book'.

      --
      Censorship is telling a man he can't have a steak just because a baby can't chew it. --Mark Twain
    8. Re:Maybe by massysett · · Score: 4, Interesting

      "may be impossible to see" are the operative words there. Ever used a Sharpie to black out the routing number on the bottom of a check? You can still make out the numbers. One way I've found to really black them out is to Sharpie the numbers, and then Xerox that check. Even Sharpies don't work as they might at first appear to.

      Real redactors use razors. You hold up one of those redacted documents and it looks like a punch card.

    9. Re:Maybe by OldManAndTheC++ · · Score: 5, Funny

      It was good enough back in the days of wood-burning computers

      Oh man, that brings back some memories! Late nights cranking out code on my Bunyan 2500 - that puppy went through three cords of oak a week, and it kept the place warm to boot. And we didn't need any of that fancy book learnin' to make it work either; if you were a good hand at whittling, you could be a programmer. Never had a lick of trouble with the Bunyan, except for the occasional splinter. Oh sure, you had to keep some kindling around to get her started, but once she got goin' she could do anything - add, multiply, and of course, branch.

      Internet? Pfft. We modulated the smoke exhaust by opening and closing the flue - you could see it for miles, unless it was raining, or windy. Hell, we had peer to peer networks back before most of you guys were even a swimmer in your dad's testicals.

      There's still a few Bunyans around, if you know where to look. Auditors like them, since they're so good at logging, and keeping a paper trail. I think the Vatican still has one, though they only fire it up when they elect a new Pope. Ah, the good old days...

      --
      Soylent Green is peoplicious!
    10. Re:Maybe by IAmTheDave · · Score: 4, Funny

      Reminds me of one of my favorite Onion articles: CIA Realizes It's Been Using Black Highlighters All These Years

      --
      Excuse my speling.
      Making The Bar Project
    11. Re:Maybe by statusbar · · Score: 2, Funny
      No. no, all you need to do is to sell them 'magic-cyber markers' for $1000.00 that they can apply directly to their screen!

      --jeffk++

      (... a fool and his money are best parted... )

      --
      ipv6 is my vpn
    12. Re:Maybe by recursiv · · Score: 2, Funny

      I can't remember the last time I saw rot13 mentioned without without someone cleverly pointing out their habit of double or triple rot13. That would be a novel thing.

      --
      I used to bulls-eye womp-rats in my pants
    13. Re:Maybe by scribblej · · Score: 4, Funny

      I had one of them wooden computers once. Wooden keyboard, wooden monitor, wooden CPU... only one problem: it wooden work. :(

    14. Re:Maybe by Uncle+Kadigan · · Score: 3, Funny

      Yours could multiply? Ours could only add. In order to multiply, we needed to build a log table.

    15. Re:Maybe by frdmfghtr · · Score: 4, Insightful
      Perhaps the people making these "blacked out documents" should be taught a little about Vector Graphics and that a black box is not the same as a sharpie. One word for them 'n00b'!!


      Sometimes I wonder if these incidents are really "accidents" or somebody's way of feigning ignorance of technology to get the facts out to the public.
      --
      Government's idea of a balanced budget: take money from the right pocket to balance...oh who am I kidding?
    16. Re:Maybe by Gilmoure · · Score: 2, Informative
      --
      I drank what? -- Socrates
    17. Re:Maybe by TubeSteak · · Score: 2, Informative

      I un-redacted the PDF file, as an example for others. Instead of stripping out the black mask, I turned it red & lowered the opacity. It now highlights the 'redacted' portions quite nicely (and any underlining they used).

      Skip to pages 6-16 of the PDF for the not-so-hidden goods
      http://www.easy-sharing.com/528126/BALCO_quash_sub poena_sfchronicle_unredacted.pdf.html

      P.S. I did it in FoxIt PDF Editor Pro, which I wouldn't really recommend to anyone

      --
      [Fuck Beta]
      o0t!
  2. This is why... by dubmun · · Score: 2, Funny

    we should all just write everything down in pencil. Boo to technology.

    Or... they could just find a better technological solution. Seems like a no brainer to me.

    --
    (end of post)
    1. Re:This is why... by Analog+Squirrel · · Score: 3, Funny

      Are you saying "boo" or "Boo urns"?

      </oblig>
      --
      I'd rather be flying
  3. History repeats itself by alshithead · · Score: 5, Funny

    Perhaps after another dozen or so incidents they'll decide a little training is appropriate for the folks who are doing the redacting.

    --
    I reserve the right to think for myself. Others' opinions are optional. Puppy on lap = typos...not illiteracy.
    1. Re:History repeats itself by cavtroop · · Score: 5, Insightful

      No, more than likely they will just pass a new law, stating that "Copying and pasting of blacked out (redacted) lines is a felony" or somesuch...

    2. Re:History repeats itself by richg74 · · Score: 5, Insightful
      This is in principle a good idea. However, the implementation may suffer from a fundamental problem.

      My grandfather used to say that there is one irreducible requirement for training a dog: you have to be smarter than the dog.

    3. Re:History repeats itself by Anonymous Coward · · Score: 5, Insightful

      I think that's called the DMCA

    4. Re:History repeats itself by squiggleslash · · Score: 5, Insightful

      Alternatively, perhaps the technology is at fault. If the same mistake is made over, and over, and over again, many user interface experts would start investigating whether it's the UI, not the user that's at fault. The argument is that the mistake is being made because the correct solution is not intuitively obvious.

      I'd be curious to know what tool the users are using to black out the text. Are they just exporting from Word but, before exporting, "blocking it out" in Word? If so, how? Are they putting black blocks over text, or setting attributes of the relevent text? If these are the wrong techniques, what can be done to make the right techniques obvious (and the wrongness of these techniques equally obvious)?

      I've designed enough crappy UIs in the past and justified them with "It's user error! All they have to do is hit the OK or CANCEL buttons, of course it's not going to work if they close the window instead!" and other such stuff that, with hindsight, was utterly wrong and elitist of me, to know that technically skilled people are not the best judge of intuitiveness. The fact is, I'm a programmer. You're probably technically minded too. The average user isn't. We can't avoid making assumptions about what the user thinks works that are, on occasion, completely, 180 degrees, wrong. What we can do is own up to them and try to determine how to steer the user in the right direction.

      --
      You are not alone. This is not normal. None of this is normal.
    5. Re:History repeats itself by alshithead · · Score: 2, Interesting

      Good point but I think you're looking deeper than need be. The users are probably just not fully understanding what they are doing. The full version of Adobe and Word are both great examples of applications that have so many options, tools, settings, and functions that the average user of these applications probably never even begins to understand 50% of what is available to them.

      --
      I reserve the right to think for myself. Others' opinions are optional. Puppy on lap = typos...not illiteracy.
    6. Re:History repeats itself by gEvil+(beta) · · Score: 3, Insightful

      What happens when I actually want to print white text on a black background? Will I have to go through some convoluted process because setting the background as black doesn't actually change the background to black, but rather also eliminates any text contained within it?

      --
      This guy's the limit!
    7. Re:History repeats itself by squiggleslash · · Score: 3, Interesting

      If the user interface is designed well, you'll know exactly what to do, just as you'll know intuitively how to really redact text.

      If you're asking me to tell you how such a properly designed UI will work, you're asking the wrong person. It'd be interesting to get someone like Bruce Tognazzini to give their take on it. Right now, all we can be fairly sure of is that the UI isn't working because people are constantly choosing the wrong tool for the job.

      --
      You are not alone. This is not normal. None of this is normal.
    8. Re:History repeats itself by Eadwacer · · Score: 4, Funny

      Sometimes history doesn't repeat itself. Sometimes it picks up a big club and says "Weren't you listening the first time?" - Terry Pratchett.

    9. Re:History repeats itself by DarkVader · · Score: 3, Insightful

      You know, considering the state our government is in, I would much prefer that someone would build into all software going to the government an "unredact" feature to make it even easier to recover government coverups.

      Barring that, PLEASE don't educate them, or make it easier for them to really redact anything.

    10. Re:History repeats itself by DarkSarin · · Score: 4, Insightful

      Fortunately this does not apply to humans--not directly.

      I can easily train people that are smarter than myself, if the conditions are right. For instance, I know a fair bit about statistics and data analysis, and would be perfectly comfortable training certain folks in the field, as long as they didn't know more than I do. Even then, it perfectly possible for me to come up with a unique idea that someone smarter than myself hasn't (note that I didn't say couldn't) considered.

      In the public schools there are frequent cases of a teacher training a student more intelligent than themself. It is unavoidable, although it could be reduced by making sure only the smartest teachers were highered.

      Smarter? Not a requirement. More experienced? Having unique knowledge? Yes, that is required, but maybe not irreducibly.

      HAND

      --
      "We don't know what we are doing, but we are doing it very carefully,..." Wherry, R.J. Personnel Psychology (1995)
    11. Re:History repeats itself by geobeck · · Score: 2, Funny

      Perhaps after another dozen or so incidents they'll decide a little training is appropriate for the folks who are doing the redacting.

      No, more than likely they will just pass a new law, stating that "Copying and pasting of blacked out (redacted) lines is a felony" or somesuch...

      Train them to use the blackout method, but to replace the redacted text with "If you can read this, you're under arrest!"

      --
      Find environmentally and socially responsible products on http://buy-right.net
    12. Re:History repeats itself by indifferent+children · · Score: 3, Interesting

      Maybe Adobe Acrobat needs a new menu item: Edit->Redact Then you only have to train people to use that feature rather than the backgound-color feature.

      --
      Censorship is telling a man he can't have a steak just because a baby can't chew it. --Mark Twain
    13. Re:History repeats itself by GWTPict · · Score: 2, Informative

      Nah, he just can't spell.

  4. People...learn...? by Elros · · Score: 3, Interesting

    You would think that people would have learned after the first time around. Apparently not.

    --
    "And the geek shall inherit the earth."

    1. Re:People...learn...? by Billosaur · · Score: 4, Insightful

      You would think that people would have learned after the first time around. Apparently not.

      You're giving people too much credit; as has been noted in this forum many times, the average computer user is not exactly bright and doesn't read Slashdot, so they would have no idea that this is a problem. People just assume that if something appears to work a certain way, it in fact works that way.

      --
      GetOuttaMySpace - The Anti-Social Network
    2. Re:People...learn...? by jimktrains · · Score: 5, Insightful

      "Human beings, who are almost unique in having the ability to learn from the experience of others, are also remarkable for their apparent disinclination to do so." - Douglas Adams

      --
      "You will do foolish things, but do them with enthusiasm." - S. G. Colette
    3. Re:People...learn...? by gstoddart · · Score: 4, Insightful
      You're giving people too much credit; as has been noted in this forum many times, the average computer user is not exactly bright and doesn't read Slashdot

      You're giving people too little credit. Most people who use computers are probably fairly bright -- they're lawyers, doctors, accountants, and all sorts of things most people on Slashdot can't do. Reading Slashdot doesn't make you bright (in fact, given much of hte drivel, just the opposite.)

      But, they expect computers to work like a friggin' toaster, and to them, if the text it blanked out, it's not readable. They're not going to realize the 'black' is a representation of a rectangle in a different document layer, and that the actual internal tree of the PDF still contains the actual text. Really, how could they?

      They understand computers by metaphor and analog to the real world. They don't know or care about the actual internal stuff. Since the paradigms have been done to look like the real-world, these people assume that the rest of the things also apply.

      Many people use computers who don't have a full grasp on all of their intricacies. However, I haven't looked inside of a TV in 20+ years, but I'm comfortable using one.

      Cheers
      --
      Lost at C:>. Found at C.
    4. Re:People...learn...? by Nef · · Score: 3, Interesting

      Here's my problem with this scenario though. You could probably describe most of the inner workings of said TV, be it SD or HD, LCD /CRT/Plasma, without much difficulty. I'll grant you, you probably couldn't recreate the thing (unless you were an EE or something similar) but you have a basic understanding of how things work.

      This represents a fundamental difference between how geeks/nerds think, and how the population at large thinks. Those technically inclined, whether they're gear-heads, pencil-pushers or computer geeks all take pride in knowing the HOW and WHY of the inner workings of almost everything around them. In fact, of the 3 examples I listed, the only real difference is their own proclivities. Gear-heads are gear-heads because they LOVE cars, computer geeks are computer geeks because they LOVE computers, and pencil-pushers (aka bean-counters, or Analysts in modern corporate-speak) love the truth in numbers!

      This raises the question, from my geek perspective, "Why do some people not care to educate themselves on how the things they use in their life work?" I mean, aside from the obvious benefit of saving a metric butt-ton on services most people pay an arm and a leg for, you can work on just about anything once you get bitten by the knowledge bug.

      Also, IMHO, probably the biggest advantage to being the geeky type is the personal pride one feels when accomplishing something difficult (such as fixing their PC, figuring out how to properly redact text in a particular file format, or rebuilding your engine)!! While I realize that pride can be a bad thing, when it's the kind of pride that makes you happy to be who you are, capable of the things you are, thats a HUGE confidence boost and spills over into so many other areas in life, you'd be silly not to try and take advantage of it!

      Oh, and one last thing. I wouldn't be so quick to assume those reading slashdot can't do some of the things you listed. In fact, knowing a few of our fellow /.'ers personally, there are at least 2 lawyers, 1 doctor and 5 accountants that I know. And they chose those professions because they wanted the money/prestige, or because they truly love what they do. No, I'd be more inclined to say that /.'ers may choose not to do those things on par with the average in society at large, but especially fields requiring intense study and years of education, most definitely pull at the heart-strings of a true geek.

  5. which? by Anonymous Coward · · Score: 5, Funny

    with a U.S. government apparently releasing a redacted version of their court filing

    Which U.S. government?

    1. Re:which? by TwilightSentry · · Score: 3, Funny

      The RIAA

      --
      How to enable garbage collection on a system without protected memory: #define malloc() ((void *) rand())
  6. Works in Safari directly by Deep+Fried+Geekboy · · Score: 4, Informative

    You can open them directly in Safari and cut/paste into TextEdit too.

    --

    I'm not wrong. You haven't thought about it hard enough.

  7. works in older acroread too by FudRucker · · Score: 4, Interesting

    i keep an older version of adobe's acrobat reader for Linux version 5.0 and copy & paste in to a text editor works in it too...

    i hate the new acrobat reader. some claim it calls home to the mothership(Adobe) which i dont approve of either (spyware)...

    --
    Politics is Treachery, Religion is Brainwashing
    1. Re:works in older acroread too by Sir+Codelot · · Score: 2, Informative

      i hate the new acrobat reader. some claim it calls home to the mothership(Adobe) which i dont approve of either (spyware)...

      Then you should try Foxit Reader. Apart from being free, light-weight and best for everyday use, it also has got a 'Fox' in its name. :)

      --
      I have a truly marvelous proof of the Riemann hypothesis which this sig is too short to contain...
  8. Even more shocking by $RANDOMLUSER · · Score: 5, Funny

    What's this in TFA about Barry Bonds and steroids? I had no idea.

    --
    No folly is more costly than the folly of intolerant idealism. - Winston Churchill
  9. Redacting right is HARD by nweaver · · Score: 4, Informative

    Redacting electronic documents right is HARD. See, for example, The NSA's guide to redacting word documents as PDF.

    --
    Test your net with Netalyzr
    1. Re:Redacting right is HARD by fistfullast33l · · Score: 4, Funny

      From the document:

      This page intentionally left blank.

      I was going to say, those guys are goooood.

    2. Re:Redacting right is HARD by Rob+T+Firefly · · Score: 3, Funny

      Turning leaked information back into a secret, that's the HARD bit.

    3. Re:Redacting right is HARD by quokkapox · · Score: 2, Interesting

      Two things: 1) Why not have a handy context menu option, "Redact selection", available with a right click on the selected object? 2) Awwww, the NSA uses the little kitty cat assistant instead of Clippy. Just like my mom. Until I gave her openoffice.

      --
      it's a blue bright blue Saturday hey hey
    4. Re:Redacting right is HARD by gEvil+(beta) · · Score: 2, Insightful

      Why not have a handy context menu option, "Redact selection"

      Because management and clueless users will demand that there be an "unredact selection" menu option, also. I'll let you sort out the implications of that. Either that or original copies of documents everywhere will have text permanently blocked out by the above-mentioned clueless users and management types.

      --
      This guy's the limit!
    5. Re:Redacting right is HARD by More+Trouble · · Score: 4, Funny
      Looks like you're redacting that document. NSA Office Kitty can help! First, tell us what you're trying to hide:

      • gov'ment impropriety
      • financial cheating
      • illicit sex
      • other
    6. Re:Redacting right is HARD by RobertB-DC · · Score: 2, Informative
      Redacting electronic documents right is HARD. See, for example, The NSA's guide to redacting word documents as PDF

      At least it's obvious that the folks who know what they're doing, know that MS products aren't the best solution. From the doc:
      Microsoft Word XP/2003: Microsoft has attempted to remedy certain issues with Metadata in Office XP and up by including a menu option to remove personal information (metadata). There
      is also a tool available for free from MS, Remove Hidden Data 1.0 (for XP) and 1.1 (for Office
      2003), hereafter referred to as RHD, that allows batch removal information from Word
      documents. None of these will remove sensitive information from the main document; neither
      will they remove all metadata of possible concern. And RHD 1.0 suffered from stability issues.
      Reliance of these tools may give a false sense of security.

      The fact that MS tools are in use at all in these situations -- as opposed to free, open-source solutions that can be customized for high security applications -- may show the ineptitude of whatever management keeps signing off on their purchase.
      --
      Stressed? Me? Of course not. Stress is what a rubber band feels before it breaks, silly.
    7. Re:Redacting right is HARD by Pendersempai · · Score: 2, Informative

      It's not hard; people just have to manually delete (not obscure) data they want redacted. Then all outgoing Word files should be scrubbed of metadata. There are commercial packages, included in many groupware suites, that do this automatically. At the law firm where I work, every single Word file that gets emailed to an address outside the firm is automatically scrubbed of metadata by the server. If you try to save a document with Track Changes enabled, a dialog box warns you. If you try to email a document with Track Changes enabled, several layers of dialog boxes confirm that this is actually what you wanted to do.

      The procedure you link to has people scrubbing the metadata by copying all the content of the document and pasting it into a new document. This puts too much trust in the user and does not clear some types of metadata anyway.

    8. Re:Redacting right is HARD by MrCopilot · · Score: 4, Informative
      17 Pages. Note to NSA.

      There is a much Simpler Solution.

      1.)Print Document.
      2.)Locate and uncap Sharpie.
      3.) Blackout Text.
      4.) Scan to DocRedacted.pdf
      Wow less than the average government paragraph. Seems like the way they have been doing it for years why change now?

      --
      OSGGFG - Open Source Gamers Guide to Free Games
  10. PDF Redaction by Fedallah · · Score: 4, Informative

    This is pretty ridiculous. Products have existed for years to take care of this sort of thing, such as http://www.appligent.com/products/product_families /redaction.php.

    How does this keep happening?

  11. I wonder how long it'll be... by TWX · · Score: 2, Insightful

    ...before they are told to just take a print-screen of the document, page by page, then use a graphics program to install the black boxes over words, then import each image as a page into their PDF creator...

    --
    Do not look into laser with remaining eye.
    1. Re:I wonder how long it'll be... by nixnutz · · Score: 2, Insightful

      If you use Group 4 tiff encoding, which is standard in the legal industry, there should be no problem with file size. Clean text like a court filing should be no more than 20-30k per page. This is probably how I would do it; print to tiff (no screenshots of course), import to IPro or whatever and redact (any litigation database software should support redaction), then export PDF again. The problem is that if you need searchable pdf you need to OCR the tiffs at some point after you've redacted them and the quality of the OCR is not as good as extracted text from the original doc.

      I'm sure that in this case whoever redacted the pdf didn't have access to the original file, and while it's easy to draw boxes in Acrobat, there's no easy way to delete the underlying text. Whoever was responsible for this should have had access to the tools to do this correctly, or if not they should have hired a vendor, I think I'd charge about $35 for this job.

      Also, why is this hosted on the SFGate site? Where was it originally?

  12. Nice and secure. Riiiiggght... by blcamp · · Score: 3, Interesting


    Really nice to know that these folks has taken an apparent cue on safe and secure documents from the folks in Redmond.

    On a serious note... this is seriously scary. Imagine if the NSA and other agencies are redacting all of their documents this way an passing them around the world to field offices, embassies and elsewhere.

    Imagine the implications during legal proceedings here in the States. Yikes.

    --
    The problem with socialism is that they always run out of other people's money. - Margaret Thatcher
    1. Re:Nice and secure. Riiiiggght... by DarkVader · · Score: 2, Insightful

      This is NOT scary. This is refreshing.

      I would much prefer my government be unable to successfully keep secrets from me.

  13. blonde joke by k4_pacific · · Score: 4, Funny
    Q: How can you tell when a blonde NSA agent has been redacting PDFs?

    A: There is magic marker ink all over the screen!

    --
    Unknown host pong.
  14. The New Way for Gov't Transparency by thatguywhoiam · · Score: 4, Interesting
    I love this idea.

    Leave PDF the way it is. In fact, make it really hard to actually redact something, but put a tool front-and-center that looks like its redacting something.

    Then - remove any delete capability from Outlook. Trash is fine, but not delete.

    Then - configure all Windows machines to be inherently wide open, so that we may all peer into gov't computers. Oh wait, this is already true.

    Sometimes I think those in positions of high gov't power should forfeit practically all privacy for the duration of their term. Put a webcam on these fuckers 24/7. Does that sound... draconian? Unreasonable? Maybe. But after losing billions of dollars in things like Iraq military contract debacles, I don't trust any of these people. They certainly don't trust us.

    --
    If Jesus wants me it knows where to find me.
  15. Someone missed the memo by Tozog · · Score: 4, Interesting

    Here's how the NSA recommends redacting files:

    http://www.nsa.gov/snac/vtechrep/I333-TR-015R-2005 .PDF

    1. Re:Someone missed the memo by Neil+Watson · · Score: 2, Funny

      The animated moggie Word assitant really adds a professional touch ;)

  16. Pretension by GonzoTech · · Score: 2, Funny
    http://www.sfgate.com/cgi-bin/object/article?f=/c/ a/2006/06/21/MNGUAJI4B85.DTL&o=0 The two reporters "are the only individuals, other than the leaker himself, who would have personal knowledge of the leaker's identity," Hershman and Raphael said.

    Is it just me or do they look a little pretensious?

    --
    "Snatching defeat from the mouth of victory on a daily basis."
  17. That's the problem with these powerful formats by hey! · · Score: 2, Informative

    Like .doc, .pdf, and AFAIK the opendoc format.

    It's the same old story as with operating systems or anything else: features are usually either a plus or a "don't matter", except when serious security issues are involved, in which case you can't always predict what is benign, whether in and of itself or in combination with other features. Adobe tried to position PDF for all kinds of other things like portable forms and collaboration, but obviously their users are running into the same problems ad MS Word users have with leaking sensitive information.

    What there should be is a standard document format for outside release of legal or sensitive documents, that doesn't have any features that could be inadvertantly used. Maybe it is RFT or a stripped down PDF; but something where you can tell the intern to release this press release, and not count on him being smart enough to check for hidden comments and workflow information. It sould be WYSIAYG -- what you see is ALL you get -- and any additional features, other than possibly a small and well defined set of metadata, should parse as an error.

    --
    Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
  18. Adobe can come out of this smelling like a rose! by Laura_DilDio · · Score: 3, Funny

    Add a "redact" tool to the existing toolbar!

  19. Hush! Hush! by Anonymous Coward · · Score: 5, Funny

    Why are we publicizing this flaw? We have a US Government in power that increasingly wants to peer into the lives of innocent citizens, while becoming less transparent itself in order to cover up deceit, fraud, abuse, and just plain bumbling incompetence. If these Keystone Kops want to believe that they are criminal masterminds, let them, but don't help them actually cover stuff up!

  20. This frightens me!!!!! by waif69 · · Score: 5, Interesting

    Having worked for the gov't and knowing that some documents that I have signed and worked on should be redacted, this scares the crap out of me. It's not that I did anything that was illegal or "evil" as google would put it, I just don't want the "bad guys" (terrorists, etc.) knowing my name is attached to anything that resulted in their cohorts arrested or killed on the battlefield (also includes CONUS since 9/11).

    Normal average government workers should NOT be redacting, the people who redact should be those who know that if they screw-up, they may be screwing themselves or good friends in the process. Have people do it(redact) who have something to lose.

    Just my 2 cents.

    1. Re:This frightens me!!!!! by dinsdale3 · · Score: 3, Funny
      I just don't want the "bad guys" (terrorists, etc.) knowing my name is attached to anything that resulted in their cohorts arrested or killed on the battlefield
      Its a good thing you haven't told anyone, then.
  21. That's just what's called by alewar · · Score: 4, Insightful

    "Security by obscurity" :)

  22. We have to act! by Waffle+Iron · · Score: 5, Funny
    Clearly, these information leaks are a major security threat that is aided and abetted by these renegade PDF viewers. I'm encouraging my representatives in Congress to introduce a "Digital Millennium Redaction Act" that will prohibit the manufacture, sale, discussion or hyperlinks to any PDF viewers which enable the illicit extraction of redacted data from PDF documents. Such viewers are little more than the preferred tools for information thieves, hiding in the guise of "productivity applications". It's despicable.

    This law would instruct the FCC to create a program to certify approved PDF viewers; such viewers must make it impossible for users to steal the redacted data in a file, along with technical measures to prevent tampering with the viewers by hackers. Certified viewers will be made available to the public by software companies on a list of government-approved PDF vendors. After it becomes illegal to own a non-certified pirate PDF viewer, these dangerous information leaks will thankfully become a thing of the past.

  23. Re:A redacted document? Say it ain't so! by The+Only+Druid · · Score: 5, Informative
    "Redacted" is a legal term of art (i.e. it has a special meaning in the legal context).

    For lawyers/courts/etc., redacted (Per Black's Legal Dictionary) means:
    n), n. 1. The careful editing of a document, esp. to remove confidential references or offensive material. (Cases: Criminal Law 663; Federal Civil Procedure 2011; Trial 39. C.J.S. Criminal Law 1210-1211; Trial 148-153.) 2. A revised or edited document. -- redactional, adj. -- redact, vb.>


    The lesson here is this: if you see a word used in a legal context (or any professional context) and it sounds entirely wrong...ask yourself first whether it might have a special meaning before complaining.
    --
    "Stumble before you crawl"
  24. They're correct. by Kadin2048 · · Score: 4, Informative

    Their use of redact is completely correct.

    If I am releasing a document for publication and decide to remove information from it, this is redaction. It's editing for publication, which can include the removal of information. It could also include the addition of new information, but that's not what typically happens. Redaction can be a form of self-censorship, but it's not always the same.

    Censorship is when a third party, generally a person in authority, suppresses information which is considered objectionable. The 'authority' can be the same as the author (e.g. 'self-censorship'), or the suppression can be indirect -- it need not be editing per se.

    It's my understanding that "redact" is used only in reference to written documents that are being edited, while 'censor' is more general and can refer to anything. The terms are closely related, especially in their typical use, but they're not exactly the same. "Redact" is actually a more specific and precise word for what's going on in this instance. We can argue about whether censorship is also going on, but redaction definitely is.

    Anyway, arguing about definitions by citing dictionaries is always a bit pedantic, since dictionaries are not authoritative except as a historical reference: they can tell you what a word meant at the time the dictionary was written, but not what it means right now, since a word's definition is determined by its usage. All language is inherently arbitrary: they're just sounds we make or things we write down in order to convey ideas, and the relationship between the sounds/characters and ideas is not fixed, but infinitely variable. If everyone were to decide tomorrow that 'redaction' meant the same thing as 'censorship,' that's what it would mean, and next year's dictionaries would have to be updated to reflect that.

    --
    "Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."
  25. Seems to be a common occurrence by milgr · · Score: 3, Insightful

    I googled for redacted doctuments, chose some pdfs at random, and found that the text is behind the black bars.

    When I started searching, I googled for redact. There were two ads for products that remove the text from the pdf as well as create the black bar. One made it clear that the text would be inaccessible from hackers.

    So, why aren't these types of tools being used for all redactions?

    --
    Where law ends, tyranny begins -- William Pitt
  26. Congratulaitons. by sammy+baby · · Score: 4, Informative

    Congratulations, Slashdot! The FBI will be along shortly to raid your offices on suspicion of violating the DMCA, the Patriot Act, and probably some other bullshit piece of legislation we don't even know about.

    Oh, yeah - it's a no-knock warrant, so put your pants on now.

    1. Re:Congratulaitons. by botlrokit · · Score: 2, Informative
      The FBI will be along shortly to raid your offices on suspicion of violating the DMCA, the Patriot Act, and probably some other bullshit piece of legislation we don't even know about.

      /. doesn't host with AT&T, so no worries.

    2. Re:Congratulaitons. by BandwidthHog · · Score: 3, Insightful
      /. doesn't host with AT&T, so no worries.
      Doesn’t necessarily matter. Just because I don’t purchase services directly from NSAT&T, that doesn’t mean that my data isn’t flowing through their network at some point on its journey. So while I am immune (for now?) from NSAT&T’s content ownership bullshit, I can’t count on not having them dump my packets into Cheney’s inbox.

      --

      Quantum materiae materietur marmota monax si marmota monax materiam possit materiari?
  27. Re:This proves it: by Svartalf · · Score: 3, Insightful

    Excuse me, any electronic format, unless it is a bitmap format, will have this problem unless
    all the viewers 100% honor the redaction as it's intended. In the case of a bitmap format,
    you can burn a black or white rectangle into the original image and then add an annotation
    a la TIFF's annotations that contains the original portion of the image that was redacted
    in an encrypted format so that it's difficult to expose the redaction- IF you need to have
    the redaction exposed. If not, you hand across the redacted image as-is without annotations.

    This has NOTHING to do with PDF or ODF at all- trying to make this a connection to these
    is bogus to say the least. In this case, I believe that the people doing it used the MS Office
    redaction capabilities and then exported the redacted content to PDF, which the export
    carried the same sort of redactions across to the other format. What happened is because
    someone didn't understand the tools they were using, not because of PDF or ODF.

    --
    I am not merely a "consumer" or a "taxpayer". I am a Citizen of the State of Texas
  28. Re:A redacted document? Say it ain't so! by giafly · · Score: 2, Funny

    If redacting is the "the careful editing of a document", obviously this wasn't a redacted PDF.

    --
    Reduce, reuse, cycle
  29. Acceptance of Risk by Kadin2048 · · Score: 4, Insightful

    While you make a good point, the people who have to use computers to accomplish their jobs, but do not make an attempt to understand how they work (and just treat them like "black boxes") are taking an enormous risk. They are hitching the metaphorical wagon of their livelihood to a team of horses that they don't know shit about.

    If you were somebody who made your living in television, but didn't understand anything about it, you would likewise be taking a great risk. You might, for instance, look like a big idiot when you show up to work at your anchor desk wearing a horizontally pinstriped shirt (which looks like ass on TV because of the Moire effect between the lines on the shirt and the TV scanlines). If you had understood the technology a little better, you might not have done that. That's a trivial example -- undoubtedly if you were a TV anchor, you'd learn or be told at some point not to wear a shirt like that without having to learn about scanlines -- but I hope you see my point.

    Whenever you use a technology without learning about it, you accept a certain amount of risk. Sometimes, you gamble and win: you just use the technology, get your job done, and nobody's the wiser. You're faster, more efficient, more competitive, you look like a hero to your boss, whatever. But if the technology doesn't work, then you're SOL -- but that's the price you pay for not understanding it. That's the risk you accepted when you said to yourself "eh, I don't really care what goes on inside there."

    In the case of PDF, we have a lot of people using a certain technology without knowing anything about how it works, and thus -- like the TV anchor in his pinstriped shirt (or a weatherman wearing chroma-key blue or green) -- you get these gaffes.

    I'm not saying that everybody needs to learn about how everything they use all day works, down to the bare metal. Virtually nobody needs to know that, except perhaps people who are doing things that are so dangerous that they can't afford to fuck up. However, people should be aware of the tradeoff they're making and the risk they're accepting when they forgo figuring out the internal details of a system and simply accept it as a whole, on faith that it will always work a certain way. As long as people are aware of that decision, and make it consiously, and accept the results, you can't ask for more.

    Generally speaking: faith is a fine thing, as long as you know when you're relying on it. It's when you thought you were relying on something else, and find out that you had nothing but faith, that a problem has occured.

    --
    "Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."
    1. Re:Acceptance of Risk by SydShamino · · Score: 2, Insightful

      They are hitching the metaphorical wagon of their livelihood to a team of horses that they don't know shit about.

      Millions of Americans hitch the physical "wagon" (or SUV, or sedan, or minivan) of their livelihood to a bundle of "horsepower" that they don't know shit about every single day, and then they drive that wagon at 75 MPH.*

      In the case of their cars, the consequences for misuse are serious injury or death. In comparison, the consequences for learning next to nothing about their computers seem slight.

      * It seems to me that knowing how to redact text in Acrobat is like knowing why you are supposed to turn on your headlights around dusk. Yes, you think you can still see just fine - the headlights are for others to see you. And no, I can't see your dim low-set parking lights if you turn those on alone.

      --
      It doesn't hurt to be nice.
    2. Re:Acceptance of Risk by gstoddart · · Score: 2, Insightful
      I'm commuting a lot these days post Katrina...and it seems very few people understand the left lane is the passing lane...get the fsck out of the way of a driver coming up behind you faster than you're travelling. If possible, you should do the majority of your driving on the rt. lane (US).

      And, I'm equally amazed at how many people are too damned ignorant and intent at driving at Max 0.6 to realize I'm in the middle of fscking passing this guy (as evidenced by the fact that I'm going faster than him), and that fact that you want to go 2x speedlimit vs my 1.2x speedlimit doesn't mean I'm suddenly going to accelerate to your speed to complete my pass, or abandon my pass so you can fly by at insane speeds.

      When I finish passing the guy, I will get out of the passing lane, I've already factored that in. It doesn't mean I'm gonna relinquish the lane to you or speed even more to keep you happy.

      The passing lane isn't a free pass to drive like an asshole at the highest rate of speed you can manage. You need to cut the rest of us some slack when we're actually passing too. I've seen far too many people who, even though I'm in the middle of actually passing cars, expect I just scrape and grovel and get completely out of their fscking way -- those people might see my brake lights rather unexpectedly!

      People are bad drivers on both ends of that spectrum -- both the people who never move, and the people who expect you to move immediately as if they're the friggin emperor or something.
      --
      Lost at C:>. Found at C.
  30. Disability guidelines prohibit rasterized docs. by Kadin2048 · · Score: 3, Insightful

    I am pretty sure that rasterized PDF documents violate government disability-access guidelines, since they can't be read with screenreaders, braille terminals, or basically anything other than a set of human eyes (or a good OCR program).

    They would be a lot better off going through the document in Word (or Notepad/Textedit/vi/EMACS/whatever) and just selecting the regions of text that they want to remove, and replacing it with [-- TEXT REMOVED --] or even [REDACTED]. If they were really slick, I'm sure somebody could write a little macro to replace the text with an equivalent number of characters of whitespace or random text or dashes, to preserve formatting. (Okay, so to really preserve the formatting it would have to be replaced with characters that have the same amount width as the deleted characters; maybe there's a font-set containing various widths of whitespace characters that they could use? In TeX it would be trivial.)

    The results would be ugly (but really, were black bars ever very beautiful?) but at least it would actually remove the information, and wouldn't result in an inaccessible, rasterized document.

    --
    "Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."
  31. Clear as Mud by Doc+Ruby · · Score: 4, Interesting

    Why doesn't Adobe upgrade their PDF generators to include a "Real Redact" button that actually deletes the redacted data? They could sell it to governments at the usual 1000x government markup rate, and the government would probably still save money vs the fallout from these illusory blackout follies. Neither the government nor Adobe is in the "freedom of information" camp. Maybe the government just refuses to buy an upgrade because that would save money overall.

    --

    --
    make install -not war

  32. Re:And these are... by drc500free · · Score: 4, Insightful

    "If you want my opinion (or even if you don't...:-p) this is the achelle's(sp) heel of our society today, most people are lazy bastards that just want to get done with somethign without learning anything about it."

    "Another thing that pisses me off is incopetence."

    Oh, the irony.

  33. Command Line Programs; evince by Noksagt · · Score: 2, Informative

    CLI programs are REALLY useful to look at "hidden" content.

    'pdftotext' comes with xpdf & is even available natively on windows.

    Similarly, for MS Word documents, you may use 'antiword', 'catdoc', and 'wv'.

    These programs are quite nice in that they can easily batch-process a lot of documents & then you can go grepping through them for interesting tidbits.

    (On the GUI front, evince deserves a plug. It uses the same poppler backend as xpdf and kpdf. I used to use tiny & fast xpdf for most of my pdf viewing, but evince has a few nice features which xpdf lacks & has become my personal favorite pdf viewer.)

  34. Common problem with today's UIs by Namlak · · Score: 5, Informative

    The industry at large (Microsoft being a big offender) has been trying to get us to a this magical place where everything is system and location independent and this is where we end up:

    1) FTP sites in Windows Explorer look like regular Windows folders. People expect them to work like regular folders. I had a field sales force try to "share" an Excel spreadsheet expecting the others to get a "Read Only" copy just like would happen on a local network share. Overwriting madness ensued. You can't blame them, there was no indication that it would work differently. Asking them to understand FTP is like accounting expecting me to fully understand the accounting rules behind my IT purchases.

    2) A manager where I used to work had an Excel spreadsheet with payroll data for the entire company. He wanted to send each department their subset of the data. So he filtered his spreadsheet and sent the filtered lists to each department not knowing that he was sending each department the whole list under teh covers. Luckily, the file was 30MB and choked in the mail server and I was able to bail him out of that huge mistake. But you really can't blame him - he saw something on the screen and sent "it". There should be an indication of underlying data. BTW, doing a cut and paste special made each file about 25k or so.

    Same thing with this PDF error. If your file shows certain information, it should contain that information only or indicate (or warn) otherwise.

    By "simplifying" everything, nobody knows what's really going on. A couple times per week I have to explain some type of issue to some user about how "It's really more complicated than that, see Windows (or an app) hides this from you." User roll eyes as their simple task has become obscurely complicated - all in the name of making things "easier" to understand, ironically.

    If something works different, it should be displayed different - that at least gives the user a chance to question what they are doing.

    1. Re:Common problem with today's UIs by BandwidthHog · · Score: 2, Interesting
      So he filtered his spreadsheet and sent the filtered lists to each department not knowing that he was sending each department the whole list under teh covers.

      I use that feature quite often and it was only yesterday that I noticed that the little triangle turns from black to dark blue when you’re viewing a filtered set. All this time I was thinking there really ought to be some sort of visual indication (other than the wonky row numbers).

      --

      Quantum materiae materietur marmota monax si marmota monax materiam possit materiari?
  35. Anyone think this may have not been an acident? by Mister+Whirly · · Score: 2, Interesting

    Does anyone think that this may not have been an oversight, and that someone knew the geeks would figure it out (like we did before) and wanted it to be leaked??? I know it's giving someone a lot of credit, but stranger things have happened...

    --
    "But this one goes to 11!"
  36. Circumvention by Mateo_LeFou · · Score: 2, Insightful

    If black squares count as a "technical measure" protecting access to a work... ? Someone actually should go ahead and launch this suit, to draw attention to the DMCA's shittiness.

    --
    My turnips listen for the soft cry of your love
    1. Re:Circumvention by jZnat · · Score: 2, Insightful

      I don't think that court documents like these are copyrighted, so you can't even apply the DMCA to it. The leading source of public domain material these days seems to be the government itself...

      --
      'Yes, firefox is indeed greater than women. Can women block pops up for you? No. Can Firefox show you naked women? Yes.'
  37. NSA procedure sucks! by ukemike · · Score: 2, Insightful

    They say you should open the original document in Word and EDIT the document by replacing the redacted text with a bunch of X's then print it to a PDF. That's a fundamentally different process than redacting. It's editing, and the temptation to ALTER the document would be huge. Also what would you do if you don't have the original Word document?

    Doing it right isn't so hard. You want to end up with a graphical only PDF of the document that has been redacted. (I can't believe I'm about to give the NSA good advise on how to keep secrets!)

    Use acrobat to mark out all of the evidence of your wrongdoing (oops I meant mark out anything classified...) Save it. Open it in a third party pdf program like FoxitPDF reader. Print it to a new pdf file using you PDFwriter of PDFdistiller print driver. You should now have a completely graphical pdf with no embedded text in the file. This is just as good as printing it, redacting it, then scanning it (which would be another good procedure.)

    It may look all blocky and pixelated but redacted documents from the government always look like crap.

    --
    -- QED
    1. Re:NSA procedure sucks! by wiredlogic · · Score: 2, Insightful

      FWIW you could very easily write up some VBA code that converts highlighted text (maybe just one specific color like red) into Xs. Then you would just have to highlight the redacted sections and run the macro when you're finished. The highlighting could be optionally kept in place to make it more visible in the PDF. It also would be useful for an actively changing document to make the author more aware of where the sensitive bits are.

      --
      I am becoming gerund, destroyer of verbs.
  38. MS Word Redaction Tool by blackstripe · · Score: 5, Informative

    Assuming the original document was in Word format, I'm surprised they didn't use Microsoft's freely available redaction add-in.

  39. Re:And these are... by Anonymous Coward · · Score: 2, Insightful

    The irony here is that you're complaining about people being "so damn lazy that they can't do a little research" when you haven't taken the (very small) amount of time researching how to correctly spell Achilles.

  40. Re:This proves it: by ThePelt · · Score: 2, Funny

    )

    that is all

  41. On Purpose? by Pahroza · · Score: 2, Interesting

    I wonder whether it's possible some of the people doing this really want the truth to come out? That someone "accidentally" screwed this up?

    Oh, wait, we're talking about the government?

    Nevermind.

  42. Evince goes one better by Bill+Hayden · · Score: 2, Informative

    Using Evince, GNOME's document viewer, you don't even have to copy to another document. Merely selecting the "redacted" text shows the actual text.

    --
    Protect your browser with the Force Safe Search add-on
  43. Proper way to conceal information... by InvisibleSoul · · Score: 2, Funny

    Those ignoramuses!

    The proper way to conceal the information is to apply white out on the monitor!

  44. "The Simpsons" did it! by LanceUppercut · · Score: 2, Funny

    This reminds me of the Homer Simpson's "Mister X" Web site :)

  45. Re:And these are... by TheBrakShow · · Score: 2, Interesting

    I have never understood why people are so damn lazy that they can't do a little research into what they are doing. People juat want results, not knowlege about what they are doing.

    You mean like when people are too lazy to spell check their posts on Slashdot? Look, most people can usually excuse spelling and grammar mistakes but your argument would be much stronger without the brazen hypocrisy.

  46. Er... pdftotext...? by digital+photo · · Score: 2, Informative

    Okay... this is what is considered secured??

    Using a STANDARD pdf handling tool:
    % pdftotext BALCO_quash_subpoena_sfchronicle.pdf

    From the PDF->TXT file:

    [snipped to first line before the "blacked out section"]

      C. Movants' Efforts to Obtain the Secret Grand Jury Transcripts

    [beginning of first blacked out section]

    Prior to the return of the Balco indictments, the lead defendant, Victor Conte ("Conte"), began to correspond via e-mail with Movants. (See Ex. 1 to Donnelan Aff.). Neither Movants nor Conte attempted to keep their relationship confidential, as the e-mail correspondence routinely was reported by Movants.2 (Exs. 1, 2, 3, and 11 to Donnelan 1

    [... snipped for berevity ...]

    On June 23, 2004, Fainaru-Wada sent an e-mail to Conte indicating that he (Fainaru-Wada) was busy working on some stories that may be "up on the web soon. Hope you like t
    hem." (Ex. T to Hershman Decl.). Conte responded that he was looking forward to seeing the article and that his lawyer would be available for comment. (Id.).

    [end of first blacked out section]

    D. Disclosure of the Montgomery Grand Jury Transcript On June 24, 2004

    [more, but why post it when you can read it yourself!?]

    Okay... WTF!? Doesn't ANYONE check this stuff before it goes out the door!?

    OMG! Wonder if this is how our private documents are "made safe"....