DARPA: Reconstruct Shredded Docs, Win $50K USD
ematic writes with a link to an interesting competition from DARPA: "The ability to reconstruct shredded documents will potentially yield information that may save lives or offer critical information about an adversary's plans. Currently, this process is much too slow and too labor-intensive, particularly if the documents are handwritten. We are looking to the Shredder Challenge to generate some leap-ahead thinking in this area. The Shredder Challenge is composed of five separate problems. The overall prize awarded depends on the number and difficulty of problems solved."
Someone with a unique way of reconstructing shredded documents can probably earn more than that in one afternoon of dumpster diving.
There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
I don't know, I've been hitting the shredded documents with a wrench for the last 10 minutes, it doesn't seem to be working.
Any adversary that shreds rather than incinerates critical information they don't want recovered isn't much of an adversary.
Beauty is in the eye of the beerholder.
You gotta love when someone offers a $50,000 prize for an improvement that would save them millions of dollars in labor, not to mention the value of files reconstructed that might have been ignored before it became so much easier to do.
A million dollars for improving the movie recommendations on Netflix, and $50,000 for a massive intelligence breakthrough?
Way to go, Pentagon. Way to prove that even with a defense budget of $649 billion dollars you can still be a total cheapass.
I get this all the time. You're probably using imperial; try switching to metric.
This actually looks like a ton of fun. After looking at the basic documents they tried to put other indirection in the images like color levels that really need to be sorted before the actual shredding issue is resolved. There is a mix of up/down and useless data on the page, but the ligatures seem consistent on the images - brute force on the first page is probably the most cost effective solution - the others seem to be order of magnitude problems. The reality of this being "shredded" solution is probably a real-life problem in disguise like a transmitted scrambled image problem or connecting/stitching problem.
So they might have a head start to winning the prize.
"The likes of Facebook and WhatsApp are free to those whose privacy is of zero value."
I think you'd be better off, if you were successful, to simply commercialize it. $50,000? That's like the first year's support contract on the software you'll sell them for $300,000 per seat. And since it's "enterprise" software, it doesn't even have to actually work particularly well. That's why you sell the support contracts.
Off the top of my head, this seems very close to the techniques used for shotgun sequencing of genomic data. Lots of little strands you want to line up. Just in multiple dimensions.
I Browse at +4 Flamebait
Open Source Sysadmin
This has nothing to do with scanning the fragments. They give you a tiff, with an alpha channel, and each scrapped already pressed out and scanned into the image.
The thought being, in the field, you can get the grunts to take back the bag of shreds, lay them out in blocks, scan them, and submit the blocks to some back-end program that will do some jigsaw algo to put together pieces within the block. You'll just have to make sure each shred is surrounded by a space.
Honestly, I'm surprised some archaeological PHD hasn't already invented some system similar to this, for putting back together s broken Egyptian hieroglyph style wall writing or something.
-Malakai
A Dragon Lives in my Garage
shredderchallenge seems to be Slashdotted, so apologies if this is a dup.
During the Iran Hostage Crisis teams of carpet weavers were recruited to piece together shredded documents. They were then published in 1982 in 54 volumes under the title "Documents From the U.S. Espionage Den".
Even when I first got into the Navy (which was like 25 years ago... damn I'm old), we were using cross-cut shredders to destroy classified paperwork. These things practically turned the paper to dust - the individual pieces were like maybe 3/8" long by, I don't know, 1/32" wide? There's no freaking way you could put these back together.
And if that wasn't good enough, one ship I was on had a paper mulcher. You threw in the paper you wanted destroyed, and it ground it up with water into a sodden, pulpy gray mass. There was nothing TO put back together after this process.
For piecing together shredded East Germany Secret Police (Stasi) documents: http://www.time.com/time/business/article/0,8599,1983287,00.html
Maybe DARPA needs to take a trip to Germany . . .
Schroedinger's Brexit: The UK is both in and out of the EU at the same time!
Yeah well there's a difference between theory and practice.
Actually many of the great successes of AI (and even then some would debate how great they've been) are simple-sounding in principle but tough to get right. Things like route planning (just start a directed random walk from the start and finish and explore the graph until they connect to each other), web search (just weight results by popularity/links), document search (just show anything with a partial match), OCR (just threshold the image and match pixels to a database of font characters), voice recognition (just break it up into phonemes and look it up in a pronunciation dictionary), voice synthesis (just pre-record some phonemes and stitch them together), image recognition (just tag a bunch of images and train a neural net), and so on.
They all sound simple enough. But for an actual implementation to be successful, there are tons of pitfalls and gotchas and real-world ambiguities that need to be figured out. There's then whole other layer of tweaking to get a reasonable idea to run in a reasonable amount of time: many problems can be brute-forced but people typically don't want to wait forever for the answer, so ingenious algorithms for pruning the search tree or efficiently exploring the parameter space have to be designed.
Point being, don't assume this is as easy as it sounds. If it were, then we wouldn't even be discussing it (and no one would bother using shredders).
FYI: A.C's user id is 666
One person's "nefarious" is another's "good".
Ask not what your country can do for you, ask how you can de-shred documents for your country.
I know you are a 7 digit...
Would someone with a 5 digit UID please show up and tell this guy he's fucking stupid? By his own logic he'd have to agree.
I'll have to see if I can find someone young enough to have a 5-digit ...
but in any case, yeah, UID does not equate to much!
"Flame away, I wear asbestos underwear"
I think unshred is the next menu selection after uncrop.