256GB Geometrically Encoded Paper Storage Device

Robustness & Feasibility by eldavojohn · 2006-11-26 02:29 · Score: 5, Interesting

The Rainbow technology is feasible because printed text, readable by the human eye is a very wasteful use of the potential capacity of paper to store data.

And I'm sure this "Rainbow Technology" is also very wasteful if you would devise a way to encode data on electrons & lay them on the sheet of paper and then read them. The obvious problem being that just exposing the paper to the natural elements would probably render some of the data useless. Now I know that compact disc drives in computers use a form of error correcting codes (I can't recall if it's cyclic redundancy checks or some other form of parity) and I assume that the scheme of this paper technology uses the same (most likely at the cost of a fraction of space). Judging by the word 'rainbow' I'm guessing it uses colorized shapes to encode the data which is a novel idea but what quality must the paper & ink be? Can the paper in my printer be used to encode this data?

My question would be how much wear & tear can a sample of this medium stand before it is rendered unreadable? I would highly doubt one would be able to fold it--however it would be interesting to see whether creating a diagonal read/write scheme would protect from vertical & horizontal folds with the proper ECC. I think the plastic sheets could potentially be as robust as discs but would you be able to bend them? I doubt it though if they allowed it, it'd probably end up being more expensive than a disc.

Interesting technology but I'd sure like to hear a lot more of the details of how it works & how it performs before I make a solid judgment on its feasibility.

--
My work here is dung.

Re:Robustness & Feasibility by Zadaz · 2006-11-26 02:39 · Score: 3, Informative

QR Codes are not as sophisticated, but can reconstruct the original data when 30% image is missing or distorted. Since these guys are obviously pretty clever, I can't imagine this feature would be overlooked.
Re:Robustness & Feasibility by stonedcat · 2006-11-26 03:58 · Score: 1, Funny

*crumble crumble crumble*

Ha Ha I just killed your 5 year archive of porn!

--
You can't take the sky from me.
Re:Robustness & Feasibility by Anonymous Coward · 2006-11-26 04:02 · Score: 1, Insightful

Based on the number of scratched and ruined disks I've got (including music CDs that got a mysterious fogginess to them and are now unreadable), as well as various forms of CD destroying bacteria, I'd have to say that optical disk technology is nowhere near as "robust" as you seem to imply. Conversely, we are still digging up readable paper from thousands of years ago. Something tells me that archaeologists in the year 3006 will not find a single readable CD from this decade.

As a genealogy hobbyist, I keep paper backups of everything, because I know the paper form is likely to last a hell of a lot longer than the digital versions.
Re:Robustness & Feasibility by Kazuma-san · 2006-11-26 04:11 · Score: 1

You must differentiate between the different form of paper. The paper you can buy in a common office shop is produced with emphasis on low price for the purpose of writing on it. Eventually it will rott away, or fall in parts in no time, compared to the millenia papyrus has endured. Actually, I heared that a papyrus archive was created recently for archaeologist to be readable in a thousand years.
Re:Robustness & Feasibility by LindseyJ · 2006-11-26 04:18 · Score: 3, Funny

People have been using paper to store their porn long before the web was here.
Re:Robustness & Feasibility by JazzLad · 2006-11-26 04:19 · Score: 3, Funny

You think a /. reader's 5 year collection of porn would fit on 256GB?

You must be new here.

--
"If you have nothing to hide, you have nothing to fear." - Every fascist, ever
Re:Robustness & Feasibility by Yvanhoe · 2006-11-26 04:55 · Score: 1

We can imagine to protect the paper in some way. After all, hard drives have to be stored in a metallic case to work correctly. That brings me to another question : Considering a feasible protection, how long do the data stay on the medium ? How many read errors per Gb ? how many write errors ? That is the real problem with data storage.

--
The Wise adapts himself to the world. The Fool adapts the world to himself. Therefore, all progress depends on the Fool.
Re:Robustness & Feasibility by Gerzel · 2006-11-26 05:21 · Score: 1

Then wouldn't paper be the ideal medium? After all we know a LOT about keeping paper around for ages and ages. Magnetic and optical media have a much shorter lifespan than most printed documents.
Re:Robustness & Feasibility by Raptoer · 2006-11-26 06:21 · Score: 1

to have it just on a unshielded piece of paper is a bad idea, if you folded it the data along the crease may be lost, if you leave it out in the sun, a portion of the data may be lost too. More likely is that it would be in a case, think of a floppy disk, unless you dissect a floppy disk you wont see the actual disk inside. If you crease the floppy, flatten it out, put it back in the case and run it, its unlikely that it would run properly.
Re:Robustness & Feasibility by iocat · 2006-11-26 06:35 · Score: 1

The issue isn't paper... it's toner. Even the best paper will lose laser printer toner if you bend it. And ink jet is less accurate.

--
Dude, I think I can see my house from here.
Re:Robustness & Feasibility by stonedcat · 2006-11-26 07:03 · Score: 0, Troll

I've been through alot of hard drives in my time... i almost always lose most of it :P

I no longer use fat32 or ntfs for anything so I'll actually end up having to delete things from time to time since nature won't be taking it's course. :)

--
You can't take the sky from me.
Re:Robustness & Feasibility by Anonymous Coward · 2006-11-26 07:21 · Score: 0

Who are you to "make a solid judgment on its feasibility" anyway?
How about "A computer scientist who thinks he knows everything"?

Which would put me just above a piece of shit Indian scammer.
Re:Robustness & Feasibility by Eternauta3k · 2006-11-26 08:06 · Score: 1

If you crease the floppy, flatten it out, put it back in the case and run it, its unlikely that it would run properly.
Actually, I once took the disk out, folded it, played frisbee with it, etc. and when I put it back in, it read most of the files correctly. Yet the little bitches lose data when you take perfectly good care of them.

--
Yeah. Would you choose a neurosurgeon who pokes around people's brains in his spare time? I wouldn't.
Re:Robustness & Feasibility by synthespian · 2006-11-26 08:08 · Score: 1, Insightful

Your claim that paper last less because it gets exposed to elements has no data to support it.

Paper lasts 2000 years, as archaelogists will tell you. If you have error-correcting code, it sounds achievable.
I believe NIST has made a study about this, IIRC, some years ago. Digital media is bad media. I mean, for starters, large business have trouble with 1995 .doc files...Imagine something you want to last 100 years. Maybe LaTeX files can, if you print the code.

Regarding the use of paper, one begins to imagine the use of robotics to handle huge "Rainbow Formats" paper archives. I'm not so sure how you can have very large databases using this format. I can't envision this being feasible. Maybe it is, time will tell.

By the way, this begs the question: how useful is such a technology without open source software? Much more interesting it would be if the specification was open, and if the technology could be made to work with any old scanner and any lousy home printer. *Then* you'd have a revolution. Right now, it seems he wants a monopoly. We all know how Great Proprietary Ideas finnish sometimes: nobody cares about it; it dies.

Also, I hope a patent for "saving data on paper" is not filed, we'd loose our right to use pencils. However, he did let the cat out of the box. I expect to see copycats, just like there are many HD manufacturers.

Anyways, this is one of the most ground-breaking things I've read in years. Sainul Abideen is a genius.

--
Main difference between the BSD license and the GPL license: one is from California and the other is from Massachusetts
Re:Robustness & Feasibility by hairpinblue · 2006-11-26 09:52 · Score: 1

Over the years I noticed floppy disk quality diminish rapidly--especially around the time that hard drives reached capacities greater than 400 mb for around $200. Maybe it was coincidental with the expiration of patented technology with respect to the manufacture and quality control of non-fixed magnetic media. At one time, SS/DD disks could be reliably turned into DS/DD disks with a hole punch (or a "disk doubler"--a customized hole punch which would easily put another square notch at precisely the correct height on a 5 1/4" floppy disk. I tried to find a pic on Google but didn't have much luck) and preserve data without error for five years. Around '92, though, with 3 1/2" disks not being able to be turned upside-down and fixed disk drives (hard drives) coming on strong, I noticed that new 5 1/4" disks would begin developing random errors after a few months and, by '96, 3 1/2" disks were no better.

I heard many such stories of people abusing their 5 1/4" or 3 1/2" floppy disks severely outside of the casing and then continuing to use the disk without (or with minimal) errors. I never bothered stress testing them outside of normal use myself.

--
Hustlers exist solely through charity. I see their scams, lies, and deceit: I'm too charitable to outright shoot them.
Re:Robustness & Feasibility by Mozk · 2006-11-26 10:07 · Score: 1

I've never heard the phrase "little bitches" in reference to floppy disks before. :D

--
No existe.
Re:Robustness & Feasibility by DeadChobi · 2006-11-26 12:05 · Score: 1

Monks used to store their paper in bound stacks with leather at either end. This could be feasible for this technology, considering that the cover of a book can get damaged but the inside remains relatively intact. Book covers also keep books out of sunlight, direct rain contact, and impact damage. I'd be more worried about keeping this data stored in a computer long enough to use it. It'd be more feasible to keep your backups on a bookshelf than to read your operating system in off of a book.

--
SRSLY.
Re:Robustness & Feasibility by Anonymous Coward · 2006-11-26 15:13 · Score: 0

And we'll be waiting for your august judgement.
Re:Robustness & Feasibility by gfody · 2006-11-26 15:27 · Score: 1

Instead of placing the sheet in your scanner maybe you could just hold it up to a camera. It would take a pretty high res camera to decode 256gb in 300dpi circles and squares but maybe 256gb is overkill anyways.

Maybe a good compromise would be something like black and white vertical bars of varying widths - then you could use a simple laser to read it!

--

bite my glorious golden ass.
Re:Robustness & Feasibility by Anonymous Coward · 2006-11-26 16:37 · Score: 0

Bend this!

you ziddiot, why would you want to benf it?

it can be behind a "glass" it can be multylayered, it can be written trhough laser,
it can be cased.

2*2 inches = 5GB data, its small, no need to bend it.

zidiot.
Re:Robustness & Feasibility by Yvanhoe · 2006-11-26 20:34 · Score: 1

We don't know much about the conservation of laser-color-printed paper, espiecially when you have to keep microscopic informations on it...
It is however a very interesting idea.

--
The Wise adapts himself to the world. The Fool adapts the world to himself. Therefore, all progress depends on the Fool.
Re:Robustness & Feasibility by Anonymous Coward · 2006-11-27 00:55 · Score: 0

I can't recall if it's cyclic redundancy checks or some other form of parity

It's neither. CRCs and parity are error-detecting codes. Instead they use an error-correcting code. IIRC it's a Reed-Solomon code.
Re:Robustness & Feasibility by Anonymous Coward · 2006-11-27 02:13 · Score: 0

So much for the paperless office

Cool... by tinrobot · 2006-11-26 02:34 · Score: 4, Funny

Now it's possible to fold up 256MB worth of data and fly it across the room.

Re:Cool... by Anonymous Coward · 2006-11-26 02:38 · Score: 3, Funny

This could be the dawn of a new era in wireless networking...
Re:Cool... by ScrewMaster · 2006-11-26 02:55 · Score: 2, Funny

Nah ... just an improvement in the paperairplanenet.

--
The higher the technology, the sharper that two-edged sword.
Re:Cool... by Bibz · 2006-11-26 02:56 · Score: 1

Actually it's 256GB

It's a lot safer and funnier than throwing hard drives at each other.

--
I didn't found something funny to put here.
Re:Cool... by h4rm0ny · 2006-11-26 03:05 · Score: 1

Safer, yes. Funnier... depends on your colleagues, I guess. ;)

--

Aide-toi, le Ciel t'aidera - Jeanne D'Arc.
Re:Cool... by stunt_penguin · 2006-11-26 03:37 · Score: 1

More of a seekernet than a sneakernet, then.

--
When the posters fear their moderators, there is tyranny; when the moderators fears the posters, there is liberty.
Re:Cool... by Anonymous Coward · 2006-11-26 04:03 · Score: 0

Not quite, but here's the kicker from the article:

Sainul says a CD or DVD consumes 16 grams of polycarbonate, a petroleum by-product. While a CD costs Rs.15 (SR1.25), his paper or plastic-made RVD will cost just about Rs.1.50 and has 131 times more storage capacity. ...
Named "Rainbow Technology", the new technique is the brainchild of Sainul Abideen, who has just finished his MCA at Muslim Educational Society Engineering College in Kuttipuram in Kerala's Malappuram district.

talk about one bad apple giving the rest a bed reputation...

Captcha: particle
Re:Cool... by Anonymous Coward · 2006-11-26 05:25 · Score: 0

This brings to mind an interesting scenario. Imagine backing up the data in your data center and sending it via courier to another office. This would completely change the nature of couriers since it would possibly be faster to encode and transmit the data this way then to send it over the wire (fiber, whatever---Think Johhny Mnemonic) Second, nothing is said about the time that it takes to decode the data(or encode it for that matter). Having outlined these two scenarios, now consider which is faster...Sending the data over courier and decoding it, or sending it over the wire in the first place.
Re:Cool... by SeaFox · 2006-11-26 12:07 · Score: 1

More of a seekernet than a sneakernet, then.

Yes, but the chances of your data being intercepted in transmission are much higher.
Re:Cool... by Phantom+Coward · 2006-11-26 13:41 · Score: 1

Aghhh think of the papercuts!

Must be a very good scanner. by Utopia · 2006-11-26 02:37 · Score: 2, Interesting

I would love to know which scanner has the ability to scan with such high fidelity.

Re:Must be a very good scanner. by Anonymous Coward · 2006-11-26 02:40 · Score: 0

I am more interested in the speed at which this data can be read, does it come close to the DVD read/write speeds?
Re:Must be a very good scanner. by aplusjimages · 2006-11-26 03:14 · Score: 1

It's the $10,000 scanner not available to consumers yet.

--
Can I bum a sig?
Re:Must be a very good scanner. by sbaker · 2006-11-26 05:48 · Score: 1

...and a very good printer too.

If you could print and recognise all 2^24 colours (very unlikely) and print and scan reliably at 10,000 dots per inch (no way!) - then this would be possible. However, if this guy knows how to do that - he should be in the printing and scanning business because there isn't a machine built that can come close to doing that.

But you wouldn't use circles and squares and such.

This is bogus - it's a complete scam.

--
www.sjbaker.org
Re:Must be a very good scanner. by JohnWasser · 2006-11-26 06:11 · Score: 1

Perhaps it is just an error in translation and he meant 256KB and not 256GB. :-\
Re:Must be a very good scanner. by FLEB · 2006-11-26 06:28 · Score: 1

Even if it didn't, it could still work in a tape-backup sense.

--
Information wants to be free.
Entertainment wants to be paid.
You just want to be cheap.
Re:Must be a very good scanner. by spence2680 · 2006-11-26 06:54 · Score: 1

Hopefully the scanner he uses is of higher quality than the one he used to scan these photos [sainul.abideen.dr.ag] on his blog. (click on "photos" after the jump).
Re:Must be a very good scanner. by sbaker · 2006-11-26 07:04 · Score: 1

Nope - follow the link to the article - then follow the link from there to the ORIGINAL article in "Arab News" - the reporter claims to have been shown a working paper scanner that scanned in a movie - then played it on-screen. 256Kbytes isn't enough to store a movie!

Also, the guy is going off on claims to replace DVD's with PVD (paper video disks)...and all manner of crap like that.

No - this isn't a mistake - it's definitely a scam.

--
www.sjbaker.org
Re:Must be a very good scanner. by Anonymous Coward · 2006-11-26 07:32 · Score: 0

on his blog.

Are you fucking kidding me? Are you really that stupid? It's a website you moron. If there's one thing more idiotic than "blogs", it's fucking morons calling regular websites "blogs". Get a clue, dumbass. I can't wait till the whole "blog" bullshit passes like Geocities websites.
Re:Must be a very good scanner. by Stellian · 2006-11-26 11:44 · Score: 1

If you could print and recognise all 2^24 colours (very unlikely) and print and scan reliably at 10,000 dots per inch (no way!) - then this would be possible.
No, it would not. That's still just 300 MB / square inch. He would need 30,000 dpi to reach 2.7GB / inch^2. As for 16 million unique colors per dot... just forget about it.
Re:Must be a very good scanner. by CalSolt · 2006-11-26 12:03 · Score: 1

I think you are stupid. You don't understand the specifics of the technology, so you're assuming it's a scam.

Obviously the data storage scheme is far more complex than anything you have experience with. How stupid do you have to be to think the color of the dot on paper directly determines an individual bit? This isn't about storing bits directly on paper- there are only 2 possibilities in binary, why would you need 256++ colors to store binary bits? The LEAST complex algorithm I can think of (assuming no shapes, just dots of various colors) is that each color represents a certain binary sequence, say of length 8 (2^8 = 256). That means that with 256 colors (256 unique states) you can store 8 bits per dot. Assuming a 300x300 dot per inch printer, we've got 90,000 dots per square inch, each being 1 of 256 uniquely identifiable 8 bit sequences. 90,000 x 8 = 720,000 bits per square inch, or 90KB per square inch.

Try a 1200x1200 dpi printer: 1,440,000 dots per square inch = 1.44 MB/in^2

Of course, this is assuming 8-bit color. In reality, a photograph in 256 colors is, well, not a photograph. Say we have a 24 bit printer and scanner (24 bit is regarded as life-like, and the standard for printers and scanners), thus 2^24 = (approximately) 16.8 million uniquely identifiable colors. This means we can encode unique sequences of length 24 bits per dot. At 1200x1200 dpi, we're talking 1,440,000 x 24 = 34,560,000 bits per square inch or 4.32 MB/in^2.

Using only colored dots. Throw in printers with even higher resolutions like these from HP at 1200x4800 dpi, the even higher density of data storage available from the use of shapes (adding even more uniquely identifiable states through shape, size, order, orientation, special characteristics, and relative positioning), and some clever compression and encoding algorithms... and you can easily build pretty high density storage.
Re:Must be a very good scanner. by lnjasdpppun · 2006-11-26 16:24 · Score: 1

Lets assume 1200x4800 dpi, 24bit colour can be printed and recognised by a printer/scanner with 0 errors. That gives 1200x4800x24x8.5x11 = 12,925,440,000 bits per page = 1,615,680,000 bytes per page = 1540.83 MB per page.

To get to the claimed 256GB per page he needs to have found some way to compress 256GB into 1.5GB (a ratio of 170:1 !) of data into 1 bit (on average across the whole page). Forgive me if I find that a little hard to believe.... But if it's true my 1.5 mbit DSL line could carry more data than an OC-3 line.
Re:Must be a very good scanner. by sbaker · 2006-11-26 16:52 · Score: 1

I didn't assume 1 dot is 1 bit. I was actually rather generous and assumed that one dot was 24 bits (6 each of Cyan, Magenta, Yellow and Black...which is VERY generous!!)

Even using YOUR VERY OWN most generous estimation of 4.32MB/in^2, finish that thought...there are 8.5x11 inches on a sheet of paper - so YOU came up with an upper limit (assuming 24 bit inks and magical scanners) of just 403MB for a sheet of paper. Recall that this guy is claiming 250 GIGABYTES. That's 620 times more than your best possible estimate.

It doesn't matter how complex the scheme is. You can't fit more data into some fixed number of storage locations than there are locations. Whether he uses shapes, teeny-tiny letters, bar codes or anything else, there are only so many possible ways to cover a sheet of paper with ink. The X resolution times the Y resolution times the width of the paper times the height of the paper is the MAXIMUM number of 'storage locations' on the paper. That's just like the number of bytes in the RAM of your computer except instead of bytes, we have coloured dots. That's an upper limit - fundamental information theory doesn't permit more than that no matter what shapes you draw with those dots or what colours you use.

In fact, using shapes and stuff would dramatically lower your information content because it would remove some combinations of dot colours from the set that he can print.

So no, he can't possibly under any mechanism be getting what he's claimed. We could maybe avoid calling him a complete fraud if he'd simply been quoting theoretical numbers for an untried technique - we could allow that he'd dropped a couple of decimal places. But he has also "demonstrated" his claim - by scanning a piece of 'rainbow-encoded' paper and playing a full length movie off of it - which we know is quite utterly impossible for really fundamental reasons. Some we know he couldn't conceivably have done this 'for real' - he must have cheated and put up a 'rigged' demo - which means he's not just claiming the impossible - he KNOWS it's impossible. That's a scam.

The guy is a liar and a cheat - and please don't call me an idiot for proving that using the numbers that you provided.

Feel free to apologise at any time!

--
www.sjbaker.org
Re:Must be a very good scanner. by CalSolt · 2006-11-26 17:40 · Score: 1

Yes, but that's assuming the data is only encoded by the color of the dot. And this is also assuming you have no encoding schemes- you are directly encoding bits on the page, no layers of abstraction. When you throw in shapes, you further increase the amount of data that you can hold. I'm no encoding algorithm specialist. As a matter of fact I have no idea how compression algorithms work, having never seen any code or read anything on the subject. But I can guess at what it might look like.

Say a shape acts like a key. A red dot in a circle means something different than a red dot in a triangle. Say the encompassing shape denotes a specific key that should be used to decode the dot colors. Let's use the 3 basic shapes: triangle, circle, square. Each shape points to a unique set of 2^24 combinations. This way, a single dot, instead of only representing 1 out of a possible 2^24 sequences (24 bits each), represents 1 out of a possible 3*2^24 sequences (not necessarily 24 bits long, because by now we can expand into 25 bit sequences or even higher). And of course there are more than 3 possible shapes. Circles of different radii, triangles and rectangles with different side ratios. Using ellipses (2 radii) you add 2 variables to circles. Rectangles have another 3 variables (ratio of side lengths, radius to corner, rotation), triangles have another 4 (2 angles, mean radius, orientation with respect to vertical axis). Using my scheme you could have thousands of keys, each pointing to a different set of 2^24 bit sequences.

There are tremendous possibilities to store data when you are no longer bound by the 2 bit world. My scheme of course has huge limitations, like needing to encode across keys (what if two consecutive sequences (dots) need to be in different shapes?). Maybe you could get around that by encoding a unique formula or a table of contents somewhere in the document. The point is, I only spent 5 minutes to come up with a fairly high density geometric and color encoding scheme that could work. If the raw number of encodable bits for a given algorithm was high enough, you could even have space for error correction and redundancy. Imagine if someone thought about this problem for 1 or 2 years. It's very possible.
Re:Must be a very good scanner. by CalSolt · 2006-11-26 18:05 · Score: 1

Well at least you backed up your claims this time. I'm not apologizing for calling bullshit when someone makes a claim with no explanation of the logic behind it.

Read my response to the guy who responded to my original comment, where I go into greater detail, and tell me what you think about it in regard to your "fundamental limits."

If you'd like an example contrary to what you just claimed- that the upper limit of data you can store on a physical medium directly corresponds to the number of available physical sites- try this: an Nx1 matrix multiplied by a 1xN matrix gives you an NxN matrix. Through matrix multiplication you only consume 2N spaces (maybe 2N+2 to denote orientation of each matrix) but you have N^2 data points after decoding. If N is 1,000,000, ratio of spaces used to number of data points actually represented equals 500,000.

True my example only works if the data conforms to a particular pattern in the first place, but it disproves your claim about fundamental limits. With a few layers of abstraction, good processing algorithms, and the added geometric variables you can easily hit the 250GB range, and maybe even higher.
Re:Must be a very good scanner. by SnowZero · 2006-11-26 19:54 · Score: 1

Sorry, you are wrong. I recommend looking up "Information Theory" or "Encoding Theory", in particular stuff from Shannon.

Your matrix example is basically saying you can make a small amount of information into a larger, less efficient encoding. Of course you can. but that's not the direction that is interesting. I can repeat a bit a thousand times, but that doesn't mean I can take an arbitrary 1000-bit sequence, and store it in a lossless form with one bit. Thus the question is if you can always go both ways - take an arbitrary NxN matrix of values, store it as two 1xN vectors (encoding), and then recreate the original matrix (decoding). That's compression.

Another way to look at the fundamental limit is the following: You cannot compress all possible values of a data block. If you could, you could keep re-running the compression on the result, and eventually you will compress all possible data streams to a single bit - obviously that makes no sense, as you can't recover much from a single bit. Real compression works by making "common" sequences, with a lot of repetition, into shorter sequences, while uncommon ones get slightly longer. There will not be some "layer of abstraction" that can universally compress data.

Spend some time at Wikipedia and this will make a little more sense. It's actually pretty interesting stuff, as there really are hard limits you hit when sending or encoding information, and they can be calculated. The early work on this topic formed the basis for the modern digital world.
Re:Must be a very good scanner. by SnowZero · 2006-11-26 20:00 · Score: 1

That's awesome. A supposed "genius" of data encoding and compression, and he uses GIFs for all of his photographs. Yep, definitely an image processing and compression expert...
Re:Must be a very good scanner. by Ihlosi · 2006-11-27 00:21 · Score: 1

When you throw in shapes, you further increase the amount of data that you can hold.

No. No. No.

As soon as you enforce some sort of format, you decrease the amount of data, by eliminating possible combinations of dots from the total pool.

Think about it: If you're trying to come up with a password, and introduce the rule "Capital letters must be adjacent to at least one more capital letter." - have you increased or decreased the number of possible passwords ?
Re:Must be a very good scanner. by Ihlosi · 2006-11-27 00:26 · Score: 1

Through matrix multiplication you only consume 2N spaces (maybe 2N+2 to denote orientation of each matrix) but you have N^2 data points after decoding.
And those N^2 dots are not independent from each other. You cannot re-create every N^2 matrix by multiplying 2 N vectors. Your hypothetical compression would be lossy.
Re:Must be a very good scanner. by Fastolfe · 2006-11-27 09:59 · Score: 1

It takes space to draw geometry on paper. Your "encompassing shape" has to be drawn somehow. You have to use up some of your dots doing that.

If this encoding scheme were indeed as simple as this, it could be implemented on top of bitmapped/raster data structure (graphic) instead of paper with theoretically the same storage capacity. A 1200dpi printer printing 7.5"x10" of paper at 24-bit color would give you the equivalent of 309MB of raw, uncompressed storage space. This is the *maximum* amount of space needed to completely reconstruct all of the data present on any sheet of paper produced by any 24-bit 1200dpi printer (assuming it were possible to do that reliably). If this encoding scheme amounts to nothing more than clever use of this space, so as to put 256GB onto something that can be represented in 309MB, that's extraordinary (and as they say, demanding of extraordinary proof).

Three possibilities: this is a scam, something got lost in translation, or there is far more to the story than this.

Scam... by Anonymous Coward · 2006-11-26 02:37 · Score: 5, Informative

according to this Indian blogger.

Re:Scam... by An+Onerous+Coward · 2006-11-26 05:52 · Score: 1, Interesting

I think it's possibly a scam, but the counterclaims in the scam article seem to misrepresent the nature of the initial claims.

Some people where suggesting by using different colours one can squeeze in more data, but what about error tolerance then? This guy is questioning the fundamental reason why digital / binary technology became popular, its because its either 0 or 1 , so its mostly fool proof, we could have used different voltages and instead of binary, use 0 1 2 3 4, but then it will not be fool proof.
That claim doesn't make a whole lot of sense, because there are still bits of computer hardware that have and make use of the ability to distinguish between more than two levels of voltage (dial-up modems, for example). In transistors, going with the on/off motif greatly simplifies hardware design. But as long as you can reliably distinguish between n levels, you should be good to go.

Barcode companies did their maximum when they tried to develop 2 D barcodes and the maximum they could get was around 2000 bytes of data!!
2D barcodes aren't a good metric for the theoretical maximum for storing data on paper. Reasons:

- since it is assumed that some portion of the barcode may be damaged, you have to devote a lot of space to error correction.
- it has to be scannable from any direction.
- it has to be scannable almost instantly by the sort cheap, rugged handheld device you'd find at millions of point-of-sale stations.
- it has to present a big enough target for the person doing the scanning to easily find it.

But I'm still really suspicious of the initial claims. The thing that really bothers me is the idea of encoding using "shapes". I can see using pixels, and I can see using various colors (with some limitations on the number of different colors the scanner can reliably distinguish). But using shapes only seems to make sense if you have both the ability to squirt multi-shaped pixels from your inkjet (I've never heard of such a thing), and also the ability to scan at a significantly higher resolution than you can print. If it takes multiple pixels to create a shape, then I find it odd that those pixels aren't being devoted to "more dots".

Also, I think we need to distinguish between two claims. First, there is the claim that this student is able to reliably write 256GB of data to a sheet of paper and read it back. Then there is the claim that this feat can be bootstrapped into a production mass storage system without extreme difficulty. I consider the former much more likely than the latter.

BTW, my back of the envelope calculation: 8.5x11in * 1800dpi * 1800dpi * 256colors = 77 billion bits of information, or about 9GB. I think 1800dpi is the output you get with a high quality laser printer. I doubt that "theoretical maximum" is achievable, because even the best printers are designed only to impress the human eye.

--
You want the truthiness? You can't handle the truthiness!
Re:Scam... by An+Onerous+Coward · 2006-11-26 06:26 · Score: 4, Informative

Okay, time to throw out my calculation. As someone else pointed out:

Each dot is going to be either cyan, magenta, yellow, or black. Laser and injket printers produce multicolour output by dithering, not by mixing inks, and the "dpi" rating of the printer refers to the dots used when dithering, not to the equivalent of screen pixels.
So instead of multiplying by 256, you have to multiply by 4. Result: about 140MB.

Another approach to analyzing the claim: For a given dpi resolution, how many variations of a single dot must your system be able to produce and distinguish? I get 256 GB / 302940000 dots, or 907 gradiations. Instead, we have four available.

I'm split between "scam" and "incompetent." But believing he may have actually done what he claimed is no longer an option for me.

--
You want the truthiness? You can't handle the truthiness!
Re:Scam... by Sique · 2006-11-26 06:33 · Score: 1

I guess, the original claim just messed up GByte and MByte, because with 1200x1200 dpi in 24 bit color you get about 420 MByte per A4 sheet, throw in some error correction (e.g. 8 in 14 encoding), and you are very close to 256 MByte.

--
.sig: Sique *sigh*
Re:Scam... by mrchaotica · 2006-11-26 06:41 · Score: 1

The thing that really bothers me is the idea of encoding using "shapes". I can see using pixels, and I can see using various colors (with some limitations on the number of different colors the scanner can reliably distinguish). But using shapes only seems to make sense if you have both the ability to squirt multi-shaped pixels from your inkjet (I've never heard of such a thing), and also the ability to scan at a significantly higher resolution than you can print. If it takes multiple pixels to create a shape, then I find it odd that those pixels aren't being devoted to "more dots".

Now, I don't know anything about this other than what I just read in the article, but I can see how this "shape" idea could work: patterns at multiple levels of scale. Think of one of those pictures where each "pixel" is made out of a photo, so that if you look close you can see a whole bunch of individual photos, but if you look from farther away you just see the average color in each, and those average colors form a big image. Now, if they could figure out how to do that with multiple levels of detail, I can see how you could start getting quite a lot of data on the page. Imagine that you could make one "higher order" data point out of, say, 32 "lower order" ones. Then, the storage capacity of the sheet, instead of N for only the lowest layer, becomes N + N/32 + (N/32)/32 + ... + 1.

--
"[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz
Re:Scam... by Anonymous Coward · 2006-11-26 06:45 · Score: 0

Arabs are scammers, plain and simple; it's not the people, it's their culture.

They just work on a level different than other types of scammers-- they do it at the academic level.

And they're holding back real progress with their bullshit.
Re:Scam... by presentt · 2006-11-26 07:56 · Score: 1

Yes, but you're still "pixels within pixels," and there is ultimately some smallest-sized pixel that could be used as a bit for data itself. So whether you have a 10x10 grid composed of 10x10 "smallpixel" pixels or not, you still have 10,000 bits.

--
I decided to stop stealing cynical quotes to use as a signature line.
Re:Scam... by kimvette · 2006-11-26 07:59 · Score: 1

With a four-color laser printer you do not get 24-bit color. You get FIVE distinct colors:

Medium (usually white) + CMYK

THat's white, cyan, magenta, yellow, and black.

Oh, so you have a photo inkjet printer? Okay, so you have two additional shades (usually light cyan and a fuscia), but now you have imprecise dot placement. You have improved capacity, but also introduced many more errors due to the difficulty in spraying ink precisely, which will likely use MORE than your added capacity in error correction.

Dye sublimation might get you there but that's going to be an expensive medium compared to Blu-Ray.

--
The Christian Right is Neither (Christian nor right). See: Matthew 23, Matthew 25, Ezekiel 16:48-50
Re:Scam... by Chmarr · 2006-11-26 08:09 · Score: 1, Informative

Okay... however, if you have a PHASE CHANGE printer, you can lay down a pixel in any of 16 million colours, because it doesn't use half-tone screens to get colour variations. These printers tend to have a low stated resolution - 300lpi for example - but they're much better because again they can use all of that reslution for image detail, and not halftones.

So, on the (unbelievable) assumption that a scanner could pick up ALL the 16 million colour variations. We have:

8.5*300*8.5*300*256*256*256 = 141180272640000 bits

or 17647534080000 bytes. That's 1.7 trillion bytes!

Now, I don't believe for a second that the printer nor scanner are accurate enough to pick up all the variations. So... lets take something more realistic. Lets assume 75dpi, and 512 (8*8*8) colour variations:

8.5*75*8.5*75*512 = 208080000 bits or 26010000 bytes, or 26 Megabytes. That seems a bit more "believable" but also demonstrates how quickly the storage degrades by taking off a couple binary factors from the X, Y and C dimensions.
Re:Scam... by Chmarr · 2006-11-26 08:11 · Score: 1

Oh, I missed up, I used another 8.5 instead of 11 :)
Re:Scam... by Jott42 · 2006-11-26 08:59 · Score: 2, Informative

You mess up in more than one way: 16 million colours is by 24 bits, or 3*8, not 256^3. The same mistake goes for a lot of the calculations in the replies on this page. If we have a paper of size 8.5 x 11 inch, a resolution of 300 dpi, and a colour depth of 24 bts (giving 16.8 million colours, we get a total information content of: 8.5 x 11 x 300 x 300 x 24 = 20196Mbit. Nothing more, nothing less. Shapes etc. is forms of redundancy, or error correcing codes, and reduces the amount of information carried, as measured in bits, and not the other way around.
Re:Scam... by drerwk · 2006-11-26 10:34 · Score: 1

I think he meant 8*8*8 being 9 bits of color. I don't think my scanner is actually good to 24 bits of color, but I might believe 9.
Re:Scam... by Anonymous Coward · 2006-11-26 13:19 · Score: 0

according to this Indian blogger [blogspot.com].
And, hey, if you can't trust a random Indian blogger, who can you trust?
Re:Scam... by An+Onerous+Coward · 2006-11-26 14:12 · Score: 1

Unfortunately, I don't think that works. The only way to achieve higher-order patterns is by bringing some sort of regularity to the lower-order patterns, which decreases the overall entropy (and hence the maximum amount of information that can be conveyed).

--
You want the truthiness? You can't handle the truthiness!
Re:Scam... by j-beda · 2006-11-26 14:36 · Score: 1

Each dot is going to be either cyan, magenta, yellow, or black. Laser and injket printers produce multicolour output by dithering, not by mixing inks, and the "dpi" rating of the printer refers to the dots used when dithering, not to the equivalent of screen pixels.

So instead of multiplying by 256, you have to multiply by 4. Result: about 140MB.

I think that in addition to CMYK when the pairs of inks overlap you also get Red, Green, and Blue (the CMY colour inks act as filters of the primary colours, pairs of inks then give you those primary colours - Cyan over yellow results in green) all without dithering. Thus rather than just four possiblities, you actually have: white (with no ink), cyan, magenta, yellow, black, red, green, or blue, giving a total of eight possibilities for the smallest picture element. I'm not too good with combinitorics - does this only raise the 140MB up to 280MB?
Re:Scam... by Anonymous Coward · 2006-11-26 17:47 · Score: 0

So instead of multiplying by 256, you have to multiply by 4. Result: about 140MB.

Wouldn't you have to multiply by the base 2 logarithm of 4? (or base 256 to go straight for bytes)

If you give them the benefit of leaving a dot blank for 5 colors:

8.5*1800 * 11*1800 * log_256 5 = 87,925,612 bytes
Re:Scam... by drgonjo · 2006-11-26 18:43 · Score: 1

I actually think that you could get more information out of the paper than the matrix of colored blocks. You guys seem to be thinking in terms of the colors being used as numbering system (much like binary is a base 2 numbering system, DNA is base 4 and we count in base 10). Your calculations are based on how much info you could cram on a piece of paper using each dot essentially as a bit that could instead have 4 or 5 possible states. I'm thinking that significantly more information could be gleaned from that piece of paper if the algorithm encoding and decoding it is sophisticated enough to find patterns and relations between the colors and the shapes. Maybe the first chunk of data is based upon the shapes and their relations to each other, the next is based on the colors used, another is based on the color densities, another is based on which colors are in which shapes, etc... A system like that would be very complex for sure, but maybe possible and could explain why you could get more info than a dot per dot "read" of the piece of paper.
Re:Scam... by Wrath0fb0b · 2006-11-27 02:36 · Score: 1

BTW, my back of the envelope calculation: 8.5x11in * 1800dpi * 1800dpi * 256colors = 77 billion bits of information, or about 9GB. I think 1800dpi is the output you get with a high quality laser printer. I doubt that "theoretical maximum" is achievable, because even the best printers are designed only to impress the human eye.

A4 paper is not 8.5x11. RTFA or at least RTF title of the article!
Re:Scam... by An+Onerous+Coward · 2006-11-27 05:21 · Score: 1

Oh, I'm sorry. Not 8.5 x 11 inches, but 8.3 x 11.7 inches. Yeah, that's going to change my calculations *vastly*. ::end sarcasm::

--
You want the truthiness? You can't handle the truthiness!
Re:Scam... by An+Onerous+Coward · 2006-11-27 05:35 · Score: 1

Unfortunately, it doesn't work. What you're proposing is some sort of lossless compression scheme. If true, it would be way bigger news than some guy applying ink to a fibrous substrate. It would also violate about eighteen basic principles of information theory.

As I've said elsewhere, the whole "shapes" thing is bogus. In the end, all you really have is the dots, and shapes just end up bringing regularity to the patterns of dots. In information theory, regularity is the opposite of entropy, and the entropy of a pattern is identical to its ability to carry information. Quick version: 11111111111111111111111111111111111111111111111... has much less entropy than 10001011110100011101011110100001011010101000011110 0001... and the latter carries much more information (because the latter message cannot be compressed as small as the former message).

In the case of shapes, knowing that your dots form either "circle", "triangle" or "square" means that if you read a given pixel, it is possible to say something about neighboring pixels. In order to convey the most information possible, knowing the contents of pixel A shouldn't tell you anything about any other pixel on the page.

--
You want the truthiness? You can't handle the truthiness!
Re:Scam... by Anonymous Coward · 2006-11-27 10:31 · Score: 0

Obviously the claims are off by several degrees of magnitude, but you might be able to get a little closer by using more inks. The only reason we think in terms of RGB/CMYK is because the human visual system is only sensitive to these three frequencies. We can simulate the appearance of any color by combining three specific colors in different proportions -- but we forget that we are not actually reproducing the colors themselves, just the perception of them. Some animals have more than three color receptors, so they can see many more colors than we can.

Thus, you may be able to cram more information on a page by printing in inks that block different frequencies. Sure, a human might not be able to perceive the difference, but you'll get more information on the page. You might even be able to adapt existing printers/scanners, for example, by running the page through multiple inkjets, each with a customized cartridge, or using scanners with different colored filters. There's probably a limit to how many inks you can print to the page and still have it be readable, but that limit is surely greater than three.

Scam? by anandpur · 2006-11-26 02:38 · Score: 5, Informative

http://itsoup.blogspot.com/2006/11/scam-of-indian- student-developing.html
http://www.digg.com/tech_news/Scam_of_Indian_stude nt_developing_technology_to_store_450_GB_on_paper

Re:Scam? by brxndxn · 2006-11-26 03:48 · Score: 0, Offtopic

PWNT!!!

--
--- We need more Ron Paul!
Re:Scam? by AtomicBomb · 2006-11-26 05:14 · Score: 1

The Register people seems to be a bit more clue up than the techworld people.

http://www.theregister.co.uk/2006/11/23/rvd_system /

So one picture says more by Anonymous Coward · 2006-11-26 02:39 · Score: 2, Funny

than 6800000000 dwords?

Sounds like vaporware by lunchlady55 · 2006-11-26 02:39 · Score: 1

Copied from digg.com, where there is already a post indicating this is from a very unreliable source. Besides, we can't even get B&W OCR right.

Re:Sounds like vaporware by jonbryce · 2006-11-26 02:55 · Score: 1

We can, if it is originally printed with OCR in mind. Bar codes are pretty reliable, and the bank's computers can read the numbers at the bottom of a cheque pretty reliably.
Re:Sounds like vaporware by Anonymous Coward · 2006-11-26 03:13 · Score: 2, Informative

Bank's computers don't use OCR, they use MICR
Re:Sounds like vaporware by Skater · 2006-11-26 03:28 · Score: 1

That's the point, though - they're printed with the reading mechanism in mind, so they're very reliable. This paper storage device uses a similar concept.
Re:Sounds like vaporware by cmdrbuzz · 2006-11-26 10:35 · Score: 1

In the UK the pre-printed Credits and Cheques do use MICR, but the items printed by the tellers all use OCR.
If you pay a cheque into an account without a pre-printed slip, the cashier will print one which will be read by OCR.

Why? by Cartzca · 2006-11-26 02:39 · Score: 1

Shouldn't work, really should it? I mean it's all very well saying 'ooh we can get an infinite number of different curves in this finite area', but at the end of the day, you're going to be limited by the resolution of the scanner or printer. But for a given resolution, you can store the most data by drawing arrays of coloured dots (the curves surely have redundant data in them). Which isn't a very novel idea, at all. Maybe I'm wrong. Besides, think of the trees.

Re:Why? by Anonymous Coward · 2006-11-26 07:45 · Score: 0

The quoted capacities are bunk, and it's possible that it was a mistake by the article writer. A 45 second video doesn't require anywhere near hundreds of gigabytes of storage, even uncompressed and in high definition.

That said, there's a good reason for using shapes instead of just an array of dots. For one, you can't reliably detect individual dots of a print-out for various reasons. So let's say you use 10x10 dots. You can pack in more data if you use shapes instead of just a bunch of square 10x10 dots.

The basic idea is that you can't reliably discern 1x1 dots, but maybe you can reliably discern patterns in bigger squares, even if you couldn't reliably detect the individual dot patterns. (Which would resemble noise.)

steganography by Anonymous Coward · 2006-11-26 02:40 · Score: 0

could you store your data to look like a simple picture, or just a word document made out of billions of circles and triangles arranged into letters and sentences?

Maybe hand in your 20 page english essay, 1 page that says you suck :)

I wonder if you made photocopies of it would they be DRM'ed to prevent the distribution of 50,000+ songs per sheet

This is brilliant by TubeSteak · 2006-11-26 02:41 · Score: 1

The ultimate backup solution. With acid free paper & some stable color inks, you can back up your entire hard drive on a regular basis.

I wonder how... low the data density can go in terms of DPI & resolution and how that would compare to 2D barcodes.

BTW - TFA is really just a summary of this article
http://www.arabnews.com/?page=4&section=0&article= 88962&d=18&m=11&y=2006

--
[Fuck Beta]
o0t!

Re:This is brilliant by An+Onerous+Coward · 2006-11-26 06:10 · Score: 4, Funny

How much do you love the story's title? "Data can now be stored on paper!"

We truly live in the golden age of technology.

--
You want the truthiness? You can't handle the truthiness!
Re:This is brilliant by kimvette · 2006-11-26 08:21 · Score: 1

I can store data on a more indelible medium.

I have a chisel, and I can scrounge up some slate in the back yard.

Beat THAT for reliable data storage!

--
The Christian Right is Neither (Christian nor right). See: Matthew 23, Matthew 25, Ezekiel 16:48-50
Re:This is brilliant by moro_666 · 2006-11-26 09:14 · Score: 1

I'f mod you funny, but the funny slashdot decided to leave me without modpoints today :-/

Anyway, indeed a new landmark in science. Finally we have a way to store something on that paper that otherwise lays around uselessly, shocking!

--

I'd tell you the chances of this story being a dupe, but you wouldn't like it.

C'mon Slashdot by jokell82 · 2006-11-26 02:41 · Score: 5, Informative

I expect to see a story like this on Digg, but I thought Slashdot was better than this.

It's a scam.

--
I dunno who it is
but it prolly is fhqwhgads.

Re:C'mon Slashdot by Kuscheltier · 2006-11-26 03:08 · Score: 2, Funny

quote:I expect to see a story like this on Digg, but I thought Slashdot was better than this.

You must be new here!
Re:C'mon Slashdot by AK+Marc · 2006-11-26 07:06 · Score: 1

It's a scam.

Let's take a look at some of the "proof" that it's a scam:

This guy is questioning the fundamental reason why digital / binary technology became popular, its because its either 0 or 1 , so its mostly fool proof, we could have used different voltages and instead of binary, use 0 1 2 3 4, but then it will not be fool proof.

Well, if you have ever used, say, digital TV, satellite data or TV, or some really obscure technology like 802.11 wireless, then you've used technology that uses different levels other than just 0 or 1. And here this guy is "proving" that this is a scam because anything other than 0 and 1 would be stupid. I'm not saying that it is or isn't true, but I'm saying that especially this one blogger is a loser who doesn't understand digital data funtamentals and is speaking from ignorance.

--
Learn to love Alaska
Re:C'mon Slashdot by osu-neko · 2006-11-26 09:02 · Score: 1

I remember when modems got beyond 300 baud, people started complaining about people who continued to use the term incorrectly. For example, people would call a 2400 bps modem a "2400 baud modem", but that was incorrect, since baud is signals per second, not bits per second, and faster modems got higher speeds by doing more than just using mode signals, but also by using more frequencies to get more bits per signal. IIRC, a 2400 bps modem was actually 600 baud, but 16 different frequency levels, so 4 bits per signal.

The guy who wrote the blog entry clearly is too clueless to judge the veracity of this claim. Not that the claim is believable, either, just some of this guy's objections aren't.

--
"Convictions are more dangerous enemies of truth than lies."
Re:C'mon Slashdot by Anonymous Coward · 2006-11-26 17:42 · Score: 0

I thought Slashdot was better than this.
You must be new here.

I tried this... by Anonymous Coward · 2006-11-26 02:42 · Score: 5, Funny

I wiped my ass on a blank sheet, scanned it in and was greeted with the Windows Vista login screen.

Re:I tried this... by sillybilly · 2006-11-26 04:56 · Score: 1

I was just looking for the toilet paper reference! One use is an awesome random seed generator for security applications! Everytime you wipe, first scan and read contents to update the seed, before you toss!
Re:I tried this... by MisterSquiddy · 2006-11-26 06:36 · Score: 0

That's funny. I printed out your post, scanned it in and got a picture of a turd.
Re:I tried this... by Anonymous Coward · 2006-11-26 12:10 · Score: 0

Don't try to fool us! We know it will take WAAAAAAY more than 256GB to get to the Vista login screen.
Re:I tried this... by master_p · 2006-11-26 22:52 · Score: 1

So you were able to stomach Windows? I thought no sane person could do that...

That's not new by El+Lobo · 2006-11-26 02:42 · Score: 0

That's not new or revolutionary in any way. You can encode and store data anywhere. In my university we are making experiments storing data in microcrystals. This works at atomic level the same way a hologramma works more or less. In one microcrystal of 1x1 mm you can store more than 500 GB data at this moment. The only problem is: the technology is so damn expansive than you can but 500 computers with the same money :-) The same could be applied to this paper format, I guess the scaner that reads-decodes this data cannot be bough in the next-door store.

--
It's time to realise that Abble's products are the biggest abomination these days. Just say NO to the dumb iAbble way!!

This looks like a lie by OeLeWaPpErKe · 2006-11-26 02:45 · Score: 3, Insightful

What does this bring that normal scanners can't ?

Let's see A4 - 256Gig. Let's say n different colors.

He'd need to store 256*1024*1024*1024*8 = 2199023255552 bits
on A4 = 210 mm x 297 mm = 62370 mm^2 = 2456 inch

That makes 895 367 775 bits per inch. To encode that you'd need 895 367 775 / log2(n) dots. Increasing the number of colors can buy you some leeway, but not that much.

The surface area of such a dot would be 1/30 000 000 th of a millimeter.

Where will you find paper that has surface flaws significantly smaller than that ? No matter what the encoding, you're still going to need it. So this is a scam, plain and simple.

Re:This looks like a lie by Anonymous Coward · 2006-11-26 03:03 · Score: 0

You needed to do that much math to figure out that it is a scam? I saw the word Indian and got that much figured out in five seconds...
Re:This looks like a lie by LiquidCoooled · 2006-11-26 03:08 · Score: 1

A4 paper in India obviously means something different to a4 paper in the rest of the world.
One A4 sheet can cover an entire town.

--
liqbase :: faster than paper
Re:This looks like a lie by Anonymous Coward · 2006-11-26 03:09 · Score: 0

Nah, it's not because he's an Indian. Look at his educational institution: "Muslim Educational Society Engineering College". Yeah, like they're going to come up with anything useful.
Re:This looks like a lie by owlstead · 2006-11-26 03:09 · Score: 1

You guys don't get it, this storage facility is write-only, so there is no need for a scanner.
Re:This looks like a lie by BigBuckHunter · 2006-11-26 03:09 · Score: 0

The way I understand it, a triangle can store 4 bits (pointing up, down, left, right). Then add colors (ROYGBIVK) to the equation, that's 8 bits. Now the triangle can hold a total of 32 bits (4x8)of info. What if the triangle is hollow/solid? That's now 32x2 =64 bits!. Squares are 2x8x2, circles are 1x8x2, line holds 32 (vert, horiz, slash, bkslash). See where this is going. This isn't something new. I look forward to seeing if they can make it practical.

BBH
Re:This looks like a lie by BeerCat · 2006-11-26 03:11 · Score: 1

I think you are mostly there.

He'd need to store 256*1024*1024*1024*8 = 2199023255552 bits

A4 is, as you say 210mm x 297mm, which is 96.67 inch^2 (you forgot to divide by another 25.4)

So, 22,747,732,032 bits per inch^2 are needed. Now, even a lowly 300 dpi scanner would only need to differentiate 252,752 colours, which is achievable with 6 bits each for R,G and B.

Alternatively, a 600dpi scanner could do it with 63188 colours, which is less than 2^16.

In other words, I think it is physically possible, but the scanner resolution would need to be well chosen to counteract the effects of fading of the colours or creases in the paper.

--
"She's furniture with a pulse"
Re:This looks like a lie by JoostSchuttelaar · 2006-11-26 03:29 · Score: 1

Up, down, left, right -> 4 possiblities, 2^2=4, so 2 bits, not 4. Also, hollow or solid, that's one bit extra, not twice as many bits. Dito for the color info. Nevertheless, this is a big scam. It all boils down to printer resolution/medium quality/optical scanner resolution. Nowhere near enough for these dense data densities.
Re:This looks like a lie by kubalaa · 2006-11-26 03:42 · Score: 5, Insightful

Your scheme doesn't work because triangles are not atomic; they are made out of lines, which are described in terms of endpoints, which have a finite resolution. For example, an inkjet printer would make a triangle by printing dots at some fixed resolution. A drafting machine or laser printer might be able to draw the triangle without resorting to dots, but the elements of the triangle can still only be positioned with finite accuracy.

Let's say that we're drawing very tiny triangles as close to our resolution limit as possible (which we must do if we want to fit a lot of them). Such a triangle might be, say, 3 x 3 resolution units, so a hollow, up-triangle might look like this:
010 101 111
But look: there are 2^9 (or 512) possible shapes that can be made in this grid -- so by using only 64 different triangles, we are actually underutilising our medium. It doesn't matter what technology you use, any shape other than a "dot" is itself made out of smaller units like "dots", so restricting our vocabulary to certain shapes (rather than arbitrary sequences of dots) will waste space.

--
"If you look 'round the table and can't tell who the sucker is, it's you." -- Quiz Show
Re:This looks like a lie by Dun+Malg · 2006-11-26 03:43 · Score: 1

The way I understand it, a triangle can store 4 bits (pointing up, down, left, right). Then add colors (ROYGBIVK) to the equation, that's 8 bits. Now the triangle can hold a total of 32 bits (4x8)of info. What if the triangle is hollow/solid? That's now 32x2 =64 bits!. Squares are 2x8x2, circles are 1x8x2, line holds 32 (vert, horiz, slash, bkslash). See where this is going. This isn't something new. I look forward to seeing if they can make it practical.
No, it's a scam. You're missing the greater point. Even with all this fuddly array of shape arrangements, there is not nearly as much bandwidth on a standard sheet of paper as they claim. Sure shape encoding is an interesting notion, but not nearly as practical as they profess.

--
If a job's not worth doing, it's not worth doing right.
Re:This looks like a lie by scribblej · 2006-11-26 04:08 · Score: 2

Oh, to have modpoints. As I was reading the gp post I was wondering what words I would use to explain his error. You've done a far better job than I would have - though I probably would I have begun along the lines of, "Listen, fool!" which usually just hurts my case. But is fun.

Okay, I'm probably past 20 seconds now. Someone with mod points, save the parent poster from obscurity, please!
Re:This looks like a lie by bergeron76 · 2006-11-26 05:03 · Score: 0

You're forgetting the geometric patterns. Using 10 geometric shapes, plus 65,535 colors (to be readable on the other end), this may be feasible.

Doubtful though.

--
Don't think that a small group of dedicated individuals can't change the world. It's the only thing that ever has.
Re:This looks like a lie by Anonymous Coward · 2006-11-26 05:14 · Score: 0

Consider a three dimensional analogue to OFDM and it's easy to see that the spatial data density he claims is a piece of cake.

Effing /.ers will buy anything for a dollar.

Oh, and enough of the friggin subliminal messages in the friggin image challenge.
Re:This looks like a lie by RealGrouchy · 2006-11-26 05:59 · Score: 1

Not if you use the highly-efficient LenPEG compression algorithm!

- RG>

--
Hey pal, this isn't a pleasantforest, so don't waste my time with pleasantries!
Re:This looks like a lie by hey! · 2006-11-26 06:09 · Score: 1

This is a theoretically interesting assumption in your argument. Since there are only 2 ^(n + c - 2) images that can be printed (where n is the number of pixels on the page and c is the number of colors the pixel can take), this forms the upper limit for the number of messages that could be represented by a printer on a page.

This can be shown by disproof. Let L be the number of distinct dot arrays a printer can create. Suppose we try to print a sequence of M distinct messages, where M > L; the L + 1st message must be identical to one of the previous L messages.

This doesn't mean, however, it is not possible to encode a message longer than L on the page, it just can't contain more L bits of information. That is to say you can compress data, but you can't get more information out. If you are only interested in a small minority of the possible messages, it may be possible to compress much longer data streams, which could make an impressive demonstration.

It may also be able to get the hardware in the printer to encode more possible dot patterns on the page than their rated resolution, I think. You could probably encode more than two bits of data in two pixels by playing with their alignments. This wouldn't be many orders of magnitude more data, but some.

--
Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
Re:This looks like a lie by erotic+piebald · 2006-11-26 06:12 · Score: 1

Hmmm, maybe the 'something new' that he's doing is 'printing' different shaped dots. Atomically different shapes, not 'dots'.
Re:This looks like a lie by ozbird · 2006-11-26 06:15 · Score: 1

You're forgetting data compression...
Re:This looks like a lie by solafide · 2006-11-26 06:23 · Score: 1

Suppose we try to print a sequence of M distinct messages, where M > L; the L + 1st message must be identical to one of the previous L messages.
Not quite right, but a trivial flaw. By the Pidgeonhole Principle in a set of L+1 messages with L possible values for each message one value of a message must appear twice, but that doesn't guarantee that the L+1st is equal to one of the previous ones; take L=2 -> M-M-N has L+1 messages, but the L+1st is not equal to any previous message.
Re:This looks like a lie by Anonymous Coward · 2006-11-26 06:23 · Score: 0

The way I understand it, a triangle can store 4 bits (pointing up, down, left, right). Then add colors (ROYGBIVK) to the equation, that's 8 bits. Now the triangle can hold a total of 32 bits (4x8)of info. What if the triangle is hollow/solid? That's now 32x2 =64 bits!. Squares are 2x8x2, circles are 1x8x2, line holds 32 (vert, horiz, slash, bkslash). See where this is going. This isn't something new. I look forward to seeing if they can make it practical.
No, it's a scam. You're missing the greater point. Even with all this fuddly array of shape arrangements, there is not nearly as much bandwidth on a standard sheet of paper as they claim. Sure shape encoding is an interesting notion, but not nearly as practical as they profess.Bandwidth refers to transmission, not storage. Perhaps you're thinking of paper aeroplanes... but they're inferior to the HD-DVD frisbee :)
Re:This looks like a lie by An+Onerous+Coward · 2006-11-26 06:34 · Score: 1

I'd considered that option as well. Not entirely outside the realm of possibility, but consider this:

1) it would require specialized printing hardware. Consider how likely it is that this guy built a specialized printing head that can match the resolution of current laser printers, and produce distinguishable shapes as well. It seems far more likely that he's working with off-the-shelf hardware.

2) With inkjet, at least, whatever shape you're trying to squirt onto the paper, the shape will be blurred as the ink is soaked into the paper.

--
You want the truthiness? You can't handle the truthiness!
Re:This looks like a lie by Anonymous Coward · 2006-11-26 09:06 · Score: 0

I'm a bit concerned about your math jumping from 62370 mm^2 to 2456 inch. How did you translate area to linear inches? Are you already assuming that dots are 1mm by 1mm? As far as I know 62370mm^2 should be around 96.67 sq-inch.

Assuming that a dot could be of 1/10 th of mm by side, an A4 sheet will give you 2100x2970 dots. That means that you'll need 352000 levels of color.

What worries me the most, is that slashdot published this. Isn't it like saying "A guy discovered a way to save 74mins of high resolution audio signals on a 12-inch vinyl device that spins at 33 1/3 rmp". I mean, paper is not a lossless media, and 300k levels might not easy to difference when you attempt to read it. Wasn't that the reason why CDs outperformed vinyls?
Re:This looks like a lie by osu-neko · 2006-11-26 09:30 · Score: 1

Did you read the article? It doesn't imply but oughtright says that it requires a specially developed scanner to read, so we know he's not using off-the-shelf hardware on that end. It seems unlikely he'd require a specially developed scanner to read the output of an off-the-shelf printer.

It's actually pretty obvious from the description that what he's trying to do would require specialized printing hardware. Now, whether he's actually done that, or just hidden a disk inside his special scanner, I don't know. The fact that he won't say how it's done stinks of scam. But all these numbers people are throwing around in this thread are just way off-base.

--
"Convictions are more dangerous enemies of truth than lies."
Re:This looks like a lie by dangitman · 2006-11-26 09:50 · Score: 1

But wouldn't the triangles help with things like error-correction, or tolerance for a degraded image? If you use dots only, and some of those dots get damage, the data is gone. But the shape of a triangle can be inferred from the angles, even if part of the image is missing. Maybe it's not "wasted" as such, but used to make the storage medium more robust?

--
... and then they built the supercollider.
Re:This looks like a lie by Petronius.Scribe · 2006-11-26 14:33 · Score: 1

Storage is merely transmission (forward) through time. Bandwidth applies just as much as to any other form of transmission.
Re:This looks like a lie by madbawa · 2006-11-26 15:57 · Score: 0

Why don't you people STOP THINKING IN BITS!!!! 3 co-ordinates in a 2-D space present a theoretically infinite number of possibilities. Data can be stored in the angle, distance, location and relative position of the triangle. This has nothing to do with bits and bytes, but is a different scheme of encoding. So stop with those inane calculations already!
Re:This looks like a lie by ThePhilips · 2006-11-26 21:32 · Score: 2

No matter what the encoding, you're still going to need it. So this is a scam, plain and simple.

You forgot the one important component which embedded in the name of the RTFA's method: colors. The method uses colors and thus called "Rainbow format". Also it specifically used geometrical shapes - which introduce another angle to data representations. It's not about dots anymore - it's about shapes and colors.
Besides, somehow encodings used to reach megabyte wireless speeds are not surprising to you. (Note, w/o frying its user.)
Math of past mid century - applied algebra - allows one to customize encoding to errors particular to an application.
Read on about the modulations(*) and how they can deal with different kind of errors. Or more precisely, the modulations are capable of detecting errors immediately with high probability of correction. This is further development of the same theory which covers Hamming codes. Often the modulations are used together with error correcting codes - they complement each other - to further improve line characteristics. (Hamming works on bit sequences (e.g. bytes), while modulations work directly on bits, converting sequences of numbers as seen on line into sequence of bits.)
(*) If I'm not mistaken, the "modulation" isn't original "modulation" as used by modems, but has to do with modulo operations, which is heavily used by the methods e.g. in CDMA (Code Division is in fact modulo, not normal division) and also as I heard in Wi-Fi.

--
All hope abandon ye who enter here.
Re:This looks like a lie by SQL+Error · 2006-11-26 22:13 · Score: 1

Doesn't work. Even if you had a magic printer that could do that (not physically impossible, just utterly impractical), that just increases the resolution required of the scanner. In fact, it increases the required resolution beyond what is possible with visible light.

What you're talking about is doing (approximately) 90nm lithography on paper, and then reading it back. You can do 90nm lithography, of course (but on pure silicon wafers, not on paper), and you can read it back using an electron microscope. But it's hardly a cost-effective method for backing up your pr0n.
Re:This looks like a lie by Anonymous Coward · 2006-11-29 05:29 · Score: 0

He'd need to store 256*1024*1024*1024*8 = 2199023255552 bits
on A4 = 210 mm x 297 mm = 62370 mm^2 = 2456 inch
You're confusing inches and square inches there. 62370 mm^2 = 62370 / ( 25.4 mm/inch ) / (25.4 mm/inch ) inch^2 = 96.67 inch^2. This is at least in the ballpark of a 8x11 paper with 88 inch^2.

2199023255552 bits / 96.67inch^2 = 22747732032 bits/inch^2

sqrt( 22747732032 bits/inch^2 ) = 150823/inch

That would be 150823 dpi to encode those bits. I have no idea where your log2 comes from - one bit is equivalent to one dot.

Durability by HammerHead2000 · 2006-11-26 02:47 · Score: 1

While I'm sure this is very pretty and sounds good, there have to be durability issues with storing all your data on a piece of paper. At least if I spill my coffee over a CD I know it will work again once I've dried it off - doing this with a bit of paper is a little more problematic.

Archive format by BeerCat · 2006-11-26 02:50 · Score: 1

If they can pull it off, it might be a good "Medium term" archive format (in other words, about 100 - 500 years), as there are many many books of those ages.

Given that the BBC's Domesday project (data gathered in 1986) needed to be "rescued" by 2002 (http://news.bbc.co.uk/1/hi/technology/2534391.stm ), then there are currently no reliable digital archive systems for long term storage.

On the other hand, the Rosetta Project look like they could get it licked for really long term storage (Example http://www.rosettaproject.org/about-us/disk/concep t)

--
"She's furniture with a pulse"

Re:Archive format by Dunbal · 2006-11-26 02:53 · Score: 1

If they can pull it off, it might be a good "Medium term" archive format (in other words, about 100 - 500 years), as there are many many books of those ages.

Plus you could always take some LSD (if you're into that sort of thing) and stare at your data for a few hours. Like, wow - man. Try doing THAT with a hard drive...

--
Seven puppies were harmed during the making of this post.
Re:Archive format by Znork · 2006-11-26 03:09 · Score: 1

"then there are currently no reliable digital archive systems for long term storage."

Actually, there are reliable digital archive systems, you just need to get rid of the conceptual fallacy that 'archive' means some 'special' form of storage.

Consider the data you want 'archived' the same as stored live data and dont have that particular problem anymore. The problem for long term live data storage in that case merely becomes issues of migration, redundancy and maintenance (the issues one tries to ignore or drastically reduce when thinking 'archive', with expected results). And, of course, you need to store your data in an accessible format; any proprietary and/or drm'ed data you might as well just run through the shredder today and save the bother. You likely wont be able to read it in a decade, never mind half a century, anyway.
Re:Archive format by BeerCat · 2006-11-26 03:32 · Score: 1

OK, my bad on using a bit too much verbal shorthand.

The data medium matters for "store and forget about it", whereas "install new and copy old" means that every time the hardware is upgraded, the archived data is re-copied (which is presumably why there is still the oldest web page accessible, created back in 1996)

Spot on with the DRM comment, though. I wonder how long it will take people to realise that it turns data into random junk.

--
"She's furniture with a pulse"
Re:Archive format by Rallion · 2006-11-26 04:04 · Score: 1

If they can pull it off, it might be a good "Medium term" archive format (in other words, about 100 - 500 years), as there are many many books of those ages.

Except that in a book, if the color fades a little, we can still read it. If the paper gets folded, we can still read it. And so on.
Re:Archive format by Anonymous Coward · 2006-11-26 06:47 · Score: 0

Plus you could always take some LSD (if you're into that sort of thing) and stare at your data for a few hours. Like, wow - man. Try doing THAT with a hard drive...

Dude, you can stare at anything for a few hours after doping up a little, and LSD is much stronger shit.

So in our digital age... by n1hilist · 2006-11-26 02:51 · Score: 2, Funny

Can we now say again, "Sorry, my dog ate my homework!" ?

Re:So in our digital age... by dohzer · 2006-11-26 05:31 · Score: 1

Really?
Where is the article about them printing data on a bone?

Bullshit, complete bullshit by terrencefw · 2006-11-26 02:52 · Score: 4, Informative

2.7GB per square inch would would require a linear data density of 152292 dpi. Neither my scanner nor my printer come within a hundredth of this. The main problem with the printer at such resolutions is bleeding of the inks into the paper. To form the different shapes several dots would be necessary, which would further decrease the effective resolution by an order of magnitude. For example, suppose a 3x3 grid was used to form each character, the article states that there are four different shapes used, yet that 3x3 grid could encode 512 different patterns. Realistically, at 600dpi (giving 360000 dots per square inch), with 3 ink colours (yielding 8 different colours) you would get 360000 bytes per square inch, or 33MB per A4 page - somewhat short of the 256GB promised. You'd also need to dedicate around 25% of the capacity for error correction. This is complete and utter bollocks.

--
Like tinyurl, but one letter less! http://qurl.co.uk/

Re:Bullshit, complete bullshit by CODiNE · 2006-11-26 05:53 · Score: 1

The main problem with the printer at such resolutions is bleeding of the inks into the paper.

The trick is to actually render this page as a vector image, save the 256GB file on a network shared drive, and THEN open it on the other computer. Tada! 256GB on a single page IN a paperless office, how neat is that?

Of course the actual advantage of this is when the feds start looking through your files and find the image you tell them "Oh, I made a program to draw random shapes, but forgot to tell it when to stop."

--
Cwm, fjord-bank glyphs vext quiz
Re:Bullshit, complete bullshit by Anonymous Coward · 2006-11-26 08:01 · Score: 0

While I agree with your basic conclusion, I think printer manufacturers would be quite surprised to hear that three inks only produce eight different colors. Your estimate is true if you take the colors to be on/off quantities, but you can also vary the intensity of each component color by how much ink you use. A little ink will produce a lighter shade than more ink, and combinations with other colors produce even more variations.

High-end consumer printers also often use more than 3 inks these days, to get better color reproduction. 8 inks (7 if you don't count black) isn't uncommon anymore. And high-end inkjets are often capable of much better than 600 dpi these days. So you can do better than 33 MB on a single A4 sheet, if you can really get 600 dpi resolution, and every dot is a distinguishable pixel (which I doubt). Not all that much better, though.
Re:Bullshit, complete bullshit by nametaken · 2006-11-26 08:20 · Score: 1

Ok, so I get that it's a scam or involves some undisclosed and brilliant technique.

But what is interesting to me is that you're saying I could realistically store about 50MB of data on the two sides of a sheet of paper and it would include error correction. That actually sounds useful to me if I had an ADF and a decent scanner. I suppose my biggest problem would be the cost of the ink, otherwise it could be a fun (if impractical) way to store data.

As far as you know, has someone made software to encode and decode this sort of thing as a proof of concept, if even just to play with it?
Re:Bullshit, complete bullshit by Ruby+Wednesday · 2006-11-26 09:54 · Score: 1

While I agree with your basic conclusion, I think printer manufacturers would be quite surprised to hear that three inks only produce eight different colors. Your estimate is true if you take the colors to be on/off quantities, but you can also vary the intensity of each component color by how much ink you use. A little ink will produce a lighter shade than more ink, and combinations with other colors produce even more variations.

That only applies with dye sublimation printers. Most printers can only output shades of colours by dithering or halftoning
Re:Bullshit, complete bullshit by dangitman · 2006-11-26 09:54 · Score: 1

Of course the actual advantage of this is when the feds start looking through your files and find the image you tell them "Oh, I made a program to draw random shapes, but forgot to tell it when to stop."
I'm pretty sure that drawing random shapes is illegal under the PATRIOT act. The only thing you are allowed to draw is Bald Eagles and American flags with the President standing in front of it. Or any combination thereof. As long as the eagle is not crushing the President's skull or pooping on him.

--
... and then they built the supercollider.
Re:Bullshit, complete bullshit by mikiN · 2006-11-26 23:41 · Score: 1

I wonder what information would result from decoding the American Flag using his algorithm...

--
The Hacker's Guide To The Kernel: Don't panic()!

Do The Numbers by SQL+Error · 2006-11-26 02:53 · Score: 4, Insightful

2.7GB per square inch, eh?

Alright, that's 21.6 gigabits per square inch.

For the sake of argument, let's say that the printer and scanner can reliably print and scan colour at 24-bit fidelity (which is nonsense, but makes the numbers nice and tidy): 900 million pixels per square inch.

That's 30,000 dpi.

That means you'd have to print and scan pixels less than a micron across. In full colour.

I don't think so.

Size by Knutsi · 2006-11-26 02:56 · Score: 1

And I thought those huge floppy disks we used back at school were large....

Sound interesting tho', but those 256GBs seem a bit too much, unless the computer has a massive translation table for those shapes and colors that account for a staggering amount of possible meanings. This is like the mother of all barcodes ;)

hmmm... by leehwtsohg · 2006-11-26 02:57 · Score: 4, Interesting

600dpi times 8*11 inches makes 32M dots. To get 26GB you need 6500 bits per dot. This gives either an amazing resolution in color separation (as opposed to, say, 32 bits on a screen - maybe 700 different frequencies, each with 10bit separation), or much higher dot density - something like 50000dpi!

Re:hmmm... by 3aPo · 2006-11-26 04:14 · Score: 0

He could have used some sort of source encoding also. The article doesnt say if he put up the uncompressed data on the paper, if he did use any simplistic source encoding he could actually reduce the data rate by 50%. But I am highly skeptical if he did that. Post Script: DVD and CD dont use any source encoding, so the capacity of the paper and DVD might not be comparable here.
Re:hmmm... by Jeff+DeMaagd · 2006-11-26 05:34 · Score: 1

(as opposed to, say, 32 bits on a screen - maybe 700 different frequencies)

How do you get 700 light frequencies in a pixel on a screen? The colors are simulated by just mixing three different frequencies. It might appear simulate more frequencies, but it's just three.
Re:hmmm... by dangitman · 2006-11-26 09:57 · Score: 1

No, it's three different frequencies, multiplied by intensity. Each pixel can do more than just be on or off. So, in combination, you do get more actual values.

--
... and then they built the supercollider.
Re:hmmm... by leehwtsohg · 2006-11-27 00:44 · Score: 1

Sorry, I was saying that to get 6500 bits in a single pixel, you'd need something like 700 different frequencies with 10 bit separation each - which you obviously can not do on a regular screen, and with current technology is pretty much impossible, I think. (especially the printing part)

Full costs by Captain+Perspicuous · 2006-11-26 02:58 · Score: 1

From the linked article in the article:

Sainul says a CD or DVD consumes 16 grams of polycarbonate, a petroleum by-product. While a CD costs Rs.15 (SR1.25), his paper or plastic-made RVD will cost just about Rs.1.50 and has 131 times more storage capacity.

So, the paper is cheap - but how exactly do you print on it? Using dyes? Which costs how much? And are created from what?

Exactly.

My whole video collection in a Trapper Keeper! by Anonymous Coward · 2006-11-26 03:04 · Score: 0

OK, that would be cool. All my videos in one narrow loose-leaf binder.

Re:My whole video collection in a Trapper Keeper! by ThatsNotFunny · 2006-11-26 03:12 · Score: 1

If these "videos" are the kind that I'm thinking of, be careful of the pages sticking together!

--
"Was it a millionaire who said 'Imagine No Posessions?'" -- Elvis Costello

dots per inch,, color resolution by gd23ka · 2006-11-26 03:05 · Score: 1

Let's see, a piece of "letter" paper is 8.5 x 11 inches. Let's see at 600 dpis inch that are 8.5 x 600 = 5100 dots
per line or 5100 x 11 = 56100 dots per page. The raw capacity of a common laser printer is therefore a little less
than 64 Kilobytes per page.

The next thing we could do to increase the amount of data is to add the color dimension. Let's say our printer is capable
of printing 256 * 65536 different shades of red, blue and green by putting different quantities of cyan, magenta and yellow
and black ink/toner on the paper _and_ our scanner has the color resolution to discern 65535 different shades on it, each of
the 56100 dots on that piece of paper could be in 256 * 65536 = 16777216 states meaning each of the 56100 dot can store a
distinct 3 byte value. So now we get 56100 * 3 = ca. a little less than 192 Kilobytes of raw data.

I guess however that in practice we would be hard pressed to come up with more than 64K per piece of paper (128K if you're
using double sided paper :-) ) because of real-life equipment limitations in color and dot resolution and of course the
certainly required error correction scheme(s).

I still like the idea, though of refining the 2d barcode.

Re:dots per inch,, color resolution by fatphil · 2006-11-26 03:17 · Score: 1

times another 600. (You've assumed 1dpi in one dimension).

It's so obviously a scam that there's no need to discuss it further.

The best reaction is silence.

--
Also FatPhil on SoylentNews, id 863
Re:dots per inch,, color resolution by gutnor · 2006-11-26 03:19 · Score: 1

A 600 dpi printer is actually a 600 dpi vertical and 600 dpi horizontal
So it is (8.5 * 600) * ( 11 * 600) = 33 660 000.

So that's 30 Mb.

However to be able to store 250.000 Mb, it still need to be able to encode (and decode) something like 8000 variations on a single dot. A bit optimist I suppose.
Re:dots per inch,, color resolution by BeerCat · 2006-11-26 03:19 · Score: 1

Let's see, a piece of "letter" paper is 8.5 x 11 inches. Let's see at 600 dpis inch that are 8.5 x 600 = 5100 dots
per line or 5100 x 11 = 56100 dots per page.

5100 dots per line is 5100 x 11 x 600 dots per page, which is 33,660,000 dots, or about 32Mb when done monochrome, or 8.2Gb when done in only 8 bit (256 shade) colour.

--
"She's furniture with a pulse"
Re:dots per inch,, color resolution by ip_fired · 2006-11-26 03:24 · Score: 1

Your reasoning is not correct by a few orders of magnitude. 600 x 8.5 = 5100 600 x 11 = 6600 6600 x 5100 = 33660000 dots per page

--
Don't count your messages before they ACK.
Re:dots per inch,, color resolution by Tim+Browse · 2006-11-26 03:30 · Score: 1

5100 dots per line is 5100 x 11 x 600 dots per page, which is 33,660,000 dots, or about 32Mb when done monochrome, or 8.2Gb when done in only 8 bit (256 shade) colour.
Er...
Re:dots per inch,, color resolution by Haeleth · 2006-11-26 03:37 · Score: 1

each of the 56100 dots on that piece of paper could be in 256 * 65536 = 16777216 states

Um, no. Each dot is going to be either cyan, magenta, yellow, or black. Laser and injket printers produce multicolour output by dithering, not by mixing inks, and the "dpi" rating of the printer refers to the dots used when dithering, not to the equivalent of screen pixels.
Re:dots per inch,, color resolution by Anonymous Coward · 2006-11-26 04:03 · Score: 0

250.000? Why not just say 250, and leave the decimal out? Unless, of course, you mean 250,000.
Re:dots per inch,, color resolution by Metasquares · 2006-11-26 04:18 · Score: 1

"." is used as a thousands-separator in parts of Europe (confusingly, "," is also used as a decimal point!)
Re:dots per inch,, color resolution by imroy · 2006-11-26 06:52 · Score: 2, Interesting

AFAIK, a CMYK printer can still squirt multiple colours onto one point. Since it's pointless to mix with black, and mixing all three gives you a dark brown that's pretty close to black, that leaves eight reliable colours: white (no ink), cyan, magenta, yellow, red, green, blue, black. So that's basically three bits per dot. For a piece of 8.5x11" paper at 1440dpi, I figure only about 69MiB of storage. Not looking very plausible...

Here's my guess: by KokorHekkus · 2006-11-26 03:05 · Score: 1

I think they've introduced some kind of compression into their encoding.

Enter the usual mode of operations for people coming up with new "fantastic" compression algorithms. First the present what they've done. Then they're asked to do a demonstration on actual data supplied by someone else. After that they either disappear straight away or say it has a few glitches to iron out and they will be back soon with the sharp version and disappear.

Re:Here's my guess: by Zeinfeld · 2006-11-26 03:13 · Score: 1

There is an old saw in cryptography that anyone can create an encryption algorithm that they themselves cannot break. The point being that the value lies in creating an encryption algorithm that somebody else cannot break. The same excuse does not exist for compression algorithms. Just choose a random sample of wikipedia and see how well the scheme works.

--
Looking for an Information Security student project suggestion?
Try http://dotcrimeManifesto.com/

Re:RTFA by Lord+Bitman · 2006-11-26 03:09 · Score: 0, Troll

Assuming he encoded it using a bunch of dots, which he never claimed to do. This guy is full of shit for entirely different reasons. Here is the conversation so far, let's see if you can detect where you became a jackass:
[Idiot] I have come up with a novel technique of encoding data which nobody has thought of before, which will get a much higher capacity than the normal and obvious way.
[Jackass] If you used the normal and obvious way of encoding data there's no way you could store that much, so you must be lying.

Oh look, it was the first thing out of your mouth.

He is an idiot because: He didnt even hint at anything he could be doing differently and yet expects people to buy his story just because he said "It's different, so it works. Seriously, I'm Indian so you know I'm smart, I don't need to explain myself at all to be credible."
You are an idiot because: You ignored the one and only thing he /did/ say, which was that he was doing something differently.

--
-- 'The' Lord and Master Bitman On High, Master Of All

I once heard by cucucu · 2006-11-26 03:11 · Score: 1

I once heard that Linus has a the complete source code of the 2.6.0 kernel codified in a tattoo in his left buttock.
To distribute a copy he sits on a copying machine and presses the green button. Otherwise this would violate the GPL (or perhaps no, since he is the copyright owner?).

Stop posting stupid stories!

Re:I once heard by Anonymous Coward · 2006-11-26 07:17 · Score: 0

Who runs the tatoo parlor, SCO?
Re:I once heard by mikiN · 2006-11-27 00:02 · Score: 1

...and he always does a 'make clean' when going to the bathroom...

--
The Hacker's Guide To The Kernel: Don't panic()!

Give me a piece of paper large enough... by Bastard+of+Subhumani · 2006-11-26 03:12 · Score: 0

I think if Archimedes were alive today, he could easily do it - he'd just need a very large sheet of paper.

--
Only three things are certain; death, taxes, and apocryphal quotations - Ben Franklin.

maybe not scam? by goombah99 · 2006-11-26 03:14 · Score: 1

The blogs proclaiming this to be a scam seem pretty feeble reasoning. But it's not too hard to see if the numbers add up themselves. First let's suppose we have a very fine color printer and a very fine color scanner that can print at say 4096 DPI in RGB with 24 bits of color. And we'll consider an 8x11 sheet of paper:
4096*4096*256*256*256*8*11

this is 24 Million Giga Bits, or 3 Million Gigabytes.
Now he's going to have to redundantly encode this in some error correcting way since paper and color resolution is imperfect. So assume he makes it 10,000 fold redundant with his circles and triangles. that's 300GB just like he claimes.

--
Some drink at the fountain of knowledge. Others just gargle.

Re:maybe not scam? by SQL+Error · 2006-11-26 03:23 · Score: 1, Informative

Sorry, but you have combinatorial math in there where it doesn't apply. Yes, 24 bits gives you 256*256*256 colours, but it's still 24 bits.

The appropriate calculation is 4096*4096*24*8*11, so you over-estimated the capacity by a factor of 700,000 or so.
Re:maybe not scam? by Anonymous Coward · 2006-11-26 03:28 · Score: 2, Informative

let's suppose we have a very fine color printer and a very fine color scanner that can print at say 4096 DPI in RGB with 24 bits of color. And we'll consider an 8x11 sheet of paper:
4096*4096*256*256*256*8*11

Please, at least try to understand the technology involved. 4096 dpi means 4096 individual dots. Each of those dots is a single ink colour (typically one of cyan, magenta, yellow, or black), and it is the combination of those dots in dithering patterns which produces multicoloured output.

Your "4096 * 4096 * 256 * 256 * 256" is way off the mark - you are overestimating the capabilities of printers by several orders of magnitude.

FYI, the average "ZOMG 1440 dpi!!!" consumer printer is lucky to reach the equivalent of 100 pixels per inch. Even the commercial machines used in printing glossy magazines are more like 300 pixels per inch -- nowhere near 4096.
Re:maybe not scam? by gutnor · 2006-11-26 03:34 · Score: 1

He claims a density of 2.7 GB per square inch.

Professional printer can realiabily print with 300 dpi resolution. The extra dot in 4096 dpi are used only because more than one dot is required to create 1 visual dot : example you need at least 2 dots to create a pure red dot. Also there is diffusion and splatter, ...

Now with 300 "realiable" dpi, it still needs to encode (2700*1024*1024*8)/(300*300) values ( ie colors ) = 250 000 colors.
I suppose without using ultra calibrated hardware on highly consistent set of paper in a color managed lab, you would have terrible difficulties to distinguish reliabily more than 100 color variation per dot.
Re:maybe not scam? by SQL+Error · 2006-11-26 03:48 · Score: 1

Sorry, you've fallen into the same trap. Not 250,000 colours (which sounds vaguely feasible), but 250,000 bits of colour resolution, which is of course impossible.

As you say, the true colour resolution of good printers is around 300dpi, so the caculation is really off by a factor of 100,000,000 or so.
Re:maybe not scam? by goombah99 · 2006-11-26 03:50 · Score: 1

oopsie you're right! but that's still multi-gigabyte capacity. As for the other comments about it taking more than one dot to make an image pixel, they don't understand that I'm computing the maximum data capacity not the image reproduction capacity. And the calculation was supposed to be the ideal one not one considering imperfections.

Now if we were to assume that he could get an extra factor of 256 somewhere, like for example he had 48 Bit color depth, used 6 color inks, and could get 4 times as many pixels per inch using the best possible printers, then we're closing in on the right value.

And as you know newspapers get things wrong. Maybe that was for a two sided sheet of paper. Maybe he meant bits and they wrote bytes.

The point is the claim is not patently absurd like the blogs claim.

--
Some drink at the fountain of knowledge. Others just gargle.
Re:maybe not scam? by Bender_ · 2006-11-26 04:12 · Score: 1

Yeah, and now lets assume realistic numbers.

Lets assume paper resolution is limited to 600dpi. The color fidelity that can be recognized accurately is probably less than 18bit.

So, somehow we end up with 600*600*18*8*11=570240000bits which is 68 megabytes. Subtracting 50% for error correction (probably still way too conservative) we end up with 35 megabytes of usable data.

Not impressive anymore? Well, add to that that an extremely expensive scanner is required to read the data.
Suddenly USB sticks look like a much cheaper medium...
Re:maybe not scam? by Yartrebo · 2006-11-26 04:33 · Score: 3, Insightful

Can you really print 4,096 dots per linear inch on paper and still be able to read each individual dot? My guess is that beyond 300 dpi or so bleeding becomes a major issue and somewhere beyond that the grain size of paper becomes an issue.

Also, can you really have 256 distinguishable color levels on a piece of paper - especially considering that paper is not a uniform color on the micro-scale (it's made up of short strands of cellulose)?

Even if all these problems can be overcome, there is the limiting factor of diffraction, which will limit any optical system (paper or otherwise) to a data density of about 1/wavelength^2, which is roughly the density of a DVD.
Re:maybe not scam? by gutnor · 2006-11-26 06:30 · Score: 1

LOL indeed, I was too quick to jump on this one ( and I did it 2 times in this thread ... )
Re:maybe not scam? by reg106 · 2006-11-26 10:52 · Score: 1

You're still misunderstanding printing technologies. Most of the available technologies (ink jet and related, xerography, thermal transfer, offset lithography) actually print binary pixels. Either a pixel is printed or it is not printed. So you only have as many different color pixels as you have inks (or toners). Digital halftoning (look it up, it's cool) is used to group many pixels together into a much larger dot and color them appropriately to simulate a desired color tone. The limited resolution of the human eye filters out the variation at the pixel level to leave you with the simulated tone. The resolution of the dots is often around 300dpi, even though the pixel resolution (which is usually used in advertising) may be an order of magnitude higher. So your 48 bit color depth with 6 inks really only gives you 6 bits per printed pixel.
Re:maybe not scam? by goombah99 · 2006-11-26 15:38 · Score: 1

I'd like to get myself straight on this. But it's not clear that halftoning is correct. The three color print heads are distinct heads. I don't see any reason why the dots can't be superimposed. Can you please elaborate.

--
Some drink at the fountain of knowledge. Others just gargle.
Re:maybe not scam? by reg106 · 2006-11-27 04:26 · Score: 1

Perhaps it is best to think of printing a monochrome image. You do not have "continuous" (i.e. a large number of discrete levels) control over the pixel intensity, unlike a computer monitor where you can directly control intensity of a pixel. Each pixel is binary: either pigment is put down or it is not put down. In order to simulate continuous tones you use digital halftoning, as illustrated. You can think of a 256 level halftone pattern as an 8x8 block of pixels that is filled in algorithmically to give the appropriate level. Halftoning is further complicated by the fact that the pixel size is usually much larger than the addressibility. (A pixel from a 1000dpi printer will generally be much larger than .001 inches. i.e. the pixels overlap considerably.)

You can indeed overlap the three colors, but even if your registration is perfect, you will have at most three bits per pixel when printing with three colors (each color is either present or not). 24 bit color can only be simulated through half-toning, which is essentially overlaying three monochrome half-toned images.

this could be the digital archiving solution by fikx · 2006-11-26 03:17 · Score: 1

Since most forms of digital media are trumped by paper in terms of how long it lasts, this seems like a great way to archive digital data for the future. This would solve the problem of all our digital information disaapearing from one generation to the next. It's still amazing that paper records will outlast digital ones today....so I'm not sure if this is ironic or just practical...

--
AB HOC POSSUM VIDERE DOMUM TUUM

come on ! by thripper · 2006-11-26 03:19 · Score: 0

FUD

Ultimate compression? by Flain · 2006-11-26 03:19 · Score: 5, Insightful

This story is a hoax.

Lets just imagine for one second that its true.

Instead of printing this data onto paper, why not just store it loslessly in a bitmap file? After all, printers only have a certain DPI and a certain amount of colours. If you could take this bitmap file and somehow extract 256GB of data from it, that sure would be some cool magic.

Re:Ultimate compression? by Loconut1389 · 2006-11-26 03:27 · Score: 1

bingo- there's the real proof that this is a scam. most of the other arguments don't take into account a sort of compression that could be possible by using the shapes and whatnot to encode the data- so think of it on a bitmap- he claims you can store 256gb of data in a bitmap of only a few mb?

I wanted to be the first to post this, but you beat me to it, and I wanted to make sure someone saw your post which was at zero for some reason.
Re:Ultimate compression? by Anonymous Coward · 2006-11-26 04:53 · Score: 0

Simple and elegant proof! Please tell me you are a Mathematician. Otherwise it is a waste of your talent. I had just finished working out a proof in my head for why shifting to a high base won't increase information density and then I saw your proof. Brilliant!

Awww by McNihil · 2006-11-26 03:20 · Score: 1

I was SO hoping that fax flier senders would start to fax me some pr0n movies instead.

Simpler way to debunk the claim by redblue · 2006-11-26 03:21 · Score: 1

I read some elaborate explanations for why this is not possible. There is a simpler way of showing that this is not possible.
Assumptions:
1 Printing is done using a 24-bit continuous color tone reproduction process
2 Resolution is high. Let's say 2400 dpi.
3 Paper is 12"x24"

This means there are 12x24x2400x2400 = 1658880000 ~ 1.7E9 pixels. Printed at 24-bit resolution. That's about 5GB. In other word, this person is claiming a way to compress arbitrary 250GB into 5GB, losslessly.
That is impossible theoretically. So the claim is bogus.

Re:Simpler way to debunk the claim by Anonymous Coward · 2006-11-26 03:33 · Score: 0

First of all, I'm not saying this _isn't_ bogus (I think it is).

But no one has been thinking about this the one way it _could_ be possible. What if each pixel doesn't just represent a single piece of information? I mean the whole point is that a single pixel might represent one piece of information (by being part of a curve) and _simultaneously_ another piece of information by being part of a different triangle.

Who's to say he isn't compressing complex patterns of data using a small figure? There is nothing say a single pixel can't represent a large number of pixels (see huffman encoding for example).

Again, I'm not saying it's real, but come on people, if 1 bit input === 1 bit output we wouldn't ZIP, GZ, BZ2, etc.
Re:Simpler way to debunk the claim by jafiwam · 2006-11-26 03:40 · Score: 2, Interesting

Unless he is doing something really novel, like relying on computational power elsewhere to do the compression.

I have seen articles about algorithms that allow you to calculate the value of any decimal place in Pi.

Plus, additional ones that allow you to prove (in a mathematical way) that any given string of characters you care to wish for are present in some location somewhere in the Pi decimal string.

SO the first point allows you to decompress knowing the beginning and ending values, and the second point allows you to pick some pretty long strings.

That would not really be compression, but more of a lookup table or code that allows anybody to have the table (or recompute it as necessary).

I agree, completely bogus. For one, you would need to pay lots of attention to the quality of the paper and resolution of the printer to the point it would BE the point of this method of data storage. Paper, is a piss poor medium for really precise images. Plastic or foil would be much better. But this guy mentions that only in passing.

I recall lots of talk of holographic storage and how a cube of some size stores X. So far real practical data storage has not come of that either.

Send your investment checks elsewhere, in other words.
Re:Simpler way to debunk the claim by SQL+Error · 2006-11-26 04:01 · Score: 1

Unless he is doing something really novel, like relying on computational power elsewhere to do the compression.

I have seen articles about algorithms that allow you to calculate the value of any decimal place in Pi.

Plus, additional ones that allow you to prove (in a mathematical way) that any given string of characters you care to wish for are present in some location somewhere in the Pi decimal string.

SO the first point allows you to decompress knowing the beginning and ending values, and the second point allows you to pick some pretty long strings.

This works really well, so long as the data you want to store happens to be the value of Pi.

Unfortunately, if you want to store anything else, you get no saving; the number of bits required to store the starting and ending positions in the sequence of Pi works out to be at least as much as the number of bits required to store your original data.

Try it. Let's say you want to store the digit 0. Now, let's find the position of the first instance of 0 in the decimal expansion of Pi... it's the 37th digit after the decimal point. So instead of just storing 0, we have to store 37.

The adage about If it sounds too good to be true... holds double for compression algorithms.
Re:Simpler way to debunk the claim by Anonymous Coward · 2006-11-26 06:05 · Score: 0

If this is some kind of geometrical compression thing, then why do we need the paper support? Just rasterize the geometric shapes inside the computer memory and store the result as a file. Then you have a stand-alone compressor, ready to compete with zip, gzip, bzip2, rar, 7zip etc.

BTW when you say things like "I'm not saying this _isn't_ bogus" or "Again, I'm not saying it's real" you may think that you sound like a nice open-minded person, but this is only true with somewhat believable claims. When it's such an obviously bogus claim you sound really stupid for saying that.
Re:Simpler way to debunk the claim by sbaker · 2006-11-26 06:32 · Score: 1

No - you can't win by arguing for compression.

Suppose there was a scheme (however complex) that could reliably take absolutely any arbitary data and compress it without loss by just one bit - and reconstitute it reliably. So you'd throw a million bits of arbitary data at it - and it would give you back 999,999 bits. This sounds (on the face of it) entirely reasonable - but it's not.

If you had such a program, you could take the compressed data stream out of it and put it back into the compression program and shave another bit off of it. Take your million bits of data - and shove it through the program 999,998 times and you have one bit of (very, very, compressed) data! The trouble is that how can the decompressor possibly know which of the countless amounts of possible inputs to make from a single 1 or 0?

So no - there is no concievable way that you can RELIABLY compress arbitary data losslessly. You can reliably compress data with losses (eg JPEG) or you can come up with schemes (eg the kinds of algorithms that ZIP and gzip use) that compress most kinds of data losslessly - but occasionally make the data a little bit bigger. You can also compress certain kinds of data (eg English text) reliably and losslessly - but only because it's not ARBITARY data.

So no - you can't invoke a magical compression algorithm to let this guy off the hook.

--
www.sjbaker.org
Re:Simpler way to debunk the claim by AK+Marc · 2006-11-26 07:14 · Score: 2, Interesting

In other word, this person is claiming a way to compress arbitrary 250GB into 5GB, losslessly. That is impossible theoretically.

No, the person made no such claims. You have created a strawman. In a computer, when you read a "1" what is the chance that the following bit will be a "1"? I'm not sure what the real probablility is, but I'm guessing somewhere around 50%. Why? Because the bits are not dependent on the previous bits. However, this guy is using shapes and patterns. Perhaps there is some manner of differential information stored in there. The pixel below is blue and the pixel above is red? Then add green to the blue pixel. I'm not saying that I think he is correct. I'm saying let him prove it before all the people here say it is impossible because they can't think of how it works.

--
Learn to love Alaska
Re:Simpler way to debunk the claim by Anonymous Coward · 2006-11-26 09:01 · Score: 0

That is impossible theoretically. So the claim is bogus.

Can you show me a proof that says that it's impossible to get such levels of compression, losslessly? No, you can't, because there aren't any.

There are interesting, but intensely computationally reliant, ways which might be used to obtain obscene compression levels. Fractal compression and suchlike. But who wants to wait three days to decompress a few gigs?
Re:Simpler way to debunk the claim by E++99 · 2006-11-26 15:40 · Score: 1

I'm saying let him prove it before all the people here say it is impossible because they can't think of how it works.

The arguements people are giving are provably refutations of what is claimed. If he's using colored shapes for storage, fine, but the MAXIMAL amount of information that could be gotten from the paper is if each individual dot could be independently written and read. That's not possible, but if it were, that would not be anywhere close to 256GB. This guy's technique can store necessarily significantly less information that that hypothetical. The former technique is a small subset of the latter.

Somewhere a GB was probably swapped for a MB, or he did some bad math, or something like that.
Re:Simpler way to debunk the claim by SnowZero · 2006-11-26 22:20 · Score: 1

Yes, you can prove it. This comment explains it quite well. Universal compression cannot exist.

Neat idea... by Anonymous Coward · 2006-11-26 03:23 · Score: 0

Even if this particular "implementation" is a scam, so what? It's a neat idea. Let's say we could feasibly get, oh, I don't know... 10MB on a standard 8.5x11 sheet of paper. That's plenty for an MP3 of a new "hot" single. I know everyone is looking at this as a way to store a bunch of information cheaply, why not use it to distribute a little bit of information cheaply? Flipping through a magazine and come across a half-page ad for the pop-star-du-jour with a half-page "scan-this-to-hear-the-single"? Could be an interesting alternative distribution method... or "scan-this-to-see-the-latest-game-trailer"...

Just sayin' is all...

Re:Neat idea... by sbaker · 2006-11-26 06:13 · Score: 1

There are 2D barcode technologies out there that can store data on paper like that - including a few that use colour.

Picking one at random from the Wikipedia article on 'Barcode': A company called 'MarkAny' (for example) has a 440bytes/cm2 black and white barcode that's actually in common use. That would get you around 200Kbytes on a sheet of A4...it's not 10Mbytes - but enough for an MP3 track.

10Mbytes would be tough - that's 117Kbytes per square inch - which would need (say) 256 unique colours and 350 dots per inch - reliably through both printer and scanner. I wouldn't say it was impossible - but it's pushing the envelope. You could use error-correction techniques to solve some of the ikky problems you'd end up with - but that'll eat into your data capacity.

--
www.sjbaker.org

Re:RTFA by fatphil · 2006-11-26 03:28 · Score: 3, Insightful

The entropy rate of arbitrary pixel values is higher than the entropy rate of related pixel values (such as shapes).

Therefore the obvious way gives a better information density.
Therefore comparing against the obvious way is *not* necessarily the behaviour of a jackass, but quite possibly the behaviour of someone who has a grasp of Information Theory.

Time for everyone to borrow Cover and Thomas from their local library, methinks.

FatPhil

--
Also FatPhil on SoylentNews, id 863

Not Dots by Sinbios · 2006-11-26 03:30 · Score: 1, Insightful

All the "proofs" in the comments that show this is a scam so far calculates how many dots can be printed/read from a piece of paper, and then corresponds each dot to a bit of data. Well, guess what. The whole point of this thing is he's NOT USING DOTS. This may very well be bullshit, but the "proofs" against it are meaningless.

--
Anyone can "stand up for what they believe", but it takes a very brave individual to change what they believe. - Loundry

Re:Not Dots by joshtimmons · 2006-11-26 03:57 · Score: 1

The proofs are not meaningless. Remember that all of geometric shapes that they are using must be rendered to dots on the printer because that is what the printer prints and what the scanner scans. Calculating the number of dots available is a straightforward way to approach the problem because the number of dots that can be printed directly corresponds to the number of distinct states that the paper can have. It's a basic fact that you can't store more information in a medium than can be represented by the number of distinct states that the medium has.
Re:Not Dots by Cambo67 · 2006-11-26 04:06 · Score: 1

The point everyone is making is that a dot is the smallest possible 'shape' that a printer can print. Any other shape is going to have to be made of 2 or more dots. Unless they have developed some way of creating 1-dimensional shapes to stand upright on the paper, which can be printed onto the paper _and_ read by a scanner, there is no way of getting data more dense than dots.
Re:Not Dots by Dun+Malg · 2006-11-26 04:29 · Score: 3, Informative

All the "proofs" in the comments that show this is a scam so far calculates how many dots can be printed/read from a piece of paper, and then corresponds each dot to a bit of data. Well, guess what. The whole point of this thing is he's NOT USING DOTS. This may very well be bullshit, but the "proofs" against it are meaningless.No, you simply don't understand very basic information theory. Printers print with dots. Any shapes you make on the paper are made up of dots. A 3x3 grid of dots (9 bits) can be marked in 512 different combinations, only 10 of which make a squares(2), triangles(4), or lines(4) that can be easily differentiated. Using shapes does not increase the resolution, it limits it. You cannot represent 8 arbitrarily chosen bits of information if you've budgeted 7 bits of storage. At 1440dpi, 8 bit color, even assuming perfect readability, you cannot record more than 1.4GB of information, no matter what "shapes" you arrange for the dots to make.

--
If a job's not worth doing, it's not worth doing right.
Re:Not Dots by Anonymous Coward · 2006-11-26 04:38 · Score: 0

The whole point of this thing is he's NOT USING DOTS.

Excellent point! He is using shapes. And although shapes are made up of dots (as others have noted), they contain more information than the dots by themselves.

You can arrange three dots in a pattern, and then you get not only the information of those three dots, but also the arrangement of the dots, which can convey information.

The trick is the "rainbow" pattern. As we all know, there is a pot of gold at the end of the rainbow. By leveraging these rainbow patterns, the inventor has, in effect, created several billion pots of gold. This gold can be hammered into sheets, and then be die cut to wrap delicious chocolate bars.

So, in the end, right, not dots, but chocolate bars. Yum!

If you like chocolate, please invest in his invention by donating to my PayPal account!
Re:Not Dots by jesdynf · 2006-11-26 04:41 · Score: 1

You're right, to a point -- but if he's not using pixels, then what we're actually looking at is a Novel Compression Scheme.

This Novel Compression Scheme can apparently compress data equivalent to (I'm taking an answer from above, this isn't my math) 30000 dpi and render it in the space of 300 dpi. Were this true, then they could take a TIFF of that same image and still enjoy luxurious 100x compression without ever bothering to print something out.

Another way to say "Novel Compression Scheme" is "scam", and unless your name is Diffie or Helman it's generally gonna stick. (There are advances in the field, but far as I know they're incremental. Someone who knows more than I could better address this, though.)

--
Yahoo! Pipes are awesome. How awesome? http://pipes.yahoo.com/jesdynf/slashdot
Re:Not Dots by Anonymous Coward · 2006-11-26 05:30 · Score: 0

You're an idiot and have no idea how information theory works.
Re:Not Dots by An+Onerous+Coward · 2006-11-26 06:46 · Score: 1

If he is using shapes, then either the shapes themselves are single-pixel units (unlikely, requires specialized hardware), or the shapes are made of multiple dots. In the latter case, this means a loss of entropy (since the shapes add regularities to the patterns of dots), which is guaranteed to cause a corresponding loss of information-carrying capacity.

I see nothing wrong with the approach these proofs are taking.

--
You want the truthiness? You can't handle the truthiness!
Re:Not Dots by joto · 2006-11-26 08:41 · Score: 1

At 1440dpi, 8 bit color, even assuming perfect readability, you cannot record more than 1.4GB of information, no matter what "shapes" you arrange for the dots to make.

How about if I had a really large piece of paper?
(And seriously, your math is more than a little way off. With 1440dpi (odd number by the way, haven't heard about a printer with that resolution yet), and 8 bit color, assuming perfect readability), you could store 0.0019 GB per inch. A normal "letter"-sized sheet of paper is about 93 square inches, giving you a storage capacity of 0.18 GB. Surprisingly, there is a standard papersize that gives you about 1.4 GB storage, it's A1, which is as large as 8 A4 pages. On the other hand, if you intended to use two-sided printing, you could do with the A2 format).
Re:Not Dots by j00r0m4nc3r · 2006-11-26 11:23 · Score: 1

Not all printers print with dots.
Re:Not Dots by Grail · 2006-11-26 13:22 · Score: 1

Show me how to print a triangular dot from one microscopic droplet of ink, and you'll have found a way to print a smaller round dot.

Dots are one colour - unless you have a printer that can change the colour of the ink it's printing with on the fly, all the "colours" you see in matrix printing come from a dithered pattern of various sizes of dots of 3-7 different coloured inks.

why the paper? by alexx · 2006-11-26 03:30 · Score: 1

OK, so let's assume the printer prints dots of different color. If there are x colors each dot would contain log2(x) bits of information.
Now, the clever encoding of information into shapes would mean that a shape represents Y bits of information using Z dots.
If Y/Z/log2(x) > 1, we would magically be able to have more information on the sheet of paper than we would arrive at doing the DPI*paper area calculations.
On the other hand, wouldn't we also be able use the same magical trick also without the paper? I.e. digitally code all information not as bits but as the digital representation of the shapes? (i.e. achiving the Y/(Z*log2(x)) compression ratio without needing new printer/scanner devices)

Re:why the paper? by Dcnjoe60 · 2006-11-26 03:43 · Score: 1

On the other hand, wouldn't we also be able use the same magical trick also without the paper? I.e. digitally code all information not as bits but as the digital representation of the shapes? (i.e. achiving the Y/(Z*log2(x)) compression ratio without needing new printer/scanner devices)

Except that once you go digital, all of that information needs to be transfered into binary 0s and 1s. Now, if they every come up with computers that use something other than binary, it might work.
Re:why the paper? by alexx · 2006-11-26 04:05 · Score: 1

What I meant was: If a shape can contain more information bits than bits necessary to represent it then we could of course use the same trick digitally by coding Y bits of information but as Z number of log2(x)-tuples of 0s and 1s. That is, instead of having Y 0s and 1s we have Z*log2(x) 0s and 1s.
Re:why the paper? by Dcnjoe60 · 2006-11-26 07:03 · Score: 1

But the storage, in the article, comes from overlaying the shapes and colors, thus creating new shapes and colors. Seems pretty hard to do in RAM or on magnetic media which only has two states.

Bullshit ... here's why by Dr.+Zowie · 2006-11-26 03:30 · Score: 1

I don't know the exact dimensions of A4, but it's similar to 8.5"x11" (at . Suppose each spot of color stores 24 bits of information -- this is really a stretch, for reasons I'll get to below. Normal Xerox paper supports something like 300 dpi of resolution (that is the scale of the fibers of the paper). Then you can store at most 3*8.5*11*300*300 bytes of information on a piece of paper with color coding. That is just 0.025 GB.

Now, one might argue that nice photo paper could do better -- but it couldn't do better than about 1200x1200 -- the resolution of inkjet photopaper. That would yield a factor-of-16 increase, for 0.4 GB.

In practice, you can't do that well at all, because each location on the paper doesn't store 24 bits, it generally only stores one bit for each type of ink you put down. With an RGB ink system you get only 3 bits per location, even with ideal placement. That yields 0.003 GB on normal paper or 0.05 GB on photo paper.

The only way to do better is to increase the number of symbols on the paper (writing smaller) or to increase the information content of each symbol (using more subtle gradations of color). If it were easy, we'd all be doing it already.

Re:Bullshit ... here's why by Anonymous Coward · 2006-11-26 07:43 · Score: 0

The article claims he did a 45s video clip, not 450gb. Not sure why the media keeps getting this wrong.

Re:RTFA by SQL+Error · 2006-11-26 03:31 · Score: 4, Informative

You are an idiot because: You ignored the one and only thing he /did/ say, which was that he was doing something differently.

Bzzt.

Encoding data using dots is the most efficient method possible. He has to print the image somehow, and scan it back in again. No combination of triangles and circles can circumvent the resolution limit, which is what is being calculated here.

By showing that the claim exceeds all practical limits of optical resolution (and probably the absolute physical limits), we show that what we have is just another magical compression scam.

He says that he's "doing something differently"; we've proved that what he claims to be doing is impossible. End of story.

shurely.. by Anonymous Coward · 2006-11-26 03:31 · Score: 0

the claim is that with many overlapping gemoetrical objects of different colors a single pixel can contribute to a number of different bit-streams. so all this talk of dpi is meaningless in the context of the claim - which is that an individual pixel can have a value of more than one bit if it participates in enough objects through layering of differently coloured shapes.

Re:RTFA by thelamecamel · 2006-11-26 03:31 · Score: 1

Ultimately, the triangles on the paper are really just printed as a bunch of dots. Printing purely a bunch of dots would give the highest data capacity, but would also be highly unreliable - printing triangles give a kind of error correction. Printing triangles may be a novel technique, and may be more useful than all previous techniques, but its capacity is bounded by the capacity of the page when printed with dots.

Good flame, but bad argument. by MrBoombasticfantasti · 2006-11-26 03:32 · Score: 1

Your parent poster made clear that the *theoretical* limits on pixel density prevent *any* encoding from storing this much data on a piece of paper. To me, that is a sound rebuttal of any claims of "new encoding" and such.

No matter *how* he does things differently, it is just not possible to get the required density.

But hey, you got to show your ignorance, that's not bad, eh?

--
!ERR: Signature not found.

Re:This is a lie by Panaqqa · 2006-11-26 03:35 · Score: 4, Informative

Okay, let's look at some math. First, calculate the number of bits that must be stored to reliably archive 256GB:

256*1024*1024*1024*8*(10/8) = 2.749 * 10^12 [allowing for 25% extra - error detection/correction]

Now, the area of a sheet of paper in mm^2:

210 mm * 297 mm = 6.237 * 10^4

Let's make an assumption: it would be tough for a scanner to correctly identify more than 256 colors (blues especially are problematic). So, going by a pixel based method, we can store 8 bits per pixel, so the number of pixels needed is:

2.749 * 10^12 / 8 = 3.436 * 10^11

Pixels per mm^2 will therefore be:

3.436 * 10^11 / 6.237 * 10^4 = 5.509 * 10^6

Taking the square root of this figure and inverting will give us the size of one side of a pixel in mm, so:

1 / (5.509 * 10^6)^.5 = 4.260 * 10^-4 mm = .426 micro meters = 426 nm

This is smaller than the wavelengths of some frequencies of visible light, therefore a large portion of the spectrum is gone in terms of colors that can be used. Eliminate these colors and you increase density yet again, requiring you eliminate more colours. By the time you get to monochromatic (black white), which you will, the size is smaller than the wavelength of ANY visible light.

So, for this storage density, either you are scanning in ultraviolet light (and printing using an appropriate ink) to get a small enough wavelength, or you have thrown out light all together and you are using an electron microscope as your scanner. (Note - ever see electron microscope images in color? Can't exist unless colorized).

Fairly clever hoax though - if they had stuck with, say, 16GB then it would not have edged into the impossible.

Scam? by raeldc · 2006-11-26 03:36 · Score: 0

http://itsoup.blogspot.com/2006/11/scam-of-indian- student-developing.html

Sounds like snake oil by Anonymous Coward · 2006-11-26 03:38 · Score: 0

I'm sorry but that would mean he'd broken Shannon's law which I just can't believe. The article reads like a press release: all sizzle and no steak.

HA! by Anonymous Coward · 2006-11-26 03:42 · Score: 0

They all laughed at me when I said paper tape would make a comeback... WELL WHO'S LAUGHING NOW!?

Isn't this a step backwards? by gcranston · 2006-11-26 03:44 · Score: 1

So much for the paperless office.

Arab News? by Anonymous Coward · 2006-11-26 03:47 · Score: 0

Now that sounds like an unbiased reliable source of IT information. Gee, I wonder what they think about Israeli's?

Extraordinary claims by NorbrookC · 2006-11-26 03:49 · Score: 1

Extraordinary claims require extraordinary proof. Carl Sagan

Someone should send this quote to all the people who bought into this claim, and reported it. Someone makes a claim, does a "demonstration", and it gets reported as an advance. Didn't anyone in the media (including Techworld), bother to do some background checks? Standard things like checking the credentials of the person making the claim, checking with experts in the field? Apparently not. That would be standard journalism practice, and we know how discredited that is these days!

As has been pointed out here and on other forums, it's a scam. Is it possible to store 256 GB, etc. on paper? Sure it is. There was a time when computer data was stored on paper - punch cards and paper tape systems. Is it likely to be done on the size of paper mentioned with anything resembling today's best technology? No.

An extraordinary claim has been made - and there is no ordinary proof, let alone extraordinary proof. But, I do have a bridge to sell, and I think I see some potential buyers!

while unpractical, theoretically possible by e**(i+pi)-1 · 2006-11-26 03:52 · Score: 0

By adding some good error correcting codes and using the best technology available, it should be possible to store and retrieve a DVD on paper by printing and scanning.

Upper bound (there were some computations posted below, which are flawed): Assuming A4 which is 210x297 mmm = 8.3 x 11.7 and having 600 DPI, we have 8.3*11.7*600*600 34 Mega pixels Assuming each pixel can store a RGB color from 255*255*255 colors, there are 8.3*11.7*600^2*255^3/10^12 = 579 Gigabites.

If we assume a color printer can print in 600 DPI precisely and a color scanner can scan 600 DPI reliably and reconstruct each of the RGB color values up to 1/256 accuracy then 579 Gigs are possible.

It is upper bound on the information a paper can carry assuming the DPI and color resolution. It is not possible to add more information by additional geometric encoding.

As others have pointed out, printing at a 600 DPI resolution with a precise color value at each dot is a main obstacle to make this work. The ink is assumed not to diffuse and influence the color dots nearby. It would need a really good printer. The scanner needs to scan at 600 DPI and get the color value precicely. This would not work for the junk scanners I have but this should be technically possible with professional scanners.

Re:while unpractical, theoretically possible by e**(i+pi)-1 · 2006-11-26 03:59 · Score: 1

P.S. The calculation has to be divided by 8 to get Bytes from Bits.
So, the upper bound is 72.5 Gig not 579 Gig
Re:while unpractical, theoretically possible by SQL+Error · 2006-11-26 04:09 · Score: 1

Sorry, like the other poster, you have used combinatorial maths where it doesn't apply.

Replace the 255^3 with 24 to get the correct number of bits. That reduces the result by a factor of 700,000.

Oh, and 10^12 is tera, not giga.

So no, not theoretically possible.

You can't scan smaller than light's wavelength by Panaqqa · 2006-11-26 03:54 · Score: 1

Which is what this would have to do - scan markings on the paper that are smaller than the wavelength of visible light. Why do you think the max an optical microscope can magnify is 2,000x, and beyond that you need electron microscopes?

Re:You can't scan smaller than light's wavelength by Anonymous Coward · 2006-11-26 07:22 · Score: 0

Apparently you can.

Re:This is a lie by Anonymous Coward · 2006-11-26 03:55 · Score: 0

A good scanner can do 600 dpi.

For a typical 8.5x11 sheet of paper, in black and white, that's 600^2*8.5*11 = 4207500 bits (4.01 MB)

If you used three colors, you could get 12MB.

That is nowhere close to 256GB. 256GB would require something close to the density of hd-dvd or blue-ray, for that you need a very smooth surface, paper won't work. You can do that with metal or plastic, but not paper.

I am confused by OricAtmos48K · 2006-11-26 03:55 · Score: 1

So how many Libraries of Congress amount of information would the Library of Congress contain if each and every one of the pages of books in the library were like this ?

Re:I am confused by shazbotus · 2006-11-26 04:16 · Score: 1

One...The Library of Congress will always hold One Library of Congress amount of information.

Flaw in your math by benhocking · 2006-11-26 04:01 · Score: 1

You seem to have confused numbers with the bits required to store them.

So, 22,747,732,032 bits per inch^2 are needed. Now, even a lowly 300 dpi scanner would only need to differentiate 252,752 colours, which is achievable with 6 bits each for R,G and B.

A 300 dpi scanner with 6 bits each for R,G,B can only encode 300*300*(6+6+6)=1,620,000 bits per inch. At 600 dpi, you get 4 times as many bits. Also, keep in mind that (a) not all paper is created equally "white" which will diminish the required accuracy for storing data, and (b) even if you store this paper as carefully as we in the US have stored the Declaration of Independence, the color quality of the paper will degrade slightly. Sure, acid free and all that, but there will be some color degradation, so I think it's optimistic to even think you'll be able to store 252,752 colors for any significant amount of time. Imagine this: how many bits get altered by a single fingerprint?

--
Ben Hocking
Need a professional organizer?

Re:Flaw in your math by Anonymous Coward · 2006-11-26 05:39 · Score: 0

The amount of people on Slashdot who think they can do the math (since they post) but get the math wrong lead me to believe that instead of a scam this is probably what happened to this indian student: he came up with some idea and his back-of-the-envelope calculation was wrong -- off by many orders of magnitude -- but he genuinely believed he did it right.
Re:Flaw in your math by sbaker · 2006-11-26 05:57 · Score: 1

Whilst it would be nice to assume this guy made an honest mistake - that is definitely not the case. The original article in the "Arab News" web site says that the journalist watched him scan in an play a feature length movie from a single sheet of A4.

Since we know that's impossible - even in theory and with perfect equipment, we know for 100% sure that this isn't what happened.

Either the journalist is lying - or (MUCH more likely IMHO), he scanned the paper into one file and replayed the movie from a different file. That would be childishly easy to do.

But it's not something he could have done by accident. This guy is a scammer - pure and simple.

--
www.sjbaker.org

Maybe, just maybe, it is possible? by The+Mysterious+X · 2006-11-26 04:01 · Score: 0

so lets see. If he uses 3 shapes, and 5 colours, that's 13 possibilities per "dot". For example, Circle, Square, Triangle, Red, Blue, Green, Black, White. All he has to do, s find another 3 permutations, and he has 16 options per pixel. For those of you out there that didn't notice, white can't have a shape, as the paper is white. If the data to be stored were to be translated into hexidecimal, you can store 1 hexidecimal digit per "dot". 1 Hex digit is equivalent to a nibble, so for every 2 dots you have encoded one byte. 256 gigabytes is 274877906944 bytes. Now, most printers can easily do 1200 dpi. This is linear DPI though, so they can actually do 1440000 dots per square inch. Now, if we assume that we would need at least 9 dots to do all three shapes: (see the full comment on my blog for the shapes, I had to take them out to avoid the lameness filter, address is http://www.omega.org.uk/) As shown above, that reduces the shape density to 360000 shapes per square inch, or 180000 bits per square inch. A4 paper which is almost foolscap has an area of 96.6763 square inches, so we can store, using my methods, 17401734 bytes, so 16 megabytes, much higher than people here so far have been claiming, and using very, very conservative colour choices and resolutions. While this is an order of magnitude away from the stated values, this could easily be much higher. I have assumed a very low resolution (laser printers can easily get up to 2000 DPI these days), no compression, and a very restricted subset of values. I should think it would be easily possible to use 8 bit colour, with no risk of data loss. Add to this 8 to 14 conversion, or parity values, to ensure data integrity, and I think that what this guy is claiming is within the realms of possibility. FYI, using 8 bit colour which yields 256 possibilities 3x256-3 = 765 765 is almost 3 bytes per dot using 2000 DPI, 2000x2000 is 4000000 divided by 9 is 444444 shapes per square inch. 444444x3 makes 1333332 bytes per square inch. so, 128901604 bytes per sheet of A4 which is 128 megabytes per sheet of A4. So as you can see, it's not a case of "is it possible to fit that much data", it's just a case of howdetailed it has to be; add another shape, and the desity per dot goes up massively, (see the full comment on my blog for the shapes, I had to take them out to avoid the lameness filter, address is http://www.omega.org.uk/) maybe? Is it possible to fit 256 gigs of data on a sheet of A4 with ink?: Yes. Is it possible to retrieve it?: Possibly, depends how small you go. If you want precendent, think how small the pits are on a Blu Ray disk are; if we can retrieve a single bit from something that small, can we can surely retrieve something a bit bigger and a bit more detailed. My maths isn't to strong, so if I've made a mistake, feel free to correct me. Ah feck, lameness filter encountered. Looks like I'm going to have to type a bit more to balance it out. Sorry to ramble on, but most of the junk characters aren't junk characters. You'd think that once you get up to good karma that the lameness filter would give you a little leway, but apparently not. I wonder if this is enough yet? Apparently not, well CBA to type any more, better try and remove a few characters. Taken out most mathmatical symbols, does that do the trick?

Re:Maybe, just maybe, it is possible? by sbaker · 2006-11-26 06:04 · Score: 1

so 16 megabytes, much higher than people here so far have been claiming, and using very, very conservative colour choices and resolutions. While this is an order of magnitude away from the stated values

I have no problem in believing 16Mbytes - but that's not "an order of magnitude away" - he's claiming 250Gbytes. That's FIVE orders of magnitude.

You end up needing shapes that are of the order of 1/10000'th of an inch across (much smaller than the strands of material that make up the paper) and colour precision of around 24 bits - which is vastly better than any existing printer can manage.

Then you need a small scanner that can scan at similar resolutions...also VASTLY beyond any reasonable technology.

It's a scam - the end.

--
www.sjbaker.org

Yup by Anonymous Coward · 2006-11-26 04:02 · Score: 0

Shannon's entropy goes hand-in-hand with it too.

Yeah, I should have used preview, so sue me :) by The+Mysterious+X · 2006-11-26 04:03 · Score: 1

Could have sworn I set my default posting style to plaintext, but meh. And in my defense I was battling the lameness filter to get this posted in the first place.

so lets see.

If he uses 3 shapes, and 5 colours, that's 13 possibilities per "dot". For example, Circle, Square, Triangle, Red, Blue, Green, Black, White. All he has to do, s find another 3 permutations, and he has 16 options per pixel. For those of you out there that didn't notice, white can't have a shape, as the paper is white.

If the data to be stored were to be translated into hexidecimal, you can store 1 hexidecimal digit per "dot".

1 Hex digit is equivalent to a nibble, so for every 2 dots you have encoded one byte.

256 gigabytes is 274877906944 bytes. Now, most printers can easily do 1200 dpi. This is linear DPI though, so they can actually do 1440000 dots per square inch. Now, if we assume that we would need at least 9 dots to do all three shapes:

(see the full comment on my blog for the shapes, I had to take them out to avoid the lameness filter, address is http://www.omega.org.uk/)

As shown above, that reduces the shape density to 360000 shapes per square inch, or 180000 bits per square inch.

A4 paper which is almost foolscap has an area of 96.6763 square inches, so we can store, using my methods, 17401734 bytes, so 16 megabytes, much higher than people here so far have been claiming, and using very, very conservative colour choices and resolutions.

While this is an order of magnitude away from the stated values, this could easily be much higher.

I have assumed a very low resolution (laser printers can easily get up to 2000 DPI these days), no compression, and a very restricted subset of values. I should think it would be easily possible to use 8 bit colour, with no risk of data loss.

Add to this 8 to 14 conversion, or parity values, to ensure data integrity, and I think that what this guy is claiming is within the realms of possibility.

FYI, using 8 bit colour which yields 256 possibilities

3x256-3 = 765
765 is almost 3 bytes per dot
using 2000 DPI,
2000x2000 is 4000000 divided by 9 is 444444 shapes per square inch.

444444x3 makes 1333332 bytes per square inch.
so, 128901604 bytes per sheet of A4
which is 128 megabytes per sheet of A4.

So as you can see, it's not a case of "is it possible to fit that much data", it's just a case of howdetailed it has to be; add another shape, and the desity per dot goes up massively,

(see the full comment on my blog for the shapes, I had to take them out to avoid the lameness filter, address is http://www.omega.org.uk/)

maybe?

Is it possible to fit 256 gigs of data on a sheet of A4 with ink?: Yes.
Is it possible to retrieve it?: Possibly, depends how small you go.

If you want precendent, think how small the pits are on a Blu Ray disk are; if we can retrieve a single bit from something that small, can we can surely retrieve something a bit bigger and a bit more detailed.

My maths isn't too strong, so if I've made a mistake, feel free to correct me.

Do The Numbers Again by meza · 2006-11-26 04:06 · Score: 1

From what I can tell you're using the color information wrong. You divided the number of bits (21.6 gigabit) by 24 assuming a total of 24 colors. But you said to assume 24 bits of colors yielding a total of 2^24 different colors in one dot, right? Doing the same calculations on that numbers gives a 30dpi which is quite possible.

Calculations:
2.7GB per square inch = 21.6gigabit per square inch.
Each dot carry 2^24 bits of information due to color so we only need 21.6*1024^3 / 2^24 = 1382.4 dots per square inch
Square root of that equals about 37 dpi.

I'm not saying that the article might be bogus. It all seems to come down to what color depth you can get on a paper (and doing the correct calculations, so please correct me if I got something wrong). From the posts it seems like the ones that claim this is bogus counts with a very low color depth, like 3 colors or 8 colors. While the ones who believe this might be true counts with a very high color depth, 2^24 or something. I think 24 bits seems a bit high, although I must admit that I don't know anything about that.

Re:Do The Numbers Again by SQL+Error · 2006-11-26 04:14 · Score: 2, Informative

No.

I divided it by 24, because the entire calculation is in terms of bits. We have 24 bits per pixel. 2^24 possible colours, encoded as 24 bits. 24 colours would encode less than 5 bits.

What your calculation assumes is that we are storing two megabytes per pixel. I think you can see why this is impractical.

Your mistake is... by Dr.+Zowie · 2006-11-26 04:07 · Score: 1

... in confusing number of colors with number of bits. Each of your pixels doesn't store 255*255*255 bits, only one of 255*255*255 values (about 24 bits). You have overestimated the capacity by a small factor of 700,000.

Re:Your mistake is... by An+Onerous+Coward · 2006-11-26 07:04 · Score: 1

I find it a little scary that the GP's home page is at www.math.harvard.edu. But we've all had our oops moments on Slashdot, so we should be forgiving.

Another thing that has been pointed out repeatedly throughout this thread is that a dot on a sheet of paper isn't one of 256 or 256^2 or 256^3 colors. It should be one of four colors (cyan, yellow, magenta, black). Other colors are created by interleaving dots of those four colors. So even if we're generous and say that you can use any combination of the four on a single pixel, and that all the combinations are distinguishable from each other, you can only get four bits of information into a pixel.

--
You want the truthiness? You can't handle the truthiness!
Re:Your mistake is... by Anonymous Coward · 2006-11-26 07:12 · Score: 0

I posted this before but I was trying to do the math in my head instead of on paper... got it wrong, natch.

Might still be wrong but I'm liking it better :-)

QAM gives us 96 symbols per 16 bits.
It works spatial domain as well as frequency.
Nice thing about paper is the dots are pretty easy to identify as on or off as opposed to a mess of sinc functions with phase and amplitude noise.

Add the third dimension of 16 bits of color depth and you get 96 times the number of symbols.
It's a bit tricky visualizing a 3-D spatial analogue of QAM but this is what you get.

A 300x300x16bit PEL/inch print and scan gives you 90,000*96*96 bits.829,440,000 bits per square inch.

On A4 you have 90 square inches giving 74.6 gigabits

If you increase your dot density by 4 (600x600) and your color depth by 4 you've got an additional factor of 96 for 7,166 gigabits. Practical print and scan at 600DPI with 20 bit color depth is getting noisy so one would want to look at the noise level FEC tradeoffs...But, we're talking almost 900 gigabytes here or 256 gigabytes allowing for a 7-8db signal to noise ratio.

-Duck
Re:Your mistake is... by Dr.+Zowie · 2006-11-26 13:42 · Score: 1

No, no, no, QAM gives you 96 bits for 16 *cycles*. It's just a way of cramming more information into each symbol in the channel, just like anything else. Once you've converted all the way down to bits you can't do anything fancy like phase modulation -- that is something you do to an analog symbol, not to a digital bit. The bit count already takes into account all the phase/shape/amplitude information you can cram into the channel.
Re:Your mistake is... by Jack+Schitt · 2006-11-26 16:02 · Score: 1

Additionally, I think you'd need a decent CMYK scanner to scan it back in.

The trick, I think, is to use shades of a specific color channel instead of the channel either being on or off. Each channel gets 256 possible values (8 bits per channel, 32 bits per pixel (CMYK, not RGB)). The problem is printers typically have no mechanism for shading the inks used and instead dither them. The solution would be to use larger pixels (i.e. lower resolution) than the smallest the printer is capable of.

Lower resolution = higher color quality (per pixel) = fewer data pixels
Higher resolution = lower color quality (per pixel) = more data pixels

Each page could provide a color calibration bar that allows the scanner to determine the saturated and unsaturated points of the paper and ink.

If you really want to complicate things, some of the more expensive photo printers out there (newer HP Photosmart and Canon Pixma lines) use MORE than just CMYK. The HPs use six inks (CMYK, Photo Cyan, Photo Magenta) and the Canon use up to nine inks (CMYK, Photo Black, Photo Cyan, Photo Magenta, Red, and Green)

And to further complicate things, one of the HP lines also has a gray scale tri-color cartridge available. (I work at a major retailer that happens to sell printers and inks. You learn a lot about printers and ink this way.)

For the issue of bleeding across pixels, I'd use a color laser jet instead of ink jet. That takes it back down to the simple CMYK instead of the absolutely ridiculous CMYKPBPCPMRG that Canon uses.

--
This message brought to you by Jack Schitt's Previously Shat Shit

Re:This is a lie by DJCacophony · 2006-11-26 04:09 · Score: 0, Flamebait

Why are you using metric measurements? Resolution is measured in DPI, an imperial measurement. There is no such thing as dpmm (dots per millimeter).

--
Slow Down, Cowboy! It's been 60 minutes since you last successfully posted a comment.

Re:RTFA by Dun+Malg · 2006-11-26 04:10 · Score: 4, Informative

You are an idiot because: You ignored the one and only thing he /did/ say, which was that he was doing something differently.

Bzzt.

Encoding data using dots is the most efficient method possible. He has to print the image somehow, and scan it back in again. No combination of triangles and circles can circumvent the resolution limit, which is what is being calculated here.

By showing that the claim exceeds all practical limits of optical resolution (and probably the absolute physical limits), we show that what we have is just another magical compression scam.

He says that he's "doing something differently"; we've proved that what he claims to be doing is impossible. End of story.Indeed. For those not inclined to simple mathematics, here it is in a nutshell-
Assumptions (none of them unreasonable, all of them quite generous even):
1440dpi
8 bit color
8" x 10.5" printing area

Even assuming perfect readability, this resolution yields only 1.4GB per page. Talk of "shapes" is smoke and mirrors to obfuscate one of the cold hard facts of information theory: you cannot accurately represent all permutations of 8 bits of information if you've budgeted less than 8 bits. Compression schemes allow you to compress repetitive patterns is you know they're going to be there beforehand (e.g. an almost arbitrarily large number of only 1's or only 0's can be represented with run length encoding), but X bits of random data requires X bits of allocated space.

--
If a job's not worth doing, it's not worth doing right.

Re:Do The Numbers Again and agiang and again by meza · 2006-11-26 04:12 · Score: 1

Ooops. Please ignore this post.

24-bit color of course means 24 bits of information in one dot (being enough to represent 2^24 colors but thats not of any importance). Very stupid of me. So yet again we learn the importance of making correct calculations. Now I to call bogus on this.

Re:Do The Numbers --- But do them right by RhettLivingston · 2006-11-26 04:13 · Score: 1

24-bit fidelity would allow over 16 million colors. In fact, it appears to me that they are allowing for much less fidelity than that. Assuming a 1200dpi print capability (most modern dot matrix can do better than that, especially on the horizontal) and a 3x3 matrix for the shapes we get the following:

(1200 * 1200)/(3 * 3) = 160,000

160,000 * 4 shapes = 640,000

640,000 * (32 shades (6 bits) ** 3) = 20,971,520,000

Using some of the other print resolutions commonly available such as 1440 DPI or 2880 DPI could easily reduce the color scanner accuracy requirements to less than the 18 bits I assumed here. Note that much of the accuracy requirement could be met without "absolute" accuracy from the color scanner. It merely has to have "relative" accuracy if a small calibration area of known colors is provided. Also, I didn't account for the possibility of having multiple color regions per shape which could further increase density.

Re:Do The Numbers Again and agiang and again by SQL+Error · 2006-11-26 04:16 · Score: 1

Please ignore my correction too. Though it does seem to be a common confusion.

Good idea (for a scam) by gone.fishing · 2006-11-26 04:19 · Score: 1

While I agree that 256GB on a sheet of paper is probably a scam, I find parts of the story to be an interesting idea. The barcode is ubiquitious but it stores a limited amount of information. Some specialty codes can store a great deal more information in a much smaller area in black and white (like the UPS scatter code). Adding color and using geometric shapes seems like it could increase density by a factor of three or better (just considering color).

This kind of encoding and scanning probably isn't for everyone. A barcode can convey information that can be stored in a database - so a small amount of data (in essence a serial number) can convey a great deal of information. For inhouse applications, like storing the location of a particular pallet in a warehouse for instance this is more than adequate. But for applications where you want to share information with others outside of your network, a great deal more data may need to be contained - like when providing a lot of steel from a foundary to a manufacturing plant. In this case, you may need to provide a large amount of data to an outside company that you do not want to have access to your network data. Info that goes beyon simple dimension data, stuff like formulation data, lot number, and manufacturing date.

Paper is much cheaper than RFID and has stood the test of time. It is in many ways the ideal medium for data exchange between two entities. It has served this exact purpose for centuries.

Re:Good idea (for a scam) by Anonymous Coward · 2006-11-26 08:26 · Score: 0

Actually, your example is bogus because it's highly unlikely that one company is going to accept steel from another company of such varying quality. They're going to have a pre-existing business relationship, with standardized grades of steel, probably established by some industry consortium or professional engineering society, but at the very least stipulated in their contracts. And that'll have a code number.

EDI/EDA is all the rage these days, too, or rather was, so there's not really an issue in connecting up databases there. I'm not sure how it panned out.

While I'm sure that at the very least the original article is worthless (and the densities claimed bogus), I do like the suggestion they made that a potential application could be printing data in a magazine, instead of distributing CDs. It'd be a lot more convenient, and probably less likely to get separated from the magazine at a later date. Just scan and decode a few tens of megabytes worth of information from a glossy page, or even have multiple pages.

Mod parent up by Skrynkelberg · 2006-11-26 04:19 · Score: 1

He's got the ultimate proof this story is a hoax, hands down. And he did it without juggling a lot of numbers.

Not a big deal by ImaLamer · 2006-11-26 04:23 · Score: 1

It's just Hollerith cards all over again.

--
Get your Unix fortune now!

General Approach to Archiving? by MrHatken · 2006-11-26 04:24 · Score: 1

Ok, I agree this is most likely a scam or a fake.

But what about the general principle of archiving to dots on paper (be they black and white, probably better lasting, or colour for greater density) as a way of getting around the problem of constantly having to translate our archives into the latest technology (less we be left without a reader or, at best, only one in a museum some where).

It seems to me we'll probably always have scanners and the software to extract the information from an image would probably be relatively easy to write (for different image formats). It seems (at least superficially to me) that scanning is sort of independent of the underlying technology in a way that reading from a tape isn't.

I realise there have been (and are now) paper tape storage systems, I am more interested in getting over the problem of technology going out of date.

Cheers,
Ashley.

Re:Do The Numbers --- But do them right by SQL+Error · 2006-11-26 04:26 · Score: 1

No. You've made the same mistake as everyone else. Well, and a couple of other minor ones.

If you have 6 bits per colour (RGB), that's 18 bits per pixel. Not, as you calculated, 32,768. You would have us encode 4Kbytes per pixel.

YOU CANNOT COUNT THE COLOURS. YOU CAN ONLY COUNT THE BITS USED ENCODED THEREIN.

Re:Do The Numbers --- But do them right by SQL+Error · 2006-11-26 04:28 · Score: 1

YOU CAN ONLY COUNT THE BITS USED ENCODED THEREIN.

Note: Some bits may be redundant...

(Darn that lameness filter!)

Re:Do The Numbers --- But do them right by RhettLivingston · 2006-11-26 04:31 · Score: 1

6 bits of color control per color allows for 32 shades of that color to be created. A 24 bit color printer produces 256 shades for each of 3 colors giving over 16 million colors.

Back of the mental envelope... by Anonymous Coward · 2006-11-26 04:32 · Score: 0

For an 8.5 X 11 inch sheet of paper...

8.5 x 11 is about 90 sq inch leaving a small margin.
300DPI is 90,000DPSI is 8,100,000 dots per page (16.2 million if you use both sides)
OK, lets sat our scanner can resolve 16 bits of color. That's 16 gigabits or 2 gigabytes.
This leaves us with 8bits/bit for redundancy.

We have yet to allow for compression and 8 bits/bit for completely random data is analagous to a respectable 12db signal strength.

Re:Back of the mental envelope... by Anonymous Coward · 2006-11-26 04:48 · Score: 0

My point, in case you missed it, is that it's pretty easy to get a decent amount of storage on paper with el-cheapo COTs stuff and a quick unoptimized hack. Our Indian friend is only talking another 16 bits. You'll get this with a 600 DPI print and scan and 20 bits of color resolution.
Re:Back of the mental envelope... by watercanhydrate · 2006-11-26 07:06 · Score: 1

First, don't most color scanners scan in true 32-bit color? Second, he's not just using a single dot of color to represent the 32-bits, he's using the color in shapes, which will take many dots (and is probably less efficient than using just a single dot), so I would imagine he's using an even higher DPI printer (my really really cheap printer at home is 1200x1200 dpi). Perhaps the purpose of the shapes is to make the data readable after some damage has been done (imagine a square being recognizable after part of it has been destroyed, a single dot wouldn't be recoverable).
Re:Back of the mental envelope... by kimvette · 2006-11-26 07:20 · Score: 2, Interesting

Resolving 16 bits of color for a typical four-color (CMYK) laser printer or even a digital press won't improve things, because you always have exactly four colors plus the medium color (most typically white). Increasing color depth can improve with dye sublimation and similar technologies, but the ink and paper quality (especially absorption rate) need to be strictly controlled.

--
The Christian Right is Neither (Christian nor right). See: Matthew 23, Matthew 25, Ezekiel 16:48-50
Re:Back of the mental envelope... by Lotharus · 2006-11-29 02:09 · Score: 1

I'm sure someone will correct me if I'm wrong, but I've been of the impression that "32-bit" color is the same bit-depth as 24-bit color, plus an alpha (transparency) channel. Transparency is irrelevant in capturing an image.. You still end up with 8 bits per color channel. So in other words, there's no such thing as "true 32-bit color" from a scanner.

*sits back and waits to be corrected*

--
http://undecidedgames.blogspot.com

Re:This is a lie by Panaqqa · 2006-11-26 04:32 · Score: 1

I am using metric because this is a question of physics and the wavelength of light. Physics uses metric exclusively. Also, outside of the United States, the world overwhelmingly uses metric measure - and I am not in the United States.

DPI or dpmm, the basic facts have not been altered, nor has the conclusion: a scanner using visible light can't pick up objects this small. Work this out yourself, using your treasured "imperial" measure if you would like - but if you are going to measure the speed of light in "furlongs per fortnight", try to be accurate. :P

Re:Do The Numbers --- But do them right by rmstar · 2006-11-26 04:36 · Score: 1

Yes, I wouldn't discount the idea in principle, even though I am quite confident that much of this story is a fake. Playing a movie from a self-built prototype implies quite a bit of bandwidth.

The next step :: 3D Scanners! by itz2000 · 2006-11-26 04:36 · Score: 1

256GB per paper? The next step will be a 3d scanner which scans multiple "papers" attached, and will contain few layers.
Do the math, n layers * 256 while n might be from 2..100 for example!
lets say 10 layers (which shouldn't be hard once you get to do the multiple layers attached) = 2560 GB! per one unit!

The next step is a 3d scanner, definitely.
What do you think of that?

Not the first time by Guillermito2 · 2006-11-26 04:43 · Score: 1

It's not the first time that we hear some kind of hoax like that. Remember the wonderful 100:1 losless video that "superseded Claude Shannon's work", right here on Slashdot in 2002 (or enter "zeosync" on Google for a bit of geek fun) : http://science.slashdot.org/article.pl?sid=02/01/0 8/137246&mode=thread

Related prior art by msobkow · 2006-11-26 04:44 · Score: 2, Informative

I believe it was "Dr. Dobb's Journal" that used to publish code that could be scanned, sort of a variant on barcodes.

Printed at the higher resolutions available to printers and scanners 15-20 years later, how much data could you store using that encoding format on paper? We've gone from about 100dpi to 600-1200, which actually means at least 36 times the data storage per square inch.

I fail to see how a binary pixel can fail to take less space than a printed geometric shape. You can squirt an ink dot a lot smaller than you can a recognizable microscopic shape.

Colour filtering to provide "layers" of data is a form of bonding or multi-frequency processing. Even CMY alone triples the storage potential; the colour discrimination of the scanner is the only limit to the potential bandwidth multiplier (plus data loss due to fading.)

In other words, it sounds a lot more original than it actually is. Odds are the creator is too young to have even heard of Dr. Dobbs, much less have played with the code scanners to save typing time.

--
I do not fail; I succeed at finding out what does not work.

Re:Related prior art by Walt+Dismal · 2006-11-26 05:12 · Score: 1

I seem to remember it was Byte magazine, and the technology was called PaperBytes or something like that. Circa 1977. I'd go downstairs and check my stacks of old magazines for it, but I'm not too fond of rats.
Re:Related prior art by iocat · 2006-11-26 06:28 · Score: 2, Insightful

You're thinking of the Cauzin Softstrip. It was basically just 2D barcodes. It totally worked though; my computer teacher in middle school had one and it worked well.
If you assume an 8.5 x 10 inch sheet of paper (85 square inches), 300 x 300 dpi x 256 colors, you end up with 1.95 billion bits of info you can put on a page. Divided by 8 (to get bytes), you end up with something like 244GB of potential info. But you'll need to have some good error correction and registration. if you look at the original link (which is a link from tfa), it basically looks like a colorful, 2D bar code. I guess the color could make it a 3D barcode.
So despite the "fake" and "scam" tags on this article, there's no reason IMHO to doubt the theory, although I don't know if the application would be super practical.

--
Dude, I think I can see my house from here.
Re:Related prior art by pe1chl · 2006-11-26 06:35 · Score: 1

If you assume an 8.5 x 10 inch sheet of paper (85 square inches), 300 x 300 dpi x 256 colors, you end up with 1.95 billion bits of info you can put on a page.

Please show a detailed calculation because I cannot work out how you arrive at this result. My best result is about 7.6MB.
Re:Related prior art by iocat · 2006-11-26 06:46 · Score: 1

300 x 300 x 256 x 85 = 1,958,400,000.
300 dpi x 300 dpi is 90,000 dots, which can each be one of 256 colors, so that's 23 million possibilities per square inch, times 85 inches.
This math could be all wrong, because I'm not especially smart, but I think it's ok.

--
Dude, I think I can see my house from here.
Re:Related prior art by anagama · 2006-11-26 06:53 · Score: 1

300 dot/in * 300 dot/in * 256 colors/dot * 85 in/page = 1,958,400,000 dot colors/page

Imagine each colored dot as a bit. 244,800,000 bytes.

--
What changed under Obama? Nothing Good
Re:Related prior art by watercanhydrate · 2006-11-26 07:39 · Score: 3, Informative

Rather than multiplying by 256 colors/dot, I think you should be multiplying by 8 bits/dot (same as 256 colors). Also, 1,800,000 bytes is a MB, not a GB. So it really should look something like this: 300 dot/in * 300 dot/in * 8 bits/dot * 8.5 in * 11 in = 67,320,000 bits per page = 8,415,000 MB
Re:Related prior art by iocat · 2006-11-26 07:57 · Score: 0

No, no, I just picked 256 colors as an arbitrary number. My math has 256 bits per dot (256 colors). I don't need to worry about how you display the colors onscreen (where 256 colors = 8 bit color), since we're dealing with pigment on paper, not pixels.

--
Dude, I think I can see my house from here.
Re:Related prior art by synthespian · 2006-11-26 08:12 · Score: 1

I fail to see how a binary pixel can fail to take less space than a printed geometric shape. You can squirt an ink dot a lot smaller than you can a recognizable microscopic shape.

If you ask the wrong questions, you get the wrong answers.
Which conveys more information: ASCII character "9" or a pixel?

--
Main difference between the BSD license and the GPL license: one is from California and the other is from Massachusetts
Re:Related prior art by pe1chl · 2006-11-26 08:30 · Score: 1

correct calculation: 300x300x85 dots on the page. each dot can be 256 colors, or 1 byte can be encoded in 1 dot.
total capacity: 300x300x85 = 7650000 bytes or about 7.6 MByte.

there is no way you are going to store 23 million different colors of a single pixel reliably on paper, but even when you did yould only encode 3 bytes (24 bits) per dot, or about 22 Mbyte total.
Re:Related prior art by pe1chl · 2006-11-26 08:33 · Score: 2, Informative

You cannot encode 256 bits in a single dot, and then reliably read back the result from the paper.
You would need 2^256 different colors, reliably detectable. This is impossible.
Re:Related prior art by x2A · 2006-11-26 09:32 · Score: 1

"If you ask the wrong questions, you get the wrong answers.
Which conveys more information: ASCII character "9" or a pixel?"

Let's ask the Wiki:
http://en.wikipedia.org/wiki/Pixel
vs
http://en.wikipedia.org/wiki/ASCIIcharacter9

I think it's clear who the winner is.

--
The revolution will not be televised... but it will have a page on Wikipedia
Re:Related prior art by DahGhostfacedFiddlah · 2006-11-26 09:39 · Score: 1

Impossible?

Are there not 256 different dyes out there that
a) Reflect a narrow frequency range
b) Pass through most of the rest?

(This is a serious question - I have no idea if that many dyes could be layered/mixed such that the first one laid down would still be readable - and it would probably depend on the size of the "dot")

If so, there's no reason one could not layer 256 different colors on top of each other, and turning the colors on or off gives you your 2^256.

--
Last post!
Re:Related prior art by Bastard+of+Subhumani · 2006-11-26 09:50 · Score: 2, Funny

Are there not 256 different dyes out there that a) Reflect a narrow frequency range b) Pass through most of the rest?
Even if there were, the magenta would always run out before the others.

--
Only three things are certain; death, taxes, and apocryphal quotations - Ben Franklin.
Re:Related prior art by Hognoxious · 2006-11-26 10:02 · Score: 1

Wrong. The answer isn't clear at all.

What is clear is that the answer depends just a liiiiiittle bit on the color depth of the pixel, which synthespian (563437) didn't state. But thanks for playing anyway.

--
Confucius say, "Find worm in apple - bad. Find half a worm - worse."
Re:Related prior art by mysticgoat · 2006-11-26 10:36 · Score: 2, Interesting
The problem is, all you guys who posted above me aren't thinking inside the box.
1. Rather than looking at individual dots, group them together into 3x3 matrices. On the paper you are talking about, at 300dpi resolution, there are 85,000 such boxes.
2. Each matrix can hold an image painted in one of 256 colors on a background painted in one of 255 remaining colors. We've now got 55,488,000,000 bits of storage, or 6,615 MB.
3. We'll sacrifice the first 256 boxes as a registration area that displays the 256 unique colors as they appear on that particular paper. We'll sacrifice the rest of the top two rows for similar registrations showing matrix boundaries, etc, all done in triple redundancy (and repeated a couple of times, too). I won't bother adjusting the calculations that follow though: as far as the math is concerned, this is an insignificant overhead.
4. Now consider the shape of the image. If we use an "L" shape of the left column and bottom row of the matrix, we can rotate that into 4 distinct positions, increasing our storage to 26,460 MB.
5. By dropping the top cell from our original "L", we've got a new figure that can be taken through the same transforms, and we've doubled our storage again. By adding the center dot to all the figures we have made so far, we double the storage yet again. We've now got more than 103 GB of data on that sheet of paper.
6. And we don't have to be confined to "L" based images, either.
I expect something a bit more sophisticated than this is being done. A bigger matrix would allow more distinctive shapes and probably be more robust against dust motes, etc. I have no doubt that the technique can be made to work, and it is really appealing! It sounds like the software being developed will work with many of the printers and scanners that are already in common use. And we've got centuries of experience in handling paper. A four drawer filing cabinet could be tomorrow's petabyte archival storage.
Re:Related prior art by x2A · 2006-11-26 13:41 · Score: 1

*woosh*

--
The revolution will not be televised... but it will have a page on Wikipedia
Re:Related prior art by ConceptJunkie · 2006-11-26 14:04 · Score: 1

Your post reminds me of that old proof that the average person only works one day a year.

Your step 2 makes no sense. It looks like you are pulling numbers out of your nether regions. How can a square of 9 pixels hold more than 9 * number of colors values?

--
You are in a maze of twisty little passages, all alike.
Re:Related prior art by kasperd · 2006-11-26 14:34 · Score: 1

You calculations simply doesn't make any sense. Now let's compute an upper bound based on the first few numbers you presented. 85000 blocks of 3*3 dots with 256 possible colours. That is a total of 765000 dots and the 256 possible colours can simply be mapped to the 256 possible values of a byte. In ohter words 765KB is an absolute upper bound on the information you can store on that piece of paper. In practice you might be able to store much less information.

--

Do you care about the security of your wireless mouse?
Re:Related prior art by ZorbaTHut · 2006-11-26 14:46 · Score: 1

You do not seem to understand how mathematics works.

Your first step, for example, is hideously flawed. Take step 2. Each matrix can hold an image painted in one of 256 colors, on a background painted in one of 255 remaining colors. Okay, fine. However, this doesn't give us 85000*256*255 bits of data. In fact, this gives us 85000*log2(256)*log2(255) bits of data, or approximately 1.36 million bits.

Again, using an L shape, and rotating it, does let us have 4 distinct positions. However, that doesn't quadruple our data capacity. That quadruples the *number of possible states* the image is in . . . giving us a whopping two bits of extra storage. Adding another L gives us another two bits. Adding a center dot, or not, gives us another two bits. So now we've got approximately 1.360006 million bits. Congratulations.

If you disagree with this, here's a proof. Take a single byte. Obviously, we can store one byte of data in it. Now add a single bit. This can be in one of two positions, thus *doubling our storage*! Do this 32 times. Now, with a single byte, and 32 bits, we can store 4 gigabytes of data! Oddly, though, this doesn't work.

--
Breaking Into the Industry - A development log about starting a game studio.
Re:Related prior art by kennygraham · 2006-11-26 15:12 · Score: 1

Which conveys more information: ASCII character "9" or a pixel?

If the pixel can be one of 256 colors, they convey the exact same amount of information, as the ASCII character "9" is one character of 256 ASCII characters (including control characters and null). And you can fit many such pixels in the space it would take to display an ASCII "9".
Re:Related prior art by msobkow · 2006-11-26 15:16 · Score: 1

Presume 300x300 dpi with only an 8-color system providing an additional 3 bits. For sake of argument, presume the 3 bits added by colour are used for error correction (11 bits/byte vs. the 10 bits/byte that is the more common average for simple CRCs.) 30,000 bits per square inch, or 3,750 bytes.
Presuming an 8x10 print area on 8.5x11 inch stock (which allows for "barrier" regions that provide "alignment" for the scan), that's 800 x 3750 or 3,000,000 raw bytes of data per sheet.
Adding in compression might boost that a bit, but not by too much, and it's an unfair measure of a storage medium to presume compression.
If your ideal of 256 colours is involved, that's 8 bits of discrimination instead of 3, or 5 times as much potential storage for 15,000,000 bytes or a little over 14MB.

--
I do not fail; I succeed at finding out what does not work.
Re:Related prior art by msobkow · 2006-11-26 15:28 · Score: 1

Oops. Typo at the start. 90,000 bpi, not 30,000. So 11,250 bytes per square inch in 8 colour, 46,250 bytes per square inch in 256 colour. That works out to about 8MB per page 8-colour, 36MB at 256-colour.
600x600 would quadruple that to a peak of 144MB, and 1200x1200 would boost it to 578MB.
Improving colour discrimination to 4 bits per colour jet (presume 3), and you have a total of 12 colour adding another 50% over the 256 colour scheme.
The problem is the smaller the pixels and finer the colour discrimination, the greater the chance for ink bleed, alignment errors, colour fading, and other issues. You'd end up having to put more and more data into error detection and correction, drastically reducing the effectiveness of such techniques.

--
I do not fail; I succeed at finding out what does not work.
Re:Related prior art by mysticgoat · 2006-11-26 16:48 · Score: 1

How can a square of 9 pixels hold more than 9 * number of colors values?
It would appear that someone has yet to discover the miracle growth of combinations...
This has nothing to do with the size of the matrix. It has to do with the number of unique selections of two colors that can be made from a set of 256 colors.
Go straight to Google and type in "256 choose 2". Then type in "combination math tutorial" and start exploring a new world.
Enjoy!
Re:Related prior art by mysticgoat · 2006-11-26 17:00 · Score: 1

You calculations simply doesn't make any sense. Now let's compute an upper bound...
No let's not, since before that can be done there has to be an understanding of the underlying mathematics.
Of course my calculations don't make any sense if you haven't yet studied the applicable math. Tell you what, let's look at a couple of easier problems that are very similar. How many unique games of checkers can be played out on a 8 x 8 checkerboard? How many unique games of chess? These ARE similar problems, involving combinations and patterns.
I've talked elsewhere about this on this thread: use "parent" to get back to my first post then re-read the subsequent ones.
Re:Related prior art by kasperd · 2006-11-26 17:36 · Score: 1

No let's not, since before that can be done there has to be an understanding of the underlying mathematics.
I hope can help you a little bit in gaining that understanding. We want to know how much information we can store on this piece of paper. None of us know the exact answer, but we really don't care that much as long as we can compute an upper and a lower bound.

The mistake you made was to do some complicated computations on a lower bound, which turned out to be too complicated. What I did was to do some simple computations and achieve an upper bound. Since that upper bound is smaller than your lower bound, one of them has to be wrong. And that is exactly why I aimed for an calculation as simple as possible, not a bound as close to the exact answer as possible. The simple calculation is not as prone to errors.

A bit of common sense also helps here. Knowing how much space a bitmap with the resolution of an ordinary printer and size of a piece of paper will take, common sense will tell you, that you cannot store more than that. Basically if you could do that, there wouldn't be any reason to print it out, since the encoding would already be way smaller than the original, so you might as well store the bitmap on an ordinary floppy disk, that also saves you the trouble of producing a reliable scanner.

--

Do you care about the security of your wireless mouse?
Re:Related prior art by eno2001 · 2006-11-26 17:50 · Score: 1

Mods. The humorous parent comment is not properly moderated. Please fix. Thanks.

--
-"...bad old ideas look confusingly fresh when they are packaged as technology" - Jaron Lanier (Digital Maoism on Edge.o
Re:Related prior art by mysticgoat · 2006-11-26 18:38 · Score: 1

You do not seem to understand how mathematics works.
How insightful of parent post for stating the obvious! IANAM, nor have I ever claimed to be one.
OTOH, many of the contributors to this thread do not seem to understand how to work the mathematics of this problem. Nor do they seem to recognize that Sainul Abideen could not have gained the recognition he now enjoys if it were so easy to show that his claims were 5 orders of magnitude greater than what they think is the upper limit of the possible. It seems much more likely that slashdot's best mathematicians were screwing it up somehow.
It seemed to me that combinatorial processes had to be involved since the articles about Sainul's work happened to mention (several times) that the mechanism was based on combinations of colors and shapes. While IANAM, I am someone who knows how to comprehend what he is reading.
Since I haven't worked with combinatorial math since my student days, I put forward an estimate that I was sure was low, but which was quite a bit higher than those being bandied about, and I sort of hoped that someone would pick up on this and come through with a description of the correct technique.
Someone did, more or less, but it was not the parent post. Parent post contains some mathematical jokes that whooshed by over my head. Or, well... there is another possibility concerning where parent post's author's head is at, but I won't go there, no sir. That would be insulting.
Re:Related prior art by ZorbaTHut · 2006-11-26 18:48 · Score: 1

You overestimate the quality of average reporting and, again, fail to actually understand the mathematics involved. Please, follow the simple thought experiment I outlined in the last paragraph, then spend some time thinking about how that applies to your example.

An 8.5" by 11" sheet, at 300dpi with 24-bit color, can store at most 25,245,000 bytes of data. This assumes zero redundancy, the ability to detect 24-bit color, and no alignment needed. This is the maximum. "Geometrically encoded data" doesn't gain you anything on top of that - a raw scan of that sheet with those parameters will be that many bytes in size, and there's simply no way to synthesize more data magically out of the scan.

The article is bunk. That's all there is to it.

--
Breaking Into the Industry - A development log about starting a game studio.
Re:Related prior art by ZedmanAuk · 2006-11-26 18:49 · Score: 1

Your calculations don't hold. 256 colors is 8 bits, not 256 bits. If it were 256 bits, you could have 2 colors simultaneously in the same dot, not possible.

--
-ZA
Re:Related prior art by ZedmanAuk · 2006-11-26 19:03 · Score: 1

Incidentally, on an 8.5x11 piece of paper I can represent every single possible combination using 8.5x11x300x300x8 bits = 67.3 mbit = 8.4 megabytes of information. If it were really the case that you can store 256 GB of info in that same space, I would have the ultimate lossless compression technique ever! :-)

--
-ZA
Re:Related prior art by Anonymous Coward · 2006-11-26 19:07 · Score: 0

256 colors on top of 255 background colors doesn't give you 65280 bits, it gives you log2(65280) bits, i.e. 15 bits. If we go a tiny step further and let the background be the same color as the foreground, we get log2(65526) = 16 bits. With 85000 boxes, this is only 1,360,000 bits, around 166 KB.
Re:Related prior art by mysticgoat · 2006-11-26 19:14 · Score: 1

I hope can help you a little bit in gaining that understanding.
There seems to be a continuing failure to recognize that this is not a simple mapping problem.
Within a 3x3 matrix, there is a background color and a foreground color. Two distinct colors chosen without replacement from a palette of 256 colors (limited by what can be easily distinguished by the scanner). There are more than 32,000 ways of selecting these two colors (Google is your friend-- see elsewhere on this thread). Each matrix can take on 32,000+ distinct states. You could think of this as each matrix representing a digit in base 32,000+.
You could take a typical business document and treat it as a long binary number and convert it to base 32,000+ and store it on paper in 2 or 3 digits. The process is easily reversible: read a couple of dozen digits from the paper, convert to binary and split up into bytes then feed the resulting spreadsheet to Excel.
And in all the above we were dealing with just one shape, say a circle of one color on top of a background of a different color. When you start combining in other easily distinguished shapes like squares, stars, etc, then you increase the base of the number system even further.
So how many unique states can you represent in a handful of digits? It all depends on the base of the number system you are using. It seems like Sainul has found an interesting application of an overlooked aspect of number theory.
Re:Related prior art by PaladinAlpha · 2006-11-26 20:11 · Score: 2, Insightful

If what you're saying was true, wouldn't it be easier to just encode two dots, rather than in a 3x3 matrix? Dot 1 for color, dot 2 for color? Same result. All that other rotation hullabaloo is less efficient than just putting one more dot, since we're talking about upper bounds. If you're using 256 colors, that's the same as one byte. (1 byte = 0-255, 256 states.) So you're talking about a byte, and then another byte. Two bytes, or a 16 bit integer. EVERYTHING you've said about this 256 select 2 garbage applies to 16 bit integers, without exception.

Let's for sake of simplicity consider just one 3x3 matrix, and surely you can agree the rest of the concepts for the full sheet will follow.

We say: 3x3x256, a byte is 256, so 3x3 bytes, so 9 bytes (using 256 color). You're saying 3,329,280 combinations per 3x3. As a little experiment, let's see how many 'combinations' 9 bytes can hold, or the highest number you can count to with nine bytes, which is 9x8 or 72 bits: 2^72 is 4,722,366,482,869,645,213,696. So with our nine bytes we could hold every single combination that you postulated per matrix FOUR QUADRILLION TIMES just by assigning it a unique number. Your 'innovative' technique does nothing but waste space in a ratio of 4,000,000,000,000 to 1.

Now, the mistake you're making: bits don't store 'combinations', they store states. In order to increase the number of states you have to DOUBLE the number of combinations. A single bit has two combinations: 0 and 1. Two bits has four combinations: 00, 01, 10, 11. Three bits has eight combinations: 000, 001, 010, 011, 100, 101, 110, 111. And so on. That's why people have been repeatingly telling you to take the log and not just use the number. Combinations do not equal bits and do not equal storage space.
Re:Related prior art by mysticgoat · 2006-11-26 20:38 · Score: 1

You really do need to climb into the box. You will find it much roomier on the inside than it appears to be from the outside.
Visualize a 3x3 matrix. Consider that it can have two states: solid black or solid white. It can represent a single binary digit. There would be 85,000 of these on the "standard page" defined earlier in this thread.
Now stay with the black and white, but allow the center of the matrix to change independently of the 8 bits that surround it. This provides a foreground and background. The same 3x3 matrix still represents a single digit, but now it is in base 4. It can be all black, all white, black foreground with white background, white foreground with black background.
Increase the number of colors possible from 2 to 16. There are 120 different ways to paint the inside of this box. It still represents a single digit, but that is now a digit in base 120.[1]
When the number of possible colors increases to 256, the number system of this digit becomes base 32,640. There are that many different ways to choose 2 colors from a set of 256 colors.
Take a text file of 10 kilobytes, treat its bitstream as a longish binary number and convert it to base 32,640. It can be represented on paper in 2 (two!) digits. A 125 MB chunk of data can also be represented as 2 digits base 32,640.[2]
So, here we have 2 3x3 matrices for a total of 18 pixels that is capable of storing more than 125 MB of data, and this without even considering the combinations of placing the dot somewhere other than the center, or using more than one dot. 125 MB of storage in 18 pixels seems fair data compression. Even if we make the pixels pretty big so we can use cheaper scanners.
So where do you think the fallacy in establishing limits on paper storage lies? I believe it has something to do with the way that introducing a foreground - background relationship brings combinatorial considerations into the picture. Start using clubs, spades, diamonds and hearts as well as color, and the data compression begins to get pretty good.
I suspect that Sainul's technique might be computation bound: I think the kinds of data compression involved in converting between binary and these high base number systems must be pretty CPU intensive. It certainly doesn't look like its paper bound. [1]Maybe 240 different ways, and base 240? I'm not sure. But at least 120. [2]If I can believe my calculator, the 3rd digit would kick in at just over 127 MB: 32,640**2 = 1,065,369,600 bits = 130,050 KB = 127.002 MB
Re:Related prior art by ZorbaTHut · 2006-11-26 20:49 · Score: 1

Your math is just as flawed now as it was before.

Here, let's do this thing. Start with a binary number: 10110101 10011010 10010100 01101111 11010100. You'll notice this is 40 bits - far, far, far less than 10 kilobytes. Convert it into a base 32,640 number. Right now. Do it. You can even try converting it into two if you want. It's 40 bits, so that should be easy.

I'll give you a hint, before you start: You can't. You could convert it into two base 65536 digits, plus a base 256 digit. Or five base 256 digits. Or two base 32768 digits and a base 1024 digit. But you're confusing "bits" with "base". A 16-bit "digit" is the same as a base 65536 digit. A 32-bit "digit" is the same as a base 4294967296 digit. A 40-bit number - what we have here - would require a base 1099511627776 digit. In general, N bits takes a 2^N base digit to store, if you want to cram it into a single digit.

1000 bits cannot be stored in a base-1000 number. Not even close. 1000 bits wouldn't quite fit in a base-10^301 digit, in fact.

--
Breaking Into the Industry - A development log about starting a game studio.
Re:Related prior art by mikiN · 2006-11-26 20:57 · Score: 1

Better yet, you would be able to scan and digitize the printout, then 'compress' it again. Lather, rinse, repeat, and you would end up with one pixel.

Shannon entropy rules!

--
The Hacker's Guide To The Kernel: Don't panic()!
Re:Related prior art by pe1chl · 2006-11-26 21:17 · Score: 1

You really do need to climb into the box. You will find it much roomier on the inside than it appears to be from the outside.

Once I read statements like that I know the poster does not understand the problem. There is no magic room inside the box.
We all know about the articles about infinite compression. It simply isn't possible. Live with it.

Your basic misunderstanding is that when you have something that has "240 different ways" and you add another "240 different ways" you have "doubled the storage".
This is not true. You have added 1 bit.
Re:Related prior art by PaladinAlpha · 2006-11-26 21:28 · Score: 1

All right, patience out. Listen, idiot. A 10k text file would be a 'longish' binary number of 10,000 bytes lined end to end, or a number in BASE 2^10,000 = BASE 199(three thousand zeros). Think about it like this.

You're saying a 10k file can be represented as two numbers in base 32,640. Two 32640's means 32640*32640 = 1,065,369,600, about one billion. Ok, a lot of 10k files. 10k is about ten pages of text, give or take. (That helped inspire the original measure.) If ANY 10k can be stored in that one billion, that means there are ONLY one billion POSSIBLE ten pages, otherwise it'd overlap and you wouldn't know which one you wanted, right? So you're saying there are only one billion possible ten pages of book. How many books do you think there are in the world? Even if there were only a billion and they all had only ten pages, you'd still lose, because you'd have to account for different combinations (and this word is glad to be used properly for once around here) of those ten pages. Impossible. This means, you are incorrect.

But let's get more detailed since you feel compelled to argue. Besides, that last example feels kind of stiff.
Increase the number of colors possible from 2 to 16. There are 120 different ways to paint the inside of this box. It still represents a single digit, but that is now a digit in base 120.[1] When the number of possible colors increases to 256, the number system of this digit becomes base 32,640. There are that many different ways to choose 2 colors from a set of 256 colors.
Right, guy, and -- here's where you're not paying attention -- a single 8-bit byte is a base-256 number, because it can represent any number from 0 to 255. A 16 bit is base 65536, because you can store any number from 0 to 65535 in 16 bits. The guy above me hit on this, and he's right, too, I just can't leave this alone, as we've all provided a perfectly rational response which you've thrown to the wind with the classical approach of someone who doesn't understand what they're talking about, gets called on it, and sticks their fingers in their ears.

Think about this, also: no matter WHAT you do with a 3x3 matrix, PERIOD, I can represent it as 9 pixels, and have room left over, unless you've used every possible combination of each individual pixel, in which case they are equal. It doesn't matter which pixel is foreground vs background, or how many color patterns there are, or what shapes you draw or if you take a tarot reading, you can't represent more than the things used to make the image represent. You can. not. do. it. It's called information theory, and believe it or not it's had a lot of work done on it by people who actually did know what they were doing, and they've had the same little 'observations' as you, discredited them, and left them behind, years and years ago.

Summary: You CAN NOT EVER store 10k as '2 digits base 32640.' Let me hit this one last time just in case you don't get it. A 15-bit number can store up to 32768 values, so it is base 32768, close to your number. You're saying you can store 10k in less than two fifteen bit numbers -- or saying you can put TEN THOUSAND bytes in less than TWO bytes. Gosh, everyone else just missed it all these years, huh? Silly scientists. When will they ever figure it out? People like you give science a bad name by not understanding anything about the work that goes into what we have.
Re:Related prior art by Ed_1024 · 2006-11-26 21:54 · Score: 1

I'm rather dubious about the maths but I'll leave that to the experts.

From a simplistic point-of-view, if you could print a sheet of paper using an ordinary inkjet with that amount of information encoded on it, then logically that information has also been held in the printer RAM. Certainly my printer does not contain 29GB or whatever...
Re:Related prior art by mysticgoat · 2006-11-26 22:06 · Score: 1

Give me a set of glyphs for the 32,640 numerals needed and I'll convert that number for you. It will be the 181st glyph in the ordered set.
If you are having trouble wrapping your mind around the concept that numbers exist independently of their representation in a computer, please keep working at it. The numbers themselves are truly independent of any method of representation. Numbers seem to be what inspired Plato to talk about Ideals, and numbers do exist, in a way, independently of any physical reality.
Getting back to the case at hand: so long as there is an agreed on convention of glyphs to express them, numbers can be represented in any arbitrary base. Numerically, there is no difference between a bitstream 125 MB long and its representation as two digits in some number system with a large base. It doesn't matter whether a computer can work directly with a particular number base or not-- that doesn't invalidate that method of representing numbers.
Re:Related prior art by Ed_1024 · 2006-11-26 22:07 · Score: 1

...And if I scan this printed document in at 300 * 300 * 10 * 8 @256 colours, I can contain all that information in 7.2MiB. Wow! I've compressed GIGABYTES down to a few megs, with NO LOSS!
Re:Related prior art by ZorbaTHut · 2006-11-26 22:22 · Score: 1

Why will it be the 181st glyph?

I agree that everything else you've said there is true. I simply disagree with your beliefs on how large the bases can be. A bitstream 125MB long would require two 524288000-bit numbers, or base 2^524288000. (Which is a hell of a number.)

--
Breaking Into the Industry - A development log about starting a game studio.
Re:Related prior art by mysticgoat · 2006-11-26 22:39 · Score: 1

Nope. You just don't get it.
The number is not its representation in the computer. The number is separate from any particular representation. When it is on paper, it doesn't matter how many bits or bytes it might take to represent it in the computer-- it isn't in the computer. It is on paper. The 3 digit base 32,640 number that is one more than the largest 2 digit number in that system is the same number as when it is expressed as a 127 MB long series of bytes in a computer system.
Is it so hard to understand that non-digital media can sometimes express concepts like numbers in a way that is fundamentally different than the expression within a computer? And that sometimes there could be terrific benefits to using these non-digital expressions?
Re:Related prior art by mysticgoat · 2006-11-26 22:51 · Score: 1

Why will it be the 181st glyph?
Actually I was wrong; it will be the 182nd glyph. I did a one-off error by not counting zero as the first glyph. (the number you gave me converted to 181 decimal when I copy / pasted it into a conversion utility that I found with Google.)
I'll stipulate that your math is correct, and I agree that this is the way the number would need to be represented in a binary computer. But the number itself is independent of the way it is represented. And when we represent it on paper, we are not confined to the same conventions as we use in the computer. So long as we agree on a convention of glyphs for the numerals, we can use any arbitrarily large number base for the expression. It will still be the same number, and anyone who wanted to convert it back to into binary would get the same bitstream.
Now its long past my bedtime.
Re:Related prior art by ZorbaTHut · 2006-11-26 23:00 · Score: 1

Aha - you pasted "10110101 10011010 10010100 01101111 11010100" into a binary-to-decimal converter, am I right?

Your binary to decimal converter doesn't strip whitespace, and terminates when it hits the first space. "10110101" is equal to 181. "1011010110011010100101000110111111010100" is equal to 779982499796. I'm pretty sure that doesn't fit into a set of 32,640 glyphs.

Again, you may want to recheck your math.

--
Breaking Into the Industry - A development log about starting a game studio.
Re:Related prior art by mysticgoat · 2006-11-26 23:04 · Score: 1

From a simplistic point-of-view, if you could print a sheet of paper using an ordinary inkjet with that amount of information encoded on it, then logically that information has also been held in the printer RAM. Certainly my printer does not contain 29GB or whatever...
I think you are approaching the truly interesting part of what Sainul has claimed he has done. Obviously the data is being compressed before it goes to the printer buffer, and expanded after it comes back from the scanner. Converting from binary to images of numerals in some number system with a large base and back again has got to be computationally intensive. I think his breakthrough is not so much in stuffing that much data onto a page as it is in doing these conversions in a reasonable amount of time, without melting the CPU.
That has not been addressed in any of the articles I've seen so far.
Re:Related prior art by mysticgoat · 2006-11-26 23:33 · Score: 1

That makes sense. I thought I had stripped out the whitespace, but clearly I missed the first one. I would like to hope that if it wasn't so late at night, I would have realised that 181 didn't make much sense.
779982499796 / 32640 = 23896522.7
0.7 * 32640 = 22848, so the last digit will be the 22,849th in the set
23 896 522 / 32 640 = 732.123836
0.123836 * 32 640 = 4042.00704 the digit to the left of that will be the 4,043rd in the set
the first digit will be the 733rd in the set
So it becomes a 3 digit number in the large base. Once you decide what glyphs you want to use, you can write the number out using the above directions. Just note that the conversion wasn't clean due to the limitations of my 32 bit computer, so the last digit might be off.
The more I think about this, the more I think that Sainul must either have a bunch of question marks in the steps where the conversions between binary and image need to take place, or he's cooked up some remarkably fast and efficient conversion routines. This process has got to be more limited by CPU activity than by the limits of the paper storage.
It is waay past my bedtime and I don't do all nighters very well any more.
Re:Related prior art by ZorbaTHut · 2006-11-26 23:45 · Score: 1

So now you've actually just proven my point. I said that you'd need two 32768 base digits and a 1024 base digit. (The numbers are slightly different when you're working with 32640 bit digits, obviously, but they're quite close.) Which is roughly what you ended up with. So you need three digits for a 40-bit number.

See why I'm a little skeptical of your claim that you can cram a 10kilobyte file - eighty thousand bits - into two 32,640 base digits when you can't even squeeze 40 bits into two?

See why I utterly disbelieve you when you claim that a 125 MB file - a whopping billion bits - can be crammed into those same two 32,640 base digits?

If you scan an 8.5" by 11" piece of paper at 300dpi, with 24 bits per pixel, you get slightly more than 24 megabytes. No matter what calculations you do, no matter how much CPU power you spend, you will never be able to retrieve more information than that from the piece of paper. That's simple mathematics and computer science.

(This assumes your data is already compressed - you can of course compress terabytes of 0's into a very small amount of space, and easily store that on the piece of paper. But then you're still not storing very much information.)

--
Breaking Into the Industry - A development log about starting a game studio.
Re:Related prior art by mysticgoat · 2006-11-26 23:50 · Score: 1

I'm just going to take a moment here for a couple of points.
First, I think that you are right in that the math would be the same if two dots placed side by side were used. But I'm not sure of that, and also I felt it was easier to describe one foreground dot in the center of a background than it would have been to talk about it any other way.
Second, you state that bits store states and not combinations, and I won't argue with that one way or the other (though I think it would be sort of fun to see how you would explain that).
The thing is, that's irrelevant. We aren't talking about storing the number in the computer. We're talking about painting it on a piece of paper, where bits and bytes and such don't apply. On paper, with color ink, we've got storage bins that can be in many more states than just binary. Part of the reason is because we can assign meaning to combinations of different colors, and there are a lot of possible combinations. When we bring the number back into the computer we'll want to convert it back into a binary expression, but the constraints of binary expressions don't apply to that number while it is sitting out there on the paper. It can be colorful and concise Out There, even though it becomes colorless and quite long in the computer.
Re:Related prior art by Captain+Zep · 2006-11-27 00:18 · Score: 1

Parent is right - and I can't see why this isn't the obvious way of looking at it.
> On paper, with color ink, we've got storage bins that can be in many more states than just binary.
The parent already considered that. 256 distinguishable colours are being assumed in this thread so, as he said, each dot has 256 possible values, i.e. it encodes 8 bits.
> Part of the reason is because we can assign meaning to combinations of different colors, and there are a lot of possible combinations.
Yes there are. For example let's combine two dots as a 'combination'. Each dot can have 256 values, so there are 256*256 = 65536 unique combinations in total. Which is 16 bits worth. I.e. 8+8, 8 bits per dot.
So why does looking at it as a combination help?
Now when it comes to adding error-correction, you need to start restricting the permissible patterns, but that's nothing to do with the theoretical capacity.
Z.
Re:Related prior art by ComaVN · 2006-11-27 00:58 · Score: 1

I suggest you find some venture capitalists to explain this radical new math to, try to build this magical compression algorithm with loads of his cash, and bail out to live on a desert island somewhere.

Or you could just accept that your math is flawed, you confused number of permutations with data size, and go on with your life.

You're starting to remind me of that Timecube guy.

--
Be wary of any facts that confirm your opinion.
Re:Related prior art by sevenofnine · 2006-11-27 01:21 · Score: 1

Remember to turn the paper and print on the other side aswell :p
Re:Related prior art by Captain+Zep · 2006-11-27 01:39 · Score: 1

I meant grand-parent is right.
Which is now the grand-grand parent...
Z.
Re:Related prior art by Beowulff · 2006-11-27 01:48 · Score: 1

I think I know where the miscommunication is happening. First of all, you are right: the number doesn't change when you change bases or representations. Unfortunately, you are wrong in thinking that it does reduce the amount of information encoded in the number: changing base or representation also doesn't change the amount of information in the number. Let me explain.

Let's start with your 127 MB file example, but let's make it 128 MB for the nice roundness. I imagine that we can agree that 128 MB = 2^7*2^10*2^10*2^3 = 2^30 bits. I think we can also agree that with this much bits you can describe 2^(2^30) (note the nested exponent) distinct, unique numbers (that's a huge amount!), and the largest number that can be expressed would be 2^(2^30) - 1. Of course, you don't need to represent the number as a binary number. If you want to represent the same number of unique distinct numbers as, say, a four-digit number (just because it's easier math than your 3-digit example), this would mean that you need 4 digits of (2^30)/4 = 2^28 bits each, each of which can represent one of 2^(2^28) distinct, unique numbers. You could say that you have created a number represented by four base-2^28 = base-268435456 digits. You now only need to write out just 4 (be it complicated) glyphs to paper. OK sofar. Now you suggest that by writing only four digits, instead of the 2^30, you can reduce the data volume needed to store such large numbers. Unfortunatly, that won't work.

Here's the catch: there are 2^(2^28) distinct glyphs necessary, one for each of the 2^(2^28) possible numbers. Each of these glyphs must be completely unique, and completely distinguishable from all the others. So how are you going to distinguish your glyphs? You do that by giving glyphs characteristics when you write them, and check for those characteristics when you read them. These may include color, shape, rotation, etc. Now, suppose you have a device that prints colored dots, and another device that can read dots and determine their color. With these you can read and write glyphs that consist of colored dots. The dot is the smallest unit we can print or scan, so the most efficient we can go is to build glyphs out of dots, right? I doesn't even matter whether we arrange the dots in lines, or squares, or something esle, we just need to define some group of dots as a glyph.

Suppose we can distinguish C distinct colors (for instance C=2^48, for 48 bit scanners, assuming they are perfectly accurate). After reading the first dot, we still don't know what glyph we are reading, but we have narrowed it down to all glyphs that start with a dot of that color, which reduces the remaining search space by 1/2^48. Easy so far. How many more dots do we need to read to narrow down our search to a single glyph? After having read N dots, we've narrowed the search space down to S(N) = (2^(2^26))/((2^48)^N). Solving S(N) = 1 gives N = 2^26/48. That is how many dots you need to print one single glyph (quite a lot of dots). To print four glyphs, you'll therefore need 4*N = 4*2^26/48 = 2^28/48 dots. Agreed sofar?

Then here's the punch line: that is exactly the same number of dots you would need to directly encode our original 2^28 bits binary file: 48 bits per dot (2^48) gives 2^28/48 dots. So you have not reduced your data by converting to a different base.

To summarize: indeed, you are right, a number stays a number, no matter in what base you represent it. However, the amount of information a number can encode does not change either. Using a higher base means that you save in the number of digits you need, but the complexity of your glyphs will go up accordingly, and you'll end up with the same total amount of information. That's also why binary works and is used : if it doesn't matter what base you have, go with the simplest: base 2.

Hope this helps relieve the argument somewhat :)
Re:Related prior art by ConceptJunkie · 2006-11-27 02:57 · Score: 1

I misspoke. It was late and I was being too lazy to show my work, which would have allowed me to spot my boneheaded mistake.

I meant to say (number of colors) ^ 9. But the principle remains. I have no idea what you mean by unique selections of 2 colors from 256 colors (which would be 256 * 255, unless using the same color twice were allowed). If I have 9 pixels, each pixel can only represent 256 colors, or in terms of pure data, 1 standard, government-issue M1A1 _byte_. That's 9 bytes. There's no other way to do it. 72 bits.

If I have a piece of paper w x h inches in size and can print at a certain DPI (call it r) with z different colors, there is no way to encode more than

w * h * r^2 * ( log (base 2) z) bits, and that number is most certainly orders of magnitude less than 250GB.

In fact, let's do some numbers to see:

There appears to be some confusion as to what is meant by A4, so let's assume 9 x 12. Even if A4 is double that, the result will only be off by a factor of 4.

9 * 12 * 1200 * 1200 = 155,520,000 pixels on a 9 x 12 sheet of paper, assuming you can perfectly print and perfectly read at 1200 DPI, which would be a real feat for anything less than the kinds of presses they use for books and magazines, and a _really_ good scanner, not your $100 Best Buy bargain. Note that doesn't allow for any error correction.

Now, assuming 8-bit color you now have 155,520,000 * 8 bits or 155,520,000 bytes. That's not even a CD-R's worth of data and 3-and-change orders of magnitude less than this silly claim. ANd that's not taking into account the enormity of being able to flawlessly print and scan 1200DPI, 8 bit color, which would be unattainable with any consumer-grade equipment.

Ah, but what about compression? Who cares? It was BS when all the tape drive companies doubled their capacity claims based on the spurious claim that data you back up compresses to half its original size. Back when I used a tape drive, most of the stuff I was backing up was already compressed. Second, there is no compression algorithm that can compress arbitrary data by a factor of 1000. It's simply impossible.

The claim is complete and utter bull-plop, and your more detailed explanation makes no sense whatsoever. Check the story tags, and you'll see what the majority of people realize. To quote Futurama, "This is pure, weapons-grade Bolonium".

--
You are in a maze of twisty little passages, all alike.
Re:Related prior art by fbjon · 2006-11-27 03:09 · Score: 1

So you get a number of permutations per box, times a lot of boxes, gives a large number of bits. But all those permutations of dots in boxes correspond to permutations of dots looking at the whole sheet of paper. I don't see how you win anything, it's still just a number of dots==bits on the paper regardless of 3x3 boxes?

--
True confidence comes not from realising you are as good as your peers, but that your peers are as bad as you are.
Re:Related prior art by mysticgoat · 2006-11-27 03:34 · Score: 1

What can I say but that in the clear light of early morning I now see the error of my ways? I have learned something here and my thanks go to you and others.
I was confounding the possible values a bit stream can store with its length. Oh my.
I might respond on this thread again in a couple of days.
Re:Related prior art by pentalive · 2006-11-27 05:25 · Score: 1

If you are going to be concerned about the shape used inside the box (rather than just the difference between the foreground and the background) you need to throw out all the combinations of the same color on foreground and background.

I also think you will need to reduce your bit depth some, Printers and scanners have different gamuts
http://en.wikipedia.org/wiki/Gamut when you print color 128, the scanner sees color 112, so maybe only
64 or 128 colors.
Re:Related prior art by Anonymous Coward · 2006-11-27 08:31 · Score: 0

The familiar sound of bullshit. Music to my ears.
Re:Related prior art by ZorbaTHut · 2006-11-27 10:08 · Score: 1

*grins* No problem. 'Least you figured it out and admitted it :)

I know I've done dumb things pretty often too.

--
Breaking Into the Industry - A development log about starting a game studio.
Re:Related prior art by kasperd · 2006-11-27 17:22 · Score: 1

Google is your friend
If all my friends payed me as much money as Google, I'd be a wealthy man.

You could take a typical business document and treat it as a long binary number and convert it to base 32,000+ and store it on paper in 2 or 3 digits.
Must be an unusual busines you are in when no document is ever more than 6 characters.

--

Do you care about the security of your wireless mouse?
Re:Related prior art by Gospodin · 2006-11-28 04:39 · Score: 1

Right - 32,000 states, which is only 15 bits of information. Get it? You're confusing states with bits. There are always vastly more states than bits (since #states = 2^(#bits)). Think about this a little bit before posting further, please.

--
...following the principles of Heisenburger's Uncertain Cat...
Re:Related prior art by Anonymous Coward · 2006-12-02 07:21 · Score: 0

Think about this a little bit before posting further, please.
Or said in other words: It is better to remain silent and be thought a fool, than to speak aloud and remove all doubt. That is something lunaticgoat should think a little bit more about.

Softstrip by Threni · 2006-11-26 04:50 · Score: 1

Sounds similar to a system called Softstrip from 20 or so years ago.

http://rich12345.tripod.com/museum2/softstrip.html

I find the comments amusing. by Khyber · 2006-11-26 05:02 · Score: 4, Funny

Hey guys, remember back in the day when we stored data on paper using HOLES?

I wouldn't be so quick to say this is a scam.

--
Still waiting on Serviscope_minor to wake up to fucking reality and realize that Jessica Price isn't going to fuck him.

Re:I find the comments amusing. by Anonymous Coward · 2006-11-26 20:15 · Score: 0

Sure, I remember storing data on paper with holes.

And I happen to have a bag full of assorted paper tapes sitting on my living room couch. Also I have a ruler.

It's 80 bits per square inch.
Re:I find the comments amusing. by mikiN · 2006-11-27 00:23 · Score: 1

Eureka! You've found the solution! It's not about storing data on the paper, it's about storing data by removing paper!
Less data == Fewer holes. More data == More holes.
More holes == Less paper.
So, More data == Less paper!

I wonder what would happen if you just cut away all the paper to make one big hole.

<chuckle>

--
The Hacker's Guide To The Kernel: Don't panic()!

Who else here remembers... by solitas · 2006-11-26 05:09 · Score: 2, Interesting

...the old 'softstrip'(?) barcodes from BYTE Magazine? And wasn't there something in NIBBLE too? Did anyone else here ever use them, and what stories do you have about them?

I remember _finally_ finding an HP barcode wand at a show and hooking it up to my Apple][+. I wrote-up a reader and had a lot of fun doing it and used it to load their programs (ASCII BASIC) - I was surprised at how efficient it was (for the times). It beat the daylights out of cassette storage (couldn't afford a disk drive at that time) and it led me to writing an output application so I could barcode all my work to the printer for storage. Most of those pages are still legible after all these years.

Good times.

--
"It's time to take life by the cans." ~ Bender ("Bendin' in the Wind", ep. 3-13)

It's trivial by fph+il+quozientatore · 2006-11-26 05:12 · Score: 1

dd if=/dev/zero of=foo bs=1G count=500 gzip foo uuencode foo lpr foo et voila, 500G of data printed on a single sheet.

--
My first program:

Hell Segmentation fault

Re:This is a lie by Anonymous Coward · 2006-11-26 05:16 · Score: 0

There is no such thing as dpmm (dots per millimeter).

Umm.... Yes there is. You take the units "dots" and divide by units "millimeters" and magically get the units "dots per millimeter".

You should really take a physics or chemistry course so you could have a clue about what you're talking about.

An upper bound by TerranFury · 2006-11-26 05:22 · Score: 5, Informative

Here's an upper bound as a check on your numbers (not restricting ourselves to a small dictionary of shapes). I'll give away the punchline: My numbers agree with yours, but 256 GB is not possible using printers and paper.

Assumptions: I use your printer linear resolution of 1200 dpi, and assume that adjacent pixels can be resolved at this resolution. I also assume that 256 different colors can be distinguished, as you do, and that the paper we are using has an area of 96.6763 inches^2, also as you do.

Calculation: With a linear resolution of 1200 dpi, one can fit 1440000 dots per square inch (Check!), and so 139213872 dots on a sheet of A4. With 256 colors we can store a number as large as (number_of_colors)^(number_of_dots). So:

256^139213872 = 2^N (where N is the equivalent number of bits)
(2^8)^(139213872) = 2^N (recognizing that 256 = 2^8)
2^(8*139213872) = 2^N
N = 8*139213872 (bits)
(and if we just divide by 8 again to get bytes...) => 139213872 bytes = 139 MB

Discussion: This theoretical upper bound is three orders of magnitude smaller than what is being claimed by the article: It is not possible to store 256 GB on a sheet of A4. My result does however agree with your result in that the inequality (my_result)>(your_result) holds, as it should. Ad it's really not too shabby: Accounting for 8-to-14 conversion for some error correction, we can store slightly under 80 MB in this way.

Different assumptions: If I instead use your 2000 dpi laser printer figure, then I can fit 4000000 dots per square inch, and so 386705200 dots on a sheet of A4 and so almost 386 MB. (Including error correction, one might store 220 MB.) Pretty impressive!

The Absurd: Right now, many modern semiconductor fabs have working 90 nm photolithographic processes. That means that the smallest feature size is 3.54330709×10^(-6) in, and the linear resolution is about 282222 dpi. If all we print is the first metal layer, then a dot can either be "with metal" or "without metal" -- that is, one bit. And on a silicon wafer with an area the same as that of a sheet of A4 paper, we can then fit 7700207603555 dots, or 962 GB. Hard drives are about halfway there!

Re:An upper bound by Lothar · 2006-11-26 06:30 · Score: 1

You obviously missed the point of the article. Here is my take:

The data is divided into cells. Each cell is 3 X 3 Pixels.
There are therefore 256 combinations of 3 X 3 cells assuming the center pixel is color fixed:

0

1

2 ...

256

512 colors should be reliably printable and readable:

256 shapes X 512 color depths = 131072 bits per 'cell'

at 1200 dpi, 400 *400 cells could be available per inch
400 X 400 = 160000 cells per square inch

160000 cells * 131072 bits per 'cell' = 20971520000 bits per square inch
20971520000 / 8 = 2,621,440,000 bytes per inch

8.5 * 11 paper = 93.5 square inches
93.5 * 2,621,440,000 = 245104640000 bytes per page or 245 GB per page

Therefore, 256 GB of storage per sheet of paper should be reasonable with current technologies considering that photo paper has a very fine grain and current scanners and printers can easily surpass 1200 (even 2400) dpi.
Re:An upper bound by Lothar · 2006-11-26 06:34 · Score: 1

The 3x3 figures in post above should be: --- -x- --- 0 x-- -x- --- 1 -x- -x- --- 2 ... xxx xxx xxx 256 There you go. XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
Re:An upper bound by w3woody · 2006-11-26 06:43 · Score: 1

256 shapes X 512 color depths = 131072 bits per 'cell'

Bits don't multiply that way.

256 shapes = 8 bits, 512 color depths = 9 bits, since 8 bits and 9 bits encodes 2 ** 8 states and 2 ** 9 states, respectively. 256 shapes x 512 color depths = 131072 'states' or possible combinations, which represents 17 bits (8 + 9 bits) of information.

Using your assumption about the number of cells per square inch, you get 160,000 x 17 = 2,720,000 bits per square inch, not 20,971,520,000 bits per square inch as you state. On an 8.5 x 11 inch piece of paper, that means you would get 254 megabytes, not 245 gigabytes per page.
Re:An upper bound by TheThiefMaster · 2006-11-26 06:45 · Score: 1

You are assuming 512 colour depths = 512 bits, which isn't true. It's actually 512 VALUES, or 9 bits.
Ditto with shapes, 256 shapes is 256 values, not 256 bits. That's 8 bits.
So it's actually 9x8 = 72 bits/cell.
72 bits x 160,000 cells/sq in x 93.5 sq in = 1,077,120,000 bits, 134,640,000 bytes, or 128MB. Better?
Re:An upper bound by TheThiefMaster · 2006-11-26 06:55 · Score: 1

You forgot to divide by 8 for bits->bytes, giving a whopping 32MB to the page.
And, if the shapes are one single colour then you're right about it being 17 bits/cell, if each pixel of a cell can be a different colour then it become 8x9 instead of 8+9, giving 72 bits/cell, and around 128MB to the page. Still very short of 245GB
Re:An upper bound by Lothar · 2006-11-26 08:41 · Score: 1

I concede. Shows what happens when you cut and paste stuff without reading....

http://www.digg.com/tech_news/Scam_of_Indian_stude nt_developing_technology_to_store_450_GB_on_paper
Read comment by "earl507"

Scam ? or ... Why voluntary imprecise ? by gibboris · 2006-11-26 05:24 · Score: 1

http://www.hindu.com/2006/10/08/stories/2006100800 021100.htm ("last modified" seems correct)
"I have achieved storage densities of about 2.7 gigabytes per square inch," Mr. Abideen told The Hindu over phone from Kottakkal in Kerala.

And sooner, in September :
http://www.deccanherald.com/deccanherald/sep62006/ cyberspace163748200695.asp (but "last modified" is november 26...)
(But has been linked by some blogs the 19 September)
http://www.stingygaming.com/forum/viewtopic.php?p= 165
( the photo was on imageshack ... http://img319.imageshack.us/my.php?image=rainbowte chzz4.jpg )

I never think to this technology but the better way to discredit it is this kind of re-publication of an imprecise article.
Concepts stay very exiting even if storage place is in the order of the hundred of MB.

About the dot precision, laser printers (Xer.. ?) do some miracles :
http://www.eff.org/Privacy/printers/
http://www.eff.org/Privacy/printers/docucolor/inde x.php

Could you believe that ?! :)

250gig? by HermMunster · 2006-11-26 05:26 · Score: 1

The first time I read a related article several days ago they were professing something like 450-475gigs not 250. Wonder who's got their story wrong? Some comments regarding those were that it was not possible or even more possible with color coded binary.

--
You can lead a man with reason but you can't make him think.

Impossible. by sbaker · 2006-11-26 05:31 · Score: 1

Suppose for a moment we have 10" square paper (8.5" x 11") 256Gbits translates to a data density of 2.5 Gbits per square inch. If you have a really nice colour printer you might get 500 dots per inch - so you need to encode 2.5Gbits using 250 thousand dots. So each coloured dot has to store 10,000 bits of information. That's a pretty clever trick for a colour that's only described using a 24 bit RGB value.

So the best you could POSSIBLY do with a 500 dpi printer would be 24bits x 500 x 500 x 8.5" x 11" = 561 Mbits.

But that assumes that the colour reproduction of both printer and scanner are 100% perfect - which they most certainly aren't!

You'd need a 100% perfect 10,000 dpi full colour printer AND an equally perfect scanner.

All the crap about storing different shapes is irrelevent - even using every possible colour of every possible shape/pattern/whatever only gets you to a half a gigabyte in theory and maybe one megabyte in practice.

This is incredibly bogus...total BS.

--
www.sjbaker.org

The problem there by Moraelin · 2006-11-26 05:41 · Score: 1

The problem there is that books:

A) are very low resolution, so a speck of dust here and there, or even a hole/stain/discolouration here and there, isn't all that much. When you "encode" the letter "A" as a big hand-written letter, there is a lot of space it's spread over. Even without any additional hints (see next points) you could destroy a quarter of that area and still take an informed guess that it looks like an "A". By comparison, the same surface destroyed in this guy's encoding scheme (if it weren't a scam, which it is), you've lost a few megabytes worth of data.

B) have a much higher tolerance to stains, discolouration, etc. If you were to print this message and spill coffee over it, chances are that you could still tell there's an "A" here, because you only have to deal with two colours. If it looks like a darker "A" on brown, it's still the same as an "A" on white. But when you have thousands of superimposed shapes of all colours, a brown triangle on yellow may well be a whole different thing than a cyan triangle on white meant before the coffee spill.

C) writing and human languages have a great deal of redundancy in them, so you can still use the context to guess when bits are missing. If you have a hole in the page and you read something like "ye olde m_nks" you can take an informed guess that it was "monks". You also have the historical context there. If you were to read somewhere something like "N__ths__re Abbey", you could guess that it probably meant "Northshire Abbey", because we don't know any other that fits those letters.

Binary data doesn't have that luxury. By comparison, when you miss about an 1000x1000 bits area (and that may well be less than one square millimeter at this density -- again, if it weren't a scam anyway), even if you encoded each sector with a generous ECC code and stored checksums on each row _and_ column, you're still basically boned. It's not going to repair that much missing data.

D) conversely, writing and human languages have a certain "locality" and contexts only span so far. If the half the page about Northshire Abbey was missing from the Domesday Book, you could still read the next entry just as well. Even if half the book went missing, we could still extract _some_ info from the other half. In binary data, you don't always have that luxury. An exe where half of it is missing, is pretty much just screwed. A Domesday database where one of the normalized tables is gone, may well become nearly useless, depending on which table was lost.

Now none of those are necessarily the end of the world (or not as much as the fact that this is just as scam), because you can store the information repeatedly all over the place, and build baroque structures on ECC on top of other ECC, to the point of achieving the same effect. But by that time, you've lost most of the advantage anyway. And you've certainly lost the advantage compared to using a more robust medium in the first place, and not needing as much redundancy in the first place.

--
A polar bear is a cartesian bear after a coordinate transform.

Re:The problem there by BeerCat · 2006-11-26 08:56 · Score: 1

It's a fair cop, guv. I'll come quietly. My "back of a fag packet" calculations were off by a couple of orders of magnitude.

On the other hand, I think that pretty much summarises why this wouldn't work well, (even if, as you say, it wasn't a scam, which it seems to be).

Take a gold star.

--
"She's furniture with a pulse"

and image on paper? by Anonymous Coward · 2006-11-26 05:53 · Score: 0

wow, who would have thought that you could store an image on a piece of paper?

incredible

I've always defended Slashdot, but.. by Peter+Cooper · 2006-11-26 05:57 · Score: 5, Insightful

In this Digg generation, I've still kept reading Slashdot. The community here feels a lot nicer (surprising, I know!) and a lot more clued up. It's just a shame, then, that idiotic stories like this get posted. Usually I wouldn't whine about a bad story, but it was an hour or two before this story hit that I read the whole "why it's a scam" story on Digg.. so I read how stupid something is on Digg, only for it to be posted seriously here at Slashdot.

It's time for some sort of shakeup with editorial at Slashdot. Digg is imperfect and a lot of the users are idiots (I'd certainly say the average Slashdotter is significantly more intelligent and clued-up) but Slashdot is slow and has a poor editorial process. Could we, perhaps, strive to produce something with the perfect mix of the two? Fast news, the ability to vote, etc, but coupled with the superb Slashdot audience? It's all false hope, I'm sure, but I have more hope in people than technology.. so Slashdot is just the place to bring this up IMHO.

Re:I've always defended Slashdot, but.. by Lord_Dweomer · 2006-11-26 06:59 · Score: 2, Insightful

While I agree that Digg's quicker updates make it more "relevant", and that Slashdot indeed needs a shakeup with its editors there is a reason Slashdot is still superior. In the Digg comments for that story, most of the people quite obviously have no science to back up their responses. On Slashdot you will find some of the most thorough scientific debunking on the net. You see, not only do the intelligent people on Slashdot hate to see crap like this posted, but they also hate for people to read it and go away believing it. So they do their due diligence and make a post explaining why, scientifically, something is a load of crap.
And yes, there are plenty of pseudo scientists who post their crap along with the real ones, but the good ones tend to be validated by others and filter to the top. So while I read Digg to stay up to date on the latest greatest 5 minutes of distraction, I come to Slashdot to read the discussions and learn the varying viewpoints on things.

--
Buy Steampunk Clothing Online!
Re:I've always defended Slashdot, but.. by ecuador_gr · 2006-11-26 16:48 · Score: 1

Agreed, this is one of the articles that prooves there is something wrong with the editorial system of slashdot. We do have all this fancy meta-moderation stuff, but how about some meta-editing? You can't have an army of editors, but how about an army of willing meta-editors that can vote off dupes and lame articles before they go public?
Re:I've always defended Slashdot, but.. by Concerned+Onlooker · 2006-11-26 18:56 · Score: 1

Maybe, but I say it's been really interesting and fun to read the posts with all the math explaining why it can't be done. It reminds me of the old days when people here argued about tech stuff rather than making personal attacks.

--
http://www.rootstrikers.org/
Re:I've always defended Slashdot, but.. by Anonymous Coward · 2006-11-27 01:57 · Score: 0

I'd love to agree with you, but a huge proportion of people posting here seem to actually believe this nonsense.
Re:I've always defended Slashdot, but.. by zufar · 2006-11-27 04:33 · Score: 1

that would be reddit.com perhaps :)

Name for this invention... by Anonymous Coward · 2006-11-26 06:01 · Score: 0

Times New Roman 0.0002pt.

Too much TV by camba · 2006-11-26 06:04 · Score: 1

This guys watched too many episodes of Prison Break. Next season the tatoo will hold the Library of Congress!

Please correct me if I'm wrong. PS I Need a Job by Anonymous Coward · 2006-11-26 06:08 · Score: 0

16QAM modulation gives 96 bits per symbol utilising only 16 binary dots per symbol.
Adding another 16 bit dimension (color depth) gives 96 times the number of bits.

So. out of 32 dots on the paper(remember, it's orthogonal in 3 dimensions) you get 8716 bits. Lets call it 1kbit.

300DPI is 90k dots/square inch by 90 square inches on a A4 sheet.
You have now 8,100,000 dots
Divide this by 16 dots for our QAM scheme and you have approx 500k symbol sets.
multiply this by the 8kbytes/symbol set(don't forget the color depth is in this 8k) and

4000 megabytes assuming every dot and color is 'demodulated'. Of course this isn't the case, but it's pretty clear that there is plenty of space for redendancy and this 'paper and ink' stuff is a lot less noisy than radio.

Now I have not thought 'long and hard' about this, maybe 10 minutes in addition to the 2 finger typing time. But, it's pretty obvious to me that even if the dude is lying, it can be done.

PS. If anyone wants to hire me, I need a job...

-Duck

impossible by Anonymous Coward · 2006-11-26 06:31 · Score: 0

if it was possible. then scan say 10 a4 size papers containing few gb data in 30 M pixel in digital form. then store it in pc. then incode the again digital data in pc.... and repeat the process...

infinity storage huh!?

Close, but not quite. by jfisherwa · 2006-11-26 06:34 · Score: 1

Close, but not quite analogous. A lossless bitmap file needs 24 bits per pixel to store color information. Actually printing on paper would theoretically allow you to map 24 bits against a single colored dot, providing a form of compression.

Re:Close, but not quite. by kocsonya · 2006-11-26 13:23 · Score: 1

> Close, but not quite analogous. A lossless bitmap file needs 24 bits per pixel to store color
> information. Actually printing on paper would theoretically allow you to map 24 bits against
> a single colored dot, providing a form of compression.

Assuming that you have a printer that actually uses 16M colour shades for each pixel. Which they don't, at least the commodity printers (laser or inkjet) do not. You can instruct an inkjet printer to deposit ink at a location or not. Most inkjets use 4 inks but the black is used only when otherwise you would deposit the other three simultaneously. Thus, you really have only 8 colours to paint a single dot to. When the printer (driver) renders a 24-bit image it uses dithering or error diffusion to cheat - it trades in spatial resolution for colour. The printer's resolution is much finer than the feature size of your image so it can use multiple dots to average the actual colour you want.

So yes, the printer represents multiple bits per pixel, but its 3 bits per pixel rather than 24. Some use more than 4 inks, but they are still nowhere near to 24 bits per pixel (more like 5).
Re:Close, but not quite. by jfisherwa · 2006-11-26 17:31 · Score: 1

Hence "theoretically."

A good spatial printing scheme will use a higher dpi than represented at the printer level to obtain closer to 24-bits per dot at the selected output dpi, so while you will have 3 bits per "dot," it will seem closer (maybe half?) to 24-bits at a dpi resolution beneath the printer's maximum output. Sort of how a 1280x720 LCD actually has 3-4 times as many 'pixels' as represented, but can still reproduce 24-bits at normal viewing distances.

Of course this is true. by JMZero · 2006-11-26 06:50 · Score: 1

In other news, Sony sure employs a lot of idiots to make its BlueRay drives. They have to use blue lasers, a special medium, and lots of expensive parts to get the same storage density they could have got by having a spinning disc made out of paper and a cheap CCD.

--
Let's not stir that bag of worms...

Color fades *very* quickly. by Maxo-Texas · 2006-11-26 06:51 · Score: 1

Any exposure to light and you are going to start losing data.
You must use light to scan it.

This might be good for storage for under 2 years- can't see it for archival storage unless you greatly reduce the number of colors so fade won't matter.

--
She was like chocolate when she drank... semi-sweet at first and then increasingly bitter.

Re:Color fades *very* quickly. by ebers · 2006-11-26 08:18 · Score: 2, Informative

Depends on the pigments in the ink. Organic pigments get fried by UV, or even just trace ozone in the air, very quickly. But metal ion based pigments (lead, cadmium, iron...) can last almost forever. Too bad the used media would then be toxic waste.

Bullshit by BarnabyWilde · 2006-11-26 07:03 · Score: 1

I call "Bullshit". Makes no sense. If you encoded every single fiber visible on a page, you'd not have the claimed storage.

PaperBytes by Kadin2048 · 2006-11-26 07:07 · Score: 1

I thought I had posted this before, but it seems as though Slashdot ate my post...

Anyway, information about PaperBytes is available here. They also made a more sophisticated, 2D version of it, PaperDisk (circa 2001). It had a data density of around 140 bytes per square centimeter, or around 84,461 bytes on an 8.5x11 inch page.

--
"Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."

Re:PaperBytes by Bastard+of+Subhumani · 2006-11-26 09:56 · Score: 0

or around 84,461 bytes on an 8.5x11 inch page.
That ought to 28% more than enough for anyone!

--
Only three things are certain; death, taxes, and apocryphal quotations - Ben Franklin.
Re:PaperBytes by metroplex · 2006-11-26 11:32 · Score: 1

haha, wish I had my mod points!

--
"Words of wisdom: drop that zero and get with the hero" -- Vanilla Ice

Cauzin Softstrip format by steveha · 2006-11-26 07:08 · Score: 1

I believe it was "Dr. Dobb's Journal" that used to publish code that could be scanned, sort of a variant on barcodes.

Several magazines did this. They used Cauzin Softstrip format. Wikipedia didn't have an article on Cauzin Softstrips so I just added one: http://en.wikipedia.org/wiki/Cauzin_Softstrip

I fail to see how a binary pixel can fail to take less space than a printed geometric shape. You can squirt an ink dot a lot smaller than you can a recognizable microscopic shape.

You also need to take into account the real-world issues, such as fibers in the paper wicking the ink and making your precise geometric shapes into fuzzy blobs. Careful bit patterns, with error-correcting bits, ought to be a lot more efficient than geometric shapes.

steveha

--
lf(1): it's like ls(1) but sorts filenames by extension, tersely

Humm. by BlahSnarto · 2006-11-26 07:13 · Score: 1

An A4 Sheet?

I could use that scanner to scan all my
vinyl LP's rather then scan in half's
and stitch them together..

Sounds like Arrrhhh! by Anonymous Coward · 2006-11-26 07:13 · Score: 0

"Besides, we can't even get B&W OCR right."

9 out of 10 Usenet pirates disagree with you.

The tiff? by DeadboltX · 2006-11-26 07:23 · Score: 1

How big in MB is the TIFF file?
Instead of printing it out and scanning it, why not transfer it on a USB stick?

Re:The tiff? by DirtyRhodes · 2006-11-27 09:05 · Score: 1

My thoughts are close to this, what about PDF or Postscript format? If this deal is real, we are talking about mindboggling compression.

What is the largest size a single A4 sized PDF file can be?

--
"A keyboard?! How Quaint!"

Almost infinite data compression... Cool. by musther · 2006-11-26 07:41 · Score: 1

While I doubt this can be true, imagine if it were. Imagine taking a DVD movie, storing it on two square inches of image, well rather than print the image, save it as a compressed, lossless PNG. 500dpi is a good res for printed material, so a 1x2" image at 500dpi in 16bit colour should be about 300kb, that's great, I've just got a DVD onto a floppy. If that's not small enough, just make another rainbow representing the PNG image, then make a PNG of that, it should be under 1kb. Do this as many times as you need to, even if my resolution of 500dpi is off, we can just repeat the stages a few more times. It is now possible to fit any data onto any storage medium I for one am very grateful to this guy and his team, as soon as youtube start using this compression technology, they will have abolished the problems of net bandwidth too.... YAY!!!

Data i circle by Anonymous Coward · 2006-11-26 07:54 · Score: 0

first thing: put 250 gb on A4, scan A4, PNG A4 to file, put 250 GB of PNG to A4... REPEAT
Second: but data in shapes is somethig far diffrent: take small simple circle, calculate PI, its infinit and has no repeating parts. Put PI in binary...

Am I wrong? by damarusama · 2006-11-26 08:18 · Score: 1

Or if this is true then dowloading a 240gig file will be as fast as downloading a a4 page high def graphic ? I mean if you can store that ammount on a a4 page, then you can scan it, and send it to someone by e-mail and then the person can print it out and get the data. But again, why scan and print, can the computer direcly compute the data from a digital file ? Why is this article about using paper instead of CD?? Shouldn't it be about a new way of storing data in a visual format, who cares what you do with it after ?? Printing and scanning the image containing 240gig of data seems to be so away from the fact that if we can store 240 on a graphic the size of a a4 page, then digital transfer is drastically modifyed... Is't this like the best encryption and compressing technology ever ???? well I might be just wrong also or I just don't get it

Re:Do The Numbers --- But do them right by mypalmike · 2006-11-26 08:34 · Score: 1

Units, units, units!

> (1200 pixels/in * 1200 pixels/in) / (3 * 3 pixels^2 / shape space) = 160,000 [ shape spaces / in^2 ]

> 160,000 shape spaces / in^2* 4 shapes / shape space = 640,000 [ shapes / in^2 ]

> 640,000 shapes / in^2 * (32 shades (6 bits) ** 3) = 20,971,520,000 [ shapes * shades * bits^3 / in^2 ??? ]

My calculation:
640,000 shapes / in^2 * (36 bits / shape[see note]) = 23,040,000 bits / in^2

[4 shapes encodes 2 bits. With 18 bit color, each colored shape represents 18 color bits * 2 shape bits = 36 bits]

--
There are 0x40000000 types of people: those who understand 32-bit IEEE 754 floating point, and those who don't.

152292 assumption by way2trivial · 2006-11-26 08:45 · Score: 1

binary..

now divide that by the number of colors you can discern later

let's say you have 152,292 colors

now only need one dot.

--
every day http://en.wikipedia.org/wiki/Special:Random

Re:152292 assumption by SnowZero · 2006-11-26 21:45 · Score: 1

You are off by a factor of 8845.

Hint: log2(N) != N

Scam. by PureRED · 2006-11-26 08:50 · Score: 1

This has been proven to be a scam. It needs to be removed before people begin believing that this is even plausible. http://itsoup.blogspot.com/2006/11/scam-of-indian- student-developing.html

Re:RTFA by Compuser · 2006-11-26 08:52 · Score: 2, Informative

Strictly speaking he could use many colors. The resonable width of color spectrum is around
1000 nm and good optical filters will give you a window of 2 nm bandpass, so assuming he used
500 wavelengths/colors he could store 700 Gb per page. Also, I am aware of prototype, in the lab
printers (by Canon) which do 9600 dpi (Google it), so pushing technology to its limits and
cost notwithstanding you could write 31 Tb on an A4 sheet. And I am pretty sure one can make
this work for not much more than a yearly budget of one National Lab in the US :).

Senior BCE Project by PhoenixHack · 2006-11-26 08:55 · Score: 1

I worked with a classmate to develop a paper disk system more on the order of what has been discussed in this board than what the article describes - dots, not shapes, and theoretical maximum capacity on the order of hundreds of megabytes, not gigabytes. It was essentially a larger version of Datamatrix.

Practically speaking, we were shooting for tens of megabytes: 600 dpi, 8-bit color resolution, 1-inch margins on 8.5x11 -> 600 * 600 dots / sq in * 1 byte / dot * 75 sq in printable area / sheet of paper = about 26 MB, not including error correction.

My classmate put in the error correction which added some 25-33% to the data. We stayed with black and white and never had time to go any farther, but we did get it to work - maybe a few hundred kilobytes worth of some random file.

The unanticipated problem we ran into was skew - progressively increased rotation of the scan lines, a warped scan. To overcome it, we stumbled on the same solution the creators of Datamatrix used - a row and a column of alternating black and white dots used to establish independently-angled scan lines.

We never did do much with testing how rigorous it was; once we got the basic idea working we wrote up the paper and moved on. I believe we intended to keep working on it, mostly to spite the EE professor who arrogantly opposed the idea of paper storing anything near what CDs can store. But the intentions went nowhere as we both entered the military and got assigned to different continents.

I believe instead of sniping what is very likely a hoax, we should focus on the practical uses of paper as a digital storage device. We wondered whether companies might create posters at the front of the store - customers snap the photo with a PDA and use conversion software to extract a store catalog, special coupons, hyperlinks to advertisements. Billboards along the highway packed with audio clips. Letters to relatives with home videos included.

Some of these uses may seem more practical than others, but I'd like to hear more creativity along what can be done with tens of megabytes than proof after proof of why paper can't pack 256GB.

It's not the printing, it's the color by eyepeepackets · 2006-11-26 09:00 · Score: 1

Everyone seems to be focusing on the density of the printing onto the paper, but the key is probably the colors. The encoding algorithm probably uses color as symbolic representation of data.

x,y,z(color depth)

Assign color to represent data in the color range provided along with a coordinate system and viola, you have a practical, two dimensional representation of a three dimensional coordinate system with data depth provided on the z axis.

--
Everything in the Universe sucks: It's the law!

Comment removed by account_deleted · 2006-11-26 09:11 · Score: 1

Comment removed based on user account deletion

It works! by tcdk · 2006-11-26 09:26 · Score: 1

I just mime encoded a couple of dvd's and printed them with a point size of 0.00000001.

The important thing is to use an OCR font...

--
TC - My Photos..

Re:RTFA by maeka · 2006-11-26 09:32 · Score: 1

That is dots per inch, not pixels per inch. Each dot only being one color, the full color spectrum only created by halftones. So the maximum data to be stored on an 8.5"x11" sheet of paper using your 9600dpi printer is:
(8.5*9600)*(11*9600) dots with each dot being one of N+1 values
Where N is the number of colour tanks/toners your printer has. The +1 is for the medium (paper).
I'll be real nice and assume a 7 colour printer.
N+1=8 8 possible values = 3 bits
000=1
001=2
010=3
011=4
100=5
101=6
110=7
111=8
3 bits
So the math is 8.5*9600*11*9600*3 = storage in bits
3.23136 TiB is the theoretical maximum on paper with this 9600dpi printer.
Of course you would need a 19,200 dpi scanner...but that is child's play.

Thanks. by Brightest+Light · 2006-11-26 09:34 · Score: 1

Thank you for reminding me why I like the comments section of SlashDot better than digg.com in its entirety.

There's a flaw somewhere. by camperdave · 2006-11-26 09:57 · Score: 1

No. The article says a 45 second video clip. It does not say the size of the video clip. The 432 pages of foolscap (single sided, in 10pt courier with 1 inch margins) could fit onto a four square inch piece of paper at just over 2200 pixels per inch.

A 2400 PPI printer on letter size paper with 1/4" margins could put 8*10.5*2400*2400=483840000 bits, or 57 megabytes onto the page (twice that if you use both sides of the page). That's enough for a 45 second video clip, especially if you don't use a full screen for the video.

--
When our name is on the back of your car, we're behind you all the way!

Re:RTFA by osu-neko · 2006-11-26 09:58 · Score: 1

Ultimately, the triangles on the paper are really just printed as a bunch of dots.

Given that an ordinary printers can't print triangles, that is indeed how we normally fake it. Now, the question is, would his claims be credible if he actually was printing these shapes, not approximating them with a bunch of dots?

It seems that every "disproof" posted here starts by discarding half his claims, then proving it can't be done assuming he's only doing half of what he says he is doing.

This is still probably a scam, or he wouldn't be so tight-lipped about how he's doing it, he'd patent the process, simultaneously (a) publishing the details and (b) ensuring he makes a lot of money.

--
"Convictions are more dangerous enemies of truth than lies."

Re:This is a lie by camperdave · 2006-11-26 10:16 · Score: 1

"Imperial" units have been metric for decades. An inch is *DEFINED* as being 25.4 mm long, a pound is *DEFINED* as being 0.45359237 kg exactly. If you went to a country that used 37 and 59.5 cent coins, you'd think they were crazy. Well, the rest of the world is looking at the US's measurement "coinage" and is shaking their heads in disbelief. How do you people function over there?

--
When our name is on the back of your car, we're behind you all the way!

This is news to me. by Anonymous Coward · 2006-11-26 10:22 · Score: 0

I never knew Slashdot had an editorial process.

Well, to be Fair... by camperdave · 2006-11-26 10:33 · Score: 1

Well, to be fair, he could be printing on both sides of the page.

--
When our name is on the back of your car, we're behind you all the way!

No way.... by gweihir · 2006-11-26 10:39 · Score: 1

Even assuming 1200 dpi and full color range per DPI (i.e., 24 bit/pixel), you get

1200 x 1200 x 12 x 8 x 3 Byte = 414MB

That is a far cry from the claimed 256GB. An take into account that most scanners cannot resolve 24 bit color (because of noise) and have real resolutions more in the 300-500 dpi range, then this is just one giant unfounded claim.

Yes, true the best scanners available may actually resolve 4 bytes/pixel and 2400 dpi. But that would be drum scanners that costs more than a new car. And even with that you get only

2400 x 2400 x 12 x 8 x 4 Byte = 2.2GB

Another problem is printing. Current high-quality printing can get something like 1500...2500 dpi real resolution orn really good paper, but they have less than 16 colors per pixel and color is rasterized to give more color spectrum.

I call this a really bad calculation error on all sides involved or possibly plain fraud.

--
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.

Re:RTFA by Compuser · 2006-11-26 11:05 · Score: 1

This is fun. Let's see what we can do if money really is no object, Bronx Bomber style.
First, you can take that picoliter of ink and stuff it full of all elements of the periodic
table including isotopes. All of those have distinct spectroscopic signatures, so you can
get 200-300 8 bit markers in one dot. On top of that there are techniques which discern
bond energies (look up coherent anti-stokes resonance for instance). Once you make complex
molecules and have sensitive enough equipment to read attomolar concentrations, the playing
field widens considerably. It may take several years and billions of dollars to write an image
like this and the same to read it off but I'll bet even hundreds of terabytes are possible if
you are wealthy and tricky enough. Of course it may be easier to weave the paper out of
double stranded DNA and sequence it (around 1e17 bits for a 2 nm thick A4 sized sheet).

256GB? by Anonymous Coward · 2006-11-26 11:36 · Score: 0

That's nothing, I've got 436 petabytes here!

Your math is off by a lot by Anonymous Coward · 2006-11-26 11:46 · Score: 0

Let's pull out a sheet of 8.5x11 paper

That's 93.5 square inches. Multiply that by a reasonable DPI (1200) and you get 1.3x10^8 dots on the page.

Each of those dots can have 256^3 colors. But lets lower that by 1024 so we only need 16K of colors. 16384x1.3*10^8 is 2.1299x10^12

Divide by (8*1024*1024*1024) = 247.955GB

Which is about what he's claiming. The big compression trick is representing 16Kbit values on a single dot instead of 2 values with a single bit. That's a huge compression amount that can only be done on printed paper. It takes 14 bits of 1's and 0's to do the same as one 14 bit color value. So that's 14:1 compression.

So if you could get 16K colors that aren't easily distorted on the page his claim is reasonable. I don't know what's he talking about with shapes though.

Re:Your math is off by a lot by SnowZero · 2006-11-26 21:02 · Score: 1

You make the same error numerous people have made. 16M colors is only 24 bits, 16K colors is only 16 bits, so you are off by a factor of 1024.

(1200^2 * 8.5*11 * 16 / 8)/(1024^2) = 257 MB

You are all MISSING THE POINT!! by PinkyGigglebrain · 2006-11-26 11:53 · Score: 1

I don't usualy like to think ill of my fellow slashdoters and I know I am going to get moded a troll or flaimbait and get a lot of heat for this but it seems like everyone who has posted so far are CLUELESS!

You are all talking about how you can only store so many bits at such and such a resolution on paper and how the scanners can only get so much info from it and yada yada its all a bunch of BS.

Ever wonder why a picture is worth a thousand words? Its bacause there is VISUAL data contained in the picture that can not be described in the same number of bits as it takes to represent the information in the image.

Nobody seems to flip out when we hear about a DVD size disk that can hold a few Terrabytes of data, why?, because its holographic, it uses the same storage technology as a DVD but stores information in the depth of the bit in the media as well as its linear location in the data stream, the actual data density hasn't changed when viewed as a two dimensional surface. So what if we think of this new methode as a "Visual" storage system, not just a two dimensional array but with color and geometric angles being the third and possibly fouth layer of data?

Anyone else remember that the sine of any angle is some F'ing huge number, what about cosine and tan? Throw in radians and a few math algorthms and you could probably store some major amounts of data whith just a few lines and colors.

This may be BS, but at the same time it may be a really incredible leap of imagination on the part of the creator of this. We only show how rigid and limited our minds are when we dismiss something out of hand because we don't understand it.

Now you all get to flame me, call me an idiot and explain to me how I don't understand information theory or any of the rest of the theorys or principles everyone is talking about. I don't, that may be why I get it and you don't. Maybe it is because of your inability to see beyond those principles and theorys you hold so dear that you don't understand how this data storage system works.

Re:You are all MISSING THE POINT!! by csrster · 2006-11-26 19:38 · Score: 1

I've always believed that the more I know the less I understand. But I've never before given much thought to the converse.
Re:You are all MISSING THE POINT!! by FreemanPatrickHenry · 2006-11-26 20:28 · Score: 1

Anyone else remember that the sine of any angle is some F'ing huge number

Uhm...no, I don't actually.

--
I have discovered a truly marvelous .sig which, unfortunately, this space is too small to contain.
Re:You are all MISSING THE POINT!! by DirtyRhodes · 2006-11-27 09:50 · Score: 1

I get it, the reason people don't is that they are still thinking binary. 8 bit's require 8 physical locations to store it's data. 1 bit only has 2 states, on or off. But what if on bit had 3 states? Then we require less real-estate to store the data. What if one bit had 7 states, like red orange yellow green blue indigo violet, add black for 8 states. Now let's add the idea that not only colors but geometric shapes are used, now we have another factor let's say we use 8 shapes, circle square triangle pentagon hexagon heptagon octagon nonagon. Now our single bit holds 16 states! That's 281,474,976,710,656 Decimal for 8 bits of 16 state bits vs. 256 Decimal for 8 bits binary. But now that I think about it it's paradoxical, and that's why it's hard to grasp, it's easier to think in black and white, right and wrong, on or off. This new idea is more along the lines of philosophy then data storage and this is where people get freaked out. D. Rhodes

--
"A keyboard?! How Quaint!"

He's not using 256 colors by KalvinB · 2006-11-26 11:57 · Score: 1

He's using 2^14 colors which using your math yeilds about 250GB as advertised.

I don't know who decided that printers can only distinguish between 256 different colors. You do realize that printers can actually print more that 16K colors and scanners can scan in at 24bit resolution.

All this debunking is based on a limitation of 256 colors which only exists on Slashdot.

Whether or not the printer can write the colors with the exact value as the scanner is another debate entirely. Something that is subject to "try it."

--
Work Safe Porn

Re:He's not using 256 colors by Herak · 2006-11-26 16:46 · Score: 1

No... 2^14 colors using his math yields about 250MB, not 250GB.

Re:RTFA by Anonymous Coward · 2006-11-26 12:08 · Score: 1, Insightful

8.5*9600*11*9600*3 bits != 3.23136 TiB.

i think it actually might be possible by Anonymous Coward · 2006-11-26 12:15 · Score: 0

If we have a scanner that can distinguish between different colors, for each dot that's printed onto the paper, we can encode more than a single bit. If we can distinguish 8 different colors, that would mean each dot could encode 3 bits. Essentially it would break down to (8*dpi) * (11.5*dpi) * number of colors possible = #bits encoded in a single letter sized page. Of course, I don't know hwo good scanners are, so it's still extremely far-fetched.

all ways of colouring a 3x3 sq with two colours by weierstrass · 2006-11-26 12:34 · Score: 1

http://retropolitan.blogspot.com/2005/06/all-ways- of-colouring-3x3-square-with.html

--
my password really is 'stinkypants'

Re:all ways of colouring a 3x3 sq with two colours by mysticgoat · 2006-11-26 16:41 · Score: 2, Informative

Thanks for the link to weierstrass' Implicity blog. His diagram of the 102 unique two-color patterns is very useful in this context.
So 102 patterns using 2 colors multiplied by the number of combinations of two colors that can be drawn without replacement from a palette of 256 colors... it has been a while since I've worked combinations but I know how to use Google as a brain prosthesis:
Google this, guys: "256 choose 2" yields 32,640 unique two-color schemes.
So 32,640 * 102 patterns = 3,329,280 possible combinations within each 3x3 matrix. 85,000 such matrices on the 8.5x10 inch sheet yields 2.9*10^11 unique patterns possible on one page. That's a pretty long bit stream. Converting from bits to something understandable yields 33 GB per page.
This unsophisticated technique shows a sheet of paper can hold more than 1500 times the information that the one-bit-per-dot crowd was thinking was the max (22 MB iirc). It is still an order of magnitude lower than the reported achievement of Sainul Abideen-- but I am working as a Resource Support Assistant in a Community College and I don't profess to know much about combinatorial math or pattern recognition. I think it enlightening that Google says that "256 choose 3" gives 2,763,520 unique three-color schemes...
I do, however, know a thing or two about Google and how to use simple resources like it to make the world a little more understandable.
Re:all ways of colouring a 3x3 sq with two colours by jafuser · 2006-11-27 05:29 · Score: 1

This reminds me of a problem I've been occasionally thinking about for a few years now.

What *minimal* texture would you need to have in order to generate any arbitrary bitonal image using offsets, rotations, flipping, and scaling?

For example, a 2x2 grid has the following combinations:
00 00 10 01 00 10 11 01 00 10 01 10 11 11 01 11 00 10 00 00 01 10 00 01 11 01 10 11 10 01 11 11
All of these can be represented in a texture by properly offsetting, rotating, and/or scaling a 2x2 segment of the following:
1010 1100
In the image shown on the link of the parent poster, it shows all possible 3x3 glyphs, which is partway to solving the problem, but the hard part is combining these glyphs together into a single texture in a way that minimizes texture size -- using texture offset, rotation, flipping, and scaling techniques.

For example, the first two columns of 3x3 glyphs from the parent link:
000 100 000 000 000 000 100 010 000 000 000 000
could be combined into a single 4x3 texture:
0000 0100 0000
Assuming we start by scaling our texture so only the left 3x3 portion of it is shown (cropping off the 4th column out of view):
- The first glyph (empty) can be easily achieved by scaling any 0-pixel up until all 1-pixels are no longer in view.
- The second glyph (corner pixel) can be achieved by offsetting down and right one unit (the top row will wrap to the bottom).
- The third glyph can be achieved by offsetting right one unit.
- The fourth glyph is already shown by default with our starting scale and position.

This is just for a small subset of 3x3 glyphs. The real problem is figuring out a way to generate the minimal texture which will handle *any* 3x3 glyph. And once that is done, going up to solving any 4x4 glyph, 5x5, 6x6, etc. My only approach so far is to manually design the best minimal texture I can and keep reducing it further as I discover new ways to optimize it. Unfortunately this will not scale as the combinations get more mind-bogglingly complex. I figure there must be a much more elegant way to solve this problem for any n x n size glyphs, but it sounds like a problem better suited to someone who has majored in math. For me, it just remains an ongoing curiosity that I reflect upon occasionally. =)

--
Please consider making an automatic monthly recurring donation to the EFF
Re:all ways of colouring a 3x3 sq with two colours by weierstrass · 2006-11-27 11:36 · Score: 1

i did major in math. all i know about this problem though is that i've seen it before - either you posted about it previously with a link to an image (?) or you're not the only one to have thought about it.

off the top of my head it seems you could probably get reasonably (!) good lower estimates by examining pairs of glyphs, and determining the largest amount they could overlap by. eg the 3x3 square which has only one white square in the centre can't overlap the one with only the centre sq black at all. the second and third 3x3 grids that you reproduced above can overlap by up to 6 squares (the maximum). these are probably quite easy to work out automatically, for any n which isn't too big. if you work this out for each pair, you will be able to estimate a maximum amount of squares that you can save, compared to the trivial texture which just consists of each unique nxn square laid adjacently. then if you do something similar for sets of 3 and 4 glyphs, eg try all ways of overlapping them and see which one gives the biggest reduction in area. one glyph can overlap upto n^2-1 others, but i suspect that this rarely happens in practice once n gets a bit bigger than 2. so hopefully once you are trying combinations of 3 or 4 at a time, you will be getting estimates quite close the actual best case, without it being too computationally expensive. this might allow you to prove that the texture you've come up with 'by hand' is the best one, or very close to the best possible.
the GP is a link to my blog. i came up with the 3x3 squares from the original image by hand. my interest is in finding a neat way to generate these with a simple algorithm that doesn't involve finding all combinations and then excluding the rotations and reflections. you can find the number of unique ones quite easily using the polya-burnside formula, but i don't have a good way to generate them all. since they are relevant to your problem, i wonder if you have any ideas about this?

--
my password really is 'stinkypants'
Re:all ways of colouring a 3x3 sq with two colours by sillybilly · 2006-11-29 16:10 · Score: 1

Offtopic, but why does your http://everything2.com/index.pl?node_id=735553 everything2 page list last seen 1.4 years ago. One is inclined to think you've been "eliminated." If you're still on the web it'd be nice to put a comment there on "I've been mostly active on slashdot, etc.. not coming here much anymore." Just a suggestion.

Re:Do The Numbers --- But do them right by hung_himself · 2006-11-26 12:57 · Score: 1

Actually, no - 1200 dpi *resolution* (i.e. number of dots per inch that can be distinguished) is impossible on any dot matrix printer. That's the dot density that they print. In the old days the printer people were more honest - and computer articles were more saavy about the difference (some of the old MacUser mags from the early 90s for example).

A good laser printer might get between 300 to 600 dpi true resolution with good toner on plain paper. A dye sub might get you 300 dpi with something approaching the hues that you are talking about.

Same with scanners - while they might have the optical resolution of 1200 dpi - the number of colours that they actually distinguish is far less than the ones that they claim. Just because they keep 24 or 32 bits and their chips output that many bits doesn't mean they can *distinguish* 24 bits of colour. I believe that it's much closer to 15 to 18 bits of colour.

Color and figures... by Anonymous Coward · 2006-11-26 13:13 · Score: 0

Quoting my own words from a blog... "Besides using color for storing information, you can also use the geometrical figures... and with those figures, you can be a smart guy and draw then into some kind of order that could operate in a similar way as the layers in most modern optical disks technologies. Hum, for example: you can always inscribe the figures one on each order (the most external being the paper border itself)... you can also vary the angles in which some inscribed figures (like the triangle) are related to a preferential reference system (that could be set fixed in relation to the paper boundaries)... imho, it is possible...

mantenham suas mentes abertas... always."

Adding a little bit of ideas... you can use geometrical and color together... the boundaries of a geometric figure could code part of the information, the figure itself could code part of the information, the level at it was inscribed (related to the zeroth-level, that would be the most external figure) could code part of the information, and the space that was not used to draw figures (simply because it was to small to draw a geometrical figure on it) could be used to code information as well...

Can it be a scam? Yes. Can it be real? Yes.

The information can be coded two-dimensionally... the really new idea of this guy is probably the >>encoding scheme, that is "cheap research", not the printing/scanning technology, that are "expensive research"... this is the beauty of most of the third-world innovations! Not to flame wars, but take a look at Alberto Santos-Dumont's 14-bis (and also demoiselles) and think about airplanes... do we have catapults at our airports???

Best regards,

Alberto.

Re:RTFA by maeka · 2006-11-26 13:46 · Score: 1

yea, off by 1000, my bad.

DOT MATRIX by Anonymous Coward · 2006-11-26 14:01 · Score: 0

eveyone's calculations are based on the assumption that he is using a dot matrix printer. In reality, a paper has close to infinite capacity to hold information given that your encoding device is accurate enough and your decoding device is perceptive enough.

Shapes make sense by iamacat · 2006-11-26 15:19 · Score: 1

Otherwise how are you going to read your rainbow disk if the paper is slightly rotated, crumbled or sheared? With geometric shapes, you can detect these effects and compensate for them.

Nope by arodland · 2006-11-26 15:28 · Score: 1

because the "higher orders" are completely dependent on the finer details. There's nowhere to put any more information that doesn't destroy some of what you already wrote. If you know every detail at the finest resolution, then you also know exactly what it looks like at the next level up, so there's no uncertainty to fill with more information.

Re:This is a lie by E++99 · 2006-11-26 15:30 · Score: 1

I think what the article must have failed to mention is that it was a 256GB text file containing almost completely white space that he was storing in this manner -- after compressing it.

If you work it backwards..... by paulmac84 · 2006-11-26 15:54 · Score: 1

... 256GB per page is 256 * 1024 * 1024 * 8 = 2147483648 bits per page. A standard 8.5x11 inch page is 93.5 inches^2, so that works out at 2147483648 / 93.5 = 22967740 bits per inch^2 (approx). If a dot can either be printed or not printed then, we're talking about a binary encoding system and taking the square root, then gives you a density of approximately 4792 DPI.

Assuming that this method of encoding was initially designed to work with a standard home printer and assuming a standard printing quality of 300 DPI, that leaves us short by a factor of about 16, (4792 / 300), or 2^4. By using the standard 4 standard CYMK printing colours to encode each bit, we have our factor of 16

My calculations do not take into account the need for any colour registration or error correction areas on the page. I would think however that by increasing the number of colours, it would still be technically feasible to have your 256GB and have plenty of room to include error correction etc.

The only thing that I can't figure out is the requirement for different shapes, maybe the error correction is built in to the shapes being used, but surely there would be more efficient ways of doing it?

--
One of the universal rules of happiness is always be wary of any helpful item that weighs less than its operating manual

Re:Am I wrong? - Yes by ChronoFish · 2006-11-26 16:05 · Score: 1

Yes you're wrong.

An image that was printed "actual size" would not fill an a4 piece of paper. Sure there are various formats that can compress the image, but there is no additional compression gained when printed on paper vs. saved to disk. The amount of information that needs to be saved is the same.

Turn your comments around:

Suppose you had printed a full page (A4 - 8.27 x 11.69 inches) image such that every "dot" printed by a 9600 x 1200 dpi printer was a different color. Now you scan it in as a full-color tiff image - i.e. no data loss (images saved as jpg can compress quite a bit, but you loose data - hence the name "lossy-compression").

So you scan the image at the scanners best ("highest") resolution. Assuming 24 bits per color, the graphic image that is scanned in is now (9600*8.27)*(1200*11.69)*24 = 26,729,063,424 bits = 3,341,132,928 bytes or about 3Gig.

That is how big your file would be to "email" around. You may be able to get some additional compression (zip, gif, etc), but that would be software compression, and software compression doesn't care what the medium is that it is saving to - so the fact that it's from printed paper doesn't matter - and really only confuses the issue.

The point here is that just because you have a high-def image that looks good printed edge-to-edge, don't assume that there is a lot of data there. A 3meg jpg file looks damn good when printed with a high-quality printer - but there is still only 3meg of data on the printed page no matter how good the scanner is that reads it back in.

-CF

Bandwidth... by GLneo · 2006-11-26 16:10 · Score: 1

Never underestimate the bandwidth of the Hammermill® paper truck hurtling down the highway

Yet another math post by Arceliar · 2006-11-26 16:29 · Score: 1

Okay, lets look at the current maximum feasible specs we could use: 24 bit color depth at 2400dpi, seem reasonable?

Per square inch of space, that's:

(2^24)^(2400*2400) possible combinations of data, which equals
2^(24*2400*2400) if rewritten, or, conveniently enough (since I started in base 2)
24*2400*2400 bits of data per square inch, about 17 megs

17 megs per square inch of paper, although fairly impressive, falls vastly short of what was supposedly achieved. And no matter what mysterious system of shapes might be used, once put on paper, the data is going to be in dots. There's nothing you can do about that. It's just the physical limitations of two dimensions.

I wonder what information is stored here? by katchins · 2006-11-26 17:41 · Score: 1

I wonder what kind of information (Very Work Safe) is stored on this page?

Must be top secret info!!!

--
if (!sig) { printf("Signature Unavailable\n"); }

Wow by wolf369T · 2006-11-26 18:07 · Score: 0

This should be enough for everybody...

Hey, no offense and all by PylonHead · 2006-11-26 20:30 · Score: 1

But reading the comments on this article make me realize just how many morons we have reading SlashDot. Actually, it would be nice if they stuck to just reading it.

Happy Thankgiving everybody!

--
# (/.);;
- : float -> float -> float =

Awesome! by crhylove · 2006-11-26 20:46 · Score: 1

Maybe you can shit out some code for us over at ReactOS! Or hell with it, maybe just join any Linux distro dev team, and let's really bury redmond this time!

I'm sure Vista is over a million lines of code, so at one sheet of paper to a million lines of code, I'd guess you could code something really awesome with just 10 blank sheets of paper and a few burritos! But testing will be hard, since the bugs will be so small, but maybe we can kill them with the nano knives I read about 3 stories up.

rhY

--
I hold very few opinions. I hold information based on observation and fact. If you wish to disagree, please use facts.

Pr0n potential! by Dieppe · 2006-11-26 21:33 · Score: 1

Wow, you could use a sheet of paper to store like... thousands of pr0n images! Just think of the possibilities! Playboy could stop being so wasteful with individual images and include images of the past 50 years on a few sheets of colored print!

Re:Pr0n potential! by heroofhyr · 2006-11-26 22:04 · Score: 1

Imagine the horrors if they had had this technology in the late 80s/early 90s. The world would still be using Windows because Linus' dog ate his homework and the only copy of the draft GPL would be smouldering from the end of a roach clip in Richard Stallman's bedroom (aka The Swinging Bit Pad).

--
brandelf: invalid ELF type 'KEEBLER'

Re:Do The Numbers --- But do them right by SnowZero · 2006-11-26 21:58 · Score: 1

So the question is, with 16M colors, how much information can be stored there? 24 bits. "256 GB" is measuring bytes, and bytes are made of bits. Thus, ultimately we care about the number of bits, not the number of colors. So, he's right, you're wrong.

P.S. 2^6 = 64, NOT 32

Re:Do The Numbers --- But do them right by SnowZero · 2006-11-26 22:06 · Score: 1

What's sad is that you get the units right, but then get the exponentiation/bits wrong.

log2(2^18 * 2^2) = log2(2^18) + log2(2^2) = 18 + 2
i.e. 18 color bits + 2 shape bits = 20 bits.

Pneumail! by Anonymous Coward · 2006-11-26 22:11 · Score: 0

As internet, per Daily Show reconing, is a bunch of tubes, and pneumail is to be the new standard after the net-neutrality debate ends, i forsee a great future for this. Pneumail attachments! :-)

You first have to fold the paper 9 times... by leuk_he · 2006-11-26 22:38 · Score: 1

Wait.. that is impossible. (really try it with a A4! )

What a bad day for science by mikiN · 2006-11-26 23:21 · Score: 1

Google sez:

Sainul Abideen rainbow: 646 hits
sainul Abideen rainbow shannon: 3 hits

If anyone is wondering what (Claude) Shannon has to do with all this, look him up.

Don't promise me a rainbow, cry me a river instead.

--
The Hacker's Guide To The Kernel: Don't panic()!

I think you ARE wrong. by leehwtsohg · 2006-11-27 00:19 · Score: 1

I have to admit that the first time I heard about 16qam was in your post. But you can't make 96 bits out of 16 bits. 16 bits have 65536 possible states. You can not code more than 65536 states using 65536 states.

Re:RTFA by Anonymous Coward · 2006-11-27 01:06 · Score: 0

Encoding data using dots is the most efficient method possible.

You can't debunk something just by stating the converse. Where's your proof?

Suppose he was using such high resolution that the paper surface was significantly non-flat over the area of an encoding object.

It might well turn out that a triangle's image and colour can be recovered reliably at such scales using some geometric processing, while a dot's cannot.

Let's wait for the paper before we judge this. Unless people are breaking established physical laws, you can't just say "scam" without thinking about it.

Let's *apply* some basic information theory, then by Anonymous Coward · 2006-11-27 01:37 · Score: 0

Well I do understand basic information theory so allow me to help you to apply it properly in this situation.

Firstly, no-one is claiming that he has stored 256GB on a piece of paper today. The number is a theoretical limit. So let us assume that it requires printing and scanning resolutions which are very high.

Secondly, you can't talk about "information theory" without talking about information. You need to look at both data bandwidth and error rates because information is not the same thing as data. You say that using shapes limits the resolution but resolution is just the data rate. We want the information rate. These are not the same thing at all.

Under the proposed scheme at very high resolution, the paper surface will be significantly non-flat. Some kind of error-correction would need to be used. Your dots are not much use to you now because the errors are not the kind of "random noise" errors you know from your basic information theory which deals with time signals on wires. Here, there are geometric errors: distortion and stretching. Some of your dots are now too small to detect, and it is extremely hard to decide which coloured dot matches which output bit. The error-correcting schemes you know about no longer apply. Some form of solution is required that reflects the fact that we are in a geometric situation.

Perhaps using triangles and squares serves the purpose of both ensuring that no object is missed, and provides enough spatial information to undistort the scan and reproduce the original image. The idea certainly seems plausible.

Of course you won't find the algorithm in your basic information theory book. Geometric error correction is hardly basic.

Re:can you spell lamination by hesaigo999ca · 2006-11-27 02:35 · Score: 1

if you RTFA ....you would have seen that this technology is what we are talking about, I am sure that the paper scheme is the least important of concerns, and if you create copies and store them, lamimate them, you still use 2 sheets of paper compared to discs which are not recyclable....you just shred the paper and create new ones....backups my man backups..... plus someone wants to sell you an mp3, showing you have bought a legal copy of an mp3 will be much easier in court now that you have your bill & the mp3 on the same piece of paper Have a nice day "Now this is what I am talking about!"

Bio-degradable? by odourpreventer · 2006-11-27 02:57 · Score: 1

I got stuck at this word in TFA. Isn't the purpose of long-term storage to be as bio-UNdegradable as possible?

--
Adventure, Romance, MAD SCIENCE!

Bits, bytes, ... Blots? by Nakarti · 2006-11-27 03:17 · Score: 1

If somebody could do something similar with sealed storage devices, imagine the possibilities. It's basically compression of data with a physical algorithm.
I don't know how effective it might be without color, but I suspect there's still a lot of data that can be stored.

Too bad I didn't try to do something with my (and probably half everyone else's) idea for a more-than-binary format.

no you didn't by Anonymous Coward · 2006-11-27 04:05 · Score: 0

You didn't just say what I think you said did you? Why yes. Yes you did. "one of the cold hard facts of information theory"

That's priceless.

Never underestimate... by Mikeeee84 · 2006-11-27 05:31 · Score: 1

Never underestimate the bandwidth of a paper plane full of data hurtling across the room

Re:Do The Numbers --- But do them right by mypalmike · 2006-11-27 09:28 · Score: 1

Darn. That was indeed stupid.

--
There are 0x40000000 types of people: those who understand 32-bit IEEE 754 floating point, and those who don't.

for completeness by weierstrass · 2006-11-27 09:49 · Score: 2, Funny

and to save you from having to read the whole discussion, all estimates from below at the time of posting as to the maximum amount of information that can be stored on a sheet of either A4 or 8.5"x11" paper in this way:

2 GB, 244 GB, 7.6MB, 22MB,03GB, 765KB, 1.360006 million bits, 244,800,000 bytes, 8,415,000 MB, 15,000,000 bytes, 578MB, 1540.83 MB, 403MB, 9GB, 140MB, 280MB, 87,925,612 bytes, 256 MByte, 1.7 trillion bytes, 26 Megabytes, 20196 Mbit, 3 Million Gigabytes, 68 megabytes, 64K, 30 Mb, 32Mb, 8.2Gb, 69MiB, 1.4GB, 700 Gb, 31 Tb, 3.23136 TiB, 12MB, 247.955GB, 257 MB, 50 MB, 20,971,520,000 bits, 23,040,000 bits, 4000 megabytes, 5GB, 10MB, 200Kbytes, 1.4GB, 0.18 GB, 0.05 GB, 579 Gigabytes, 72.5 Gigabytes, 7,166 gigabits, 561 Mbits, 3,341,132,928 bytes, 26 MB, 2.2GB, 17 megabytes, 17401734 bytes, 128 megabytes, 139 MB, 254 megabytes, 245 GB, 128 MB.

--
my password really is 'stinkypants'

You made my day. by PinkyGigglebrain · 2006-11-27 17:44 · Score: 1

Thank you for poving to me that there are still other people out there who have not yet had their brains become so full of "facts" that they can't see beyond them.

When I saw "geometrically" in the title the first thing I thought was "WOW, the data is in the angles!", but now I think your closer to the truth, its all about adding states to the bits. Though using the angle of the figure relative to some reference line would add another set of states, hmmmm.

Can you imagine the fun when quantum computing hits main stream? All of the 1/0 dinosuars are going to be facing a meteor in the form of the Qbit. Thanks again, PG

Re:You made my day. by DirtyRhodes · 2006-12-01 15:01 · Score: 1

You're welcome.
The point you make on quantum computing is so true, what are they going to do then? I mean quantum storage devices will have to be built what will the prototype be? It absolutely can't be binary even ternary bits couldn't hold up. In the world of phase-differential computing (LVD for SCSI and Double Data Rate for RAM) why can't there be a leap into a 3 state bit. Once that leap is made the paradigm will shift and in no time they'll teach 3rd graders about the Qbit and in the future they will refer to this time as the binary dark ages.
D. Rhodes

--
"A keyboard?! How Quaint!"

Doable a la scotch tape by mattr · 2006-11-28 06:05 · Score: 1

He didn't say how thick the plastic sheet or "paper" was, did he?
I think scotch tape stored 100GB didn't it? You just need to get enough tape. The rest's all grunt work.

The leaders of the business in April of course announced 515GB/square inch which sounds like a lot but how long does it take to read and back it up, right?

link

Re:Do The Numbers --- But do them right by RhettLivingston · 2006-11-29 05:09 · Score: 1

Thanks. After rereading this whole exchange, I think that checking up on my drug regimen is in order :-)

Slashdot Mirror

256GB Geometrically Encoded Paper Storage Device

462 comments