Why Big Data Could Sink Europe's 'Right To Be Forgotten'
concealment tips this news from GigaOm:
"Europe's proposed 'right to be forgotten' has been the subject of intense debate, with many people arguing it's simply not practical in the age of the internet for any data to be reliably expunged from history. Well, add another voice to that mix. The European Network and Information Security Agency (ENISA) has published its assessment of the proposals (PDF), and the tone is skeptical to say the least. And, interestingly, one of the biggest problems ENISA has found has to do with big data. They say, 'Removing forgotten information from all aggregated or derived forms may present a significant technical challenge. On the other hand, not removing such information from aggregated forms is risky, because it may be possible to infer the forgotten raw information by correlating different aggregated forms.'"
If customers want their data forgotten then maybe they didn't want it stored or shared in the first place. The rule should not so much be about data retention but data gathering. The rule should be quite simple. Any organization that gathers data can't share it at all with anyone not directly connected with the reason it was gathered. So my power company needs my address to know where the lights need to be turned on and enough info to bill me. But anyone beyond billing and switching should not have my data, not management, not marketing, and definitely not a "trusted" third party.
The same with my driver's license that is needed by two small groups of people, the people who issue the license, and the police if they need to know that I am allowed to drive. It should literally be illegal for anyone else to copy anything from my license if it doesn't involve my ability to drive so say a car rental place would be OK. Many bars have taken to scanning driver's licenses as you enter the bar. Then you start getting mail and crap from the bar and anyone else they sell the data to. I met a guy who rewrote the data on the magnetic strip to cause buffer overruns and crash their little hand held units. He regularly went to every bar downtown that had the scanners as the crash wasn't a simple reboot of the unit as some remote server lost its mind requiring someone to come in.
These organizations find this data valuable but somehow think they can take that valuable thing from us without negotiation. I say you want my data you can pay me $1,000,000 per byte plus royalties on resale.
What about my right to control my server. I look at this 'right to be forgot' as the same sort of over reach which allows media companies to put DRM on my ebook reader or smartphone, then make it illegal for me to remove it. My equipment. My decision. You want to force be to keep or remove any software/data, then you get yourself a court order. I don't see why phantom Imaginary property rights seem to keep trumping rights over real property. Sheesh.
What is the problem with doing the same for people?
Facebook actually makes it hard for people to remove their content from the service, and it doesn't even say "delete", it says "remove from timeline" (but not from the whole system).
If I want my Facebook history Wiped, it is my right to do that, it is *my* data and Facebook and others shouldn't have a operating license unless they make it really simple for people to "be forgotten".
echo '[q]sa[ln0=aln80~Psnlbx]16isb572CCB9AE9DB03273snlbxq' |dc
As long as I have my right to not care. In the unlikely event I stumble upon your embarrassing "e-foible", I do not judge, and will soon forget. Unless you "protest too much", which might spark a memory...
This issue is a bit more complicated than you think.
I don't know if you remember MacOS prior to OSX, but classically it had two "forks" - the data fork which compares to the typical flat file we all know, and a properties fork which is something like the metadata in a file system (time created, ownership, permissions, etc) but with a much richer syntax.
OSX lost that separation and now uses a Unix-y model.
If we wanted data to be trustably limited in scope, then we'd have to structure *all* our data everywhere so that it contains the literal data being saved, as well as another "properties" fork which could contain information about the scope of acceptable usability.
It could be done, but it would be very, very, very expensive. I'm not sure whether it wouldn't be worth it, the right to privacy and personal rights does count for quite a bit, and the court system in the USA is also very, very expensive and equally worth it.
Note that since we're talking about data, Moore's law means that the cost is about 1 or 2 years of actual growth. 1 or 2 years of no growth at all to accommodate this idea....
I have no problem with your religion until you decide it's reason to deprive others of the truth.
not any government policy or commercial entity
they call it disruptive technology for a reason. like the printing press, or the gun, or the atom bomb, it dramatically changes the status quo
it's simple: if you don't want it to live forever, don't put it on the internet. if you put it on the internet, it lives for ever
that's about the truth of it
but i suppose many people out there are like music company executives trying to impose legal constructs from the cassette tape age on the internet: unwelcome to accept ugly reality on the subject
well i'm sorry, you need to accept this as reality, no matter your feelings
one other point: privacy is NOT dead
all you have to do is stop offering parts of your life to the internet
the insane part is feeding private parts of your life to the internet, and then whining about a lack of privacy
intellectual property law is philosophically incoherent. it is your moral duty to ignore it or sabotage it
Few ideas are more absurd. They will have to outlaw all recorded media and burn down the libraries. Make ignorance the law of the land.
"Right to be forgotten" is an odd phrase, but it doesn't mean anything like what you seem to think it means. Basically it just means you have the right to request that information which you have provided to a particular data repository be removed from that repository. IOW, no more "we own everything you post forever" policies. Seems reasonable enough.
The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
... de-identification is an area of active research, because we'd really like to be able to mine all that juicy medical record data without infringing on patients' privacy rights. The gold standard so far seems to be Vanderbilt's Synthetic Derivative, which cleverly alters individual records enough that they can't be traced back to the actual patient. If these records are then used to create aggregate data, then attempts to reconstruct patient records by "correlating different aggregated forms" won't work, because they'll just reconstruct the SD instead. It seems to me that a similar two-stage process could be applied in a number of realms, so Google or whoever could still do all the "Big Data analytics" they want without raising privacy problems.
The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
I think some would argue that there is a right to remember. The Wayback Machine, for instance, has been instrumental in proving corporate malfeasance. Do we really want to lose that?
Oliver's law of assumed responsibility: If you're seen fixing it, you will be blamed for breaking it.
Let's say you meet The President or Prime Minister in real life. They say something that impacts you so greatly, it changes your entire life.
I met the Prime Minister once, and it had no effect on my life at all. Then again, the PM in question was John Major, so not really a surprise.
The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
Let's say you meet The President or Prime Minister in real life. They say something that impacts you so greatly, it changes your entire life.
I met the Prime Minister once, and it had no effect on my life at all. Then again, the PM in question was John Major, so not really a surprise.
He had that effect. Almost all Prime Ministers I can remember a lot of bad or good things they have done. I'm damned if I can remember any policy, enactment or decision good or bad that John Major's government did.
"I was walking in the hills when I came across a man who looked as though he carried the cares of the world on his shoulders. I introduced myself and asked him what was wrong.
The man pointed to a bay in the distance and said: "look at all those ships down there. Do you know who built them?". "No", I replied. " I did", her replied. After a pause he said "but do you think they call me Dai the ship builder?.... no"
He then pointed to the city and and said "look at all those houses down there. Do you know who built them?". "Was it you?" I asked. "Yes", said the man, "but do you think they call me Dai the house builder?.... no"
He then pointed at a fine new church building , saying "See that church.... I designed that myself... but they don't call me Dai the Architect either".
With a sigh he turned to me and said: "....... but you shag one sheep"
To techies the idea seems absurd, but it's not. Sure, your server, your rules. But what you pull into them is another matter entirely, and the American view that if it's not behind closed curtains, it must be public, doesn't scale.
Compare, of all places, Japan, where it is in fact customary to "not see" things that are pretty much out in the open out of sheer necessity because too many people are living too close together. In a sense, the internet is worse than Tokyo.
There's irony here, where the techies are deriding politicians for doing boneheaded things with far too much data. Well, this is part of that, but in reverse, and if they're doing it wrong it's up to us to find ways to do it right and nudge them in the right direction.
DRM became a bad word because big media deployed it to control their customer whom had thought they'd bought something only the seller afterward pulled a legalised fast one. David losing to Goliath until dvdjon came along.
Data protection in this case wouldn't include money passing hands in the reverse direction. It's more like, well, you put DRM on your SSN when you sign up (and pay) for something that requires it, and you can more or less reliably wipe your SSN out of their databases once they no longer need it.
No longer having to trust some faceless large entity on their wooly word salad assurances and their pretty face is a nice boon for the individual. Bit of a different power balance there.
Yet the only real fix is to not store all that data in the first place. This means that a lot of data that's being gathered now must not be gathered at all or perhaps some other data needs to be gathered. Zero-knowledge proofs will likely have a big place in that, say to prove you're old enough without showing your ID card with all that extra data you're forced to give out currently. This'll need new techology, but will prove necessary to really scale out our data use without building databases of ruin.
In the UK (I don't know about the rest of the EU) an individual can send a subject access request to a company or organisation and that organisation has 40 days to send you all the information they have on you. Companies have been doing this for years now. It doesn't seem so hard to change the query from a SELECT to a DELETE.
Now the paper in the article talks about how publicly available information may be copied (via the web) without the original author/organisation knowing, e.g. you could copy this post and store/publish it else where and neither slashdot or I would know, so you can't guarantee that the data will be completely deleted. But personally I don't think this is that big of a deal. If I want company Foo to remove all the information they have on me, for whatever reason, what do I care that company Bar also has information on me?
I think, to a point, an individual should be responsible for tracking all the information that they want removed, and companies/organisations should be responsible for acting on legitimate requests to remove the information.