Google Plans Service to Store Users' Data Online
achillean wrote this morning with a link to the Wall Street Journal, announcing plans we've all seen coming for a while: an online data storage service from Google. Though the article doesn't come out and call the project 'gDrive' or anything like that, it does indicate the service could be available within the next few months. "Google's push underlines a shift in how businesses and consumers approach computing. They are increasingly using the Web to access applications and files stored in massive computer data centers operated by tech companies such as Salesforce.com Inc., Microsoft Corp. and Google. Such arrangements, made possible by high-speed Internet connections between homes, offices and data centers, aim to ease users' technology headaches and, in some cases, cut their costs."
a strategy that could accelerate a shift to Web-based computing doesn't this sound just a little bit like a dumb terminal in terms of computing?
Those who live by the sword, get shot by those who live by the gun...
Once installed, you upload your files by right clicking on them and selecting "I'm feeling lucky".
ccalam - acoustic versions of new songs.
first thing I'm going to put up there is personal information to others. Any bets for how long till they are compromised?
In an age of sealed warrants, if the government even bothers with that, why would anyone put their data out of their sight? When it comes to privacy, I cannot see how the benefits outweigh the risks.
Some people have a way with words, and some people, um, thingy.
Methinks this trend will create a network of vast underground datacenters so large that archaeologists of the distant future will be left wondering whether we were created by metallic silicon creatures.
Ofcourse, it'll be gDrive Beta.
Fudge, I had just started to write a virtual filesystem driver using IMAP as a back-end. (Not fast, but gmail's 5.2G is free.)
Oh well, I'll put it on the back-burner until I hear more.
Unless Google can lobby Rogers to get rid of its arcane practice of capping usage at 60GB / month for it's standard high speed, me, and around 50% of people in Canada with high speed internet, can not make any real use of this service.
It is pretty sad that a company will give you a nice 6 Mbps link only to cap you at 60 GB, which you could exceed in only 1 day of saturating your link.
As a recent convert to google apps, this is very interesting. I have/still have all the concerns about my privacy, but the offering was too tempting to pass up. Of course I use the Firefox Customize Google add on, but also don't really put anything sensitive up there. If they build it right, it could be very nice. I've tried all the online backup apps, and outside of Mozy, don't really like any that much. But I'm now all Linux, so Mozy is no longer an option. Anything that competes with Microsoft is a good thing!
Storin' those data
To network platta
Drive image good
As face image could
Burma Shave
Get thee glass eyes, and, like a scurvy politician, seem to see things thou dost not.--King Lear
Thin client computing is on its way back - like it or not.
Though nothing new, this is a great idea who's time has finally come - particularly for people with mobile devices connected to wifi hotspots - both of which Google has been investing in.
I already do this a bit with Amazon's S3 storage system. It is really nice being able to store files anywhere and paying all of $0.03/month for it.
But hey, I'll take free any day.
On a somewhat related note: It would be great if Google bought the LexisNexus people. Having public access to their database would be a great public service.
-nick
Google needs to incorporate encryption with keys totally held and managed by the end user in such a way that even if Google is subpoenaed or shown national security papers, Google would be technically unable to access end-user's data. Another words, at no time should Google have access to any of the user's cleartext nor the user's secret key. Decryption would all be client-side. A subpoena or national security letter would have to go directly to the end user who would then at least know they are being served.
What kind of encryption would you use for this?
The most secure would be to store a single large archive of all your files encrypted with a strong cipher, but that has the disadvantage that you have to download it all to decipher it.
Alternatively you could encrypt each file separately, which would speed up access considerably, but also leak more information about what you are storing (i.e many small files vs one big one ).
I guess if the data is sensitive enough to require the former type of encryption you shouldn't transmit it over insecure connections to begin with...
This shouldn't be a surprise to anyone. It's Google, and it's one of the oldest ideas on the Internet which they haven't yet done; before the dot-com bubble burst there were at least half a dozen sites that claimed to provide an online "drive" of sorts - X-drive and E-drive are ones that come to mind, I think they advertised on the radio. Going further back, I remember using an online storage service on CompuServe in 1995 or so.
Slashdot Burying Stories About Slashdot Media Owned
OK, so that was last part was really unnecessary, but still...!
Beware: In C++, your friends can see your privates!
If Google is intending this service for online storage of personal information, I don't think it's going to succeed... people use Facebook for that sort of thing.
Sam! If you will let me be,
I will try them.
You will see.
On the contrary.
I've long been planning to put my most personal and important data on Google's servers, using the already existing gmailfs. Using good encryption, of course, which you really should use on local storage as well, if there's even a slight chance that it might get physically stolen.
Using this would give me a very cheap (actually free) off-site backup, so I know I can still retrieve my stuff even if my house burns down, or if RIAA sends the police to get my computers...
c++;
I really hope I'm not the only one that doesn't believe it's too smart a move for anyone to store all their data in a place that they can't physically access.
I love Google, don't get me wrong, but isn't this a step back? Privacy would be another concern. I think I'll just keep sticking my data on my home Terabyte NAS so I can keep it semi private.
I suggest calling it gPorn, because you know that's what's going to be on there.
That's our life, the big wheel of shit. - The Fat Man, Blue Tango Salvage
I imagine this will be a big part of the android platform. Any android device can be a 'dumb' terminal for your data. Let's just hope it's encrypted for the sake of the people that will inevitably place sensitive data on their 'g-drive' via their phones.
any easier and safer then buying and external HD for $120 or less and sticking it into your USB port? I'd trust a corporation with my own data as much as I'd trust a crack head to hold my ball of coke while I tie my shoes.
Some say well you can access your data anywhere! What data? Your financial data? Health Data? I sure as hell wouldn't trust anyone but the party thats needs that data with keeping it private/safe. If its online its gonna get snooped on eventually.
by TheSpoom (715771) Uncaring Linux user here. I have nothing to add to this but please continue. *munches popcorn*
So anyone will be able to take the latest movie, upload it to a google account, and give the password in a forum.
...
It's like P2P but Google pays the uploading bandwidth!
You could even use the movie name for the account:
0000000000-MoviesAhoy-TheMatrix
0000000001-MoviesAhoy-Slashdot, the movie
0000000002-MoviesAhoy-Pr0n (1)
0003814661-MoviesAhoy-Pr0n (3814660)
I have/still have all the concerns about my privacy, but the offering was too tempting to pass up.
And that is why things in today's database-driven, surveillance-obsessed society are going to get very much worse before they get better.
It's quite sad that even after the big leak here in the UK last week, things have gone quiet on the political front and there isn't a sustained media attack on our underpowered privacy and data protection laws.
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
Haven't we the public shown that we don't want our data online this way? And they pursue it anyway. You hear all of these reports about Google becoming the next Microsoft. This sounds very Microsoftish.
...someone had to say it.
The RIAA wouldn't need to send the police for your computers since they can subpoena Google to get the evidence that they need. They do that for search queries now. Uploading your personal data gives law enforcement one stop shopping to your information. A "portal" to all of your personal information. How convenient...
--- The revolution will be digitized! - http://www.binrev.com/ ---
Amazon has been doing this for ages - very well I might add. What does Google bring to the party. Advertising? Big whoop. I like the privacy of my data. I don't want Google scanning my data as it scans my email. That's taking things just too far. What next? Google coming around to my house to check my desk draws.....?
Rich people are eccentric. Poor people are strange. Me, I'd be happy with odd.
... provider publishes a weekly statement indicating that no warrants (secret or otherwise) have been served on the hardware/premises, etc.
You can see it here:
http://www.rsync.net/resources/notices/canary.txt
As for offsite backups in general, I trumpet this service as loudly and as often as I can - it surprises me that the FOSS (and privacy/crypto) communities do not talk more about rsync.net - they do things the right way.
You won't ever be able to pick up the phone and talk to someone at google, and you certainly won't have a totally open, standards-based filesystem.
duplicity:
http://duplicity.nongnu.org/
is a mash-up of (lib)rsync, tar, and GPG. Plaintext, normal filesystem on your end, and a big bunch of gibberish tarfiles on the remote end.
The remote end can be anything - it just needs to be accessible via plain-old scp/sftp (or ftp).
A new version of duplicity was just released and because of a bounty and ongoing funding provided by rsync.net:
http://www.rsync.net/resources/notices/2007cb.html
there is a permanent maintainer dedicated to the project now. I don't use it for all of my data, because one of the main selling points of rsync.net was the ability to just connect and browse with any old sftp/scp based client - and that won't work if the files are all gibberish tarballs, but for my "important" data, I do use duplicity and point it at a special directory.
Just wondering if you've had success with Mozy. I have been using the OS X beta of Mozy's backup tool and have not been able to create a successful backup. I keep getting connection lost error messages when it tries to upload my 2 GB VMware virtual disks.
I'd be highly surprised if they could even decrypt triple-DES easily. (for very difficult values of easily).
DES is singly the most researched encryption algorithm, and as such it shouldn't be written off yet, but I'd say it could be trusted in the form of triple DES.
You make a good point that anything over 128 bits is uncrackable by brute force methods. There are other ways of reducing the effective number of bits, however; these are generally purely academic in nature (differential, linear cryptanalysis). And of course there's always the fact that at some point the data will be unencrypted in order to be useful, which brings about my favorite form of cryptanalysis: the "rubber hose" method. (credits to Bruce Schneier)
Torture is the easiest form of cryptanalysis by far. Thugs are cheaper than PhDs.
- MK
Google's push underlines a shift in how businesses and consumers approach computing. They are increasingly using the Web to access applications and files stored in massive computer data centers operated by tech companies such as Salesforce.com Inc., Microsoft Corp. and Google
We've been hearing this prediction for 5 years now and it annoys me when someone makes it. Yes for for small businesses and home users, using online word processors, spreadsheets etc. makes sense as you don't have to worry about backup and you can share them. For medium sized businesses upwards it makes no sense at all. If your internet connection is lost, your business is screwed. For law firms for example, this can cost millions per day or even hour. There's also the usual issues about privacy and storage with 3rd parties.
http://www.desmoinesregister.com/apps/pbcs.dll/article?AID=2007711260321
second dupe.
and it's already been done- 1GB of storage on Gmail? This is why Zonk is unchecked in my "show stories from" dialog.
+5, Truth
Which is why the grandparent mentioned strong encryption. How they get the data is irrelevant; they still need to get the keys from you, either via installing spyware or using something like RIPA to make you tell them.
I am TheRaven on Soylent News
I've spent the day implementing off-site backup with S3, and let me just say: It's freakin' awesome. If I had a reasonable upload, my MP3 collection would be on there. £15/month for 120G is easily worth it, and it's already on a RAID1 array.
I'd probably contemplate suicide if I lost it.
Data corruption/loss would be a problem though.
Less so, I think, than if you're asked to keep track of your own stuff. Businesses drop big bucks on making sure they have backups of all computer files because the average person simply cannot be relied upon to do it themselves, accurately and regularly.
[Ego]out
In the information age, perhaps people's expectations need to change? It needs to be realised what you really do in public is not local but global.
Alternatively, maybe we should introduce legal safeguards that apply to data, taking into account the much greater storage, data mining and communication facilities available today? Just because we can do something, does not mean we should, particularly where "we" means governments, businesses or other groups with disproportionate resources rather than private individuals.
In fact, I would argue that to some extent this is inevitable. Everyone does things in "public" (which apparently includes people observing things inside your home from outside, for the purposes of this debate) that might be embarrassing as a minimum, or potentially even dangerous. It needn't even be what you were actually doing or some item you actually possessed; anything you could be misconstrued as doing or appear to possess counts, too.
Have you ever left a credit card statement on a desk visible from a window? Mistyped a password in a clear text field and then immediately deleted it? Followed a link from a normally reliable web site and found the content that appeared on your screen was not what you expected? Said something in your home that might be misunderstood if taken out of context? Had a visitor who looks a bit like someone who's been in the news recently? Of course you have, many times, and so has everyone else.
If we go down the path of saying everything that ever happens that is observable even momentarily from a public location may be permanently recorded and searchable by absolutely anyone, then privacy will be dead and we will have killed it. Humanity has evolved to respect privacy for a variety of good reasons, and the consequences of breaches in privacy can be life-changing or worse. If the highly resourced Big Guy gets to do whatever he likes in terms of invading the Little Guy's life (as long as it's done from a public place, naturally) then we're basically condemning anyone to suffer arbitrary, unaccountable damage if anyone else doesn't like them, and that is not a healthy path to tread.
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
As the old saying goes, if you count on crypto to solve all your problems you don't understand crypto and you don't understand your problems.
The point that your data can and will be attacked while it's in plaintext is well taken. A networked machine running a web browser (the Sendmail of the 21st century) is a low security device, even with a good operating system. Google for "Scarfo", the mobster who was using PGP but also had an FBI keylogger on his computer.
As regards AES, though, we've got good reason to think it's resistant to cryptanalysis. The NSA is also in charge of protecting government secrets from foreign snoops and has approved AES for protecting classified data.
The low security of a workstation cuts both ways in an argument about gDrive: because your data is already at risk sitting on your hard drive, storing it encrypted on gDrive might not be any worse.
Security without threat modeling is like bricks without straw. What are we protecting data against? Loss, primarily. I trust Google's backups more than I trust mine (but I'd tell a client to look for a provider willing to sign an SLA). Unauthorized copying by crackers? AES should be an adequate control to cover that risk. Subpoenas? An attorney with two brain cells to rub together will subpoena the decryption keys, so no help from AES there. Vacuum-cleaner style mass government surveillance, looking for keywords like "Tibet" or "Falun Gong"? AES should prevent that. Government criminal investigation? You could (in the US) argue that surrendering the keys would be self-incrimination and end up paying a lawyer lots of money to argue the point for years. Expensive and undependable security, but then in a criminal investigation there's not much security difference between gDrive and your local machine anyway.
If you have security needs you should do an analysis like that last paragraph, only longer. For lots of people encrypted files on gDrive might be just fine.
Here's what we should all do. Post phony-balony (fictional) data so it can be harvested and merged with the existing data that compromises the identity theft databases.
After several months, the database will be all but USELESS because they won't know what data is valid and what data is false.
No one will want to buy identity data if it contains so much false information, that it becomes useless. It actually becomes DANGEROUS to the purchaser because there's significantly reduced payoff - while it increases the perpetrator's exposure to risk of detection and prosecution.
Every place you visit on the web should be an opportunity to "salt" it with fiction. I recommend that you forward this idea to everyone you know to rapidly make this happen.
I am starting this initiative here and now by posting my (false) data:
Robert DeScully
6733 Orion Ct. Apt-B
Ann Arbor, MI 48109
Occupation: IRS Auditor
Annual Income: $187,200
Soc. Sec. No.: 853-98-1294
To paraphrase mayor Daly, "Post soon and post often" (he actually said, "Vote soon and vote often.")
...like they do for Gmail users losing data? Google, the first step is admitting you have a problem....
Disappearing Gmail messages baffle users
That? That was a pigeon.
Ever notice how the techies almost always rail against remote storage while the normal computer users think it is a great idea ?
What is really interesting is that Google could, in theory, link duplicate files. For example if 10,000 people have the Will Farrel "SNL/Blue Oyster Cult" video in their storage area, Google could soft link to just one copy and break the link if one particular user ever edited it.
-- Anybody here remember the Atari 800?
If Google has your data, they have enough horsepower to break everything but the toughest ciphers. Aren't distributed systems what they do best?
Actually, by personal data I don't mean the heaps of movies and music that I've been downloading from the internet... ahum.. I mean linux distributions. More like pictures I've taken, source code I've written, and in general, things I can't download again from the internet.
RIAA would of course want to believe that it contains their precious Imaginary Property, but since it's encrypted, they could only guess.
c++;
Do you really think Google has enough computing power to crack 128-bit AES? To crack a symmetric cypher, on average, you need to search half of the key space. That means you'd need to search 2^127 keys. My 2GHz Core 2 Duo can (according to openssl speed aes) do about 40,000 1024 byte blocks per second. In one year, it could do 1.3x10^12. If you had a compute cluster composed entirely of machines of this speed, it would need a shade under 1.3×10^26 machines to be able to crack a single AES-encrypted message in a year (on average).
To put this in perspective, Apple sold 1.6x10^6 computers in the first quarter of this year[1]. You would need to buy every single computer Apple made for 4x10^19 years. If we assume Apple sells approximately 5% of all computers, you would need to buy every computer made (assuming constant production) for 10^18 years.
To put that even more in perspective (10^18 is still a bit big for my brain), the age of the universe is estimated to be just under 1.4x10^10 years. If, for every year that the universe has been around so far, you bought as many computers as could have been made if production had begun at the current rate at the start of the universe and continued until now, then you would have slightly more CPU power than you need to crack 128-bit AES. Oh, and trying all of the possible keys is only half the problem; you also need to recognise when you've decrypted it.
Of course, if you're really paranoid, you can use 256-bit AES (the time to crack it doubles for every extra bit of key length).
[1] I tried to find numbers for Intel and Dell, but could only find revenue and profit numbers, not sales.
I am TheRaven on Soylent News
if you're not seeing his stories, how'd you know to come here to comment?
"256-bit encryption ought to be strong enough for anybody." - Glonoinha '07
Some of you laugh because you think it is true.
Some of you laugh because you think it is false.
Some laugh because you remember BillG saying the same thing about 640k.
And very few of you laugh because you know what I know, but none of us will actually admit it.
Glonoinha the MebiByte Slayer
Assuming that brute force is the way to go. Which I think you've explained quite well that it isn't.
Maybe the key in AES256 is divulged a bit at time, in the 95101924th bit, the 814255525181th bit, etc.
Get your own free personal location tracker
I have a similar service in development. I expect porn to be a pretty popular use but I think the real area money will be made is in backing up people's photos and home movies. The average parent (and grandparent) has switched to digital photos and movies but doesn't have the know-how to properly safeguard these personal treasures. With our service all the user has to do is come to our website and log in and their photos and movies are indexed and backed up for them.
;)
The file sharing abilties will no doubt happen a lot for porn sharing but it's the people who have photos they actually care about that are most likely to pay money for extra services.
Our open API will make it easy to built third-party services on too. It'll be interesting to see if porn is the number one creative path for third-party add-ons.
At what price learning? At what cost wisdom? The price is a man's peace of mind, and the cost is his life.
A back door in AES would have been found by now. There are enough brilliant security experts hacking on the big public schemes, with full access to the algorithm (it is open source after all), that something would have been found by now. The only weakness known in AES is weak passkeys.
Locking your files with the password "12345" is about as brilliant as it has been for the past 20+ years.
Clones are people two.
... with them searching your files for advertising keywords? They already do it with the search engine every time you use it. At least if they looked at your stuff and saw you solved, say the Theory of Everything, they'd know when you searched for "Nobel Prize" to give you ads for airline tickets to, and hotels in, Stockholm, rather than ads for buying a Nobel Prize on eBay.
If you want to stuff your pr0n or plans for a dirty bomb on their servers, an obscure compression technique (say the old Apple II ShrinkIt NuFX) after having converted your file to a UUencoded text file, will probably foil their attempts. A couple more such twists, and you'd foil the NSA much less Google.
"I may be synthetic, but I'm not stupid." -- Bishop 341-B
Is there any form of encryption that you believe people like the NSA cannot crack?
That is not the correct question to ask. The one you should ask is "Is there a form of encryption that MI5/NSA/... cannot be bothered to crack given the type of info encrypted?". Encryption is like securing your house. You only have to make it secure enough that the effort to break in an steal something is greater than the value of the contents warrants.
As long as there is a means to decode the message it will almost always be possible to break the encryption scheme if by no other means that hacking your machine and taking the decryption key (the possible exception may be quantum encryption technology). However it is extremely unlikely that any government would bother to do this to discover your secret recipe for mince pies!
Damn it! Stop posting my data to the intertubes!!!
;D
"Gratuitous complexity is akin to chaos" - True Vox