Slashdot Mirror


Encrypted But Searchable Online Storage?

An anonymous reader asks "Is there a solution for online storage of encrypted data providing encrypted search and similar functions over the encrypted data? Is there an API/software/solution or even some online storage company providing this? I don't like Google understanding all my unencrypted data, but I like that Google can search them when they are unencrypted. So I would like to have both: the online storage provider does not understand my data, but he can still help me with searching in them, and doing other useful stuff. I mean: I send to the remote server encrypted data and later an encrypted query (the server cannot decipher them), and the server sends me back a chunk of my encrypted data stored there — the result of my encrypted query. Or I ask for the directory structure of my encrypted data (somehow stored in my data too — like in a tar archive), and the server sends it back, without knowing that this encrypted chunk is the directory structure. I googled for this and found some papers, however no software and no online service providing this yet." Can anyone point to an available implementation?

14 of 266 comments (clear)

  1. Re:It's not possible even in theory by TheRaven64 · · Score: 5, Interesting

    It is possible. When you upload the data, you also upload an index. When you connect again, you download the index (which is much smaller than the data) and search that on the local machine. Neither the index, nor the data, is ever unencrypted on the server.

    As for frequency analysis, I don't think any encryption algorithms published in the last 40 years have been vulnerable to this sort of attack...

    --
    I am TheRaven on Soylent News
  2. Re:It's not possible even in theory by FredFredrickson · · Score: 3, Interesting

    Mozy does this for personal/business backups. You can use a completely private key, but search your own data.

    --
    Belief? Hope? Preference?The Existential Vortex
  3. not impossible; not easy by Lord+Ender · · Score: 2, Interesting

    Keep the files on the remote server, encrypted. Keep the search index in a database, encrypted in chunks. Rsync your search database between your local machine and the server. Actual searches of the databases would be done locally.

    Result: terrible performance whenever you access your data from a new machine (must sync entire search database). Good performance the rest of the time. Remote server never sees anything but cyphertext.

    --
    A slashdotter who didn't build his own computer is like a Jedi who didn't build his own lightsaber.
  4. Re:Maybe, maybe not by The+Moof · · Score: 3, Interesting

    Maybe something like this -

    Create an index of hashes using the unencrypted data on the client.
    Encrypt the data on the client so we now have an index of hashes that apply to an encrypted file.
    Upload the hash index and the encrypted data file to the server.
    To search, hash the search criteria on the client.
    Server search the indexes for the hash value, returning a list of encrypted files with an index matching the criteria hash.

  5. querying encrypted data howto by burnin1965 · · Score: 2, Interesting

    As long as your query looks something like this...

    SELECT * FROM mydata WHERE stuff LIKE '%YToyOntzOjc6InBhY2thZ2UiO3M6MjM5OiKyKHPh9ZawDX6KyA62cMd6p+mjBybGwJyCaNfFb7S.........

    Seriously though, if I understand your objective I think it would be feasible to develop something like that, but I don't think its something you could integrate into Google's search services unless they added something on their end.

    You could pass a decryption key along with your query and the server would then decrypt records as it performed the search. It would be very resource intensive.

    As an close example, I have a web based password storage application in which I did not want to keep the encryption keys on the same server as the password database. So I generate a key with which to encrypt the records and the user keeps their key and must supply it every time they want to decrypt a record. I don't go so far as to enable searching of the encrypted data, I have a description field specifically for that purpose. The web application is called Passbox and is written in PHP.

  6. What an oxymoron! by hesaigo999ca · · Score: 2, Interesting

    Yeah, Id like my cake and eat it too!

    The only way this could work is if you has tags in the meta header of the encrypted file
    telling you that yes I am encrypted, but I have an image in me or my encrypted data is of the type accounting.

    This might work for indexing searches where you want to be able to return all the files on the pc (encrypted or not) that are images or etc...

  7. Re:A guy walks into a bar... by richie2000 · · Score: 2, Interesting

    But... That's not a valid car analogy since you're not allowed to drink and drive.

    --
    Money for nothing, pix for free
  8. It is possible to a certain extent by goombah99 · · Score: 2, Interesting

    There are encryption algorithms that allow addition. That is, the sum of two encrypted messages is an encryption of the sum. I've forgotten how these work exactly, I think they are some many to one mapping, and the addition operation is not simply adding the encrytped numerical representations.

    I came across these when looking at voting systems that allow N distributed people to vote in a way that sums the result before it is decrypted rather than decrypting to do the sum.

    Anyhow what this means is that is possible to do certain operations on a remote database, like sum a column, without the database knowing the result and without transmistting anything additional information inbound or outbound.

    You could presumably have your data stored in many forms on the database, each form suited for one type of query. Then you just query the approriate form to perform the operation of interest.

    I'm reasonably sure there is no way to perform very high order operations that one might typically do in a relational database however.

    --
    Some drink at the fountain of knowledge. Others just gargle.
  9. Re:Am I missing something? by thtrgremlin · · Score: 2, Interesting

    What it sounded like was that he wanted to keep a database with Google that was encrypted and wanted to search it remotely and securely, but without Google being able to look at the data. Even if that were possible, why are you trusting Google with that in the first place? Why not store it somewhere else? I would think keep encrypted data on a server and make a secure connection to it. You send your normal query across the encrypted channel to the secure server, it does its regular search and sends the result back across the secured channel. Add to that some secure authentication, and I thought that would have met the objective (even if the implementation is slightly different than described). If you wanted encrypted data stored in an untrusted location (why are you doing this again?) then you would think it would be necessary to hash specific queries as keys to encrypted data lacking the necessary information to decrypt the information remotely. Either way, guess I will see what other people are saying if this question seems more obvious (differently) to someone else.

    --
    Want Big Business out of government? Take away the incentive and start by getting government out of big business!
  10. I put something similiar to this together myself.. by airjrdn · · Score: 3, Interesting

    But it may not be everything you're looking for. My requirements were:
    1 - Mask the filename
    2 - Encrypt the contents
    3 - Add recovery data in case the file got damaged
    4 - Ability to view unmasked filename from web

    I put together a batch file I could drag/drop multiple files onto that used WinRAR to compress the files (individually), with encrypted filenames, a password (of course), and included archive recovery data. It then used ReNamer to encrypt the .rar filenames. After that, I simply FTP'd the files to the server.

    I had a webpage that would accept a password, and unencrypt the filenames so they were viewable in readable form on the page. Each one was a hyperlink. There was an extra step required if you wanted the downloaded filename to be unencrypted as well.

    After uploading 115G or so, my host alerted me to the fact that they didn't allow me to keep offsite backups there. :) So in the end, I'm not even using it at the moment.

    My solution didn't allow me to search within the files, but it did allow me to store files on the server that they had no way of viewing the contents of, or guessing the contents of based on filename.

  11. not hard by zogger · · Score: 2, Interesting

    Just use a book (or multiple books) code cipher for your index. You don't need to remember a thing beyond which books and what your key starting number is, the pattern. And if someone is in your house throwing all your books at cracking the remote server, you are already screwed and have much bigger problems, such as they probably already installed a keylogger on you. If you are that much of a target for someone to take that much interest....time for plan B or C then, involving plastic surgery, new ID and some nation where there is no extradition treaty 0_o

  12. Re:CONFIRMED: You are missing something. by mcrbids · · Score: 2, Interesting

    Sure the *NAME* is "Secure Sockets Layer", and perhaps that was what it was originally developed for, but it's just wrong to say that it can't be used otherwise, and/or that it only encrypts data "in transit", not on a server. Take a look at this:

    http://us2.php.net/manual/en/function.openssl-public-encrypt.php

    Here's the use of SSL functionality without (ahem) a socket. Right from the docs:

    This function can be used e.g. to encrypt message which can be then read only by owner of the private key. It can be also used to store secure data in database.

    I routinely use SSL to sign files in order to prove whodunnit. This information is stored alongside the signed document. Whether it's transported subsequently is inconsequential.

    --
    I have no problem with your religion until you decide it's reason to deprive others of the truth.
  13. Re:A guy walks into a bar... by dimeglio · · Score: 2, Interesting

    ...if I tell you a story in French and you don't understand it, you will have no idea what I told you and will not be able to answer questions about my story. However, if you are able to memorize all I told you phonetically I can ask if I said a word or not just by the sound. Yet you don't know exactly what I asked for, nor the meaning of the answer but you are able to answer that question since it doesn't imply meaning.

    So a possibility for the OP would be to store the information in a language unknown to anyone but to the poster. This language would need be compatible with the search algorithms used by Google. Not very practical but maybe someone can build on this.

    --
    Views expressed do not necessarily reflect those of the author.
  14. Re:It's not possible even in theory by raju1kabir · · Score: 2, Interesting

    I'm curious - why would you post a comment claiming that this can't even be done in theory, when the submitter included links in the summary to a paper that shows that it can?

    Because the paper doesn't propose any solution that is practical, or which even leads to a practical solution.

    In theory I can cure all forms of cancer - all I have to do is go through each cell in the victim's body and pluck out the cancerous ones.

    --
    "Patriotism is your conviction that this country is superior to all other countries because you were born in it." -- GBS