I'm not sure encryption is the issue here. Why do you think this data needs to be more encrypted than the rest of the user's home directory, which should hopefully already be protected by full-disk encryption if the user cares about that sort of thing?
Because it would be really easy to extract a lot of data out of that database, which is what Microsoft Recall is being criticized for
But if an attacker has decrypted access to a user's home directory, aren't they screwed anyway?
I guess that's right
Not necessarily, recall might contain more information than what is currently saved on disk. For example bank statements, accessible though their web application, protected by 2FA.
A recall that ran entirely locally—which is absolutely a necessary precondition for it to count as "secure"—would necessarily contain only information stored on disk because where else would it put the data it's collecting/analyzing?
In other words, if it screenshots you accessing your back via website, that screenshot would be stored locally and would be just as protected by full disk encryption as the rest of your files.
I disagree, I'm with OP. screenshots contain (previously) temporary information from all sorts, such as a private meeting between 2 parties with confidencial, eyes only, data. And for going towards extreme privacy end of spectrum, proving you know someone is already a red flag.
If someone has a trojan running with access to the disk, yes it's a big deal. But it's still worth limiting the extent of it by putting extra protection in the things such this. A hacker can have the screenshot files but won't be able to do anything with it.
Unless you're constantly running a secure overwrite of your free disk space, ram and CPU caches, no data is truly temporary. There is always a possibility for recovery by a skilled enough adversary.
Well yea that's the point: unencrypted recall database would make this sooo much easier.
My plan was to use asymmetric encryption where the secret key is again encrypted using something like AES
I think your terminology is off. AES is an example of symmetric encryption: Decryption requires the same key as encryption.
An example of asymmetric encryption would be public-key cryptography: You encrypt a message with the public key, but only a private key can decrypt the result.
AES should be fine for encrypting large blocks of data.
I believe that for systems like TLS, asymmetric encryption is only used briefly to negotiate a symmetric key between client and server.
Have in mind most image processing is done with lowered resolutions for getting more speed. So consider having a downscale parameterized as : image reduction ratio, and method (average, anti aliasing, or just use one of the pixels and discard the others)
Op I think this is an important research, if not for educational purposes, at least to provide a demo to supposedly Microsoft experts how things should be done responsibly.
If nothing else, it encourages discourse in privacy.
Cryptography
cryptography (noun). The discipline concerned with communication security (eg, confidentiality of messages, integrity of messages, sender authentication, non-repudiation of messages, and many other related issues), regardless of the used medium such as pencil and paper or computers.
This community is for links about and discussion of cryptography specifically. For privacy technology more generally, use !privacy.
This community is explicitly not about cryptocurrency; see !crypto for that.