Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The actual content is encrypted, but one can still build an index that points to individual email IDs and score the search results properly. Only when returning the top N results that one needs to decrypt those N emails with the right keys. The index would be kept in the server. Of course, the devil is in the details and things like email threading, order by by date or group by senders will make or break the user experience.


A full text index that's actually useful will allow you to largely piece back together the original content, modulo stemming and stopwords.

I guess it would be something like encrypting the index, then decrypt it on demand, just like you would decrypt individual messages on demand.


Not if the index values are encrypted (public-key) too.

hashed-word => encrypted-list-of-msg-indices

something like that.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: