1667564 - should automatically collect incoming public keys from emails, for later usage

Assignee

Description

•

5 years ago

Currently to get someone's key you have to do a manual import step. After this the key is "available" but untrusted.

I think we should automate this key collection so that TOFU encryption becomes real.

IIRC there are some technical aspects that prevents this atm; if the key storage becomes large it becomes slow. We don't necessarily need to store incoming keys in this list though. We should store at least

email address
key
url (mid:, or https:) of where it was obtained (for investigative purposes)
timestamp

Magnus Melin [:mkmelin]

Assignee

Updated

•

5 years ago

Priority: -- → P2

Magnus Melin [:mkmelin]

Assignee

Updated

•

4 years ago

Priority: P2 → P1

neal

Comment 2

•

4 years ago

There is no inherent reason why the key store has to be slow. The current format, however, does have serious scalability problems, because it doesn't allow random access. It would be better to use a proper database. Sequoia's native public store (which is not used by the Octopus) does that and we have no problem doing random access on 100ks of certificates. I'd strongly suggest you go this route instead of introducing another cache.

Magnus Melin [:mkmelin]

Assignee

Comment 3

•

4 years ago

Agreed, they should just be stored in a database.

Magnus Melin [:mkmelin]

Assignee

Comment 4

•

4 years ago

Attached file Bug 1667564 - collect OpenPGP keys from email attachments, Autocrypt headers and keyserver lookups. r=kaie — Details

This stores keys from email attachments, Autocrypt headers and keyserver lookups into an IndexedDB database, along with some metadata about when/from where it was collected.

Phabricator Automation

Updated

•

4 years ago

Assignee: nobody → mkmelin+mozilla

Status: NEW → ASSIGNED

u617804

Comment 5

•

4 years ago

Great that there is a progress on a P1 bug, Thanks.
Forgive me to ask, but I could not see test coverage for Autocrypt key collection in the bug (like the test for an attached key)? Since the collection as I understand is a new feature I wonder if that is tested somewhere else when not in this commit?

Magnus Melin [:mkmelin]

Assignee

Comment 6

•

4 years ago

Correct, it's not there yet. Opening .eml from file - which the test does - is a bit special so the header is not directly available in that case. So punted on that for now ;) It does work in normal operations though.

Magnus Melin [:mkmelin]

Assignee

Updated

•

4 years ago

Comment 7

•

4 years ago

We need to define the desired behavior for the keys cache.

Index keys by all contained email addresses.

When storing a candidate key, we should make sure that we can lookup the candidate key quickly.
This requires an index.
If a key contains multiple email addresses, we must create multiple index entries, one for each email address, with each index entry pointing to the key.

The user may receive multiple keys for an email address.
We need to decide how to handle that.

Alice may have received an email from Bob one week ago, which still contained an old key.
Today, Alice received an email with Bob's newer key.

Alice clicks Bob's old email.
Alice replies to that old email, and decides to encrypt for the first time.

In this scenario, I request that we MUST NOT offer ONLY the old key.
In other words, a trivial implementation that always has just "one most recently seen key" per email address would fail to suggest Bob's newer key to Alice.

The implementation must use one of the following approaches:

(a)
Store multiple candidate keys (that have different fingerprints) per email address.
In the above scenario, Thunderbird would offer both candidate keys to Alice, old and new.
Is indexeddb capable of this data model?
From https://developer.mozilla.org/en-US/docs/Web/API/IndexedDB_API/Basic_Terminology
"Each record in an index can point to only one record in its referenced object store".
If indexeddb doesn't support one-index-entry-to-multiple-records, then we cannot use it for this implementation approach.

(b)
Only store one candidate key.
This is tricky and risky.
First, we'd have to come up with our own model for identifying the "best" candidate key, that is the only one we want to store in the cache.
Using "most recently seen" isn't a good approach, as outlined in the above example, as it would cause to offer ONLY an older key to Alice.
An approach could also look at the "created" attribute. But that alone might not be reliable.
For example, Bob could have produced one key valid from 2015 to 2025, and another key valid from 2020 to 2021. If we're in 2022, this simple strategy would incorrectly prefer the newer key that has already expired.
Also, with a "one key only" strategy, you'd have to make sure that you don't store a key that has been revoked (overriding a non-revoked key).
I think this illustrates that a "one key per email address" would require a smart implementation.
Furthermore, storing just one candidate key makes it easier for an attacker to trick someone.

David regularly sends email with his key.
Carol has repeatedly received David's key, but never had a need to encrypt.
Now an attacker, Eve, sends an email to Carol with David as a fake sender address, and attached a fake key for David (private key owned by Eve, user ID claims it's David's key). The email requests Carol to reply, including sensitive information, and insisting that Carol encrypts the email.

With a simple implementation that collects only one key, Thunderbird might offer only the fake key.

I think this example is a strong argument for an implementation that collects and offers multiple candidate keys.
If there are multiple keys stored, Carol can notice that there are two keys for David, and she can notice that she needs to investigate which key is the correct one.

Bug 1667564 - collect OpenPGP keys from email attachments, Autocrypt headers and keyserver lookups. r=kaie 4 years ago Magnus Melin [:mkmelin] 48 bytes, text/x-phabricator-request		Details \| Review
fix-auto-update.patch 4 years ago Kai Engert [:KaiE:] 3.73 KB, patch		Details \| Diff \| Splinter Review
ontop-colldb.patch 4 years ago Kai Engert [:KaiE:] 1.37 KB, patch		Details \| Diff \| Splinter Review
1667564-armor.patch 3 years ago Kai Engert [:KaiE:] 5.00 KB, patch		Details \| Diff \| Splinter Review
1667564-armor-v2.patch 3 years ago Kai Engert [:KaiE:] 3.74 KB, patch		Details \| Diff \| Splinter Review
Bug 1667564 - Don't collect OpenPGP key candidates that don't pass sanity checks. r=mkmelin 3 years ago Kai Engert [:KaiE:] 48 bytes, text/x-phabricator-request		Details \| Review
Bug 1667564 - Function mergePublicKeyBlocks, clarify comment and use const. r=PatrickBrunschwig 3 years ago Kai Engert [:KaiE:] 48 bytes, text/x-phabricator-request		Details \| Review