openpgp-notes/book/source/10-encryption.md
2023-12-08 16:43:38 +01:00

13 KiB

(encryption_chapter)=

Encryption

Encryption is one of the core facilities of OpenPGP. It provides confidentiality.

For an in-depth, packet-level view of encrypted data in OpenPGP, see {ref}zoom_enc.

Terminology

Term Description
SEIPD Packet Symmetrically Encrypted, Integrity Protected Data packet; contains the encrypted message payload
SKESK Packet Symmetric-Key-Encrypted Session-Key packet; contains or provides a passphrase-encrypted Session-Key
PKESK Packet Public-Key-Encrypted Session-Key packet; contains a session-key encrypted using an asymmetric public-key
Session-Key Symmetric encryption key, which is either used directly as - or to derive - the Message-Key
Message-Key Symmetric encryption key used to encrypt the contents of the SEIPD packet

High-Level overview of the message encryption process

Encryption in OpenPGP is performed in two distinct steps:

  1. The plaintext is encrypted based on a (secret) symmetric key, the session key. The (potentially large) ciphertext only needs to be stored once, even if it is sent to multiple recipients. All recipients get access to the same shared session key to decrypt the message.
  2. For each recipient of the message, a packet that contains the session key is generated.
    • Usually, the session key is encrypted to a public encryption component key of the recipient.
    • Alternatively - or additionally - the session key may also be encrypted using a passphrase. This is a specialized and less commonly used mode of operation that doesn't require OpenPGP certificates.
Above, "plaintext" either means a *Literal Data* packet, *Compressed Data* packet or a *signed message*.
A *signed message* on the other hand is a packet sequence that either resembles an *inline-signed message* (a *Literal Data* packet sandwhiched between one or more *One-Pass-Signature* and their respective *Signature* packets), or a *prefixed-signed* message (one or more *Signature* packets followed by a single *Literal Data* packet).

History of encryption mechanisms in OpenPGP

OpenPGP's encryption mechanisms have evolved over time. The RFC shows an overview of encryption mechanisms, and how they may be combined.

Two generations of encryption mechanisms are currently relevant in OpenPGP, and will co-exist for the foreseeable future. The main difference between these lies in the symmetric part of the encryption mechanism, represented by versions 1 and 2 of the Symmetrically Encrypted and Integrity Protected Data packets (abbreviated as "SEIPD"), as they make use of different techniques to provide non-malleability. More on these below.

Older, legacy encryption mechanisms exist in OpenPGP. However, those must not be used for encryption anymore. Messages encrypted using these legacy mechanisms may still be decrypted, although with caution. For more information see the decryption chapter.

SEIPD packets are typically used in combination with two mechanisms that provide session keys:

The typical combination of mechanisms for encryption in OpenPGP is a hybrid cryptosystem, consisting of one or more Public-Key Encrypted Session Key packets (PKESK), followed by a [Symmetrically Encrypted Integrity Protected Data* (SEIPD) packet. In this combination, an asymmetric cryptographic mechanism is used to protect a session key inside PKESK packets, which is used to protect the plaintext using symmetric-key encryption in a SEIPD packet.

Encapsulating session keys: PKESK, SKESK

"*ESK" (encrypted session-key) packets are a family of mechanisms for encapsulation of symmetric key material. There are two branches:

  • PKESK: Uses asymmetric OpenPGP key material to protect a session key, and
  • SKESK: Uses passphrases to protect the symmetric key material, instead of OpenPGP asymmetric key material (this is less commonly used).

An arbitrary number of PKESKs and SKESKs can be used for the same message. It is also possible to mix those, resulting in a message which can be decrypted using either one of the designated OpenPGP keys or any of the passwords used to encrypt the message. This is useful to make a message available to a number of known recipients, with the option to provide the password to future recipients.

PKESK: Session key encrypted to an asymmetric OpenPGP key

To encrypt an OpenPGP message for a recipient, the session-key is encrypted to the recipients public key. The resulting encrypted session key is packed into a PKESK packet, which holds essential metadata, like an identifier of the recipients encryption (sub)-key.

This procedure is repeated for each recipient of the message, and all resulting PKESK packets are prepended to the SEIPD packet (see below) containing the actual message.

Typically, the sender would also include themselves as a recipient, in order to be able to decrypt the sent message at a later point in time.

SKESK: Session key encrypted to a passphrase

As an alternative (or augmentation) to PKESK packets, a message can also be encrypted to a symmetric passphrase. This is done using a SKESK packet, which basically uses an S2K mechanism to derive a symmetric key from a passphrase, which is then either used directly as the session-key, or more commonly, used as a key-encapsulation-key (KEK) to encrypt the session-key.

Also see https://flowcrypt.com/docs/guide/send-and-receive/send-password-protected-emails.html

As for protection of secret key material, it is important to chose appropriate S2K parameters when generating an SKESK packet. The specification currently recommends to use either Iterated and Salted S2K or Argon2.

:class: warning

Add further guidance for recommended S2K parameters, like iteration count or Argon2 configuration. Perhaps in a dedicated "S2K Parameters" section, which can be reused for the encryption chapter and when we talk about secret key encryption in TSKs.

Symmetric encryption of data, SEIPD

Symmetrically Encrypted Integrity Protected Data (SEIPD) packets represent the symmetric aspect of OpenPGP's encryption mechanism. The function of these packets is entirely independent of (asymmetric) OpenPGP keys.

A SEIPD packet contains the actual payload: the ciphertext of the encrypted message. For a large encrypted message, the SEIPD packet will also be large.

SEIPD packets are the successor to the [Symmetrically Encrypted Data](https://www.ietf.org/archive/id/draft-ietf-openpgp-crypto-refresh-12.html#name-symmetrically-encrypted-dat) packet, which is obsolete.

Two versions of the SEIPD packet (differentiated by the version number) have been specified. Version 1, introduced in RFC4880, is used in OpenPGP v4 while SEIPD version 2 was introduced with OpenPGP v6. Both versions can be used with either OpenPGP v4 or v6 keys, although OpenPGP v4 keys need to announce support for SEIPD version 2 via the Feature signature subpacket.

When decrypted, the data contained in a SEIPD packet forms an OpenPGP message. That is, the decrypted data consists of a series of OpenPGP packets.

In both versions of SEIPD, the decryptor must have obtained a session key in a previous step, before processing the SEIPD packet. Using this session key, the decryptor can decrypt the SEIPD packet and process the plaintext data that it contains.

v1 SEIPD, based on MDC

The version 1 SEIPD mechanism is supported by all modern OpenPGP version 4 implementations. It was introduced in RFC 4880 as a replacement for the SED (Symmetricaly Encrypted Data) packet. SEIPDv1 provides integrity protection of the ciphertext using a SHA-1 checksum of the plaintext as modification detection code.

Version 1 SEIPD can only be combined with version 3 PKESK and/or version 4 SKESK packets.

In this version of the SEIPD packet, the session-key is used directly as message-key, meaning the payload is encrypted symmetrically using the session-key.

When communicating with a mix of recipients, some of whose OpenPGP software only supports OpenPGP version 4, then this mechanism must be used.

:name: fig-encryption-seipdv1-pkesk
:alt: Depicts a dotted hexagon labeled "Plaintext", from which a curved arrow passes another dotted hexagon "Session Key" and finally points to a "SEIPDv1" packet. Two more curved arrows originate from the session key and pass Alice' and Bob's encryption key, ending in two PKESK packets.

With SEIPDv1, the session-key is directly used as message-key to encrypt the payload

(SEIPDv2)=

v2 SEIPD, based on AEAD

The version 2 SEIPD mechanism was introduced in OpenPGP version 6. Consequently, it can only be used for encryption when all recipients explicitly announce support for it using a Feature signature subpacket. It provides integrity protection of the ciphertext using AEAD (authenticated encryption with additional data). v2 SEIPD can only be combined with either version 6 PKESK and/or version 6 SKESK packets.

In version 2 SEIPD, the session key is transformed into a message key, based on a per-message salt value stored separately in the v2 SEIPD packet. The message key is then used in an AEAD scheme to encrypt the message payload.

The session-key can use a different symmetric algorithm than the message-key.
:name: fig-encryption-seipdv2-pkesk
:alt: TODO

With SEIPDv2, the message-key is derived from the session-key in an extra step.

This additional step introduces key-separation into the protocol, which protects against certain attacks, such as an OpenPGP SEIP downgrade attack.

:class: warning

Explain, that with SEIPDv2, a session-key can essentially protect more than one message by reusing the same session-key and *ESK packets with a fresh, per-message salt.

This might very well go into the advanced topics section though.

Advanced topics

Encrypt to multiple/single subkey per certificate?

A recipients certificate may possibly contain more than one usable encryption subkey. This raises the question, should the message be encrypted for all of them?

There is the argument, that a powerful attacker might have managed to add an attacker-controlled encryption subkey to the victims certificate. In this case, only encrypting to the "newest" encryption key would help uncovering such an attack, although a powerful attacker could just MitM any sent messages and just add a PKESK for the victim-controlled encryption keys to hide the fact that the sender used a different key.

On the other hand, a user might have multiple encryption subkeys on purpose. Picture for example a scenario where the same certificate is used on multiple devices, but each devices has dedicated encryption subkeys to allow for smoother revocation in case of a lost device. In this scenario, it is important that the sender encrypts the message to all available encryption subkeys.

"Negotiating" algorithms based on recipients preference subpackets

Prevent "downgrade" -> Policy

Each implementation should define a "minimum" level of security when it comes to algorithms and key lengths. If the lowest common denominator of symmetric encryption algorithms preferred by a set of recipients provides too little security, the implementation should either use a configured fallback algorithm instead, or fail to produce a message at all.

Implications of how a recipient cert is "addressed" (fingerprint/key-ID vs. user-ID) (preferences, expiration, revocation)

:class: warning

This has been described elsewhere already.
See 9.7.3

AEAD modes in v2 SEIPD: GCM

:class: warning

Produce text around discussion: https://mailarchive.ietf.org/arch/msg/openpgp/ZTYD5VJsG1k2jJBbn5zIAf5o7d4/