Store pubkey cache decompressed on disk

## Description

Currently Lighthouse takes a long time to start up while it reloads the public key cache from disk.

The fundamental reason for this is that it stores the BLS pubkeys in their compressed form and then has to decompress every one of them for the in-memory cache:

https://github.com/sigp/lighthouse/blob/18c61a5e8be3e54226a86a69b96f8f4f7fd790e4/beacon_node/beacon_chain/src/validator_pubkey_cache.rs#L166-L169

Just changing that `PublicKeyBytes` to `PublicKey` would not be sufficient, because the two types share the same compressed SSZ representation:

https://github.com/sigp/lighthouse/blob/18c61a5e8be3e54226a86a69b96f8f4f7fd790e4/crypto/bls/src/macros.rs#L29-L50

https://github.com/sigp/lighthouse/blob/18c61a5e8be3e54226a86a69b96f8f4f7fd790e4/crypto/bls/src/impls/blst.rs#L121-L124

So we would need to create new methods on `bls::GenericPublicKey` like `serialize_uncompressed` and `deserialize_uncompressed` which use the 96-byte serialization of the `PublicKey` rather than the 48-byte compressed serialization. For `blst`, we can use the [`PublicKey::serialize`](https://docs.rs/blst/0.3.10/blst/min_pk/struct.PublicKey.html#method.serialize) function. For `milagro` we can use [`PublicKey::as_uncompressed_bytes`](https://github.com/sigp/milagro_bls/blob/421aa3af8a0618a46144560389debbe7e56609db/src/keys.rs#L159-L162). Both libraries also include deserialisation functions acting on the uncompressed bytes.

Once these methods are in place in the `bls` wrapper lib, we can call them from the `DatabasePubkey` wrapper. We'll need two versions of `DatabasePubkey` in order to implement the schema migration from compressed keys to uncompressed keys.

By storing the bytes uncompressed we'll double the disk space required for the pubkey cache from 500k * 48 bytes = 24 MB to 48 MB. Given the overall size of the database I believe this is an acceptable trade-off. We could attempt to apply a faster compression algorithm like `zstd` to the data, but seeing as the input is pseudo-random bytes I suspect this naive compression would be ineffective.

## Version

Lighthouse v3.0.0


	/// Wrapper for a public key stored in the database.
	///
	/// Keyed by the validator index as `Hash256::from_low_u64_be(index)`.
	struct DatabasePubkey(PublicKeyBytes);

	/// Contains the functions required for a `ssz::Encode` implementation.
	///
	/// Does not include the `Impl` section since it gets very complicated when it comes to generics.
	macro_rules! impl_ssz_encode {
	($byte_size: expr) => {
	fn is_ssz_fixed_len() -> bool {
	true
	}

	fn ssz_fixed_len() -> usize {
	$byte_size
	}

	fn ssz_bytes_len(&self) -> usize {
	$byte_size
	}

	fn ssz_append(&self, buf: &mut Vec<u8>) {
	buf.extend_from_slice(&self.serialize())
	}
	};
	}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Store pubkey cache decompressed on disk #3505

Description

Version

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	impl TPublicKey for blst_core::PublicKey {
	fn serialize(&self) -> [u8; PUBLIC_KEY_BYTES_LEN] {
	self.compress()
	}

Store pubkey cache decompressed on disk #3505

Description

Description

Version

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions