Ledger and Snapshots#
The ledger and snapshot files written by CCF nodes to disk should be managed by operators to allow for safe backup of the service and application state as well as fast join and recovery procedures. This section describes how these files are generated and how operators should manage them effectively.
See the Audit section to read about offline ledger auditability.
The ledger is the persistent replicated append-only record of the transactions that have been executed by the CCF service. It is written by the primary node when a transaction is executed and replicated to all backups which maintain their own duplicated copy. Each node in a network creates and maintains its own local copy of the ledger. Committed entries are always byte identical between a majority of nodes, but a node may be more or less up to date, and uncommitted entries may differ.
On each node, the ledger is written to disk in a directory specified by the
ledger.directory configuration entry.
It is also possible to specify optional read-only ledger directories
ledger.read_only_directories. This enables CCF to have access to historical transactions, for example after joining from a snapshot (see Historical Transactions). Note that only committed ledger files (those whose name ends with
.committed) can be read from this directory. This option can be used to specify a shared directory that all nodes in the network can access to serve historical ledger entries.
The ledger growths as transactions mutate CCF’s key-value store. The ledger is split into multiple files (or chunks) written by a node in the directory specified by the
ledger.directory configuration entry. Even though there are multiple ledger files on disk, there is only one unique logical ledger file for the lifetime of a CCF service (and across recoveries). The logical ledger can be reconstituted by parsing the ledger files in sequence, based on the sequence number included in their file names.
The size of each ledger file is controlled by the
ledger.chunk_size configuration entry.
Ledger files containing only committed entries are named
<end_seqno> the sequence number of the first and last transaction in the ledger, respectively. These files are closed and immutable and it is safe to replicate them to backup storage. They are identical across nodes, provided
ledger.chunk_size has been set to the same value.
Ledger files that still contain some uncommitted entries are named
ledger_<start_seqno> for the most recent one. These files are typically held open by the
cchost process, which may modify their content, or even erase them completely. Uncommitted ledger files may differ arbitrarily across nodes.
Removing uncommitted ledger files from the
ledger.directory directory may cause a node to crash. It is however safe to move committed ledger files to another directory, accessible to a CCF node via the
ledger.read_only_directories configuration entry.
It is important to note that while all entries stored in ledger files ending in
.committed are committed, not all committed entries are stored in such a file at any given time. A number of them are typically in the in-progress files, waiting to be flushed to a
.committed file once the size threshold (
ledger.chunk_size) is met.
The listing below is an example of what a ledger directory may look like:
$ cchost # with ledger.ledger-dir = $LEDGER_DIR
$ ls -la $LEDGER_DIR
-rw-rw-r-- 1 user user 1.6M Jan 31 14:00 ledger_1-7501.committed
-rw-rw-r-- 1 user user 1.1M Jan 31 14:00 ledger_92502-97520.committed
-rw-rw-r-- 1 user user 553K Jan 31 14:00 ledger_97521 # File still in progress
While the Disaster Recovery procedure is in progress, new ledger files are suffixed with
.recovery. These files are automatically renamed (i.e. recovery suffix removed) once the recovery procedure is complete.
.recoveryfiles are automatically discarded on node startup so that a failed recovery attempt does not prevent further recoveries.
A new ledger chunk can also be created by the
trigger_ledger_chunkgovernance action, which will automatically produce a new chunk at the following signature transaction.
When a node is added to an existing service, the entire transaction history is automatically replicated to this new node. Similarly, on recovery, the transaction history since the creation of the service has to be replayed. Depending on the number of historical transactions, adding a node/recovering a service can take some non-negligible period of time, preventing the new node to quickly take part in the consensus and compromising the availability of the service.
To avoid this, it is possible for a new node to be added (or a service to be recovered) from an existing snapshot of the recent CCF state. In this case, only historical transactions between the sequence number at which the snapshot was taken and the latest state will be replicated.
Snapshots are generated at regular intervals by the current primary node and stored under the directory specified via the
snapshots.directory configuration entry (defaults to
snapshots/). The transaction interval at which snapshots are generated is specified via the
snapshots.tx_count configuration entry (defaults to a new snapshot generated every
10,000 committed transactions). Snapshots can also be generated by the
trigger_snapshot governance action, i.e. by submitting a proposal. A snapshot will then be generated at the next signature transaction.
Because the generation of a snapshot requires a new ledger chunk to be created (see File Layout), all nodes in the network must be started with the same
To guarantee that the identity of the primary node that generated the snapshot can be verified offline, the SHA-256 digest of the snapshot (i.e. evidence) is recorded in the snapshot_evidence table. The snapshot evidence will be signed by the primary node on the next signature transaction (see ledger_signatures).
Committed snapshot files are named
<seqno> the sequence number of the state of the key-value store at which they were generated and
<evidence_seqno> the sequence number at which the snapshot evidence was recorded.
Uncommitted snapshot files, i.e. those whose evidence has not yet been committed, are named
snapshot_<seqno>_<evidence_seqno>. These files will be ignored by CCF when joining or recovering a service as no evidence can attest of their validity.
Join or Recover From Snapshot#
Once a snapshot has been generated by the primary, operators can copy or mount the read-only snapshot directory to the new node directory before it is started. On start-up, the new node will automatically resume from the latest committed snapshot file in the
snapshots.directory directory. If no snapshot file is found, all historical transactions will be replicated to that node.
It is important to note that new nodes cannot join a service if the snapshot they start from is older than the snapshot the primary node started from. For example, if a new node resumes from a snapshot generated at
seqno 50 and joins from a (primary) node that originally resumed from a snapshot at
seqno 100, the new node will throw a
StartupSeqnoIsOld error shortly after starting up. It is expected that operators copy the latest committed snapshot file to new nodes before start up.
Nodes that started from a snapshot can still process historical queries if the historical ledger files (i.e. the ledger files preceding the snapshot) are made accessible to the node via the
ledger.read_only_directories option to
cchost. Although the read-only ledger directory must be specified to the node on start-up, the historical ledger file contents can be copied to this directory after the node is started (see Data Persistence).
Before these ledger files are present the node will be functional, participating in consensus and able to accept new transactions, but historical queries targeting the missing entries will permanently stall. Calls to the historical query APIs will return loading responses, as these APIs do not currently distinguish between temporarily missing and permanently missing files. It is the responsibility of the operator to ensure that the ledger files visible to all nodes are complete, including back-filling missing files when required.
To facilitate audit and verification of the integrity of the ledger, individual ledger files always end on a signature transaction.
For operator convenience, all committed ledger files (
.committedsuffix) are the same on all up-to-date nodes. More precisely, among up-to-date nodes:
Committed ledger files start and end at the same
Committed ledger files with the same name are byte-identical.
Snapshots are always generated for the
seqnoof a signature transaction (but not all signature transactions trigger the generation of snapshot).
The generation of a snapshot triggers the creation of a new ledger file. This is a corollary of 2. and 3., since new nodes should be able to join from a snapshot only and generate further ledger files that are the same as on the other nodes.