feat: speed benchmark and network creation optimization by samcm · Pull Request #1320 · ethpandaops/ethereum-package

samcm · 2026-02-24T03:33:56Z

Summary

Iterative optimization of network creation speed, targeting sub-60s for a minimal single-participant network.

Optimizations implemented:

Skip keystore generation when all participants have validator_count=0 (avoids pulling protolambda/eth2-val-tools container)
Faster polling intervals - interval=0.5s on CL ready conditions and EL admin_nodeInfo wait (was using 1s default)
Skip enode extraction for single-participant networks (plan.wait on admin_nodeInfo is unnecessary when no other EL nodes need the bootnode)
Optimized genesis generation - extract genesis_validators_root and osaka_time within the genesis generator container itself
Speed benchmark CI job with TIMING markers throughout the critical path for phase-level profiling

Benchmark infrastructure:

speed_benchmark CI job measures wall-clock time against 60s target
set -euo pipefail ensures kurtosis failures are caught
TIMING markers at: keystore_generation, genesis_generation, el_launch, cl_launch, vc_launch, participant_network, run

Timing markers from CI (3-participant minimal.yaml):

Phase	Duration
keystore_generation	~3.8s
genesis_generation	~3.7s
el_launch	~15.4s
cl_launch	~41.9s
vc_launch	~5.9s
total	~71s

CL launch is the dominant bottleneck. Single-participant benchmarks pending.

Add a speed_benchmark job to per-PR CI that measures network creation time against a 60s target. Includes TIMING: markers throughout the critical path (keystore gen, genesis gen, EL/CL/VC launch) for phase-level profiling.

# Conflicts: # src/participant_network.star

- Set validator_count=0 and use_separate_vc=false in speed test to skip entire keystore pipeline and VC launch - Add interval=0.5s to CL ready conditions (was using 1s default) - Add interval=0.5s to EL admin_nodeInfo wait (was using 1s default) - Extract genesis_validators_root and osaka_time within the genesis generator container to avoid needing jq in the read step - Add wait=None to osaka_time read step for potential parallelization

Short-circuit the entire keystore pipeline (avoid pulling and starting protolambda/eth2-val-tools container) when all participants have validator_count=0. Also fix lint formatting.

Fulu fork (epoch 0 default) requires supernodes/validators/peerdas which the minimal speed test doesn't configure. Set fulu_fork_epoch to far-future to skip this validation.

- Skip plan.wait() on admin_nodeInfo when num_participants==1 (enode only needed as bootnode for participant 1+) - Fix speed.yaml: use FAR_FUTURE_EPOCH for fulu_fork_epoch, add fixed genesis_time to skip timestamp container - Fix benchmark script: add set -euo pipefail to catch kurtosis failures through tee pipe

A genesis_time in 2030 means CL nodes wait years for genesis and never produce blocks. The dynamic computation adds ~2-5s but is necessary for a functional network.

The speed test needs actual validators to be a meaningful benchmark. Using defaults (128 validators, separate VC for lighthouse).

geth/reth/nethermind x lighthouse/teku/nimbus with 120s target.

Skip sequential enode extraction (EL[1..8]) and ENR/identity extraction (CL[1..8]) during the launch phase, since only the boot node's enode/ENR is needed as a bootnode for subsequent nodes. Collect the deferred enodes and identities after all VCs are launched, when nodes are already warm and responding faster. This moves ~16s of EL wait and ~25s of CL wait out of the critical path, replacing it with ~10s of post-launch collection.

- erigon and nimbus-eth1 use WS_RPC_PORT_ID instead of RPC_PORT_ID - Only geth, erigon, dummy, ethrex extract ENR from admin_nodeInfo - Auto-format with kurtosis lint

Add an image warmup phase that uses plan.add_services to pull all unique EL/CL/VC images in parallel before any launch phase begins. Docker deduplicates concurrent pulls at the layer level, so this single parallel pull warms the cache for all subsequent add_service and add_services calls. The throwaway warmer services are stopped immediately after the pulls complete.

This reverts commit 207e3cb.

…missing Defer ready_conditions for CL[1..8] so add_services returns after Docker pull+start without waiting for health checks. The health wait moves to collect_identities() which now uses plan.wait (retries) instead of plan.request (one-shot). CL nodes boot in background during VC launch. Also switch speed benchmarks from --image-download always to missing so Docker skips manifest re-verification for already-cached images.

With --image-download missing, kurtosis skips pulling cached images. Pre-pulling all client + genesis images in parallel before the kurtosis run means add_service/add_services calls find everything cached and only need to create+start containers (no network I/O).

samcm added 15 commits February 24, 2026 13:33

feat: add speed benchmark CI job with timing instrumentation

56a0050

Add a speed_benchmark job to per-PR CI that measures network creation time against a 60s target. Includes TIMING: markers throughout the critical path (keystore gen, genesis gen, EL/CL/VC launch) for phase-level profiling.

Merge remote-tracking branch 'origin/main' into feat/faster

b586918

# Conflicts: # src/participant_network.star

perf: skip keystore generation when no validators configured

25003e0

Short-circuit the entire keystore pipeline (avoid pulling and starting protolambda/eth2-val-tools container) when all participants have validator_count=0. Also fix lint formatting.

fix: disable fulu fork in speed test to fix validation error

c2a8e30

Fulu fork (epoch 0 default) requires supernodes/validators/peerdas which the minimal speed test doesn't configure. Set fulu_fork_epoch to far-future to skip this validation.

fix: remove far-future genesis_time, let it compute dynamically

b10eb14

A genesis_time in 2030 means CL nodes wait years for genesis and never produce blocks. The dynamic computation adds ~2-5s but is necessary for a functional network.

fix: restore validators in speed test config

de33e91

The speed test needs actual validators to be a meaningful benchmark. Using defaults (128 validators, separate VC for lighthouse).

feat: add 3x3 speed benchmark (9 participants, 3 EL x 3 CL)

0596678

geth/reth/nethermind x lighthouse/teku/nimbus with 120s target.

fix: handle per-client port/enr differences in collect_enodes, fix lint

44ab364

- erigon and nimbus-eth1 use WS_RPC_PORT_ID instead of RPC_PORT_ID - Only geth, erigon, dummy, ethrex extract ENR from admin_nodeInfo - Auto-format with kurtosis lint

Revert "perf: pre-pull all unique images in parallel before launch"

6d2664e

This reverts commit 207e3cb.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: speed benchmark and network creation optimization#1320

feat: speed benchmark and network creation optimization#1320
samcm wants to merge 15 commits intomainfrom
feat/faster

samcm commented Feb 24, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

samcm commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Optimizations implemented:

Benchmark infrastructure:

Timing markers from CI (3-participant minimal.yaml):

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

samcm commented Feb 24, 2026 •

edited

Loading