feat(spam-stream): per-signer concurrent sends by jelias2 · Pull Request #590 · flashbots/contender

jelias2 · 2026-06-10T14:37:07Z

Summary

spam-stream's drive_stream currently sends serially — send_one(...).await per spec — so throughput is capped at roughly one RPC round-trip per tx (~20/s). When the stream supplies specs faster than that (e.g. relaying interop executing messages), a send queue builds and end-to-end latency balloons.

This makes the send path concurrent while keeping nonce handling correct.

Approach: per-signer worker pool

Spawn one worker per pool signer. Each incoming spec is routed (round-robin by idx, matching make_strict_call's idx % signers.len() selection) to the worker that owns its signer, and workers send concurrently.

Why per-signer rather than a semaphore over all sends: a bounced/rejected send must reuse its nonce (otherwise a gap stalls every later tx from that signer). That reclaim is only correct if a given signer's sends are serialized. Pinning each signer to one worker gives exactly that:

Per-signer serial → nonce assignment + reclaim-on-rejection are race-free with no shared nonce map and no locks (each worker owns its signer's nonce locally).
Cross-signer concurrent → throughput scales with pool size.
Workers build/sign against a shared Arc<TestScenario> using a worker-local nonce (bypassing prepare_tx_request's &mut), with the gas price shared via an atomic refreshed once per interval.

Compatibility

Concurrency == pool size. --pool-size 1 reproduces the original serial behavior exactly.
stdin spec format and stdout StreamEvent schema are unchanged (the Go-side/reactive consumer correlates by idx, so out-of-order emits are fine).
No public API changes; the change is contained to spam_stream.rs.

Validation

Relaying interop executing messages at a sustained rate above the serial ceiling, with a 32-signer pool: cross-chain inclusion latency dropped from p50 80s (queue-bound) to p50 4s, with send→receipt p50 ~1.5s and 0 failures. At --pool-size 1 behavior is unchanged.

Draft: opening for review/CI. Local cargo build --release is clean; cargo fmt applied.

drive_stream sent serially (send_one().await per spec), capping throughput at ~one RPC round-trip per tx (~20/s). Replace it with a per-signer worker pool: spawn one worker per pool signer, route each spec to the worker that owns its signer (round-robin by idx, matching make_strict_call's signer selection), and let workers send concurrently. Each signer's sends stay serial within its worker, so nonce assignment and reclaim-on-rejection (reuse the nonce, no gap) remain correct with no shared nonce state or locks. Concurrency == pool size; a pool of 1 reproduces the original serial behavior. Workers build/sign against a shared Arc<TestScenario> with a worker-local nonce (bypassing prepare_tx_request's &mut), and the gas price is shared via an atomic refreshed once per interval. Validated relaying interop executing messages: at a sustained rate above the serial ceiling, cross-chain inclusion latency dropped from p50 80s (queue-bound) to p50 4s with a 32-signer pool.

…igner concurrency Extract the two correctness invariants of the per-signer worker model into pure helpers and unit-test them (no RPC needed): - worker_index(idx, n): round-robin routing; assert idx and idx+n map to the same worker (so the same signer) and that it matches make_strict_call's idx % signers.len() pick — the property that keeps each signer serial. - next_nonce(nonce, submitted): advance on accept, reuse on rejection (no gap). Also assert pool signers are distinct (per-signer workers depend on it). Update docs/stream-mode.md: data-flow now shows the per-signer worker pool (concurrency == --pool-size, pool of 1 = serial) and the reuse table reflects workers building directly with a worker-local nonce instead of prepare_tx_request.

…urrency knob The per-signer concurrent-sends change resolves the last open question (concurrency bounded by pool size). Drop the resolved questions, keep the one real follow-up (Spammer-trait reuse) under Follow-ups, and note in the CLI table that --pool-size sets send concurrency (no new flag).

bitwiseguy · 2026-06-23T05:42:25Z

-                    }
+                // Route to the worker that owns this idx's signer.
+                let w = worker_index(idx, n_workers);
+                if worker_txs[w].send((idx, spec)).await.is_err() {


This send(...).await silently blocks the whole dispatch loop whenever a worker's channel saturates. Under asymmetric per-signer drain (e.g. a node that throttles one sender's pending txs), healthy workers are stalled too without any signal to the producer. The reader/drive_stream hop handles this in forward_lines by emitting a Backpressure event once per saturation episode before blocking. The worker hop is the one place that backpressure isn't currently observable. Maybe worth mirroring the same pattern so a saturated signer shows up as a backpressure event instead of a silent drop in throughput?

Fixed in fcff894. Worker dispatch now uses try_send first, emits the existing Backpressure event once per worker saturation episode, then blocks only after making saturation observable.

bitwiseguy · 2026-06-23T05:55:17Z

+    let mut sent = 0usize;
+    let mut failed = 0usize;
+    for handle in worker_handles {
+        if let Ok((s, f)) = handle.await {


A panic in a worker thread will be swallowed. Might want to add a log it to make that failure visible.

match handle.await { Ok((s, f)) => { sent += s; failed += f; } Err(e) => warn!("stream: send worker panicked: {e}"), }

Fixed in fcff894. Worker joins now match on JoinError and log panics with warn!, while still tallying successful worker results.

jelias2 marked this pull request as ready for review June 10, 2026 14:38

jelias2 requested a review from zeroXbrock as a code owner June 10, 2026 14:38

jelias2 and others added 4 commits June 10, 2026 10:47

docs(stream-mode): drop the Follow-ups section

46eba76

Merge branch 'main' into jelias/spam-stream-concurrent-sends

0b9e7de

bitwiseguy approved these changes Jun 23, 2026

View reviewed changes

fix clippy and stream worker backpressure

fcff894

bitwiseguy merged commit bb8ac45 into flashbots:main Jun 23, 2026
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(spam-stream): per-signer concurrent sends#590

feat(spam-stream): per-signer concurrent sends#590
bitwiseguy merged 6 commits into
flashbots:mainfrom
jelias2:jelias/spam-stream-concurrent-sends

jelias2 commented Jun 10, 2026

Uh oh!

bitwiseguy Jun 23, 2026

Uh oh!

jelias2 Jun 23, 2026

Uh oh!

bitwiseguy Jun 23, 2026

Uh oh!

jelias2 Jun 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

jelias2 commented Jun 10, 2026

Summary

Approach: per-signer worker pool

Compatibility

Validation

Uh oh!

bitwiseguy Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

jelias2 Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

bitwiseguy Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

jelias2 Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants