Add CallEdgeTracker trait for distributed deadlock detection by cloutiertyler · Pull Request #4735 · clockworklabs/SpacetimeDB

cloutiertyler · 2026-03-31T17:29:21Z

Summary

New CallEdgeTracker trait for tracking cross-database call edges (A -> B)
NoopCallEdgeTracker for standalone (no-op, always succeeds)
Edge registration/unregistration in call_reducer_on_db and call_reducer_on_db_2pc
New NodesError::CycleDetected and errno::CYCLE_DETECTED (22) for wasm ABI
Cloud implementation will provide a real tracker that talks to the control DB

Design

When database A calls a reducer on database B, the edge A -> B is registered before the HTTP request is sent. If register_edge detects a cycle, it returns CycleDetected and the call is aborted before it can deadlock. After the call completes, the edge is unregistered.

The CallEdgeTracker is stored on ReplicaContext so both the actor code and HTTP handlers can access it.

Test plan

Standalone: no behavioral change (no-op tracker)
Cloud: implement a real tracker and test with A calling B while B calls A

Before making a cross-database reducer call, register an edge A -> B with a CallEdgeTracker. If adding the edge would create a cycle (distributed deadlock), the tracker returns an error. The caller retries with exponential backoff (5 attempts), then fails with a deadlock error. New trait CallEdgeTracker in core::host::call_edge_tracker with: - register_edge(call_id, caller, callee) -> Result<()> - unregister_edge(call_id) -> Result<()> - unregister_all_edges() -> Result<()> (crash cleanup on startup) NoopCallEdgeTracker for standalone (always allows calls). Cloud implementation will call control DB reducers for cycle detection. Also added register/unregister_reducer_call_edge methods to ControlStateWriteAccess trait (no-op in standalone).

Edge tracking uses CallEdgeTracker trait (in core) instead of ControlStateWriteAccess (in client-api) due to circular dependency. Added TODO to consolidate once the trait is moved to a shared crate.

Convert call_reducer_on_db and call_reducer_on_db_2pc from async to synchronous blocking HTTP. This avoids async runtime conflicts on the WASM executor thread. - Add reqwest::blocking::Client to ReplicaContext - Add execute_blocking_http helper (runs on fresh OS thread) - Add resolve_base_url_blocking to ReducerCallRouter - Make CallEdgeTracker methods synchronous - Enable reqwest "blocking" feature

Add call_edge_tracker field to HostController and ModuleLauncher so the tracker flows from the top-level Node/StandaloneEnv down to each ReplicaContext. Added set_call_edge_tracker method for runtime configuration.

- ReducerCallRouter::resolve_base_url now returns Result<String> directly (blocking) instead of BoxFuture. All implementations are synchronous. - HostController uses OnceLock for the router (set once at startup, lock-free reads afterward). Falls back to LocalReducerRouter default. - Removed async BoxFuture and resolve_base_url_blocking variant.

…ediately

After the reducer commit releases the lock, modify the first pending TxData in the barrier queue to include the st_2pc_state deletion. When the barrier clears, a single commitlog entry contains both the reducer's row changes and the COMMIT marker (st_2pc_state delete). The st_2pc_state row never enters committed_state during normal operation -- it only exists in the commitlog for crash recovery.

Replace NoopCallEdgeTracker with InMemoryCallEdgeTracker that maintains an in-memory adjacency list of active call edges and runs DFS cycle detection on each registration. Works for standalone where all databases share the same process.

execute_blocking_http now takes a RequestBuilder instead of a built Request. Both build() and execute() happen inside the scoped OS thread, which has no tokio context. In debug builds, reqwest 0.12 panics if blocking I/O operations run inside a tokio block_on context.

…panic

… startup)

modify_first_barrier_pending used Arc::get_mut which always failed because the committing code still held a reference to the same Arc. This meant the st_2pc_state DELETE was silently dropped, causing "Delete for non-existent row" crashes on commit log replay after a restart. Fix: derive Clone for TxData and use Arc::make_mut, which does copy-on-write when the Arc is shared.

The DELETE entry for st_2pc_state was constructed with empty placeholder fields (only prepare_id set). During transaction replay, delete_equal_row uses whole-row equality via eq_row_in_page, so the empty-field DELETE never matched the full-field INSERT, causing "Delete for non-existent row" errors that bricked the database on restart. Build the St2pcStateRow once and reuse it for both the INSERT marker and the DELETE entry so they match exactly during replay.

…module maybe_create_schedule was calling the blocking connect_metrics_module while holding the parking_lot::Mutex, inside an async Axum handler. If the connection timed out or failed, .unwrap() panicked with the lock held, leaving the schedule unset and returning a 500 to the last driver to register — causing it to exhaust its retry attempts and fail. Switch to connect_metrics_module_async and release the mutex before the network call, re-acquiring it only to write the final schedule. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

When multiple concurrent 2PC transactions share a participant database, each sets its own durability barrier. If one aborts, the old code only dropped pending transactions when ALL barriers were gone. Otherwise, tainted TxData from the aborted 2PC stayed in the pending list and got flushed when a later barrier cleared, writing corrupted data to the commitlog. Since abort is always followed by a module restart that rebuilds committed state from disk, unconditionally drop all barriers and all pending transactions. Other in-flight 2PC async tasks will find the barrier already gone and no-op, which is correct since the module is about to restart.

With concurrent 2PC transactions on the same participant, modify_first_barrier_pending would add the st_2pc_state DELETE to the wrong TxData entry. The pending queue could contain entries from multiple 2PC transactions, and first() picked the oldest one rather than the current transactions reducer commit. This caused the DELETE to be written to the commitlog in a different transaction than the INSERT, leading to Delete for non-existent row during replay. Replace modify_first_barrier_pending with modify_barrier_pending_at which finds the entry by its tx_offset (barrier_offset + 1, the reducer commit offset assigned while the write lock was held).

cloutiertyler force-pushed the tyler/cycle-detection branch from d0cbf19 to 6147323 Compare March 31, 2026 18:03

cloutiertyler and others added 27 commits March 31, 2026 15:42

Remove dead ControlStateWriteAccess edge tracking methods

d99a5ac

Edge tracking uses CallEdgeTracker trait (in core) instead of ControlStateWriteAccess (in client-api) due to circular dependency. Added TODO to consolidate once the trait is moved to a shared crate.

Wire CallEdgeTracker through HostController to ReplicaContext

7ab9806

Add call_edge_tracker field to HostController and ModuleLauncher so the tracker flows from the top-level Node/StandaloneEnv down to each ReplicaContext. Added set_call_edge_tracker method for runtime configuration.

WIP: RwLock for call_reducer_router, set_call_reducer_router method

6d8f5ff

Single OnceLock field for call_reducer_router, fix comment in call_edges

7a87452

Remove cycle detection retries -- deadlock means release the lock imm…

99013ac

…ediately

test script

33a971c

Fix warnings: unused import, unused mut, drop commit_result explicitly

ddc7a6e

Move 2PC operational logs to debug level, fix empty body BytesSource …

8884367

…panic

Use Mutex for call_reducer_router (init with default, replace once at…

2b8edcf

… startup)

Add set_base_url to CallEdgeTracker for deferred URL configuration

4eccabc

Use fixed time bucket table to record txns

e541c1c

Fix compilation errors

afb4db2

Allow loading additional warehouses after an initial upload

09ed62c

fix routing

9ad9d0c

enable h2

486c6ee

Don't abort on error

ac94216

Call remote reducers properly

3c4f15b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CallEdgeTracker trait for distributed deadlock detection#4735

Add CallEdgeTracker trait for distributed deadlock detection#4735
cloutiertyler wants to merge 28 commits intojdetter/tpccfrom
tyler/cycle-detection

cloutiertyler commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

cloutiertyler commented Mar 31, 2026

Summary

Design

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants