feat(communication): Agent Mesh Network — comunicação P2P descentralizada by nikolasdehor · Pull Request #591 · SynkraAI/aiox-core

nikolasdehor · 2026-03-12T18:33:30Z

Summary

Comunicação P2P descentralizada entre agentes
Roteamento BFS, detecção de partições por DFS
Rate limiting por token bucket
Pub/sub por tópicos com message queues e TTL

Testes

103 testes unitários passando

Reabertura do PR #577 (fechado acidentalmente)

Summary by CodeRabbit

New Features

Implemented a peer-to-peer mesh network for agent communication with peer management and automatic topology discovery
Added message routing with shortest-path algorithms and request/response patterns
Introduced pub/sub functionality with topic-based broadcasting
Enabled network health monitoring and automatic partition detection
Implemented message queuing with TTL enforcement and disk persistence

Tests

Added comprehensive test suite covering mesh network operations

Chores

Updated manifest configuration

…tre agentes Implementa modulo de comunicacao peer-to-peer para agentes se descobrirem, formarem grupos ad-hoc e rotearem mensagens atraves da mesh sem orquestrador central. Funcionalidades: peer discovery, direct messaging, broadcast/topics, roteamento multi-hop (BFS), heartbeat/pruning, message queue com TTL, rate limiting (token bucket), deteccao de particoes (DFS), persistencia em disco e metricas de rede.

vercel · 2026-03-12T18:33:34Z

@nikolasdehor is attempting to deploy a commit to the Pedro Valério Lopez's projects Team on Vercel.

A member of the Team first needs to authorize it.

coderabbitai · 2026-03-12T18:33:47Z

Walkthrough

This PR introduces a comprehensive Agent Mesh Network implementation for peer-to-peer communication, featuring peer management, BFS-based message routing, pub/sub messaging, request/response patterns, queue management, health monitoring, and disk persistence, along with a backward-compatibility wrapper module and extensive test coverage.

Changes

Cohort / File(s)	Summary
Agent Mesh Network Implementation `.aiox-core/core/communication/agent-mesh-network.js`	Comprehensive new class with 1115+ lines implementing peer management (join/leave/getPeer/listPeers), direct and routed messaging via BFS, pub/sub with topic-based subscriptions, request/reply with promise-based correlation, message queuing with TTL enforcement, network health monitoring, partition detection via DFS, disk-based persistence (topology.json), heartbeat/liveness detection, rate limiting per peer, and extensive event emission for observability. Exports AgentMeshNetwork class, MessageType, PeerState, MeshEvent, and DEFAULT_OPTIONS constants.
Backward Compatibility Wrapper `aios-core/core/communication/agent-mesh-network.js`	New CommonJS wrapper module that re-exports the implementation from `.aiox-core/core/communication/agent-mesh-network` for backward compatibility.
Manifest Updates `.aiox-core/install-manifest.yaml`	Timestamp bump (2026-03-11 → 2026-03-12), addition of new agent-mesh-network.js file entry, removal of development/tasks/review-prs.md entry, and size adjustments across multiple manifest entries.
Test Suite `tests/core/communication/agent-mesh-network.test.js`	Comprehensive test coverage (1113+ lines) verifying peer lifecycle, adjacency management, topic subscriptions, direct/routed/broadcast messaging, request/reply patterns, routing algorithms, topology correctness, message queuing behavior, partition detection, heartbeat operation, disk persistence, and error handling scenarios.

Sequence Diagrams

sequenceDiagram
    participant Peer1
    participant Mesh as AgentMeshNetwork
    participant Peer2
    participant Peer3

    Peer1->>Mesh: join()
    Mesh->>Mesh: Create bidirectional adjacency
    Mesh->>Peer1: Auto-subscribe to topics
    
    Peer2->>Mesh: join()
    Mesh->>Mesh: Create bidirectional adjacency
    Mesh->>Peer2: Auto-subscribe to topics
    
    Peer1->>Mesh: send(message to Peer3)
    Mesh->>Mesh: BFS shortest path<br/>(Peer1 → Peer2 → Peer3)
    Mesh->>Peer2: Route message
    Mesh->>Peer3: Deliver message
    Peer3->>Peer3: Emit receive event

sequenceDiagram
    participant Publisher as Peer1<br/>(Publisher)
    participant Mesh as AgentMeshNetwork
    participant Sub1 as Peer2<br/>(Subscriber)
    participant Sub2 as Peer3<br/>(Subscriber)

    Publisher->>Mesh: broadcast(topic='orders', data)
    Mesh->>Mesh: Filter subscribers<br/>by topic
    Mesh->>Sub1: Emit message event<br/>(topic='orders')
    Mesh->>Sub2: Emit message event<br/>(topic='orders')
    Sub1->>Sub1: Handle message
    Sub2->>Sub2: Handle message

sequenceDiagram
    participant Client as Requester<br/>(Peer1)
    participant Mesh as AgentMeshNetwork
    participant Server as Responder<br/>(Peer2)

    Client->>Mesh: request(targetPeer, data)
    Mesh->>Mesh: Create pendingRequest<br/>with timeout
    Mesh->>Server: Send request message
    Server->>Mesh: reply(requestId, response)
    Mesh->>Mesh: Resolve pending<br/>request promise
    Mesh->>Client: Return response data

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 50.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title clearly describes the main change: implementing a decentralized P2P communication system (Agent Mesh Network) between agents, which directly aligns with the substantial new implementation across multiple files.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

📝 Coding Plan

Generate coding plan for human review comments

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 8

🧹 Nitpick comments (1)

.aios-core/core/communication/agent-mesh-network.js (1)
1-2: Use the repo's absolute import convention for this compatibility shim.

Hardcoding a three-level relative require() makes this re-export brittle to future moves and diverges from the repository's JS import rule. Please resolve the implementation through the package/root alias used elsewhere. As per coding guidelines, "Use absolute imports instead of relative imports in all code".
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.aios-core/core/communication/agent-mesh-network.js around lines 1 - 2,
Replace the brittle three-level relative require in the compatibility shim by
importing via the repository's absolute/package root alias (instead of
"../../../.aiox-core/..."); update the module.exports line to require the core
module using the project alias used elsewhere (e.g., the package/root alias that
maps to the .aiox-core package) so module.exports still re-exports the same
module (agent-mesh-network) but via the absolute import convention.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In @.aiox-core/core/communication/agent-mesh-network.js:
- Around line 155-172: The join(agentId, meta = {}) function currently converts
meta.topics and meta.capabilities without validating types, which allows strings
like "deploy" to become sets/arrays of characters; add input validation in join
to explicitly require meta.topics to be an array (of strings) when provided and
meta.capabilities to be an array (of strings) when provided, throwing a
descriptive Error if they are malformed; only construct topics as new
Set(meta.topics) and capabilities as meta.capabilities (or []) after validation
so the peer object (id, capabilities, topics, state, joinedAt, lastSeen,
messageCount) is never populated with corrupted values.
- Around line 366-389: The broadcast() loop currently delivers directly to every
toId with PeerState.ACTIVE, bypassing routing/partition checks; change it to use
the same route-or-queue logic as send(): for each target (from the peers
map/targets array) perform the routing/path check used by send() (or call the
send() helper that encapsulates that logic) and only call _deliverMessage(msg)
when a valid route exists, otherwise call _enqueueMessage(toId, msg); make sure
to reuse the same symbols/behavior as send(), _deliverMessage, _enqueueMessage
and PeerState.ACTIVE so broadcasts respect mesh partitions and queued delivery
semantics.
- Around line 846-855: The save/load flow currently serializes only peer records
and then calls join(peerId, ...) which recreates peers by connecting them into
the active mesh and ignores saved topology and per-peer state; update save() to
serialize the mesh topology (peer adjacency/links, routing/multi-hop entries,
and each peer's saved state like offline/timeout flags) and update load() to
reconstruct Peer instances and routing tables from that data without invoking
join() for each peer; instead, restore connections by reinstating saved
adjacency/routing entries and applying saved state to each Peer (use existing
peer map this.peers, any routing tables, and state setters like setPeerState or
equivalent) so partitions, multi-hop routes, and offline/timeout statuses are
preserved after restart.
- Around line 654-675: getNetworkHealth() and getMeshStats() currently call
detectPartitions(), which has side-effects (increments partitionsDetected and
emits 'partition-detected'), so mere reads inflate counters and spam listeners;
change detectPartitions() to support a non-mutating mode (e.g., add a parameter
like emitEvents = true or create a pure helper detectPartitionsSnapshot()) that
returns the partition list without incrementing partitionsDetected or emitting
events, and update getNetworkHealth() and getMeshStats() to call this
non-mutating variant so reads do not change state or emit events (keep existing
behavior for callers that need real detection side-effects).
- Around line 910-923: _deliverMessage currently increments messageCount and
emits MESSAGE_RECEIVED but never refreshes the recipient peer's lastSeen or
triggers processing of any queued messages, so reactivated peers remain TIMEOUT
with stuck backlogs; update _deliverMessage to set toPeer.lastSeen = Date.now()
(or call the existing method that marks peers seen) when toPeer exists and,
after marking active, call this._drainQueue(toPeer) (or the generic _drainQueue
method) to flush queued messages; make the same adjustments in the other
message-delivery paths referenced (the similar blocks around the other delivery
sites) so every code path that delivers toPeer also refreshes lastSeen and
invokes _drainQueue for that peer.
- Around line 89-141: The constructor currently ignores
DEFAULT_OPTIONS.autoStart so automatic heartbeat never starts; update the
constructor (in agent-mesh-network.js) to inspect this.options.autoStart after
options merge and, if truthy, call the existing startHeartbeat() method and set
the _started flag (and ensure _heartbeatTimer is managed by startHeartbeat());
this will ensure periodic pruning/TTL expiry runs automatically for new Mesh
instances without requiring callers to manually invoke startHeartbeat().
- Around line 420-444: The request() implementation registers the pending
request after calling send(), so if delivery is synchronous and the receiver
replies immediately the pendingRequests entry won't exist; fix by generating or
capturing the request id up-front, call this.pendingRequests.set(requestId,
{resolve, reject, timer}) before invoking this.send(fromId, toId, message, {
type: MessageType.REQUEST }), and ensure the timeout/cleanup uses that same
requestId (instead of sent.id); keep using this.options.requestTimeout and
preserve the resolve/reject handlers and timer logic from request().

In `@tests/core/communication/agent-mesh-network.test.js`:
- Around line 11-17: The test currently imports the implementation directly from
the internal .aiox-core path; change the import to use the public compatibility
entrypoint (require('.aios-core')) so the suite covers the compatibility
wrapper. Replace the existing
require('../../../.aiox-core/core/communication/agent-mesh-network') with a
require('.aios-core') (or require('.aios-core').core/communication if the
package exposes nested exports) and then pull AgentMeshNetwork and its exports
(MessageType, PeerState, MeshEvent, DEFAULT_OPTIONS) from that module so the
test verifies the public compatibility path is exercised.

---

Nitpick comments:
In @.aios-core/core/communication/agent-mesh-network.js:
- Around line 1-2: Replace the brittle three-level relative require in the
compatibility shim by importing via the repository's absolute/package root alias
(instead of "../../../.aiox-core/..."); update the module.exports line to
require the core module using the project alias used elsewhere (e.g., the
package/root alias that maps to the .aiox-core package) so module.exports still
re-exports the same module (agent-mesh-network) but via the absolute import
convention.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: c5aa4d29-1676-48c3-b207-98444f5c148a

📥 Commits

Reviewing files that changed from the base of the PR and between f74e3e7 and bd018e0.

📒 Files selected for processing (4)

.aios-core/core/communication/agent-mesh-network.js
.aiox-core/core/communication/agent-mesh-network.js
.aiox-core/install-manifest.yaml
tests/core/communication/agent-mesh-network.test.js

coderabbitai · 2026-03-12T18:47:03Z