Building an Agent OS: A Technical Blueprint for Multi-Agent Systems with OpenClaw

高效码农

2 months ago

OpenClaw Multi-Agent System: A Full Technical Breakdown of Building an “Agent OS” Under a Single Gateway

Snippet

This article presents a complete engineering breakdown of a five-role multi-agent collaboration system built on OpenClaw. Using a single Gateway process, 10 binding routes, per-account-channel-peer session isolation, layered memory architecture, and rule-driven orchestration, the system enables stable cross-platform collaboration between Discord and Telegram without context contamination.

This Is Not “Five Bots” — It’s an Agent Operating System

When people hear “five AI roles,” they often assume:

So you’re just running five separate bots?

Technically yes — but architecturally no.

This system is not a loose collection of AI accounts. It is a structured, engineered collaboration framework composed of:

1 Gateway process
5 independent agents
5 isolated workspaces
2 connected platforms (Discord + Telegram)
10 explicit binding routes
Distinct DM and Group execution strategies
Layered memory architecture with retrieval control

Think of it as a company with departments, communication protocols, role boundaries, and operational governance — not five freelancers in a chat room.

System Architecture: Single Gateway + Multi-Agent + Multi-Workspace + Multi-Channel

1. One Gateway Carries All Capabilities

The entire system runs on a single OpenClaw Gateway process responsible for:

Message ingestion
Routing
Session management
Tool invocation
Memory indexing and retrieval
State maintenance

Why not deploy one service per role?

Because centralization provides:

Decision	Engineering Benefit
Single Gateway	Only one service to maintain
Unified configuration	Centralized global strategy
Same runtime	Low-latency inter-agent coordination

A distributed setup would introduce unnecessary synchronization and state complexity.

2. The Five Roles and Their Functional Boundaries

Role	Responsibility
Commander	Global awareness, task decomposition, assignment, closure
Strategist	Strategic analysis, solution evaluation, risk prediction
Engineer	Technical execution, implementation, maintenance
Creator	Content generation, communication clarity, external output
Think Tank	Audit, validation, quality control, compliance review

Each role operates inside its own dedicated workspace:


workspace-engineer
workspace-strategist
workspace-creator
...

Each workspace contains:

Independent personality definitions
Independent rule files
Independent memory stores
Independent daily logs

There is no cross-contamination between roles.

Dual-Platform Access: Discord + Telegram

The same Gateway connects simultaneously to:

Discord
Telegram

Each role is mapped per platform:

5 roles × 2 platforms = 10 binding mappings

Example:


discord + account_engineer → engineer
telegram + account_creator → creator

This is not duplicated deployment.

It is one shared intelligence cluster accessed through multiple communication layers.

The Routing Core: Bindings as the Entry Logic

The routing model follows:


channel + accountId → agentId

This ensures:

Immediate, precise message dispatch
No broadcast-to-all behavior
No response collisions between agents

Bindings function as the triage desk of the system. If routing is not deterministic at entry, multi-agent collaboration becomes chaotic.

Session Isolation: Why Context Never Leaks

The key configuration:


session.dmScope = per-account-channel-peer

This isolates DM context across three dimensions:

Account
Channel
Peer user

Practical impact:

Discord and Telegram histories never mix
Different users’ private sessions are fully isolated
Multi-account context contamination is prevented

This is the officially recommended strategy for multi-account scenarios in OpenClaw and has proven to be the most stable configuration.

Group Collaboration Model: Rule-Driven, Not Free-Form

Trigger Design

Commander


requireMention = false

Listens globally in group chats
Evaluates task type
Assigns roles
Closes loops

Other Roles


requireMention = true

Must be explicitly @mentioned
Prevents noise
Avoids overlapping responses

Mention Patterns

Example:


@Engineer
@engineer

Ensures stable invocation syntax.

Collaboration Flow

A user asks a question in a group.
Commander evaluates the request.
Commander @mentions the appropriate role.
The role executes its domain task.
Commander consolidates and closes.

The group dynamic shifts from chaotic multi-response chatter to structured relay execution.

Why Discord Is the Primary Collaboration Environment

Both Discord and Telegram are supported.

However, Discord offers superior orchestration conditions:

Parallel operation of five accounts
Stable mention system
Visible threaded dialogue chains
groupPolicy = open flexibility

Telegram operates under:


allowlist + mention gate

It serves as a more controlled production channel, while Discord functions as the collaborative arena.

Dual Governance Model: Configuration Layer + Rule Layer

This system does not rely solely on prompts.

Nor solely on configuration.

It enforces control at two levels.

1. Configuration-Level Constraints

Includes:

groupPolicy
dmPolicy
requireMention
bindings
dmScope
Agent ping-pong limit = 0

Why ping-pong = 0?

To prevent infinite agent-to-agent conversational loops such as:

A: Thank you
B: You’re welcome
A: Appreciate it
B: Anytime

Setting this to zero disables automatic inter-agent recursion.

2. Rule-Driven Behavioral Governance

Each workspace contains standardized governance files:

File	Function
SOUL.md	Personality and quality baseline
AGENTS.md	Operational process and collaboration standards
ROLE-COLLAB-RULES.md	Role boundaries and red lines
TEAM-RULEBOOK.md	Global hard rules
TEAM-DIRECTORY.md	Role mapping registry
IDENTITY.md	Capability definition
USER.md	User profile and preferences
TOOLS.md	Tool permissions
MEMORY.md	Long-term stable memory
GROUP_MEMORY.md	Safe reusable group knowledge
HEARTBEAT.md	Self-check and recovery
memory/YYYY-MM-DD.md	Daily operational logs

Standardization ensures predictable behavior.

Memory Architecture: Layered, Lazy-Loaded, Archived

Memory is treated as a resource management problem.

Tokens are finite.

Every memory injection consumes reasoning bandwidth.

Layer 1: Daily Logs (Short-Term)


memory/YYYY-MM-DD.md

Captures daily execution details.

Layer 2: Long-Term Memory (MEMORY.md)

Stores only:

Verified preferences
Stable decisions
Reusable operational knowledge

Layer 3: Group Long-Term Memory (GROUP_MEMORY.md)

Contains only:

Reusable content
Non-private information

Private data never enters group memory.

Layer 4: Cold Archive

Old data is periodically moved out of active context to prevent context bloat.

Retrieval Mechanism


memory_search
memory_get

Semantic retrieval followed by targeted loading.

Never load everything at once.

Context windows are limited resources.

DM Mode vs. Group Mode

A role must behave differently depending on context.

DM Mode

End-to-end expert execution
Full solution delivery
No collaboration overhead

Group Mode

Domain-limited execution
Commander orchestrates
Strict boundary adherence

Role-Specific Behavioral Constraints

Engineer

Deliverables must be executable
Must be verifiable
Must support rollback

Strategist

Must present assumptions
Must define validation paths

Think Tank

Must classify issues
Must propose repair plans

Creator

Expression must not sacrifice accuracy
Output must remain actionable

Behavior is defined by rule files — not model improvisation.

This Is an Engineering System, Not a Prompt Experiment

Multi-agent systems are not about:

Spawning more bots
Layering complex prompts

They are about:

Architecture
Routing
Isolation
Memory discipline
Governance

The gap between “it runs” and “it runs well” is engineering design.

FAQ

Can this be achieved purely with prompts?

No. Models drift and forget constraints. Platform-level configuration is necessary.

Why not let every role listen in groups?

That creates noise and response collisions.

Why disable ping-pong?

To prevent infinite automated inter-agent exchanges.

Why layered memory?

To prevent context overflow and privacy leakage.

How to Build a Similar System

Deploy a single Gateway process.
Create isolated workspaces per role.
Define deterministic bindings.
Set dmScope = per-account-channel-peer.
Implement mention-based triggering.
Build structured governance files.
Implement layered memory architecture.
Disable automatic agent ping-pong.
Separate DM and group behavioral logic.
Periodically archive inactive memory.

Final Thoughts

A multi-agent collaboration system is not about quantity.

It is about structure.

Transforming a single assistant into an Agent Operating System requires disciplined architecture, deterministic routing, context isolation, memory governance, and clear collaboration boundaries.

OpenClaw provides the foundation.

Stability and scalability come from engineering.