How to Secure OpenClaw Agents in Production: Complete Governance Guide (2026)

The production problem

OpenClaw security failures rarely start with one dramatic exploit. They start with small assumptions. A bot can run `exec`, read files, install skills, and act with real credentials. Teams often secure the host, then forget to secure the decision path that decides whether actions should run at all.

That gap matters. If policy runs only in prompts or after execution, unsafe actions already crossed the line. Real production security needs a hard pre-dispatch decision point: policy check first, execution second.

Microsoft's OpenClaw runtime analysis from February 19, 2026 makes the same boundary problem explicit. Their guidance is clear: do not run OpenClaw on normal workstations and treat it as untrusted code execution with persistent credentials.

Source	Strong coverage	What is still missing
OpenClaw Gateway Security Guide	Threat model basics, mention gating, high-risk tool restrictions, and sandbox configuration choices.	No deterministic external policy decision path with explicit ALLOW/DENY/THROTTLE/REQUIRE_HUMAN/CONSTRAIN outcomes on every tool call.
Docker: Run OpenClaw Securely in Docker Sandboxes	Strong runtime isolation and network proxy controls, including API-key handling outside the sandboxed agent process.	No governance policy lifecycle (risk tags, approval routing, simulation by policy profile, central decision semantics).
Microsoft Security: Running OpenClaw Safely	Clear risk framing for self-hosted runtimes: credentials, persistent memory manipulation, and skill malware supply chain.	No implementation path for policy-before-dispatch controls, runtime decision APIs, or repeatable operator commands for daily governance.

Reference architecture

The architecture that consistently works has two boundaries: runtime isolation and governance control. Isolation limits blast radius. Governance decides if an action executes.

Execution Path

text

OpenClaw Gateway
  -> CordClaw plugin (before_tool_execution)
  -> POST http://127.0.0.1:19090/check
  -> cordclaw-daemon (mapper + cache + circuit breaker)
  -> gRPC Safety Kernel check
  -> decision: ALLOW | DENY | THROTTLE | REQUIRE_HUMAN | CONSTRAIN
  -> plugin enforces outcome before tool execution

The plugin also hooks `after_tool_execution` and sends audit events to the daemon. That gives fast local observability, while Cordum provides durable governance records for team-scale operations.

5-minute secure setup

This uses the current `setup/install.sh` flow in `D:\\Cordum\\Cord-Claw`. Start with the installer, then verify one allowed and one denied decision before any real production workflow.

Install

bash

cd setup
OPENCLAW_SKIP=true ./install.sh

# Standalone mode (no local Cordum stack)
CORDUM_UPGRADE=false OPENCLAW_SKIP=true ./install.sh

# Full stack mode (recommended for teams)
CORDUM_UPGRADE=true OPENCLAW_SKIP=true ./install.sh

# Pick baseline profile during install
CORDCLAW_PROFILE=moderate OPENCLAW_SKIP=true ./install.sh

After install, run a deterministic smoke test. A safe command should pass. A destructive command should deny. If both pass, your policy path is not active.

Verify

bash

curl -sS http://127.0.0.1:19090/health | jq .
openclaw cordclaw status

openclaw cordclaw simulate --tool exec --command "echo hello"
openclaw cordclaw simulate --tool exec --command "rm -rf /"

curl -sS "http://127.0.0.1:19090/audit?limit=20" | jq .

Policy profile matrix

CordClaw ships three baseline profiles in `setup/templates/`. They are not cosmetic labels. They encode different risk postures across `exec`, file operations, outbound communication, and scheduling.

Profile	Behavior	Best fit
strict	Approvals required for all `exec`, `file-write`, `message-send`, and `memory-write`; denies `cron-create`; constrains browser actions.	Regulated workflows and production mutation-heavy environments.
moderate	Denies destructive/cloud/remote-access `exec`; requires approval for package installs, deploy-like commands, and outbound messaging.	Default baseline for most teams shipping OpenClaw to production.
permissive	Still denies destructive and secrets-tagged activity; requires approval for cloud/infrastructure execution; allows most other actions.	Low-risk internal automation with strong host isolation and fast iteration needs.

Switch Profile

bash

cp ~/.cordclaw/templates/policy-strict.yaml ~/.cordclaw/config/safety.yaml
cd ~/.cordclaw
docker compose --env-file .env restart safety-kernel

openclaw cordclaw simulate --tool exec --command "echo profile check"
openclaw cordclaw simulate --tool exec --command "npm install lodash"
openclaw cordclaw simulate --tool exec --command "rm -rf /"

Most teams should launch with `moderate`. It blocks destructive and cloud-risk commands while keeping daily iteration usable.

Risk-tag mapping and tests

Policy quality depends on input quality. CordClaw does not pass raw prompt text to policy. The daemon maps structured tool metadata and command/path/url signals into risk tags. That mapping is where most false positives and false negatives start, so test it directly.

In current mapper code, command regexes identify destructive operations, package installs, remote access tools, cloud utilities, and secrets exposure behavior. Path patterns add tags for sensitive files such as `.env` and key material. URL parsing adds `insecure-transport` for non-HTTPS destinations.

Signal	Trigger examples	Typical policy effect
exec + destructive pattern	`rm -rf`, `mkfs`, `dd if`, or similar command fragments	Maps to `destructive` risk tag, usually denied in moderate/strict profiles.
exec + cloud or infra tools	`aws`, `gcloud`, `az`, `docker`, `kubectl`, `helm`	Maps to cloud/infrastructure tags, typically deny or require approval.
exec + package installation	`npm install`, `pip install`, `apt install`	Maps to `package-install`, commonly approval-gated in moderate profile.
file path patterns	Reads/writes touching `.env`, `.pem`, `.key`, credentials, or system paths	Maps to `secrets` or `system-config`, blocked or heavily restricted by policy.
URL transport quality	Tool call references non-HTTPS URL	Adds `insecure-transport` tag for policy checks and denial rules.

Run explicit simulation vectors for each tag family before rollout. This prevents profile drift where rules look correct in YAML but do not trigger on real command shapes.

Risk-Tag Simulation Set

bash

# Destructive shell should deny in moderate/strict
openclaw cordclaw simulate --tool exec --command "rm -rf /tmp/demo"

# Cloud control path should deny or require approval based on profile
openclaw cordclaw simulate --tool exec --command "aws iam list-users"

# Package install should require approval in moderate
openclaw cordclaw simulate --tool exec --command "npm install lodash"

# Secrets path read should deny in permissive/strict depending on rule set
openclaw cordclaw simulate --tool read --path "~/.ssh/id_rsa"

# Insecure URL fetch should include insecure-transport risk tag
openclaw cordclaw simulate --tool web_fetch --url "http://example.com/data.json"

Human approvals that hold up

Approval flows fail when they are loosely attached to execution. In this path, approval is a policy decision outcome (`REQUIRE_HUMAN`) emitted before execution. The plugin blocks the tool call and returns an approval reference in the operator message.

Keep approvals narrow. Route high-risk actions only: package installs, deploy-like commands, outbound messaging, or cloud control-plane operations. Leave low-risk reads automated.

Small humor, large truth: if everything needs approval, nothing is actually reviewed.

Fail-mode matrix

Production outages are where policy systems earn trust or lose it. You need explicit behavior for two failure boundaries, not one.

Plugin Config

yaml

plugins:
  entries:
    cordclaw:
      enabled: true
      config:
        daemonUrl: "http://127.0.0.1:19090"
        timeoutMs: 500
        failMode: "deny"
        logDecisions: true
        bypassTools: []

Daemon Resilience Config

bash

# Required daemon env
export CORDCLAW_KERNEL_ADDR=127.0.0.1:50051
export CORDCLAW_API_KEY=<cordum-api-key>
export CORDCLAW_TENANT_ID=default

# Cache + resilience defaults
export CORDCLAW_CACHE_TTL=5m
export CORDCLAW_CACHE_MAX_SIZE=10000
export CORDCLAW_LISTEN_ADDR=127.0.0.1:19090

# Kernel outage behavior: graduated | closed | open
export CORDCLAW_FAIL_MODE=graduated

# TLS (production)
export CORDCLAW_KERNEL_INSECURE=false
export CORDCLAW_KERNEL_TLS_CA=/etc/ssl/certs/cordum-ca.pem

Layer	Control	Effect	Recommended default
OpenClaw plugin -> daemon	`failMode: deny\|allow` in plugin config	If daemon is unreachable, plugin returns `DENY` or `ALLOW` immediately with governance status `offline`.	Use `deny` in production except tightly scoped read-only bots.
daemon -> Safety Kernel	`CORDCLAW_FAIL_MODE=graduated\|closed\|open`	`closed` denies on kernel outage; `open` allows; `graduated` denies when no cached decision exists and marks governance degraded.	Use `graduated` for resilience with bounded risk, or `closed` for strict environments.
Cordum output safety	`OUTPUT_POLICY_ENABLED=true` + output rules	Post-execution output scanning can quarantine/redact secrets, PII, and injection payloads.	Enable for production. Treat pre-dispatch and post-dispatch as two required layers.

Audit evidence and retention

The daemon exposes fast local decision history via `/audit`. That is useful for immediate debugging and triage. It is not durable storage.

Current daemon behavior keeps an in-memory ring up to 1000 entries. When the process restarts, history resets. For compliance-grade trails, export decisions and correlate with Cordum run metadata.

Audit Queries

bash

openclaw cordclaw audit --limit 20
curl -sS "http://127.0.0.1:19090/audit?limit=20" | jq .

Integrate with Cordum

Pre-dispatch controls prevent unsafe actions. You still need post-dispatch output controls for secrets, PII, and injection strings that can appear in results. Cordum output policy is the second layer.

Output Safety Baseline

yaml

# Scheduler feature flag
OUTPUT_POLICY_ENABLED=true

# Example output policy rule
output_rules:
  - id: output-secret-leak
    enabled: true
    severity: critical
    match:
      topics: ["job.*"]
      detectors: ["secret_leak"]
    decision: quarantine
    reason: "Potential credential in output."

Keep OSS and enterprise claims clean. In current docs, multi-tenant API keys and advanced RBAC belong to enterprise components, while core pre-dispatch policy and output scanning are platform capabilities.

Limitations and tradeoffs

Read before rollout

`docs/POLICY_GUIDE.md`, `docs/THREAT_MODEL.md`, and `docs/TROUBLESHOOTING.md` in the CordClaw repo are still placeholders. Treat code and tests as canonical for now.
Daemon audit storage is in-memory (`auditSize=1000`), so local audit is operational telemetry, not durable evidence.
Plugin fallback (`failMode`) and daemon fallback (`CORDCLAW_FAIL_MODE`) are different controls and must be tuned together.
Even with governance, weak model/tool combinations increase risk. Isolation, governance, and model quality all matter.

Production gate checklist

Gate Script

bash

# 1) Isolation and identity
# - OpenClaw runs in isolated environment
# - non-privileged dedicated credentials only

# 2) Governance path is alive
curl -sS http://127.0.0.1:19090/health | jq .
openclaw cordclaw status

# 3) Decision behavior tests
openclaw cordclaw simulate --tool exec --command "echo ok"
openclaw cordclaw simulate --tool exec --command "rm -rf /"
openclaw cordclaw simulate --tool exec --command "npm install lodash"

# 4) Fail-mode drill
# stop safety kernel, verify expected plugin+daemon behavior

# 5) Output safety enabled
# verify OUTPUT_POLICY_ENABLED=true and output_rules loaded

# 6) Audit export path tested
curl -sS "http://127.0.0.1:19090/audit?limit=50" | jq .

A release is ready when all six checks pass in staging and the fail-mode drill behaves exactly as designed.

FAQ

Frequently Asked Questions

What is the biggest mistake when securing OpenClaw in production?

Treating sandboxing as the full solution. Isolation protects the host boundary, but you still need deterministic policy decisions on each agent action before execution.

Should I run plugin fail mode as allow or deny?

For production, default to `deny`. Use `allow` only for tightly scoped, low-impact agents where temporary governance outages are acceptable and documented.

How do I choose between daemon fail modes graduated and closed?

Use `closed` for strict compliance environments. Use `graduated` when you need better availability and are comfortable denying only uncached decisions during kernel outages.

Is the daemon audit endpoint enough for compliance evidence?

Not by itself. The current daemon audit is in-memory and capped. Export and correlate decisions with centralized Cordum run data for durable evidence.

Can I use CordClaw without the full Cordum stack?

Yes. In standalone mode, point the daemon to an existing Safety Kernel with `CORDCLAW_KERNEL_ADDR`, `CORDCLAW_API_KEY`, and `CORDCLAW_TENANT_ID`.

Next step

Pick one real OpenClaw workflow, run it through `moderate` profile with simulation tests, execute one planned kernel outage drill, and document the exact pass/fail criteria in your runbook before production traffic.

Open CordClaw Output Safety Docs

Continue with AI Agent Governance Platform Setup: Zero to Governed with CordClaw.
Use How to Add Governance to OpenClaw for a shorter migration path.
Review Configuration and Operations before broad rollout.

How to Secure OpenClaw Agents in Production

The production problem

Top 3 articles: coverage vs gaps

Reference architecture

5-minute secure setup

Policy profile matrix

Risk-tag mapping and tests

Human approvals that hold up

Fail-mode matrix

Audit evidence and retention

Integrate with Cordum

Limitations and tradeoffs

Production gate checklist

FAQ

Frequently Asked Questions

Next step

Related posts