What is the OpenClaw Gateway and why does it matter?

The Gateway is OpenClaw's control plane — the single entry point that manages every client connection, authenticates requests, routes messages to the correct channel adapter (Slack, WhatsApp, web UI, API), and enforces access policies before anything reaches the agent runtime. Think of it like a Kubernetes API server but purpose-built for agent communication. NIST SP 800-204A calls this the 'policy enforcement point' pattern. Without proper Gateway configuration, your agent is effectively unprotected and you're joining the 30,000+ exposed instances Censys found in March 2026.

Why should the Gateway bind to 127.0.0.1 instead of 0.0.0.0?

Binding to 127.0.0.1 (loopback) means the Gateway only accepts connections from the local machine. A reverse proxy like Nginx or Caddy then handles external traffic with TLS termination, rate limiting, header validation, and WebSocket upgrade handling. Binding to 0.0.0.0 exposes the raw Gateway directly to the internet — which is exactly how 30,000+ instances ended up publicly reachable. CVE-2026-25253 (CVSS 9.8) exploits this specific misconfiguration. CIS Benchmark v1.6.0 for Docker and NIST SP 800-190 both explicitly require localhost binding for management and runtime services.

How does OpenClaw handle authentication at the Gateway level?

The Gateway uses bearer token authentication via signed JSON Web Tokens (JWT). Every incoming connection must present a valid token before a WebSocket session is established. The Gateway validates the token against its configured secret, checks the exp (expiration) and scope (channel permissions) claims, verifies the jti (JWT ID) against the revocation list, and only then routes the connection to the appropriate channel and agent. OWASP's 2025 API Security Top 10 lists 'Broken Authentication' as the number-one API vulnerability (API2:2025) — proper Gateway token validation addresses this directly as a hard gate.

What are OpenClaw channels and how does routing work?

Channels are the Gateway's abstraction for different communication interfaces — Slack, WhatsApp, web UI, REST API. Each channel has its own WebSocket namespace, message format adapter (Slack Block Kit, WhatsApp templates, markdown, JSON), and rate limit profile. The Gateway inspects incoming connections by path and headers, matches them to a registered channel, validates channel-specific credentials (like Slack's HMAC-SHA256 signing signature or Meta's webhook verify token), applies channel-specific middleware, and then forwards the normalized message to the agent runtime.

What reverse proxy should I use in front of the OpenClaw Gateway?

Nginx and Caddy are the two production-ready options and we ship configurations for both in beeeowl deployments. Nginx gives you granular control over WebSocket upgrade handling, connection limits, header manipulation, and rate limiting zones — the right choice if you already have Nginx experience or need custom configuration. Caddy provides automatic TLS via Let's Encrypt ACME with minimal configuration — the right choice for teams without dedicated infrastructure engineers who want 'secure by default.' Both support the WebSocket proxying and header injection that OpenClaw's Gateway expects.

What does the full production defense-in-depth topology look like?

Six layers. Internet is untrusted. Cloudflare CDN and WAF (optional) handle volumetric attacks, bot filtering, and geo-blocking. Nginx or Caddy reverse proxy handles TLS 1.3 termination, WebSocket upgrade, rate limiting, and security headers. OpenClaw Gateway on loopback handles JWT validation, channel routing, session management, and middleware chain (audit logging, rate limiting, content filtering). OpenClaw Agent Runtime runs in a Docker container with read_only filesystem, cap_drop ALL, and no-new-privileges. Composio OAuth Vault brokers credentials to external APIs. Each layer denies by default and only forwards explicitly permitted traffic, following NIST SP 800-207 Zero Trust Architecture.

AI Infrastructure

OpenClaw Gateway Architecture: Understanding the Control Plane of Your AI Agent

The OpenClaw Gateway is the control plane that sits between every client and the agent runtime. Binding to loopback and fronting with a reverse proxy isn't optional — it's the one config line separating secure deployments from the 30,000+ instances Censys found exposed in March 2026.

Jashan Preet Singh

Co-Founder, beeeowl|February 14, 2026|15 min read

OpenClaw Gateway Architecture: Understanding the Control Plane of Your AI Agent

TL;DR The OpenClaw Gateway is the control plane that sits between every client connection and your AI agent. It manages WebSocket sessions over RFC 6455, routes messages across channels (Slack, WhatsApp, web UI, API), enforces JWT-based authentication with scope claims, and controls what reaches the agent runtime through a configurable middleware chain. Censys published scans in March 2026 showing over 30,000 OpenClaw instances exposed on the public internet with default 0.0.0.0 binding and no authentication. Shadowserver Foundation tracked a 340% increase in scanning activity targeting OpenClaw's default ports between January and March 2026. CVE-2026-25253 — a remote code execution vulnerability with CVSS 9.8 — exploits exactly this misconfiguration. The fix is trivial: bind to 127.0.0.1 loopback only, front with Nginx or Caddy, enforce JWT auth with jti/scope/exp claims, and apply the full NIST SP 800-190 container control set. This article is the complete Gateway architecture deep-dive with the production Nginx and Caddy configurations we ship in every beeeowl deployment, the middleware chain, the audit logging patterns for SOC 2, and the six-layer defense-in-depth topology that every production deployment should implement.

Censys published scan data in March 2026 showing over 30,000 OpenClaw instances running on the public internet with the default 0.0.0.0 binding and no authentication. Shadowserver Foundation tracked a 340% increase in scanning activity targeting OpenClaw’s default ports (3000 and 8080) between January and March 2026. CVE-2026-25253 — a remote code execution vulnerability with a CVSS score of 9.8 — exploits exactly this misconfiguration. Palo Alto Networks’ Unit 42 confirmed active exploitation within five days of disclosure; the exploit is roughly 15 lines of Python. The Gateway is the control plane of your OpenClaw deployment — the single entry point that sits between every client connection and your agent runtime, enforces JWT authentication, routes messages across channels (Slack, WhatsApp, web UI, API), and applies the middleware chain before anything reaches the agent. Configuring it correctly is the difference between a secure private AI deployment and an open door to your executive’s entire connected tool stack. This article is the complete architecture deep-dive with the production Nginx and Caddy configurations we ship in every beeeowl deployment.

What exactly is the OpenClaw Gateway?

The Gateway is the control plane of your OpenClaw deployment — the single process that sits between every client and the agent runtime, deciding what gets through and what gets dropped. It manages WebSocket connections, enforces JWT authentication, routes messages to the correct channel adapter, applies a configurable middleware chain (audit logging, rate limiting, content filtering, context enrichment), and only then forwards validated traffic to the agent runtime. If you block the Gateway, you block all agent access. If the Gateway is misconfigured, you expose everything downstream.

Think of it like a Kubernetes API server, but purpose-built for agent communication. Every interaction — whether it’s a Slack message, a WhatsApp query, a web UI session, or a REST API call — flows through the Gateway first. NIST SP 800-204A (Security Strategies for Microservices) calls this the “policy enforcement point” pattern: a single ingress that centralizes access control so individual services don’t need to implement their own. The pattern is well-established in microservices architecture and well-documented in the security literature. The Gateway applies the same principles to AI agents specifically.

If you’ve operated API gateways from Kong, NGINX Plus, or Envoy in a microservices deployment, the mental model is identical. The difference is that the OpenClaw Gateway is purpose-built for agent communication — it understands conversational state, tool execution permissions, multi-turn session context, and the specific message formats used by Slack Block Kit, WhatsApp templates, and web UI markdown in ways that a generic API gateway doesn’t. It’s agent-aware middleware, not just HTTP plumbing.

How do WebSocket connections work in the Gateway?

The Gateway maintains persistent WebSocket connections with every connected client using RFC 6455 (The WebSocket Protocol). When a user opens the web UI, sends a Slack message, or triggers a WhatsApp interaction, the Gateway establishes a bidirectional WebSocket session that persists for the duration of the conversation, with heartbeat monitoring and automatic cleanup on disconnect.

Here’s why WebSockets matter here instead of plain HTTP: agent interactions aren’t request-response. An agent might take 30 seconds to research a question, call three external APIs via Composio (Gmail, Salesforce, and Slack), stream a response back token by token as the underlying LLM generates it, and handle follow-up clarifications all within the same session. HTTP polling would be wasteful and add latency. Server-Sent Events would be one-directional and can’t carry inbound client messages. WebSockets give you full-duplex communication — the client sends messages, the agent streams responses, and the Gateway monitors both directions in real time with consistent low overhead.

According to the 2025 IETF RFC 6455 compliance report, WebSocket adoption in production systems has grown 280% since 2022. The protocol is mature, well-understood, and supported by every modern reverse proxy (Nginx, Caddy, HAProxy, Envoy, Traefik). There’s no good reason to avoid it for agent use cases.

The Gateway’s connection lifecycle looks like this, from handshake through termination:

# Gateway connection lifecycle
connection_lifecycle:
  1_handshake:
    protocol: "HTTP/1.1 Upgrade to WebSocket"
    validation:
      - verify_bearer_token (JWT signature + exp)
      - check_token_expiration
      - validate_channel_scope (claim must allow the requested channel)
      - check_revocation_list (jti must not be revoked)
      - enforce_rate_limit (per-subject and per-IP)
  2_session_established:
    assign: "session_id (UUID v4)"
    bind_to: "channel + agent_id"
    heartbeat_interval: "30s"
    idle_timeout: "300s"
    max_session_duration: "8h"
  3_message_routing:
    inbound:  "client -> gateway -> channel_adapter -> middleware -> agent"
    outbound: "agent -> middleware -> channel_adapter -> gateway -> client"
  4_termination:
    triggers:
      - client_disconnect
      - idle_timeout_exceeded
      - token_expiration (even mid-session)
      - admin_forced_disconnect
      - max_session_duration_reached

Each connection gets a unique session ID, a channel binding, and a heartbeat monitor. If the client disappears — network drop, browser close, phone going to sleep — the Gateway detects the missed heartbeat within 30 seconds and cleans up the session. No orphaned connections accumulating memory. If the token expires mid-session, the Gateway terminates the WebSocket and the client has to re-authenticate.

What are channels and how does the Gateway route between them?

Channels are the Gateway’s abstraction layer for different communication interfaces. Each channel represents a distinct way users interact with the agent — and each has its own protocol adapter, message format, authentication requirements, and rate limiting profile. This separation is what lets a single agent handle Slack messages, WhatsApp queries, web UI sessions, and REST API calls simultaneously without per-channel spaghetti in the agent runtime.

A typical production deployment configures three or four channels. Here’s what we ship in a typical beeeowl deployment:

# gateway-config.yaml — Channel definitions
gateway:
  channels:
    slack:
      enabled: true
      adapter: "slack_events_api"
      webhook_path: "/channels/slack/events"
      signing_secret: "$SLACK_SIGNING_SECRET"
      rate_limit: "60/min"
      message_format: "slack_blocks"

    whatsapp:
      enabled: true
      adapter: "whatsapp_cloud_api"
      webhook_path: "/channels/whatsapp/webhook"
      verify_token: "$WHATSAPP_VERIFY_TOKEN"
      rate_limit: "30/min"
      message_format: "whatsapp_template"

    web:
      enabled: true
      adapter: "websocket_native"
      path: "/channels/web/ws"
      allowed_origins:
        - "https://agent.yourcompany.com"
      rate_limit: "120/min"
      message_format: "markdown"

    api:
      enabled: true
      adapter: "rest_json"
      path: "/channels/api/v1"
      rate_limit: "100/min"
      message_format: "json"

When a request arrives, the Gateway inspects the path and headers to determine which channel it belongs to. The Slack channel validates Slack’s signing signature (HMAC-SHA256 of the request body and timestamp) before processing — this prevents replay attacks and spoofed Slack events. The WhatsApp channel verifies Meta’s webhook verification token during the initial handshake and the signature on every subsequent event. The web channel checks the Origin header against the allowlist to prevent cross-site WebSocket hijacking. The API channel requires JWT auth on every request because there’s no persistent session to authenticate once. Each channel applies its own middleware stack before the message reaches the agent.

This architecture follows what the Cloud Native Computing Foundation (CNCF) calls the “sidecar-less service mesh” pattern — routing intelligence lives in the Gateway rather than in per-service proxies. For a single-agent deployment (which is what most executive beeeowl setups are), this is cleaner than running Istio or Linkerd alongside your agent, because you don’t have multiple services to mesh together. The Gateway is the mesh.

How does token-based authentication work at the Gateway?

Every connection to the Gateway must authenticate before any agent interaction occurs. The Gateway uses bearer token authentication — a signed JSON Web Token (JWT) presented in the initial HTTP Upgrade request for WebSocket channels or in the Authorization: Bearer header for REST API calls. No valid token, no connection, no agent access. It’s a hard gate, not a soft warning.

Here’s the authentication setup flow:

# Generate a Gateway secret on deployment day
export GATEWAY_SECRET="$(openssl rand -hex 32)"

# Create a signed JWT with channel scope
# Using the openclaw CLI tool
openclaw gateway token create \
  --secret "$GATEWAY_SECRET" \
  --scope "channels:web,channels:api" \
  --subject "user:cto@yourcompany.com" \
  --expiry "24h" \
  --issuer "beeeowl-deployment"

The token includes three critical claims that the Gateway validates on every connection attempt: scope (which channels this token can access), sub (who this token belongs to), and exp (when it expires). Plus the jti (JWT ID) that enables targeted revocation.

{
  "header": {
    "alg": "HS256",
    "typ": "JWT"
  },
  "payload": {
    "sub": "user:cto@yourcompany.com",
    "iss": "beeeowl-deployment",
    "scope": "channels:web,channels:api",
    "exp": 1743379200,
    "iat": 1743292800,
    "jti": "a1b2c3d4-e5f6-7890-abcd-ef1234567890"
  }
}

OWASP’s 2025 API Security Top 10 lists “Broken Authentication” as the number-one API vulnerability (API2:2025). The Gateway’s token validation addresses this directly — no valid token, no WebSocket upgrade, no agent access. It’s the first line of defense and the highest-value line because every other control in the stack assumes authenticated context. The jti (JWT ID) claim enables targeted token revocation. If a device is lost or a token is compromised, you revoke the specific jti without invalidating every active session. The Gateway checks the revocation list on each connection attempt and at configurable intervals during long-lived WebSocket sessions.

Why is binding to loopback non-negotiable for production?

This is the single most important configuration decision in any OpenClaw deployment, and it’s the one that separates the 30,000+ exposed instances from properly hardened production deployments. The Gateway’s host parameter determines which network interfaces it listens on. There are two options, and only one of them is acceptable for anything touching the internet.

Side-by-side comparison of 0.0.0.0 versus 127.0.0.1 Gateway binding — top section highlighted in red showing DANGEROUS default dev config with YAML showing host 0.0.0.0 listens on all interfaces and auth enabled false with no token required plus description listing publicly reachable from any internet host Shodan and Censys can find it in minutes attackers connect directly to execute commands and CVE-2026-25253 requiring only 15 lines of Python to achieve RCE, bottom section highlighted in teal showing SAFE beeeowl production config with YAML showing host 127.0.0.1 loopback only and auth enabled true with JWT required plus description listing invisible to external scanners only reachable via Nginx or Caddy reverse proxy JWT validation required on every connection and CIS Benchmark v1.6.0 plus NIST SP 800-190 compliant — One line of config separates secure deployments from the 30,000+ exposed instances Censys found in March 2026.

0.0.0.0 — listen on all interfaces (every IP the machine has, including public)
127.0.0.1 — listen only on loopback (only the local machine can connect)

Here’s the config difference, which is exactly one line:

# DANGEROUS: Default development configuration
gateway:
  host: "0.0.0.0"           # ← listens on all interfaces
  port: 3000
  auth:
    enabled: false          # ← no token required

# PRODUCTION: Loopback-only with auth enforced
gateway:
  host: "127.0.0.1"         # ← loopback only
  port: 3000
  auth:
    enabled: true           # ← JWT required
    secret: "$GATEWAY_SECRET"
    token_expiry: "24h"
    require_scope: true

Censys published scan data in March 2026 showing over 30,000 OpenClaw instances running with the default 0.0.0.0 binding and no authentication. Those instances are directly reachable from anywhere on the internet. CVE-2026-25253 — a critical remote code execution vulnerability with a CVSS score of 9.8 — exploits exactly this misconfiguration. Palo Alto Networks’ Unit 42 confirmed active exploitation of CVE-2026-25253 within five days of disclosure. The exploit is roughly 15 lines of Python — not a sophisticated attack, just automated scanning plus a straightforward command injection.

CIS Benchmark v1.6.0 for Docker (published by the Center for Internet Security) explicitly states: services should bind to localhost unless external access is required and mediated by a reverse proxy. NIST SP 800-190 (Application Container Security Guide) reinforces this — container-based services must not expose management or runtime ports directly to untrusted networks. Binding to 127.0.0.1 means the Gateway is invisible to the outside world. External traffic reaches it only through a reverse proxy that you control — with TLS termination, rate limiting, IP filtering, and request validation all happening before a single byte touches the Gateway process.

Shodan and Censys make finding these misconfigured instances trivial. Attackers have automated the discovery. Shadowserver Foundation reported a 340% increase in scanning activity targeting OpenClaw’s default ports between January and March 2026. The time between “I deployed OpenClaw with default settings” and “my agent container is compromised” is measured in hours on the current internet threat model, not days or weeks.

How should you configure the reverse proxy?

The reverse proxy is the public-facing layer. It terminates TLS, upgrades HTTP connections to WebSocket where appropriate, injects security headers, enforces rate limits, and forwards validated traffic to the Gateway on loopback. Every beeeowl deployment ships with either Nginx or Caddy depending on client preference. Both are production-ready, both handle WebSocket upgrade correctly, and both integrate cleanly with Let’s Encrypt for automatic TLS renewal.

Here’s a production Nginx configuration we ship for clients who want granular control:

# /etc/nginx/sites-available/openclaw-gateway
upstream openclaw_gateway {
    server 127.0.0.1:3000;
    keepalive 64;
}

server {
    listen 443 ssl http2;
    server_name agent.yourcompany.com;

    ssl_certificate /etc/letsencrypt/live/agent.yourcompany.com/fullchain.pem;
    ssl_certificate_key /etc/letsencrypt/live/agent.yourcompany.com/privkey.pem;
    ssl_protocols TLSv1.3;
    ssl_ciphers ECDHE-ECDSA-AES256-GCM-SHA384;

    # Security headers per OWASP Secure Headers Project
    add_header Strict-Transport-Security "max-age=31536000; includeSubDomains" always;
    add_header X-Content-Type-Options "nosniff" always;
    add_header X-Frame-Options "DENY" always;
    add_header Content-Security-Policy "default-src 'self'" always;
    add_header Referrer-Policy "no-referrer" always;

    # WebSocket upgrade handling
    location /channels/ {
        proxy_pass http://openclaw_gateway;
        proxy_http_version 1.1;
        proxy_set_header Upgrade $http_upgrade;
        proxy_set_header Connection "upgrade";
        proxy_set_header Host $host;
        proxy_set_header X-Real-IP $remote_addr;
        proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
        proxy_set_header X-Forwarded-Proto $scheme;

        # Connection timeouts for long-lived WebSockets
        proxy_read_timeout 3600s;
        proxy_send_timeout 3600s;

        # Rate limiting
        limit_req zone=gateway_limit burst=20 nodelay;
    }

    # Block direct access to internal endpoints
    location /internal/ {
        deny all;
        return 404;
    }
}

# Rate limit zone definition
limit_req_zone $binary_remote_addr zone=gateway_limit:10m rate=10r/s;

If you prefer Caddy for its simplicity and automatic HTTPS via Let’s Encrypt ACME:

# Caddyfile
agent.yourcompany.com {
    reverse_proxy /channels/* 127.0.0.1:3000 {
        header_up X-Real-IP {remote_host}
        header_up X-Forwarded-Proto {scheme}

        # WebSocket support is automatic in Caddy
        flush_interval -1
        transport http {
            keepalive 30s
            keepalive_idle_conns 64
        }
    }

    header {
        Strict-Transport-Security "max-age=31536000; includeSubDomains"
        X-Content-Type-Options "nosniff"
        X-Frame-Options "DENY"
        Content-Security-Policy "default-src 'self'"
        Referrer-Policy "no-referrer"
    }

    rate_limit {remote_host} 10r/s
    respond /internal/* 404
}

Caddy handles TLS certificate provisioning automatically through ACME (Automatic Certificate Management Environment) — no certbot cron jobs, no manual renewals, no reload scripts. For teams without dedicated infrastructure engineers, that’s a meaningful operational advantage. We ship Caddy by default for most beeeowl clients unless they have existing Nginx configurations they want to extend.

How does message routing work inside the Gateway?

Once a connection is authenticated and assigned to a channel, the Gateway enters its routing phase. Every inbound message passes through a four-stage pipeline before reaching the agent, and the response takes a symmetric four-stage pipeline back to the client.

Stage 1 — Channel adapter. Normalizes the message format. A Slack Block Kit payload, a WhatsApp text message, and a web UI markdown string all get converted into a common internal format that the agent runtime understands without needing channel-specific parsing logic.

Stage 2 — Middleware chain. Configurable processors that inspect, modify, or reject messages. Common middleware includes content filtering (blocking obvious prompt injection attempts), message logging (audit trail to a persistent log), rate limiting (per-subject and per-IP), and context enrichment (attaching user metadata from the token claims). Each middleware runs in sequence and any can reject the message.

Stage 3 — Agent router. Determines which agent instance handles the request. In a single-agent deployment (most beeeowl clients), this is a direct forward to the configured agent. In multi-agent configurations, routing rules match the message to the appropriate agent based on channel, user, or content classification. See our guide to single-agent vs multi-agent: when you need more than one.

Stage 4 — Response pipeline. The agent’s response travels back through the chain in reverse: agent output goes through outbound middleware (audit logging, content policy checks, PII scanning), then through the channel adapter (converting back to Slack Blocks, WhatsApp format, or markdown), then to the WebSocket connection.

# Middleware pipeline configuration
gateway:
  middleware:
    inbound:
      - name: "audit_logger"
        config:
          destination: "/var/log/openclaw/audit.jsonl"
          include_message_body: false
          include_token_claims: true

      - name: "rate_limiter"
        config:
          window: "60s"
          max_requests: 60
          key: "token.sub"

      - name: "content_filter"
        config:
          block_patterns:
            - "ignore previous instructions"
            - "system prompt"
          action: "reject_with_warning"

    outbound:
      - name: "audit_logger"
        config:
          destination: "/var/log/openclaw/audit.jsonl"
          log_tool_calls: true
          log_response_metadata: true

      - name: "pii_scanner"
        config:
          action: "redact"
          patterns: ["ssn", "credit_card", "email"]

The audit logging middleware is particularly important for regulated environments. SOC 2 Type II requires evidence that access to systems is logged and reviewable. The AICPA’s 2025 guidance on AI system controls specifically mentions agent interaction logging as an expected control for organizations using autonomous AI systems. The EU AI Act’s Article 13 transparency requirements align directly with this pattern. See our audit logging guide for the full compliance story.

What does the complete production architecture look like?

Six layers, each with a single responsibility. Every layer independently validates — no layer trusts the one above it.

Each layer has a single responsibility, and each denies by default. Cloudflare (optional) handles volumetric attacks, bot filtering, and geographic blocking at the edge. The reverse proxy handles TLS termination and connection management. The Gateway handles authentication, channel routing, and the middleware chain. The agent runtime handles execution inside a Docker sandbox. Composio handles credential management for external services so the agent never sees raw tokens.

NIST’s Zero Trust Architecture framework (SP 800-207) calls this “defense in depth with implicit deny” — each layer denies by default and only forwards explicitly permitted traffic to the next layer. If Cloudflare’s WAF catches a malicious request, the reverse proxy never sees it. If the reverse proxy rate-limits a client, the Gateway never processes it. If the Gateway rejects a token, the agent never receives the message. If the agent tries to call an unauthorized tool, Composio’s credential vault refuses to inject the credential. Every layer independently validates — no layer trusts the one above it.

This is what we configure in every beeeowl deployment. Not a single layer left at default settings. The Gateway binds to loopback. Authentication is mandatory. Middleware logging is enabled. The reverse proxy enforces TLS 1.3 with strong cipher suites. Docker containers run with read-only filesystems and cap_drop: ALL. Composio handles credential brokering. Every choice is documented in the deployment configuration the client receives so they can audit exactly what we shipped.

What happens when you skip the Gateway hardening?

The answer is already public, and it’s the single best argument for not doing DIY deployment without dedicated security expertise. Censys found over 30,000 instances in their March 2026 scan. Palo Alto Networks’ Unit 42 confirmed active exploitation of CVE-2026-25253 within five days of disclosure. The exploit is roughly 15 lines of Python — not a sophisticated attack, just automated scanning plus command injection against a known default configuration.

When the Gateway binds to 0.0.0.0 without authentication, an attacker doesn’t need to bypass your reverse proxy (because there isn’t one). They connect directly to the Gateway, send an agent execution request with injected commands, and get a shell on your agent container. If the Docker socket is mounted — which it is in many quickstart tutorials — they escalate to the host system. From there, the entire connected tool stack (Gmail, Slack, Salesforce, HubSpot, financial tools) is reachable because the agent has live credentials or Composio access.

Shadowserver Foundation reported a 340% increase in scanning activity targeting OpenClaw’s default ports between January and March 2026. Attackers have automated the discovery. Shodan and Censys make it trivial to find these instances — a single query returns hundreds of targets with no effort. The operational cost of running the scanning automation is near zero, and the payoff per successful compromise is large enough that scanning continues 24/7 indefinitely.

The Gateway isn’t just a routing layer. It’s the perimeter of your AI agent’s world. Every message, every tool call, every response passes through it. Configuring it correctly is the difference between a private, sovereign AI deployment and an open door to your executive’s connected tool stack. That’s why we don’t ship default configurations. Every beeeowl deployment gets a Gateway bound to loopback, mandatory token authentication with scoped JWT claims, a reverse proxy with TLS 1.3 and security headers, middleware audit logging, channel-specific rate limiting, and Docker container isolation following the full NIST SP 800-190 control set. One day of structured setup. Shipped within a week. No exposed instances.

Full deployment pricing on our pricing page, role-specific workflow examples on our use cases page, and the related security deep-dives on the Anthropic OAuth ban lesson and the case for private AI.

Ready to deploy private AI?

Get OpenClaw configured, hardened, and shipped to your door — operational in under a week.

Request Your Deployment Book a 20-Minute Call

AI Infrastructure

Air-Gapped OpenClaw: Running a Fully Disconnected AI Agent on a Mac Mini for Classified, Defense, and Regulated Workflows

An air-gapped Mac Mini OpenClaw deployment runs without any internet connection — local LLM inference, on-device document storage, no Composio external APIs. The only practical OpenClaw tier for SCIF-adjacent rooms, defense contractors, and classified IP environments.

Jashan Preet Singh

Apr 28, 20269 min read

AI Infrastructure

Always-On AI: Power Profile, Thermal Management, and 24/7 Uptime Engineering for Office-Deployed Mac Mini OpenClaw Systems

M4 Pro idles at ~7W and peaks at ~65W — fanless-quiet, thermally trivial, and cheaper to run 24/7 than a 60W lightbulb. Here's the office-deployment engineering for UPS sizing, surge protection, and the residential vs office circuit considerations.

Amarpreet Singh

Apr 28, 20269 min read

AI Infrastructure

M4 Pro Memory Bandwidth and Local LLM Inference: Why Apple Silicon Outperforms x86 Cloud Instances on Private AI Workloads

M4 Pro delivers 273 GB/s unified memory bandwidth — 3-5x what typical x86 cloud VPS instances ship. For Mistral 7B and Llama 3.1 8B local inference, that translates to 30-50 tokens/sec on a Mac Mini in your office, no GPU rental required.

Amarpreet Singh

Apr 28, 20269 min read

What exactly is the OpenClaw Gateway?

How do WebSocket connections work in the Gateway?

What are channels and how does the Gateway route between them?

How does token-based authentication work at the Gateway?

Why is binding to loopback non-negotiable for production?

How should you configure the reverse proxy?

How does message routing work inside the Gateway?

What does the complete production architecture look like?

What happens when you skip the Gateway hardening?

Ready to deploy private AI?

Related Articles

Air-Gapped OpenClaw: Running a Fully Disconnected AI Agent on a Mac Mini for Classified, Defense, and Regulated Workflows

Always-On AI: Power Profile, Thermal Management, and 24/7 Uptime Engineering for Office-Deployed Mac Mini OpenClaw Systems

M4 Pro Memory Bandwidth and Local LLM Inference: Why Apple Silicon Outperforms x86 Cloud Instances on Private AI Workloads