OpenClaw execution policy & runner profiles: risk-tiered routing, tenant quotas, human gates

Key takeaways

Route by risk class first: map inbound events and job kinds to tiers (read-only diagnostics, build, release) before you pick a runner pool—routing is policy, not load balancing alone.
Runner profiles are contracts: CPU class, Xcode range, network egress, keychain access, and max lease time should be declared and enforced, not implied from “whatever image booted.”
Quotas are fairness + safety: per-tenant concurrency, disk budgets, and backoff when inode pressure rises keep noisy neighbors from starving everyone else.
Human gates are scheduling primitives: approvals for signing, notarization, or prod deploy belong in the workflow graph with SLA and audit metadata, not as Slack theater after the fact.

Keyboard and abstract technology background suggesting controlled automation — Illustrative only: the real control plane is your policy graph, queues, and runner leases—not the hero photo.

1. Why “controlled execution surface” matters

Cloud Mac capacity is attractive because it is predictable metal with a familiar toolchain. The failure mode is treating it like a timeshare shell: any webhook or chat command can enqueue arbitrary work, runners inherit broad credentials, and incidents become forensic puzzles. A controlled surface means every job carries a tier label, lands on a profiled runner, consumes metered quota, and escalates through explicit gates when the tier demands it. That is how you keep automation fast for developers without handing attackers a warm Xcode session.

2. Risk-tiered routing

Start from the trigger, not the queue depth. Low tier might allow read-only git metadata, lint, and unit tests on shared warm pools. Mid tier compiles and archives with registry egress but no App Store connect roles. High tier runs signing, notarization, or anything that mutates customer-visible state—typically fewer runners, shorter leases, and stricter network allow lists. The router should reject downgrades: a job tagged high cannot silently land on a mid pool because that pool had idle CPU.

Ingress hygiene still matters at every tier; align webhook verification and enqueue-only handlers with Chaining OpenClaw webhooks with cloud Mac runners: low-trust inbound validation, execution isolation, idempotent retries—and how to design observability and audit fields so the router never sees a parsed payload that bypassed signature checks.

Tier	Typical jobs	Router must enforce
Low	Static analysis, docs, dry-run simulators	No signing identities; outbound only to SCM and package mirrors you allow
Mid	Archive builds, test bundles, perf benches	Ephemeral keychains; artifact digests logged; bounded disk scratch
High	Release signing, notarytool, store uploads	Human approval token, dual control, or break-glass role with time box

3. Runner capability profiles

A profile is the unit operators provision: image digest, Xcode major, Rosetta on/off, Homebrew cache policy, and whether the lease may attach hardware-backed signing. Schedulers should refuse jobs whose declared requirements exceed the profile, instead of “best effort” on whatever host is free. Profiles also make cost visible: high-trust pools are smaller, so product teams see why release lanes wait while low-tier lanes stay elastic.

Publish profiles as data your clients can validate: maximum concurrent simulators, allowed xcodebuild destinations, default timeout multipliers, and whether Docker or colima is permitted on-box. When OpenClaw issues a lease, return the profile ID in the response so downstream steps can assert they are still on the same contract—drift between “what the UI promised” and “what booted” is where security reviews lose patience.

Keep isolation semantics aligned with how you already separate ingress from execution—reuse the same vocabulary as the webhook note above so runbooks stay short.

4. Tenant quotas and noisy-neighbor control

Per-tenant limits are not only billing; they cap blast radius. Concurrency limits stop one integration from occupying every warm runner during a conference demo week. Disk and inode budgets stop container layers, Derived Data, and unified logs from exhausting the filesystem—pair operational cleanup with the same language as Apple Silicon cloud Mac runner disk & inode governance: Derived Data, container layers, unified logs & caches—quota alerts, tiered cleanup, plan storage planning so alerts mean the same thing in CI and on the scheduler dashboard. When a tenant hits quota, return a structured reason (quota class, reset window) so OpenClaw automations can backoff with jitter instead of hammering the queue.

5. Human gates without blocking flow

Gates should be first-class states in the job graph: waiting_approval, approved_by, expiry, and escalation path. Prefer short-lived approval tokens bound to a change ticket over long-lived shared passwords. Auditors care that the same identifiers appear in your IdP event, queue record, and runner lease metadata—mirroring the correlation discipline you already want on the HTTP edge.

Set explicit SLAs: if nobody approves within N minutes, fail closed into a dead-letter state with enough context to retry safely, or escalate to a secondary on-call group. “Pending forever” is worse than a hard failure because it hides queue pressure and trains users to click approve without reading diffs.

6. Closing

Risk tiers, profiles, quotas, and human gates compose a single story: cloud Mac is powerful compute, and the product is the policy that wraps it. Implement routing and profiles before you optimize minutes saved per build; otherwise you optimize the wrong surface area.