Tutorial 06 Execution Sandboxing¶

Package: agentmesh-runtime · Time: 30 minutes · Prerequisites: Python 3.11+

What You'll Learn¶

4-tier privilege ring model for agent isolation
Resource limits and capability guards
Termination control and kill switch integration

Isolate AI agents at runtime using privilege rings, saga transactions, and kill switches.

1. Introduction¶

AI agents that can read files, call APIs, and execute code need strict boundaries. Without sandboxing, a misbehaving agent can:

Exfiltrate data. Read secrets and send them to external endpoints.
Corrupt state. Write to databases or files it should never touch.
Consume resources. Spin up infinite loops that exhaust CPU and memory.
Cascade failures. A failed step in a multi-agent workflow leaves the system in a broken half-finished state.

The Agent Runtime (pip install agentmesh-runtime) solves this with four layers of defense:

┌─────────────────────────────────────────────────┐
│             Execution Ring Model                │
│  Ring 0 (Root) → Ring 3 (Sandbox)               │
├─────────────────────────────────────────────────┤
│           Capability Guards                     │
│  Per-agent tool allow/deny lists                │
├─────────────────────────────────────────────────┤
│          Saga Orchestration                     │
│  Multi-step transactions with auto-rollback     │
├─────────────────────────────────────────────────┤
│          Session Isolation                      │
│  VFS namespacing, snapshots, vector clocks      │
├─────────────────────────────────────────────────┤
│          Emergency Controls                     │
│  Kill switch, rate limiting, breach detection   │
└─────────────────────────────────────────────────┘

Prerequisites¶

Python ≥ 3.11
pip install agentmesh-runtime (v2.0.2+)
For capability guards: pip install agent-os-kernel

2. Quick Start: Ring-Based Access Control¶

Get sandboxing running in under 20 lines:

from hypervisor import Hypervisor, ExecutionRing
from hypervisor.rings.classifier import ActionClassifier
from hypervisor.rings.enforcer import RingEnforcer
from hypervisor.models import ActionDescriptor

# 1. Create the runtime
hv = Hypervisor()

# 2. Classify an action — the classifier maps actions to rings
classifier = ActionClassifier()

read = ActionDescriptor(action_id="read_dataset", name="Read Dataset",
                        execute_api="/data/read", is_read_only=True)
print(classifier.classify(read).ring)      # ExecutionRing.RING_3_SANDBOX

delete = ActionDescriptor(action_id="delete_database", name="Delete Database",
                          execute_api="/db/drop")
print(classifier.classify(delete).ring)    # ExecutionRing.RING_1_PRIVILEGED

# 3. Enforce the ring — block agents that lack privilege
enforcer = RingEnforcer()
agent_ring = ExecutionRing.from_eff_score(eff_score=0.72)
print(agent_ring)  # ExecutionRing.RING_2_STANDARD

# Agent in Ring 2 tries a Ring 1 action → blocked
# Agent in Ring 2 tries a Ring 3 action → allowed

That's it. The classifier decides which ring an action belongs to, and the enforcer checks whether the agent's effective score grants sufficient privilege.

3. The 4-Ring Model¶

The runtime uses a hardware-inspired 4-ring privilege model. Each ring defines what an agent can do, how many calls it can make, and what level of trust is required.

        ┌───────────────────────┐
        │   Ring 0 — Root       │  eff_score: N/A (SRE Witness required)
        │   Runtime config      │  Requires SRE Witness
        │                       │  Rate: unlimited
        ├───────────────────────┤
        │   Ring 1 — Privileged │  eff_score ≥ 0.95 + consensus
        │   Non-reversible ops  │  Write, deploy, delete
        │   (deploy, delete)    │  Rate: 1000 calls/min
        ├───────────────────────┤
        │   Ring 2 — Standard   │  eff_score ≥ 0.60
        │   Reversible actions  │  Read + limited write
        │   (write files, APIs) │  Rate: 100 calls/min
        ├───────────────────────┤
        │   Ring 3 — Sandbox    │  Default for unknown agents
        │   Read-only, research │  No network, no writes
        │   (safe exploration)  │  Rate: 10 calls/min
        └───────────────────────┘

3.1 Ring Assignment from Effective Score¶

The ExecutionRing enum maps directly from an agent's effective score (eff_score), which combines trust, reputation, and behavioral signals:

from hypervisor.models import ExecutionRing

# Ring assignment is automatic based on eff_score
ring = ExecutionRing.from_eff_score(eff_score=0.98, has_consensus=True)
assert ring == ExecutionRing.RING_1_PRIVILEGED

ring = ExecutionRing.from_eff_score(eff_score=0.75)
assert ring == ExecutionRing.RING_2_STANDARD

ring = ExecutionRing.from_eff_score(eff_score=0.40)
assert ring == ExecutionRing.RING_3_SANDBOX

Note: Ring 0 is never assigned by score alone. It requires an SRE Witness attestation and is reserved for runtime-level configuration.

3.2 Action Classification¶

Every action is classified by risk weight and reversibility to determine which ring it requires:

from hypervisor import ExecutionRing
from hypervisor.rings.classifier import ActionClassifier
from hypervisor.models import ActionDescriptor, ReversibilityLevel

classifier = ActionClassifier()

# Read-only operations → Ring 3 (sandbox)
read = ActionDescriptor(action_id="read_dataset", name="Read Dataset",
                        execute_api="/data/read", is_read_only=True)
assert classifier.classify(read).ring == ExecutionRing.RING_3_SANDBOX

# Reversible writes (with an undo endpoint) → Ring 2 (standard)
write = ActionDescriptor(action_id="write_file", name="Write File",
                         execute_api="/files/write", undo_api="/files/restore",
                         reversibility=ReversibilityLevel.FULL)
result = classifier.classify(write)
assert result.ring == ExecutionRing.RING_2_STANDARD
assert result.reversibility == ReversibilityLevel.FULL

# Destructive, non-reversible operations → Ring 1 (privileged)
delete = ActionDescriptor(action_id="delete_database", name="Delete Database",
                          execute_api="/db/drop")
result = classifier.classify(delete)
assert result.ring == ExecutionRing.RING_1_PRIVILEGED
assert result.reversibility == ReversibilityLevel.NONE

# Override classification for custom actions
classifier.set_override("my_custom_action", ring=ExecutionRing.RING_2_STANDARD, risk_weight=0.5)

3.3 Ring Elevation (Privilege Escalation)¶

Sometimes an agent needs temporary access to a higher ring. The RingElevationManager handles time-bounded privilege escalation:

from hypervisor.rings.elevation import (
    RingElevationManager,
    RingElevation,
    ElevationDenialReason,
)

manager = RingElevationManager()

# Request elevation from Ring 2 → Ring 1
elevation = manager.request_elevation(
    agent_did="did:example:agent-42",
    session_id="session-001",
    current_ring=ExecutionRing.RING_2_STANDARD,
    target_ring=ExecutionRing.RING_1_PRIVILEGED,
    ttl_seconds=300,  # 5-minute window (max: 3600s)
    reason="Deploying approved release v2.1.0",
    attestation="signed-approval-token-from-sre",
)

if elevation.is_active:
    # Agent now has Ring 1 privileges for 5 minutes
    effective = manager.get_effective_ring(
        agent_did="did:example:agent-42",
        session_id="session-001",
        base_ring=ExecutionRing.RING_2_STANDARD,
    )
    assert effective == ExecutionRing.RING_1_PRIVILEGED

# Revoke early if needed
manager.revoke_elevation(elevation.elevation_id)

Public Preview: Elevation requests are always denied. The denial reason is ElevationDenialReason.COMMUNITY_EDITION. Upgrade to Enterprise for dynamic ring escalation.

3.4 Breach Detection¶

The RingBreachDetector monitors for agents attempting actions above their ring level:

from hypervisor.rings.breach_detector import (
    RingBreachDetector,
    BreachEvent,
    BreachSeverity,
)

detector = RingBreachDetector()

# The detector fires events when an agent in Ring 3 attempts a Ring 1 action
# Severity depends on the gap between agent ring and action ring:
#   1-ring gap  → WARNING
#   2-ring gap  → HIGH
#   3-ring gap  → CRITICAL (Ring 3 agent trying Ring 0 action)

4. Capability Guards¶

While rings control privilege levels, Capability Guards control which specific tools an agent can call. This is a second, orthogonal layer of defense.

The CapabilityGuardMiddleware (from agent-os) enforces per-agent tool allow/deny lists:

from agent_os.integrations.maf_adapter import (
    CapabilityGuardMiddleware,
    GovernancePolicyMiddleware,
    create_governance_middleware,
)

# Option 1: Explicit allow list (whitelist) — only these tools are permitted
guard = CapabilityGuardMiddleware(
    allowed_tools=["web_search", "file_read", "calculator"],
)

# Option 2: Deny list (blacklist) — everything except these tools
guard = CapabilityGuardMiddleware(
    denied_tools=["execute_code", "delete_file", "send_email"],
)

# Option 3: Factory function for full governance stack
middleware = create_governance_middleware(
    policy_directory="policies/",
    allowed_tools=["web_search", "file_read"],
    denied_tools=["execute_code", "delete_file"],
    enable_rogue_detection=True,
)

4.1 Per-Ring Tool Restrictions¶

Combine rings with capability guards for defense-in-depth:

from hypervisor.models import ExecutionRing
from agent_os.integrations.maf_adapter import CapabilityGuardMiddleware

# Define tool sets per ring
RING_TOOL_POLICIES = {
    ExecutionRing.RING_3_SANDBOX: CapabilityGuardMiddleware(
        allowed_tools=["web_search", "file_read"],
    ),
    ExecutionRing.RING_2_STANDARD: CapabilityGuardMiddleware(
        allowed_tools=["web_search", "file_read", "file_write", "api_call"],
        denied_tools=["delete_file", "execute_code"],
    ),
    ExecutionRing.RING_1_PRIVILEGED: CapabilityGuardMiddleware(
        denied_tools=["drop_database"],  # everything else allowed
    ),
    ExecutionRing.RING_0_ROOT: CapabilityGuardMiddleware(
        # No restrictions — full access
    ),
}

def get_guard_for_agent(eff_score: float) -> CapabilityGuardMiddleware:
    """Return the capability guard matching an agent's privilege ring."""
    ring = ExecutionRing.from_eff_score(eff_score)
    return RING_TOOL_POLICIES[ring]

4.2 Integrating with an Agent Framework¶

from agent_os.integrations.maf_adapter import (
    create_governance_middleware,
    AuditTrailMiddleware,
    RogueDetectionMiddleware,
)

# Full governance middleware stack: policy + capability guard + audit + rogue detection
middleware = create_governance_middleware(
    policy_directory="policies/",
    allowed_tools=["web_search", "file_read"],
    denied_tools=["execute_code"],
    enable_rogue_detection=True,
)

# Attach to your agent framework — the middleware intercepts every tool call
# and blocks anything not in the allow list (or in the deny list)

5. Saga Orchestration¶

Multi-step agent workflows are dangerous: if step 3 of 5 fails, you're left with a half-finished state. The Saga Orchestrator wraps multi-step workflows in transactions with automatic compensation (rollback).

5.1 Core Concepts¶

Step 1: Create PR          ──→  Compensate: Close PR
Step 2: Run tests          ──→  Compensate: Cancel test run
Step 3: Deploy to staging  ──→  Compensate: Rollback deployment
Step 4: Notify team        ──→  Compensate: Send failure notice

If Step 3 fails:
  → Compensate Step 2 (cancel tests)
  → Compensate Step 1 (close PR)
  → Saga state: COMPENSATING → FAILED

5.2 Creating a Saga¶

from hypervisor.saga.orchestrator import SagaOrchestrator
from hypervisor.saga.state_machine import SagaState, StepState

orchestrator = SagaOrchestrator()

# Create a new saga for this session
saga = orchestrator.create_saga(session_id="session-deploy-42")

# Add steps with execute and undo APIs
orchestrator.add_step(
    saga_id=saga.saga_id,
    action_id="pr.create",
    agent_did="did:example:dev-agent",
    execute_api="/api/pr/create",
    undo_api="/api/pr/close",        # compensation action
    timeout_seconds=60,
    max_retries=2,
)

orchestrator.add_step(
    saga_id=saga.saga_id,
    action_id="tests.run",
    agent_did="did:example:ci-agent",
    execute_api="/api/tests/run",
    undo_api="/api/tests/cancel",
    timeout_seconds=300,
    max_retries=1,
)

orchestrator.add_step(
    saga_id=saga.saga_id,
    action_id="deploy.staging",
    agent_did="did:example:deploy-agent",
    execute_api="/api/deploy/staging",
    undo_api="/api/deploy/rollback",
    timeout_seconds=600,
)

5.3 Step and Saga State Machines¶

Each step transitions through a well-defined state machine:

StepState flow:
  PENDING → EXECUTING → COMMITTED
                     ↘ FAILED → COMPENSATING → COMPENSATED
                                            ↘ COMPENSATION_FAILED

The saga itself tracks the aggregate state:

from hypervisor.saga.state_machine import SagaState, StepState, STEP_TRANSITIONS

# Valid step transitions are enforced — invalid transitions raise errors
step = SagaStep(step_id="s1", action_id="pr.create", ...)
step.transition(StepState.EXECUTING)   # PENDING → EXECUTING ✓
step.transition(StepState.COMMITTED)   # EXECUTING → COMMITTED ✓
# step.transition(StepState.PENDING)   # COMMITTED → PENDING ✗ (raises error)

Saga-level states:

State	Meaning
`RUNNING`	Steps are being executed sequentially
`COMPENSATING`	A step failed; compensation is running in reverse
`COMPLETED`	All steps committed successfully
`FAILED`	All compensation finished (or some compensation failed)
`ESCALATED`	Compensation itself failed; human intervention required

5.4 Programmatic saga orchestration¶

Define saga steps directly with SagaOrchestrator and pair each forward action with an undo endpoint:

from hypervisor.saga.orchestrator import SagaOrchestrator

orchestrator = SagaOrchestrator()
saga = orchestrator.create_saga("session-deploy-42")

create_pr = orchestrator.add_step(
    saga_id=saga.saga_id,
    action_id="pr.create",
    agent_did="did:example:dev-agent",
    execute_api="/api/pr/create",
    undo_api="/api/pr/close",
    timeout_seconds=60,
    max_retries=2,
)

run_tests = orchestrator.add_step(
    saga_id=saga.saga_id,
    action_id="tests.run",
    agent_did="did:example:ci-agent",
    execute_api="/api/tests/run",
    undo_api="/api/tests/cancel",
    timeout_seconds=300,
)

If a step fails, call compensate() to roll back committed steps in reverse order.

async def compensator(step):
    return await call_undo_endpoint(step.undo_api)

failed = await orchestrator.compensate(saga.saga_id, compensator)

6. Session Isolation¶

When multiple agents collaborate in a shared session, each agent gets an isolated view of the workspace. No agent can read or modify another agent's files without explicit sharing.

6.1 Virtual File System (VFS) Namespacing¶

The SessionVFS provides per-agent isolated file views within a shared session:

from hypervisor.session.sso import SessionVFS, VFSPermissionError

vfs = SessionVFS()

# Agent A writes a file — only Agent A can see it
vfs.write(path="/workspace/plan.md", agent_did="did:agent-a", value="# My Plan")

# Agent A reads its own file — works fine
content = vfs.read(path="/workspace/plan.md", agent_did="did:agent-a")
assert content == "# My Plan"

# Agent B tries to read Agent A's file — blocked
try:
    vfs.read(path="/workspace/plan.md", agent_did="did:agent-b")
except VFSPermissionError:
    print("Access denied: Agent B cannot read Agent A's namespace")

# Agent B writes to the same path — it gets its own copy
vfs.write(path="/workspace/plan.md", agent_did="did:agent-b", value="# Different Plan")

# Each agent sees its own version
assert vfs.read("/workspace/plan.md", "did:agent-a") == "# My Plan"
assert vfs.read("/workspace/plan.md", "did:agent-b") == "# Different Plan"

# Delete is also scoped
vfs.delete(path="/workspace/plan.md", agent_did="did:agent-a")

6.2 Isolation Levels¶

Choose the right isolation level based on your consistency requirements:

from hypervisor.session.isolation import IsolationLevel

# Snapshot gives each agent a stable view for the operation.
level = IsolationLevel.SNAPSHOT
assert not level.requires_vector_clocks
assert level.allows_concurrent_writes
assert level.coordination_cost == "low"

# Read Committed makes committed writes visible across agents.
level = IsolationLevel.READ_COMMITTED
assert level.requires_vector_clocks

# Serializable is the strongest consistency level.
level = IsolationLevel.SERIALIZABLE
assert level.requires_vector_clocks
assert not level.allows_concurrent_writes
assert level.coordination_cost == "high"

6.3 Vector Clocks for Causal Ordering¶

When agents produce concurrent writes, vector clocks establish a causal order:

from hypervisor.session.vector_clock import VectorClock

# Each agent maintains its own logical clock
clock_a = VectorClock()
clock_b = VectorClock()

# Agent A performs an action, advancing its clock
clock_a.tick("did:agent-a")

# Did A's action causally precede B's current state?
happened_before = clock_a.happens_before(clock_b)
concurrent = clock_a.is_concurrent(clock_b)

6.4 Full Session Configuration¶

Bring it all together with a SharedSessionObject:

from hypervisor.session import SharedSessionObject
from hypervisor.models import SessionConfig, ConsistencyMode

config = SessionConfig(
    consistency_mode=ConsistencyMode.STRONG,
    max_participants=5,
    max_duration_seconds=3600,
    min_eff_score=0.60,
)

session = SharedSessionObject(
    session_id="session-deploy-42",
    config=config,
)

# Session provides:
#   session.vfs           # SessionVFS isolated file views

7. Emergency Controls¶

When an agent goes rogue, you need to stop it immediately, not after the next polling interval.

7.1 Kill Switch¶

The KillSwitch terminates an agent and triggers saga compensation for any in-flight work:

from hypervisor.security.kill_switch import KillSwitch, KillReason, KillResult

kill_switch = KillSwitch()

# Immediate termination — all in-flight saga steps are compensated
result: KillResult = kill_switch.kill(
    agent_did="did:example:rogue-agent",
    session_id="session-001",
    reason=KillReason.BEHAVIORAL_DRIFT,
    details="Agent started accessing files outside its namespace",
)

print(f"Kill ID:              {result.kill_id}")
print(f"Compensation triggered: {result.compensation_triggered}")
print(f"Handoffs succeeded:    {result.handoff_success_count}")
print(f"Timestamp:             {result.timestamp}")

Available kill reasons:

Reason	When to use
`BEHAVIORAL_DRIFT`	Agent deviates from expected behavior patterns
`RATE_LIMIT`	Agent exceeded its rate limit repeatedly
`RING_BREACH`	Agent attempted actions above its ring level
`MANUAL`	Human operator triggered the kill
`SESSION_TIMEOUT`	Session exceeded its `max_duration_seconds`

7.2 Graceful Shutdown with Handoff¶

Before killing, you can register substitute agents to take over in-flight work:

from hypervisor.security.kill_switch import KillSwitch, HandoffStatus

kill_switch = KillSwitch()

# Register a substitute agent that can take over work
kill_switch.register_substitute(
    session_id="session-001",
    agent_did="did:example:backup-agent",
)

# Now when the primary agent is killed, its saga steps are handed off
result = kill_switch.kill(
    agent_did="did:example:primary-agent",
    session_id="session-001",
    reason=KillReason.MANUAL,
    details="Planned maintenance rotation",
)

# Check handoff results
for handoff in result.handoffs:
    print(f"Step {handoff.step_id}: {handoff.status}")
    # HandoffStatus: PENDING, HANDED_OFF, FAILED, COMPENSATED

# Review kill history
history = kill_switch.get_kill_history(agent_did="did:example:primary-agent")

7.3 Rate Limiting¶

Prevent resource exhaustion with per-agent rate limits:

from hypervisor.security.rate_limiter import AgentRateLimiter, RateLimitExceeded

# Ring 3 agents: 10 calls per minute
sandbox_limiter = AgentRateLimiter(
    window_seconds=60.0,
    max_calls=10,
)

# Ring 2 agents: 100 calls per minute
standard_limiter = AgentRateLimiter(
    window_seconds=60.0,
    max_calls=100,
)

# Check before each action
status = sandbox_limiter.check_rate_limit(agent_did="did:example:new-agent")
if not status.allowed:
    print(f"Rate limited — retry after {status.retry_after_seconds}s")

# Reset limits (e.g., after an agent is promoted)
sandbox_limiter.reset(agent_did="did:example:new-agent")

7.4 Breach Detection Pipeline¶

Wire breach detection into your kill switch for automated response:

from hypervisor.rings.breach_detector import RingBreachDetector, BreachSeverity
from hypervisor.security.kill_switch import KillSwitch, KillReason
from hypervisor.security.rate_limiter import AgentRateLimiter

detector = RingBreachDetector()
kill_switch = KillSwitch()
limiter = AgentRateLimiter(window_seconds=60.0, max_calls=100)

async def on_agent_action(agent_did: str, session_id: str, action_id: str):
    """Example enforcement pipeline for every agent action."""

    # Layer 1: Rate limit check
    status = limiter.check_rate_limit(agent_did)
    if not status.allowed:
        kill_switch.kill(agent_did, session_id, KillReason.RATE_LIMIT)
        return

    # Layer 2: Ring enforcement (breach detection)
    # If a breach is CRITICAL severity → kill immediately
    # If WARNING → log and allow (the agent might be testing boundaries)

    # Layer 3: Capability guard check (handled by middleware)
    # Layer 4: Saga step execution (handled by orchestrator)

8. Production Deployment¶

8.1 Running the Runtime API Server¶

The runtime includes a FastAPI server for HTTP-based enforcement:

# Install with API extras
pip install "agentmesh-runtime[api]"

# Start the server
hypervisor serve --host 0.0.0.0 --port 8000

8.2 Docker Container¶

FROM python:3.11-slim

WORKDIR /app

RUN pip install "agentmesh-runtime[full,api]"

EXPOSE 8000

CMD ["hypervisor", "serve", "--host", "0.0.0.0", "--port", "8000"]

Build and run:

docker build -t agent-runtime:latest .
docker run -p 8000:8000 agent-runtime:latest

8.3 Kubernetes Deployment¶

Deploy the runtime as a sidecar alongside your agent pods:

# runtime-deployment.yaml
apiVersion: apps/v1
kind: Deployment
metadata:
  name: agent-runtime
  labels:
    app: agent-runtime
spec:
  replicas: 2
  selector:
    matchLabels:
      app: agent-runtime
  template:
    metadata:
      labels:
        app: agent-runtime
    spec:
      containers:
        - name: runtime
          image: agent-runtime:latest
          ports:
            - containerPort: 8000
          resources:
            requests:
              memory: "256Mi"
              cpu: "250m"
            limits:
              memory: "512Mi"
              cpu: "500m"
          readinessProbe:
            httpGet:
              path: /health
              port: 8000
            initialDelaySeconds: 5
            periodSeconds: 10
          livenessProbe:
            httpGet:
              path: /health
              port: 8000
            initialDelaySeconds: 10
            periodSeconds: 30
          env:
            - name: HYPERVISOR_LOG_LEVEL
              value: "INFO"
---
apiVersion: v1
kind: Service
metadata:
  name: agent-runtime
spec:
  selector:
    app: agent-runtime
  ports:
    - port: 8000
      targetPort: 8000

8.4 Sidecar Pattern¶

For fine-grained per-pod enforcement, run the runtime as a sidecar:

# agent-pod-with-sidecar.yaml
apiVersion: v1
kind: Pod
metadata:
  name: agent-worker
spec:
  containers:
    # Your agent container
    - name: agent
      image: my-agent:latest
      env:
        - name: HYPERVISOR_URL
          value: "http://localhost:8000"

    # Runtime sidecar — enforces sandboxing for this pod
    - name: runtime-sidecar
      image: agent-runtime:latest
      ports:
        - containerPort: 8000
      resources:
        requests:
          memory: "128Mi"
          cpu: "100m"
        limits:
          memory: "256Mi"
          cpu: "250m"

8.5 Helm Chart Values¶

Create a values file for parameterized deployments:

# values.yaml
replicaCount: 2

image:
  repository: agent-runtime
  tag: "latest"
  pullPolicy: IfNotPresent

service:
  type: ClusterIP
  port: 8000

resources:
  requests:
    memory: "256Mi"
    cpu: "250m"
  limits:
    memory: "512Mi"
    cpu: "500m"

runtime:
  logLevel: INFO
  rateLimiting:
    ring3MaxCalls: 10
    ring2MaxCalls: 100
    ring1MaxCalls: 1000
    windowSeconds: 60
  session:
    maxDurationSeconds: 3600
    maxParticipants: 10
    defaultIsolation: snapshot
  saga:
    defaultTimeoutSeconds: 300
    maxRetries: 2

8.6 Observability¶

Monitor your runtime with the built-in event bus:

from hypervisor.observability.event_bus import HypervisorEventBus, EventType
from hypervisor.observability.causal_trace import CausalTraceId

event_bus = HypervisorEventBus()

# Subscribe to security events
@event_bus.subscribe(EventType.RING_BREACH)
async def on_breach(event):
    print(f"BREACH: {event.agent_did} attempted {event.action_id}")

@event_bus.subscribe(EventType.KILL_SWITCH)
async def on_kill(event):
    print(f"KILLED: {event.agent_did} — reason: {event.reason}")

# Trace causality across distributed saga steps
trace_id = CausalTraceId.generate()

Summary¶

Layer	Component	What It Does
Privilege	`ExecutionRing`	4-tier access model based on trust score
Privilege	`ActionClassifier`	Maps actions to rings by risk/reversibility
Privilege	`RingElevationManager`	Temporary privilege escalation with TTL
Detection	`RingBreachDetector`	Alerts on ring boundary violations
Tools	`CapabilityGuardMiddleware`	Per-agent tool allow/deny lists
Transactions	`SagaOrchestrator`	Multi-step workflows with auto-rollback
Isolation	`SessionVFS`	Per-agent virtual file system namespacing
Isolation	`VectorClock`	Causal ordering of concurrent operations
Emergency	`KillSwitch`	Immediate agent termination
Emergency	`AgentRateLimiter`	Per-agent call rate enforcement
Observability	`HypervisorEventBus`	Real-time event streaming

Next Steps¶

Audit trails: Use DeltaEngine for hash-chained, tamper-evident delta logging.
Deployment: Read the Azure Container Apps guide for cloud-native deployment patterns.