Skip to content

OWASP Agentic Security Initiative (ASI 2026) โ€” Reference Architecture

Disclaimer: This document is an internal self-assessment mapping, NOT a validated certification or third-party audit. It documents how the toolkit's capabilities align with the referenced standard. Organizations must perform their own compliance assessments with qualified auditors.

Version: 1.0 ยท Taxonomy: OWASP ASI 2026 (ASI01โ€“ASI11) Scope: Agent Governance Toolkit (AGT) mitigation patterns, code evidence, and gap analysis.


Executive Summary

The OWASP Agentic Security Initiative defines 11 risks (ASI01โ€“ASI11) specific to autonomous AI agent systems. This document maps every risk to concrete AGT mitigation patterns, links to implementation evidence, and provides an honest coverage assessment.

Coverage Summary

ASI ID Risk Title Coverage Primary AGT Component
ASI01 Agent Goal Hijack โœ… Full governanceMiddleware โ€” blockedPatterns
ASI02 Tool Misuse and Exploitation โœ… Full createGovernedTool โ€” allow/deny-lists
ASI03 Identity and Privilege Abuse โœ… Full PII redaction, RBAC in policy YAML
ASI04 Agentic Supply Chain โš ๏ธ Partial Policy YAML tool pinning; no SBOM
ASI05 Unexpected Code Execution โœ… Full Static reviewer detects pickle/eval
ASI06 Memory and Context Poisoning โš ๏ธ Partial Audit hash-chain; no memory sandbox
ASI07 Insecure Inter-Agent Communication โœ… Full Trust-gate with DID verification
ASI08 Cascading Agent Failures โœ… Full Circuit breaker, rate limiter
ASI09 Human-Agent Trust Exploitation โš ๏ธ Partial Audit trail; no UI-level guardrails
ASI10 Rogue Agents โœ… Full AgentBehaviorMonitor, quarantine
ASI11 Agent Untraceability โœ… Full Tamper-evident audit log (hash chain)

Overall: 8/11 Full, 3/11 Partial, 0 Gaps.


Top-Level Architecture

flowchart TD
    User([User / Copilot]) -->|prompt| GW[Governance Middleware]
    GW -->|blocked patterns check| PI{ASI01\nGoal Hijack?}
    PI -- clean --> TR[Tool Router]
    PI -- blocked --> DENY[Deny + Audit]
    TR -->|allow-list check| TG{ASI02\nTool Misuse?}
    TG -- allowed --> TOOL[Tool Execution]
    TG -- denied --> DENY
    TOOL --> CB{ASI08\nCircuit Breaker}
    CB -- ok --> AUDIT[Audit Middleware]
    CB -- open --> DENY
    AUDIT -->|hash-chain| LOG[(Audit Store)]
    TOOL -.->|inter-agent| TRUST{ASI07\nTrust Gate}
    TRUST -- verified --> SUB[Sub-Agent]
    TRUST -- rejected --> DENY
    SUB --> MON{ASI10\nBehavior Monitor}
    MON -- normal --> AUDIT
    MON -- anomaly --> QUARANTINE[Quarantine Agent]

Risk Details

ASI01 โ€” Agent Goal Hijack

Risk: Adversarial inputs override an agent's intended goal.

AGT Mitigation: The governanceMiddleware applies blockedPatterns (regex) to every inbound message before it reaches the LLM. Patterns are loaded from the policy YAML at runtime โ€” not hardcoded in source.

Evidence: - agent-governance-python/agent-os/src/agent_os/governance/middleware.py โ€” _check_blocked_patterns() - agent-governance-python/agentmesh-integrations/copilot-governance/src/reviewer.ts โ€” rule no-prompt-injection-guards

Coverage: โœ… Full


ASI02 โ€” Tool Misuse and Exploitation

Risk: An agent invokes tools in unintended or dangerous ways.

AGT Mitigation: createGovernedTool wraps every tool with allow-list / deny-list enforcement and per-tool rate limits. The static reviewer flags unguarded .execute() calls.

Evidence: - agent-governance-python/agent-os/src/agent_os/governance/tool_wrapper.py - agent-governance-python/agentmesh-integrations/copilot-governance/src/reviewer.ts โ€” rules unguarded-tool-execution, no-tool-allowlist

Coverage: โœ… Full


ASI03 โ€” Identity and Privilege Abuse

Risk: Agents acquire privileges beyond their role, exposing sensitive data.

AGT Mitigation: PII redaction middleware strips sensitive fields before forwarding. Policy YAML supports field-level pii_fields configuration.

Evidence: - agent-governance-python/agent-os/src/agent_os/governance/middleware.py โ€” _redact_pii() - agent-governance-python/agentmesh-integrations/copilot-governance/src/reviewer.ts โ€” rule missing-pii-redaction

Coverage: โœ… Full


ASI04 โ€” Agentic Supply Chain Vulnerabilities

Risk: Compromised plugins or sub-agents inject malicious behaviour.

AGT Mitigation: Policy YAML allowed_tools pins the exact set of permitted tool IDs. The static reviewer detects hardcoded deny-lists (which attackers can reverse-engineer) and recommends externalised config.

Known Gap: No SBOM generation or dependency vulnerability scanning is built into AGT. Recommend integrating with GitHub Advanced Security / Dependabot for dependency-level supply-chain coverage.

Evidence: - agent-governance-python/agentmesh-integrations/copilot-governance/src/reviewer.ts โ€” rule hardcoded-security-denylist - Policy YAML schema: allowed_tools, blocked_tools

Coverage: โš ๏ธ Partial


ASI05 โ€” Unexpected Code Execution (RCE)

Risk: Agent-driven code paths achieve arbitrary code execution.

AGT Mitigation: The static reviewer detects pickle.loads() without HMAC verification and flags it as critical. The governance policy blocks eval() and exec() in agent code via lint rules.

Evidence: - agent-governance-python/agentmesh-integrations/copilot-governance/src/reviewer.ts โ€” rule unsafe-deserialization

Coverage: โœ… Full


ASI06 โ€” Memory and Context Poisoning

Risk: Persistent memory stores are manipulated to corrupt future decisions.

AGT Mitigation: The audit hash-chain provides tamper detection for any persisted state. However, AGT does not yet sandbox agent memory stores or provide memory integrity checksums at the application layer.

Known Gap: No dedicated memory-sandbox or context-integrity module. Consider adding a ContextValidator that hashes memory snapshots.

Evidence: - agent-governance-python/agent-os/src/agent_os/audit/hash_chain.py

Coverage: โš ๏ธ Partial


ASI07 โ€” Insecure Inter-Agent Communication

Risk: Messages between agents lack authentication or integrity verification.

AGT Mitigation: The trust-gate requires DID-based identity verification before any agent-to-agent handoff. The static reviewer detects missing trust verification in multi-agent orchestration code.

Evidence: - agent-governance-python/agent-os/src/agent_os/trust/gate.py - agent-governance-python/agentmesh-integrations/copilot-governance/src/reviewer.ts โ€” rule missing-trust-verification

Coverage: โœ… Full


ASI08 โ€” Cascading Agent Failures

Risk: A failure in one agent propagates through the system.

AGT Mitigation: The circuit-breaker pattern opens after N consecutive failures, preventing cascade. Rate limiting caps per-minute tool invocations.

Evidence: - agent-governance-python/agentmesh-integrations/copilot-governance/src/reviewer.ts โ€” rule missing-circuit-breaker - agent-governance-python/agent-os/src/agent_os/governance/middleware.py โ€” _rate_limit_check()

Coverage: โœ… Full


ASI09 โ€” Human-Agent Trust Exploitation

Risk: Humans over-trust agent outputs and skip validation.

AGT Mitigation: Tamper-evident audit logs let reviewers verify what the agent actually did. The static reviewer flags code with no audit logging.

Known Gap: No UI-level confirmation dialogs or "human-in-the-loop" approval workflows are built into AGT. Consider adding a HumanApproval middleware for high-risk actions.

Evidence: - agent-governance-python/agentmesh-integrations/copilot-governance/src/reviewer.ts โ€” rule missing-audit-logging

Coverage: โš ๏ธ Partial


ASI10 โ€” Rogue Agents

Risk: An agent deviates from intended behaviour.

AGT Mitigation: AgentBehaviorMonitor tracks per-agent metrics (tool call rate, failure rate, privilege escalation attempts) and quarantines agents that exceed thresholds.

Evidence: - agent-governance-python/agent-os/src/agent_os/services/behavior_monitor.py - agent-governance-python/agentmesh-integrations/copilot-governance/src/reviewer.ts โ€” rule no-behavior-monitoring

Coverage: โœ… Full


ASI11 โ€” Agent Untraceability

Risk: Agent actions lack logging, provenance, or audit trails.

AGT Mitigation: The audit middleware produces a hash-chain log where each entry contains the SHA-256 of the previous entry, making tampering detectable. The static reviewer flags code without audit logging.

Evidence: - agent-governance-python/agent-os/src/agent_os/audit/hash_chain.py - agent-governance-python/agentmesh-integrations/copilot-governance/src/reviewer.ts โ€” rule missing-audit-logging

Coverage: โœ… Full


Deployment Architecture

flowchart LR
    subgraph GitHub Copilot
        EXT[Copilot Extension]
    end
    subgraph AGT Runtime
        GOV[Governance Middleware]
        AUD[Audit Middleware]
        TRUST[Trust Gate]
        MON[Behavior Monitor]
    end
    subgraph External
        LLM[LLM Provider]
        TOOLS[Tool APIs]
        STORE[(Audit Store)]
    end
    EXT --> GOV
    GOV --> LLM
    GOV --> TOOLS
    GOV --> AUD --> STORE
    GOV --> TRUST
    GOV --> MON

Lessons Learned

  1. Hardcoded deny-lists are discoverable. External security researchers reverse-engineered blocked-pattern lists from source code. Externalise security rules into runtime-loaded YAML configs.

  2. Stub verify() functions are a recurring root cause. Two separate incidents traced back to return True stubs in trust verification. The static reviewer now flags these as critical.

  3. Unbounded dictionaries cause memory DoS. Session caches and rate-limit buckets need explicit size limits and eviction policies.

  4. Backward compatibility matters. When migrating from ATโ†’ASI taxonomy, provide a legacy lookup map so existing integrations don't break silently.


Generated for Agent Governance Toolkit ยท OWASP ASI 2026 Taxonomy