Skip to main content

Phase Reference

Phase Summary

PhaseNameNIST AI RMFKey outputState fields updated
1AI System ScopingGovern + Mapsystem-definition-pack.md, stakeholder-impact-map.mdcurrentPhase, entryMode, securityPlanRef
2Risk ClassificationGovernRisk classification screening summary (appended to system-definition-pack.md)riskClassification
3RAI Standards MappingGovern + Measurerai-standards-mapping.mdstandardsMapped
4RAI Security Model AnalysisMeasurerai-security-model-addendum.mdsecurityModelAnalysisStarted, raiThreatCount
5RAI Impact AssessmentManagecontrol-surface-catalog.md, evidence-register.md, rai-tradeoffs.mdimpactAssessmentGenerated, evidenceRegisterComplete
6Review and HandoffManagerai-review-summary.md, backlog itemshandoffGenerated

Phase 1: AI System Scoping

NIST AI RMF alignment: Govern + Map

Purpose

Establish the AI system's boundaries, identify all AI and ML components, and map stakeholder roles and data flows. This phase provides the foundation for every subsequent assessment phase.

Inputs

  • User answers to scoping questions (capture mode)
  • PRD or BRD artifacts (from-prd mode)
  • Security plan state.json and aiComponents array (from-security-plan mode)

Process

The agent asks up to 7 questions per turn covering:

  • AI system purpose and intended outcomes
  • Technology stack, model types, and frameworks
  • Stakeholder roles (developers, operators, affected individuals, oversight bodies)
  • Data inputs, training data sources, and output destinations
  • Deployment model (cloud, edge, hybrid, on-device)
  • Intended and unintended use contexts

In from-security-plan mode, AI components from the security plan are pre-populated and the agent focuses on RAI-specific aspects not covered during security assessment.

Outputs

  • system-definition-pack.md: AI system inventory, component catalog, and deployment context
  • stakeholder-impact-map.md: Stakeholder roles, power dynamics, and impact pathways

State Transitions

FieldBeforeAfter
currentPhase12
entryModeset during initunchanged
securityPlanRefnullpath (if from-security-plan)

Phase 2: Risk Classification

NIST AI RMF alignment: Govern

Purpose

Screen the AI system for risk using NIST AI RMF 1.0 trustworthiness characteristics to determine the suggested assessment depth tier for subsequent phases. The prohibited uses gate executes first, then three risk indicators derived from NIST subcategories determine the suggested depth tier.

Inputs

  • System definition pack from Phase 1

Process

The agent evaluates three risk indicators derived from NIST AI RMF 1.0 trustworthiness characteristics:

IndicatorNIST sourceMethodDomain
safety_reliabilityValid and Reliable, SafeBinaryPhysical harm, psychological injury, operational disruption from unreliable outputs
rights_fairness_privacyAccountable and Transparent, Privacy-Enhanced, Fair with Harmful Bias ManagedCategoricalDiscrimination, rights restriction, equitable access, personal data misuse
security_explainabilitySecure and Resilient, Explainable and InterpretableContinuousAdversarial attacks, data poisoning, model theft, inability to explain decisions

Each indicator uses its own assessment method (binary, categorical, or continuous). When the user has supplied custom risk indicator extensions, those are incorporated as additional evaluation criteria alongside the default NIST-derived indicators.

Depth Tier Assignment

Activated countSuggested tier
0Basic
1Standard
2+Comprehensive

Outputs

  • Risk classification screening summary appended to system-definition-pack.md

State Transitions

FieldBeforeAfter
currentPhase23
riskClassificationnullrisk classification results object

Phase 3: RAI Standards Mapping

NIST AI RMF alignment: Govern + Measure

Purpose

Map each AI component against NIST AI RMF 1.0 trustworthiness characteristics and subcategories. Establish the evaluation framework used in Phases 4 and 5.

Inputs

  • System definition pack from Phase 1

Process

The agent maps components against seven NIST AI RMF 1.0 trustworthiness characteristics:

CharacteristicFocus area
Valid and ReliableAccurate outputs, consistent performance, degradation paths
SafePhysical and psychological harm prevention, failure modes
Secure and ResilientAdversarial robustness, data protection, recovery mechanisms
Accountable and TransparentOversight mechanisms, audit trails, decision traceability
Explainable and InterpretableModel interpretability, decision explanation, disclosure
Privacy-EnhancedData minimization, consent, inference prevention
Fair with Harmful Bias ManagedBias detection, equitable outcomes, allocation harms

For each characteristic-component pair, the agent identifies:

  • Applicable NIST AI RMF subcategories
  • Regulatory jurisdiction and framework obligations
  • Existing compliance posture

The Researcher Subagent is dispatched for runtime lookups of specific regulatory frameworks (WAF, CAF, ISO 42001, EU AI Act) when the assessment requires detail beyond embedded standards.

Outputs

  • rai-standards-mapping.md: Characteristic-by-component mapping with NIST subcategory references and compliance gaps

State Transitions

FieldBeforeAfter
currentPhase34
standardsMappedfalsetrue

Phase 4: RAI Security Model Analysis

NIST AI RMF alignment: Measure

Purpose

Identify AI-specific threats across all components using a structured threat taxonomy. Each threat receives a unique identifier and risk rating.

Inputs

  • System definition pack from Phase 1
  • RAI standards mapping from Phase 3
  • Security plan threat catalog (when using from-security-plan mode)

Process

The agent applies threat analysis across seven AI-specific categories:

CategoryThreat focus
Data poisoningManipulation of training or fine-tuning data
Model evasionAdversarial inputs designed to cause misclassification
Prompt injectionManipulation of LLM prompts to override instructions
Output manipulationAltering model outputs in transit or post-processing
Bias amplificationModel behavior that reinforces or amplifies existing biases
Privacy leakageExtraction of training data, PII, or sensitive information
Misuse escalationSystem capabilities repurposed for unintended harmful uses

Each threat receives a sequential identifier in T-RAI-{NNN} format. When a threat overlaps with a Security Planner operational bucket, it also receives a T-{BUCKET}-AI-{NNN} cross-reference ID. In from-security-plan mode, numbering continues from the security plan's threat count to maintain a unified threat registry.

Concern Level Assessment

Each threat is assigned a concern level based on contextual analysis rather than a fixed matrix:

Concern levelMeaning
LowThreat is unlikely to manifest or impact is minimal
ModerateThreat warrants attention and monitoring
HighThreat requires active mitigation before production deployment

Outputs

  • rai-security-model-addendum.md: Threat catalog with IDs, categories, descriptions, risk ratings, and recommended mitigations

State Transitions

FieldBeforeAfter
currentPhase45
securityModelAnalysisStartedfalsetrue
raiThreatCount0count of identified threats

Phase 5: RAI Impact Assessment

NIST AI RMF alignment: Manage

Purpose

Explore control surface completeness for each identified threat. Document evidence of existing mitigations, identify gaps, and analyze tradeoffs between competing trustworthiness characteristics.

Inputs

  • RAI security model addendum from Phase 4
  • RAI standards mapping from Phase 3
  • Evidence provided by the user or discovered in the codebase

Process

For each threat identified in Phase 4, the agent evaluates:

  • Whether a control or mitigation exists
  • What evidence supports the control's effectiveness
  • Whether the control introduces tradeoffs with other trustworthiness characteristics
  • What gaps remain and what remediation is recommended

Common tradeoff examples:

TradeoffExample
Transparency vs. PrivacyExplaining model decisions may reveal sensitive training data
Fairness vs. PerformanceDebiasing techniques may reduce model accuracy for some populations
Safety vs. InclusivenessConservative safety filters may disproportionately restrict certain user groups

Outputs

  • control-surface-catalog.md: Control inventory mapped to threats with effectiveness ratings
  • evidence-register.md: Evidence log documenting existing mitigations, gaps, and collection difficulty
  • rai-tradeoffs.md: Characteristic conflict analysis with resolution recommendations

The control surface catalog and evidence register are agentic artifacts that include only the AI-content transparency note. RAI tradeoffs is a human-facing artifact that also includes the human review checkbox. See Handoff Pipeline for the complete footer classification.

State Transitions

FieldBeforeAfter
currentPhase56
impactAssessmentGeneratedfalsetrue
evidenceRegisterCompletefalsetrue

Phase 6: Review and Handoff

NIST AI RMF alignment: Manage

Purpose

Generate a review summary covering observations across six dimensions and produce backlog items for identified gaps. Hand off to ADO or GitHub.

Inputs

  • All artifacts from Phases 1-5
  • User preferences for backlog format and handoff system

Process

The agent generates a review summary covering observations across six dimensions:

DimensionWhat it covers
Standards AlignmentWhether findings map to NIST AI RMF 1.0 trustworthiness characteristics and subcategories
Threat CompletenessCompleteness and accuracy of threat identification
Control EffectivenessCoverage and effectiveness of controls for identified threats
Evidence QualityQuality and availability of evidence supporting control effectiveness
Tradeoff ResolutionWhether competing characteristic tradeoffs have been analyzed and resolved
Risk ClassificationWhether risk indicators were evaluated with documented mitigations

The review summary presents observations and maturity indicators rather than numeric scores. Findings are organized to support handoff decisions and backlog prioritization.

Backlog Generation

Gaps identified across Phases 3-5 are converted to work items using the same dual-platform format as the Security Planner:

  • ADO work items use WI-RAI-{NNN} temporary IDs
  • GitHub issues use {{RAI-TEMP-N}} temporary IDs
  • Default autonomy tier is Partial: items are created but require user confirmation before submission

Outputs

  • rai-review-summary.md: Review summary with observations across six dimensions and maturity indicators
  • Backlog items in the user's preferred format

All Phase 6 artifacts are human-facing and include the AI-content transparency note and human review checkbox. The handoff summary and compact handoff summary also include the full RAI Planner disclaimer. See Handoff Pipeline for the complete footer classification.

State Transitions

FieldBeforeAfter
currentPhase66 (terminal)
handoffGenerated{ "ado": false, "github": false }{ "ado": true, "github": true }

🤖 Crafted with precision by ✨Copilot following brilliant human instruction, then carefully refined by our team of discerning human reviewers.