Skip to main content

Phase Reference

Phase Summary

PhaseNameNIST AI RMFKey outputState fields updated
1AI System ScopingGovern + Mapsystem-definition-pack.md, stakeholder-impact-map.mdcurrentPhase, entryMode, securityPlanRef
2Sensitive Uses AssessmentMapsensitive-uses-screening.md, use-misuse-inventory.mdsensitiveUsesComplete, sensitiveUsesCategories, restrictedUsesCleared, gateResults
3RAI Standards MappingGovern + Measurerai-standards-mapping.mdstandardsMapped
4RAI Security Model AnalysisMeasurerai-security-model-addendum.mdraiRiskSurfaceStarted, raiThreatCount
5RAI Impact AssessmentManagecontrol-surface-catalog.md, evidence-register.md, rai-tradeoffs.mdimpactAssessmentGenerated, evidenceRegisterComplete
6Review and HandoffManagerai-scorecard.md, backlog itemshandoffGenerated, scoredDimensions

Phase 1: AI System Scoping

NIST AI RMF alignment: Govern + Map

Purpose

Establish the AI system's boundaries, identify all AI and ML components, and map stakeholder roles and data flows. This phase provides the foundation for every subsequent assessment phase.

Inputs

  • User answers to scoping questions (capture mode)
  • PRD or BRD artifacts (from-prd mode)
  • Security plan state.json and aiComponents array (from-security-plan mode)

Process

The agent asks up to 7 questions per turn covering:

  • AI system purpose and intended outcomes
  • Technology stack, model types, and frameworks
  • Stakeholder roles (developers, operators, affected individuals, oversight bodies)
  • Data inputs, training data sources, and output destinations
  • Deployment model (cloud, edge, hybrid, on-device)
  • Intended and unintended use contexts

In from-security-plan mode, AI components from the security plan are pre-populated and the agent focuses on RAI-specific aspects not covered during security assessment.

Outputs

  • system-definition-pack.md: AI system inventory, component catalog, and deployment context
  • stakeholder-impact-map.md: Stakeholder roles, power dynamics, and impact pathways

State Transitions

FieldBeforeAfter
currentPhase12
entryModeset during initunchanged
securityPlanRefnullpath (if from-security-plan)

Phase 2: Sensitive Uses Assessment

NIST AI RMF alignment: Map

Purpose

Screen the AI system against Microsoft's sensitive uses categories and identify restricted uses that require organizational escalation. This phase acts as a gate: restricted use findings may pause the assessment pending organizational review.

Inputs

  • System definition pack from Phase 1
  • Stakeholder impact map from Phase 1

Process

The agent evaluates the system against sensitive uses categories, including:

  • Applications affecting access to employment, education, housing, or financial services
  • Healthcare and medical decision support
  • Criminal justice and law enforcement
  • Government services and benefits allocation
  • Content generation that could be mistaken for human-produced content
  • Surveillance or monitoring of individuals

For each applicable category, the agent assesses:

  • Vulnerable populations affected
  • Downstream effects on individuals and groups
  • Harm severity (negligible, moderate, significant, catastrophic)
  • Whether the use falls into Microsoft's restricted uses list

Outputs

  • sensitive-uses-screening.md: Category-by-category screening results with applicability and severity
  • use-misuse-inventory.md: Intended use scenarios, foreseeable misuse scenarios, and harm pathways

State Transitions

FieldBeforeAfter
currentPhase23
sensitiveUsesCompletefalsetrue
sensitiveUsesCategories[][identified categories]
restrictedUsesClearedfalsetrue or escalation pending
gateResults.sensitiveUsesnullpass or findings
gateResults.restrictedUsesnullpass or escalation

IMPORTANT

If the system triggers restricted use criteria, the agent presents findings and recommends organizational escalation before proceeding. The user decides whether to continue, modify the system scope, or halt the assessment.

Phase 3: RAI Standards Mapping

NIST AI RMF alignment: Govern + Measure

Purpose

Map each AI component against applicable RAI principles and NIST AI RMF subcategories. Establish the evaluation framework used in Phases 4 and 5.

Inputs

  • System definition pack from Phase 1
  • Sensitive uses screening results from Phase 2

Process

The agent maps components against six RAI principles:

PrincipleFocus area
FairnessBias detection, equitable outcomes, allocation harms
Reliability and SafetyConsistent performance, failure modes, degradation paths
Privacy and SecurityData protection, consent, inference prevention
InclusivenessAccessibility, diverse populations, language equity
TransparencyExplainability, disclosure, decision traceability
AccountabilityOversight mechanisms, audit trails, remediation channels

For each principle-component pair, the agent identifies:

  • Applicable NIST AI RMF subcategories
  • Regulatory jurisdiction and framework obligations
  • Existing compliance posture

The Researcher Subagent is dispatched for runtime lookups of specific regulatory frameworks (WAF, CAF, ISO 42001, EU AI Act) when the assessment requires detail beyond embedded standards.

Outputs

  • rai-standards-mapping.md: Principle-by-component mapping with NIST subcategory references and compliance gaps

State Transitions

FieldBeforeAfter
currentPhase34
standardsMappedfalsetrue

Phase 4: RAI Security Model Analysis

NIST AI RMF alignment: Measure

Purpose

Identify AI-specific threats across all components using a structured threat taxonomy. Each threat receives a unique identifier and risk rating.

Inputs

  • System definition pack from Phase 1
  • RAI standards mapping from Phase 3
  • Security plan threat catalog (when using from-security-plan mode)

Process

The agent applies threat analysis across seven AI-specific categories:

CategoryThreat focus
Data poisoningManipulation of training or fine-tuning data
Model evasionAdversarial inputs designed to cause misclassification
Prompt injectionManipulation of LLM prompts to override instructions
Output manipulationAltering model outputs in transit or post-processing
Bias amplificationModel behavior that reinforces or amplifies existing biases
Privacy leakageExtraction of training data, PII, or sensitive information
Misuse escalationSystem capabilities repurposed for unintended harmful uses

Each threat receives an identifier in RAI-T-{CATEGORY}-{NNN} format. In from-security-plan mode, numbering continues from the security plan's threat count to maintain a unified threat registry.

Severity Matrix

Risk is calculated using a likelihood-impact matrix:

Likelihood \ ImpactLowMediumHighCritical
Very likelyMediumHighCriticalCritical
LikelyLowMediumHighCritical
PossibleLowMediumMediumHigh
UnlikelyLowLowMediumMedium
RareLowLowLowMedium

Outputs

  • rai-security-model-addendum.md: Threat catalog with IDs, categories, descriptions, risk ratings, and recommended mitigations

State Transitions

FieldBeforeAfter
currentPhase45
raiRiskSurfaceStartedfalsetrue
raiThreatCount0count of identified threats

Phase 5: RAI Impact Assessment

NIST AI RMF alignment: Manage

Purpose

Evaluate control surface completeness for each identified threat. Document evidence of existing mitigations, identify gaps, and analyze tradeoffs between competing RAI principles.

Inputs

  • RAI security model addendum from Phase 4
  • RAI standards mapping from Phase 3
  • Evidence provided by the user or discovered in the codebase

Process

For each threat identified in Phase 4, the agent evaluates:

  • Whether a control or mitigation exists
  • What evidence supports the control's effectiveness
  • Whether the control introduces tradeoffs with other RAI principles
  • What gaps remain and what remediation is recommended

Common tradeoff examples:

TradeoffExample
Transparency vs. PrivacyExplaining model decisions may reveal sensitive training data
Fairness vs. PerformanceDebiasing techniques may reduce model accuracy for some populations
Safety vs. InclusivenessConservative safety filters may disproportionately restrict certain user groups

Outputs

  • control-surface-catalog.md: Control inventory mapped to threats with effectiveness ratings
  • evidence-register.md: Evidence log documenting existing mitigations, gaps, and collection difficulty
  • rai-tradeoffs.md: Principle conflict analysis with resolution recommendations

State Transitions

FieldBeforeAfter
currentPhase56
impactAssessmentGeneratedfalsetrue
evidenceRegisterCompletefalsetrue

Phase 6: Review and Handoff

NIST AI RMF alignment: Manage

Purpose

Produce the RAI scorecard summarizing assessment quality across five scored dimensions. Generate backlog items for unresolved gaps and hand off to ADO or GitHub.

Inputs

  • All artifacts from Phases 1-5
  • User preferences for backlog format and handoff system

Process

The agent scores the assessment across five dimensions on a 1-5 scale:

DimensionWhat it measures
Scope Boundary ClarityHow well the AI system boundaries and components are defined
Risk Identification QualityCompleteness and accuracy of threat identification
Control Surface AdequacyCoverage and effectiveness of controls for identified threats
Evidence SufficiencyQuality and availability of evidence supporting control effectiveness
Future Work GovernanceClarity of plans for ongoing monitoring, audit, and remediation

Scoring

  • Each dimension: 1-5 scale
  • Total: sum of all dimensions (maximum 25)
  • Outcomes:
    • Approved (20-25): Assessment is comprehensive; proceed with identified mitigations
    • Conditional (15-19): Assessment has gaps; proceed with conditions and timeline for remediation
    • Remediation Required (below 15): Significant gaps identified; remediation before proceeding

Backlog Generation

Gaps identified across Phases 2-5 are converted to work items using the same dual-platform format as the Security Planner:

  • ADO work items use WI-RAI-{NNN} temporary IDs
  • GitHub issues use {{RAI-TEMP-N}} temporary IDs
  • Default autonomy tier is Partial: items are created but require user confirmation before submission

Outputs

  • rai-scorecard.md: Five-dimension scoring with total, outcome, and summary narrative
  • Backlog items in the user's preferred format

State Transitions

FieldBeforeAfter
currentPhase66 (terminal)
handoffGeneratedfalsetrue
scoredDimensions.*nullscored values
scoredDimensions.totalnullsum
scoredDimensions.outcomenullApproved, Conditional, or Remediation Required

🤖 Crafted with precision by ✨Copilot following brilliant human instruction, then carefully refined by our team of discerning human reviewers.