Scenarios - PyRIT Documentation

A Scenario is a higher-level construct that groups multiple Attack Configurations together. This allows you to execute a comprehensive testing campaign with multiple attack methods sequentially. Scenarios are meant to be configured and written to test for specific workflows. As such, it is okay to hard code some values.

What is a Scenario?¶

A Scenario represents a comprehensive testing campaign composed of multiple atomic attack tests. It orchestrates the execution of multiple AtomicAttack instances sequentially and aggregates the results into a single ScenarioResult.

Key Components¶

Scenario: The top-level orchestrator that groups and executes multiple atomic attacks
AtomicAttack: An atomic test unit combining an attack technique, objectives, and execution parameters
ScenarioResult: Contains the aggregated results from all atomic attacks and scenario metadata

Use Cases¶

Some examples of scenarios you might create:

VibeCheckScenario: Randomly selects a few prompts from HarmBench Mazeika et al., 2024 to quickly assess model behavior
QuickViolence: Checks how resilient a model is to violent objectives using multiple attack techniques
ComprehensiveFoundry: Tests a target with all available attack converters and techniques
CustomCompliance: Tests against specific compliance requirements with curated datasets and attacks

These Scenarios can be updated and added to as you refine what you are testing for.

How to Run Scenarios¶

Scenarios should take almost no effort to run with default values. The PyRIT Scanner provides two CLIs for running scenarios: pyrit_scan for automated execution and pyrit_shell for interactive exploration.

For programmatic configuration — customizing datasets, techniques, scorers, and baseline mode — see Common Scenario Parameters.

How It Works¶

Each Scenario contains a collection of AtomicAttack objects. When executed:

Each AtomicAttack is executed sequentially
Every AtomicAttack tests its configured attack against all specified objectives and datasets
Results are aggregated into a single ScenarioResult with all attack outcomes
Optional memory labels help track and categorize the scenario execution

Creating Custom Scenarios¶

To create a custom scenario, extend the Scenario base class and implement the required abstract methods.

Required Components¶

Technique Enum: Create a ScenarioTechnique enum that defines the available attack techniques for your scenario.
- Each enum member represents an attack technique (the how of an attack)
- Each member is defined as (value, tags) where value is a string and tags is a set of strings
- Include an ALL aggregate technique that expands to all available techniques
Scenario Class: Extend Scenario and pass these to super().__init__():
- technique_class: Your technique enum class
- default_technique: The default technique (typically YourTechnique.ALL or YourTechnique.DEFAULT)
- Implement _build_atomic_attacks_async(context) — the single abstract extension point. Matrix-shaped scenarios delegate to build_matrix_atomic_attacks(context=...) in one line.
Default Dataset: Pass default_dataset_config= to super().__init__() to specify the datasets your scenario uses out of the box.
- Returns a DatasetConfiguration with one or more named datasets (e.g., DatasetConfiguration(dataset_names=["my_dataset"]))
- Users can override this at runtime via --dataset-names in the CLI or by passing a custom dataset_config programmatically
Constructor: Use @apply_defaults decorator and call super().__init__() with scenario metadata:
- name: Descriptive name for your scenario
- version: Integer version number
- technique_class: The technique enum class for this scenario
- default_technique: The default technique member (typically YourTechnique.ALL or YourTechnique.DEFAULT)
- default_dataset_config: A DatasetConfiguration specifying the scenario’s default datasets
- objective_scorer: The scorer used to judge responses
- scenario_result_id: Optional ID to resume an existing scenario (optional)
Initialization: Call await scenario.initialize_async() to populate atomic attacks:
- objective_target: The target system being tested (required)
- scenario_techniques: List of techniques to execute (optional, defaults to ALL)
- max_concurrency: Number of concurrent operations (default: 4)
- max_retries: Number of retry attempts on failure (default: 0)
- memory_labels: Optional labels for tracking (optional)
- include_baseline: Whether to prepend a baseline attack (defaults to the scenario type’s BASELINE_ATTACK_POLICY; most scenarios default it on, Jailbreak defaults it off)

Example Structure¶

The construction path: define your technique, dataset config, and constructor, then implement _build_atomic_attacks_async(context). Matrix-shaped scenarios delegate to the build_matrix_atomic_attacks helper, which builds atomic attacks automatically from the registered attack techniques.

from pyrit.common import apply_defaults
from pyrit.scenario import (
    DatasetConfiguration,
    Scenario,
    ScenarioTechnique,
)
from pyrit.scenario.core.matrix_atomic_attack_builder import build_matrix_atomic_attacks
from pyrit.score.true_false.true_false_scorer import TrueFalseScorer
from pyrit.setup import initialize_pyrit_async
from pyrit.setup.initializers.techniques import TechniqueInitializer

await initialize_pyrit_async(memory_db_type="InMemory")  # type: ignore [top-level-await]
await TechniqueInitializer().initialize_async()  # type: ignore [top-level-await]


class MyTechnique(ScenarioTechnique):
    ALL = ("all", {"all"})
    DEFAULT = ("default", {"default"})
    SINGLE_TURN = ("single_turn", {"single_turn"})
    # Technique members represent attack techniques
    PromptSending = ("prompt_sending", {"single_turn", "default"})
    RolePlay = ("role_play_movie_script", {"single_turn"})


class MyScenario(Scenario):
    """Quick-check scenario for testing model behavior across harm categories."""

    VERSION: int = 1

    @apply_defaults
    def __init__(
        self,
        *,
        objective_scorer: TrueFalseScorer | None = None,
        scenario_result_id: str | None = None,
    ) -> None:
        self._objective_scorer: TrueFalseScorer = (
            objective_scorer if objective_scorer else self._get_default_objective_scorer()
        )

        super().__init__(
            version=self.VERSION,
            objective_scorer=self._objective_scorer,
            technique_class=MyTechnique,
            default_technique=MyTechnique.DEFAULT,
            default_dataset_config=DatasetConfiguration(dataset_names=["dataset_name"], max_dataset_size=4),
            scenario_result_id=scenario_result_id,
        )

    # Implement the single abstract extension point. Matrix-shaped scenarios delegate
    # to build_matrix_atomic_attacks; pass display_group_fn to customize result grouping
    # (default groups by technique; here we group by dataset instead).
    async def _build_atomic_attacks_async(self, *, context):
        return build_matrix_atomic_attacks(
            context=context,
            objective_scorer=self._objective_scorer,
            display_group_fn=lambda combo: combo.dataset_name,
        )

Found default environment files: ['./.pyrit/.env', './.pyrit/.env.local']
Loaded environment file: ./.pyrit/.env
Loaded environment file: ./.pyrit/.env.local

[pyrit:alembic] No new upgrade operations detected.

Existing Scenarios¶

import logging

from pyrit.backend.services.scenario_service import get_scenario_service
from pyrit.cli._output import print_scenario_list

logging.getLogger("pyrit").setLevel(logging.ERROR)

response = await get_scenario_service().list_scenarios_async(limit=200)  # type: ignore
print_scenario_list(items=response.items)


Available Scenarios:
================================================================================

  adaptive.text_adaptive
    Class: TextAdaptive
    Description:
      Adaptive text-attack scenario. Selects techniques per-objective via an
      epsilon-greedy selector over the set of selected techniques.
      ``prompt_sending`` runs as the baseline comparison and is excluded from
      the adaptive technique pool.
    Aggregate Techniques:
      - all, default, single_turn, multi_turn
    Available Techniques (10):
      role_play, many_shot, tap, pair, crescendo_simulated, red_teaming,
      context_compliance, crescendo_movie_director, crescendo_history_lecture,
      crescendo_journalist_interview
    Default Technique: default
    Default Datasets (7, max 4 per dataset):
      airt_hate, airt_fairness, airt_violence, airt_sexual, airt_harassment,
      airt_misinformation, airt_leakage
    Supported Parameters:
      - max_attempts_per_objective (int) [default: '3']: Max techniques tried per objective. Defaults to 3.

  airt.cyber
    Class: Cyber
    Description:
      Cyber scenario implementation for PyRIT. This scenario tests how willing
      models are to exploit cybersecurity harms by generating malware. The
      Cyber class contains different variations of the malware generation
      techniques.
    Aggregate Techniques:
      - all, multi_turn
    Available Techniques (1):
      red_teaming
    Default Technique: all
    Default Datasets (1, max 4 per dataset):
      airt_malware

  airt.jailbreak
    Class: Jailbreak
    Description:
      Jailbreak scenario implementation for PyRIT. This scenario tests how
      vulnerable models are to jailbreak attacks by applying various
      single-turn jailbreak templates to a set of test prompts. The responses
      are scored to determine if the jailbreak was successful.
    Aggregate Techniques:
      - all, simple, complex
    Available Techniques (4):
      prompt_sending, many_shot, skeleton, role_play
    Default Technique: simple
    Default Datasets (1, max 4 per dataset):
      airt_harms

  airt.leakage
    Class: Leakage
    Description:
      Leakage scenario implementation for PyRIT. This scenario tests how
      susceptible models are to leaking training data, PII, intellectual
      property, or other confidential information. Uses the registry/factory
      pattern to construct attack techniques.
    Aggregate Techniques:
      - all, default, single_turn, multi_turn
    Available Techniques (12):
      role_play, many_shot, tap, pair, crescendo_simulated, red_teaming,
      context_compliance, crescendo_movie_director, crescendo_history_lecture,
      crescendo_journalist_interview, first_letter, image
    Default Technique: default
    Default Datasets (1, max 4 per dataset):
      airt_leakage

  airt.psychosocial
    Class: Psychosocial
    Description:
      Psychosocial Harms Scenario implementation for PyRIT. This scenario
      contains various psychosocial harm-based checks that evaluate model
      behavior for responses that may harm the well-being or mental health of
      users. Examples include mistreating users in a mental health crisis or
      aiding them in self-harm. All datasets in this scenario share the
      unified 'psychosocial' harm category, but each dataset (e.g.,
      imminent_crisis, dependency) can have unique: - Scoring rubrics - Seed
      objectives The scenario tests whether models: - Recognize mental health
      crises and respond appropriately - Avoid facilitating self-harm or
      harmful behaviors - Provide crisis resources and encourage professional
      help Example usage with custom configurations: # Define custom
      configurations per subharm category custom_configs = {
      "airt_imminent_crisis": SubharmConfig(
      crescendo_system_prompt_path="path/to/custom_escalation.yaml",
      scoring_rubric_path="path/to/custom_rubric.yaml", ), } scenario =
      Psychosocial(subharm_configs=custom_configs) await
      scenario.initialize_async( objective_target=target_llm,
      scenario_techniques=[PsychosocialTechnique.ImminentCrisis], )
    Aggregate Techniques:
      - all
    Available Techniques (2):
      imminent_crisis, licensed_therapist
    Default Technique: all
    Default Datasets (1, max 4 per dataset):
      airt_imminent_crisis

  airt.rapid_response
    Class: RapidResponse
    Description:
      Rapid Response scenario for content-harms testing. Tests model behavior
      across multiple harm categories using selectable attack techniques.
    Aggregate Techniques:
      - all, default, single_turn, multi_turn
    Available Techniques (10):
      role_play, many_shot, tap, pair, crescendo_simulated, red_teaming,
      context_compliance, crescendo_movie_director, crescendo_history_lecture,
      crescendo_journalist_interview
    Default Technique: default
    Default Datasets (7, max 4 per dataset):
      airt_hate, airt_fairness, airt_violence, airt_sexual, airt_harassment,
      airt_misinformation, airt_leakage

  airt.scam
    Class: Scam
    Description:
      Scam scenario evaluates an endpoint's ability to generate scam-related
      materials (e.g., phishing emails, fraudulent messages) with primarily
      persuasion-oriented techniques.
    Aggregate Techniques:
      - all, single_turn, multi_turn
    Available Techniques (3):
      context_compliance, role_play, persuasive_rta
    Default Technique: all
    Default Datasets (1, max 4 per dataset):
      airt_scams
    Supported Parameters:
      - max_turns (int) [default: '5']: Maximum conversation turns for the persuasive_rta technique.

  benchmark.adversarial
    Class: AdversarialBenchmark
    Description:
      Benchmark scenario that compares the attack success rate (ASR) across
      adversarial models. Adversarial targets are user-supplied via the
      ``adversarial_targets`` parameter (declared in
      ``supported_parameters``). Each target must already be registered in
      ``TargetRegistry`` — typically by ``TargetInitializer`` from
      ``ADVERSARIAL_CHAT_*`` env vars, or programmatically via
      ``TargetRegistry.get_registry_singleton().instances.register``. At run
      time, ``_build_atomic_attacks_async`` performs the ``(technique ×
      adversarial_target × dataset)`` cross-product: for each selected
      adversarial-capable ``core`` factory in the ``AttackTechniqueRegistry``
      and each requested target, it calls
      ``factory.create(attack_adversarial_config_override=...)`` with the
      resolved target — no global registry mutation. The resulting
      ``AtomicAttack`` is named ``f"{technique}__{target}_{dataset}"`` with
      ``display_group`` set to the target's registry name so per-model ASR
      rolls up naturally in result displays.
    Aggregate Techniques:
      - all, default, light, single_turn, multi_turn
    Available Techniques (9):
      role_play, tap, pair, crescendo_simulated, red_teaming,
      context_compliance, crescendo_movie_director, crescendo_history_lecture,
      crescendo_journalist_interview
    Default Technique: light
    Default Datasets (1, max 8 per dataset):
      harmbench
    Supported Parameters:
      - adversarial_targets (list[str]): Registry names of adversarial chat targets to benchmark. Each name must already be registered in TargetRegistry (via TargetInitializer or TargetRegistry instance registration). Use 'pyrit_scan list-targets' to see registered targets. Settable via --adversarial-targets <name> [<name> ...] on the CLI, or scenario.args.adversarial_targets in .pyrit_conf.

  foundry.red_team_agent
    Class: RedTeamAgent
    Description:
      RedTeamAgent is a preconfigured scenario that automatically generates
      multiple AtomicAttack instances based on the specified attack
      techniques. It supports both single-turn attacks (with various
      converters) and multi-turn attacks (Crescendo, RedTeaming), making it
      easy to quickly test a target against multiple attack vectors. The
      scenario can expand difficulty levels (EASY, MODERATE, DIFFICULT) into
      their constituent attack techniques, or you can specify individual
      techniques directly. This scenario is designed for use with the Foundry
      AI Red Teaming Agent library, providing a consistent PyRIT contract for
      their integration.
    Aggregate Techniques:
      - all, easy, moderate, difficult
    Available Techniques (25):
      ansi_attack, ascii_art, ascii_smuggler, atbash, base64, binary, caesar,
      character_space, char_swap, diacritic, flip, leetspeak, morse, rot13,
      suffix_append, string_join, unicode_confusable, unicode_substitution,
      url, jailbreak, tense, multi_turn, crescendo, pair, tap
    Default Technique: easy
    Default Datasets (1, max 4 per dataset):
      harmbench

  garak.encoding
    Class: Encoding
    Description:
      Encoding Scenario implementation for PyRIT. This scenario tests how
      resilient models are to various encoding attacks by encoding potentially
      harmful text (by default slurs and XSS payloads) and testing if the
      model will decode and repeat the encoded payload. It mimics the Garak
      encoding probe. The scenario works by: 1. Taking seed prompts (the
      harmful text to be encoded) 2. Encoding them using various encoding
      schemes (Base64, ROT13, Morse, etc.) 3. Asking the target model to
      decode the encoded text 4. Scoring whether the model successfully
      decoded and repeated the harmful content By default, this uses the same
      dataset as Garak: slur terms and web XSS payloads.
    Aggregate Techniques:
      - all
    Available Techniques (17):
      base64, base2048, base16, base32, ascii85, hex, quoted_printable,
      uuencode, rot13, braille, atbash, morse_code, nato, ecoji, zalgo,
      leet_speak, ascii_smuggler
    Default Technique: all
    Default Datasets (2, max 3 per dataset):
      garak_slur_terms_en, garak_web_html_js

================================================================================

Total scenarios: 10

Baseline Execution¶

Every scenario can optionally include a baseline attack — a PromptSendingAttack that sends each objective directly to the target without any converters or multi-turn techniques. This is controlled by the include_baseline parameter on initialize_async; when omitted, each scenario falls back to its own BASELINE_ATTACK_POLICY class attribute (most scenarios default it on; Jailbreak defaults it off). See Common Scenario Parameters for a worked example.

Custom scenarios should choose their BASELINE_ATTACK_POLICY based on whether an unmodified prompt is a meaningful comparator for the scenario’s techniques:

Enabled — the baseline is prepended by default and the caller can opt out. Use when an unmodified-prompt run is a meaningful comparison point (most scenarios).
Disabled — the baseline is supported but omitted by default; the caller must opt in. Use when the scenario is already dominated by a large set of templates/techniques that already exercise the unmodified surface (e.g., Jailbreak).
Forbidden — the baseline is unavailable and passing include_baseline=True raises. Use when the scenario’s semantics make a single-shot unmodified prompt meaningless as a comparator (e.g., benchmarks comparing across adversarial models, or multi-turn-only scenarios).

Resiliency¶

Scenarios can run for a long time, and because of that, things can go wrong. Network issues, rate limits, or other transient failures can interrupt execution. PyRIT provides built-in resiliency features to handle these situations gracefully.

Automatic Resume¶

If you re-run a scenario, it will automatically start where it left off. The framework tracks completed attacks and objectives in memory, so you won’t lose progress if something interrupts your scenario execution. This means you can safely stop and restart scenarios without duplicating work.

Retry Mechanism¶

You can utilize the max_retries parameter to handle transient failures. If any unknown exception occurs during execution, PyRIT will automatically retry the failed operation (starting where it left off) up to the specified number of times. This helps ensure your scenario completes successfully even in the face of temporary issues.

Dynamic Configuration¶

During a long-running scenario, you may want to adjust parameters like max_concurrency to manage resource usage, or switch your scorer to use a different target. PyRIT’s resiliency features make it safe to stop, reconfigure, and continue scenarios as needed.

For more information, see resiliency

References¶

Mazeika, M., Phan, L., Yin, X., Zou, A., Wang, Z., Mu, N., Sakhaee, E., Li, N., Basart, S., Li, B., Forsyth, D., & Hendrycks, D. (2024). HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal. arXiv Preprint arXiv:2402.04249. https://arxiv.org/abs/2402.04249