FEAT: Jailbreak Scenario #1329

ValbuenaVC · 2026-01-26T20:09:07Z

Description

Addition of a jailbreak scenario to PyRIT, which applies jailbreak templates to a set of test prompts and sends them to the target. Credit to @fdubut for developing the scenario.

Tests and Documentation

Adding test_jailbreak.py under the unit tests.

fdubut · 2026-01-26T23:46:37Z

Thanks @ValbuenaVC for picking this up! One improvement I had in mind was to create more strategies by running the different groups of jailbreaks we have in PyRIT. Right now I have only the one at the root of the directory, but we added quite a few more recently, and it would make sense to have one strategy per folder (and ALL to run them all).

fdubut

LGTM!

rlundeen2 · 2026-01-29T01:19:51Z

pyrit/datasets/seed_datasets/local/airt/jailbreak.prompt

@@ -0,0 +1,16 @@
+dataset_name: airt_jailbreak


Could we potentially have a more descriptive name? Jailbreak has a different meaning in pyrit. Potentially "airt_jailbreak_scenario"

Or really "airt_harms.prompt" is also good

rlundeen2 · 2026-01-29T01:21:21Z

pyrit/scenario/scenarios/airt/jailbreak.py

+        # Will be resolved in _get_atomic_attacks_async
+        self._seed_groups: Optional[List[SeedAttackGroup]] = None
+
+    def _get_default_objective_scorer(self) -> TrueFalseScorer:


Not for this PR, but wondering if we should just make _get_default_objective_scorer a non-abstract base class

rlundeen2 · 2026-01-29T01:23:00Z

pyrit/scenario/scenarios/airt/jailbreak.py

+
+        return list(seed_groups)
+
+    def _get_all_jailbreak_templates(self) -> List[str]:


I recommend using/extending the TextJailBreak class instead of looking for the yaml directly.

I also wonder if the number of jailbreaks could have some further filtering from the scenario strategy, so it's not necessarily always "all". It could be random N, or it could be a subcategory, or maybe other.

This is probably important so we can have shorter or more targeted runs.

rlundeen2 · 2026-01-29T01:25:25Z

pyrit/scenario/scenarios/airt/jailbreak.py

+        )
+
+        # Create the attack
+        attack = PromptSendingAttack(


(not required) Wonder if we should send multiple times as an option

Victor Valbuena added 2 commits January 26, 2026 20:06

Scaffolding

022f70a

Precommit

e85cdb9

Victor Valbuena added 5 commits January 27, 2026 00:46

fixtures and basic tests

fc260c3

basic tests

89a8079

basic tests

b18f224

last test

96ddf6c

jailbreak format test

eb4e936

ValbuenaVC marked this pull request as ready for review January 28, 2026 19:26

ValbuenaVC changed the title ~~[DRAFT] FEAT: Jailbreak Scenario~~ FEAT: Jailbreak Scenario Jan 28, 2026

Victor Valbuena and others added 4 commits January 28, 2026 21:01

sample jailbreak prompt

243ea0a

Merge branch 'main' into jailbreak

946fdde

real jailbreaks added

132caf5

Merge branch 'main' into jailbreak

c4e625f

fdubut approved these changes Jan 29, 2026

View reviewed changes

rlundeen2 reviewed Jan 29, 2026

View reviewed changes

rlundeen2 self-assigned this Jan 29, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT: Jailbreak Scenario #1329

FEAT: Jailbreak Scenario #1329

ValbuenaVC commented Jan 26, 2026 •

edited

Loading

Uh oh!

fdubut commented Jan 26, 2026

Uh oh!

fdubut left a comment

Uh oh!

rlundeen2 Jan 29, 2026

Uh oh!

rlundeen2 Jan 29, 2026

Uh oh!

rlundeen2 Jan 29, 2026 •

edited

Loading

Uh oh!

rlundeen2 Jan 29, 2026

Uh oh!

rlundeen2 Jan 29, 2026 •

edited

Loading

Uh oh!

rlundeen2 Jan 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		return list(seed_groups)

		def _get_all_jailbreak_templates(self) -> List[str]:

FEAT: Jailbreak Scenario #1329

Are you sure you want to change the base?

FEAT: Jailbreak Scenario #1329

Conversation

ValbuenaVC commented Jan 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests and Documentation

Uh oh!

fdubut commented Jan 26, 2026

Uh oh!

fdubut left a comment

Choose a reason for hiding this comment

Uh oh!

rlundeen2 Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

rlundeen2 Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

rlundeen2 Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rlundeen2 Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

rlundeen2 Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rlundeen2 Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ValbuenaVC commented Jan 26, 2026 •

edited

Loading

rlundeen2 Jan 29, 2026 •

edited

Loading

rlundeen2 Jan 29, 2026 •

edited

Loading