FEAT Beam Search for OpenAIResponseTarget by riedgar-ms · Pull Request #1346 · Azure/PyRIT

riedgar-ms · 2026-02-02T14:23:36Z

Description

Use the Lark grammar feature of the OpenAIResponseTarget to create a beam search for PyRIT. This is a single turn attack, where a collection of candidate responses (the beams) are maintained. On each iteration, the model's response is allowed to extend a little for each beam. The beams are scored, with the worst performing ones discarded, and replaced with copies of higher scoring beams.

Tests and Documentation

Have basic unit tests of the classes added, but since this requires features only currently in the OpenAIResponseTarget there didn't seem much point in mocking that. There is a notebook which runs everything E2E.

…h-01

Copilot

Pull request overview

Copilot reviewed 9 out of 9 changed files in this pull request and generated no new comments.

You can also share your feedback on Copilot code review. Take the survey.

…h-01

Copilot

Pull request overview

Copilot reviewed 9 out of 9 changed files in this pull request and generated 1 comment.

You can also share your feedback on Copilot code review. Take the survey.

Copilot · 2026-03-06T13:34:14Z

doc/api.rst

+    Beam
+    BeamReviewer
+    BeamSearchAttack
+    TopKBeamReviewer


The four new entries (Beam, BeamReviewer, BeamSearchAttack, TopKBeamReviewer) are appended at the end of the alphabetically ordered list (after TreeOfAttacksWithPruningAttack), breaking the alphabetical ordering of the existing entries. They should be inserted at their alphabetically correct positions: Beam and BeamReviewer between AttackStrategy and ChunkedRequestAttack, and BeamSearchAttack and TopKBeamReviewer at their respective positions.

romanlutz · 2026-03-07T13:24:43Z

doc/code/executor/attack/beam_search_attack.ipynb

Looks like there's an error in the output?

It's a warning, but something went wrong on one of the API calls (most likely a grammar had an incorrectly escaped character, although I'm having a hard time running that down). Would you prefer something like that be logged as info rather than warning?

pyrit/executor/attack/single_turn/beam_search.py

romanlutz · 2026-03-07T13:36:02Z

pyrit/executor/attack/single_turn/beam_search.py

+        # Log the attack configuration
+        self._logger.info(f"Starting {self.__class__.__name__} with objective: {context.objective}")
+
+        beams = [Beam(id=context.conversation_id, text="", score=0.0) for _ in range(self._num_beams)]


are we using the same conversation ID for all of them? Feels like we should create a fresh one per beam / per iteration.

They don't share (I think that would be a key violation on the database) - it's actually the opposite extreme in that every time a beam updates, it gets a fresh conversation id (hence my concerns above about whether I'm using the database correctly). That said, how this happens (when _propagate_beam_async() re-calls _setup_async() is perhaps too well hidden. I can add comments, but would like to straighten out the higher level question of how I'm spawning new conversations with abandon virst.

pyrit/executor/attack/single_turn/beam_search.py

romanlutz · 2026-03-07T13:41:36Z

pyrit/executor/attack/single_turn/beam_search.py

+            "tool_choice": "required",
+        }
+
+        return self._objective_target.fresh_instance(extra_body_parameters=ebp, grammar_name=str(grammar_tool["name"]))


The fact that our target is static in its configuration and doesn't allow for custom grammars to be fed in per request is rather annoying here...

The problem is that even if we managed to allow per-request overrides the target identifier wouldn't have those unless we managed to take care of that as well (which is doable). Perhaps the fresh instances are the only way as it stands. Getting a whole new level of appreciation for why the openai client separates such parameters out into the send call rather than the constructor.

CC @rlundeen2

I'm somewhat frustrated that OpenAI made grammars a tool, rather than a response_format (like a JSON schema).

To the specific case here, there's certainly a reason for always keeping the same tools (not thrashing the KV cache), but it does complicate matters for this scenario

…h-01

riedgar-ms added 30 commits November 4, 2025 11:45

Adding a method to create a fresh instance of an OpenAIResponseTarget

e7ff64c

Doing some bludgeoning

9d9363e

Throwing together the rest of the code (I hope)

913bc6f

Hacking...

2aa63db

A little more output for debugging

8b09ce9

Starting to think about making this an attack instead

8320953

A couple more options

3b43850

Copy more implementations

0a861e3

Lint

65ae3da

Trying to puzzle out how things fit

4ea1495

Applying some bludgeoning

91ed0c6

Chipping away, trying to get scoring working

02a60a0

Merge remote-tracking branch 'origin/main' into riedgar-ms/beam-searc…

112ceed

…h-01

Temporarily add test script to branch

a793a6f

Merge branch 'main' into riedgar-ms/beam-search-01

7c37ac1

Tweaking due to all changes since this was started

1d98653

Small change

afa2aa7

More updates

f7f6aa5

Work better?

bb1b6fd

Something done for us now

0f1f485

Merge remote-tracking branch 'origin/main' into riedgar-ms/beam-searc…

81b8e71

…h-01

Cleaning up a little more

45cbe76

More updates (still not quite working)

5fc3042

Clean up some more errors

8c20d10

Get the beams propagating

84d852d

Merge remote-tracking branch 'origin/main' into riedgar-ms/beam-searc…

fdaeeb0

…h-01

Go on to the scoring

4810516

Use the Chat target for the refusal scorer.... seems more reliable

2f1e15e

Getting things working

42f66b1

Fiddling, trying to make stuff more robust

e0b0061

Merge remote-tracking branch 'origin/main' into riedgar-ms/beam-searc…

f50278b

…h-01

Copilot AI review requested due to automatic review settings March 4, 2026 15:59

Copilot started reviewing on behalf of riedgar-ms March 4, 2026 16:00 View session

Copilot AI reviewed Mar 4, 2026

View reviewed changes

riedgar-ms added 2 commits March 5, 2026 06:40

Merge remote-tracking branch 'origin/main' into riedgar-ms/beam-searc…

1806f64

…h-01

Merge remote-tracking branch 'origin/main' into riedgar-ms/beam-searc…

54a42f5

…h-01

Copilot AI review requested due to automatic review settings March 6, 2026 13:21

Copilot started reviewing on behalf of riedgar-ms March 6, 2026 13:22 View session

Copilot AI reviewed Mar 6, 2026

View reviewed changes

romanlutz reviewed Mar 7, 2026

View reviewed changes

riedgar-ms added 20 commits March 8, 2026 12:51

Merge remote-tracking branch 'origin/main' into riedgar-ms/beam-searc…

d27bf27

…h-01

Rename field

b9314bb

Remove getattr

e067c71

ruffen

223b41d

Rerun notebook

952ce34

Some rate limit issues :-/

71ed87e

Trying to get a simple E2E test working

204768b

Have a chat with the mocking

584ec0e

Update output

83b07a0

Ruffen

c1eb323

Backwards compatibility

6001295

Rerun code

c468449

ruffen

09789cb

Run pre-commit

17aed72

Merge remote-tracking branch 'origin/main' into riedgar-ms/beam-searc…

5909942

…h-01

Merge remote-tracking branch 'origin/main' into riedgar-ms/beam-searc…

e973b33

…h-01

Fieldname fixup

f676d5a

Merge remote-tracking branch 'origin/main' into riedgar-ms/beam-searc…

c748e80

…h-01

Merge remote-tracking branch 'origin/main' into riedgar-ms/beam-searc…

f2f1468

…h-01

Merge remote-tracking branch 'origin/main' into riedgar-ms/beam-searc…

e925fb5

…h-01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT Beam Search for OpenAIResponseTarget#1346

FEAT Beam Search for OpenAIResponseTarget#1346
riedgar-ms wants to merge 150 commits intoAzure:mainfrom
riedgar-ms:riedgar-ms/beam-search-01

riedgar-ms commented Feb 2, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 6, 2026

Uh oh!

romanlutz Mar 7, 2026

Uh oh!

riedgar-ms Mar 9, 2026

Uh oh!

Uh oh!

romanlutz Mar 7, 2026

Uh oh!

riedgar-ms Mar 9, 2026

Uh oh!

Uh oh!

romanlutz Mar 7, 2026 •

edited

Loading

Uh oh!

riedgar-ms Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

riedgar-ms commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests and Documentation

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

romanlutz Mar 7, 2026

Choose a reason for hiding this comment

Uh oh!

riedgar-ms Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

romanlutz Mar 7, 2026

Choose a reason for hiding this comment

Uh oh!

riedgar-ms Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

romanlutz Mar 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

riedgar-ms Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

riedgar-ms commented Feb 2, 2026 •

edited

Loading

romanlutz Mar 7, 2026 •

edited

Loading