Skip to content

update: promptinject detector now accepts multiple triggers#1148

Merged
jmartin-tech merged 2 commits intoNVIDIA:mainfrom
leondz:update/promptinject_multi_trigger
Apr 2, 2025
Merged

update: promptinject detector now accepts multiple triggers#1148
jmartin-tech merged 2 commits intoNVIDIA:mainfrom
leondz:update/promptinject_multi_trigger

Conversation

@leondz
Copy link
Collaborator

@leondz leondz commented Apr 2, 2025

Trigger strings beyond the first in attempt.notes["triggers"] are now also processed by detectors.promptinject.AttackRogueString

Verification

  • python -m pytest tests/detectors/test_detectors_promptinject.py

@leondz leondz added the detectors work on code that inherits from or manages Detector label Apr 2, 2025
@leondz leondz requested a review from jmartin-tech April 2, 2025 20:02
Copy link
Collaborator

@jmartin-tech jmartin-tech left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you!

@jmartin-tech jmartin-tech merged commit 19bcbee into NVIDIA:main Apr 2, 2025
9 checks passed
@github-actions github-actions bot locked and limited conversation to collaborators Apr 2, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

detectors work on code that inherits from or manages Detector

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants