-
Notifications
You must be signed in to change notification settings - Fork 812
task: remove fschat #1567
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
task: remove fschat #1567
Changes from all commits
Commits
Show all changes
23 commits
Select commit
Hold shift + click to select a range
1e9e6d8
remove fschat requirements for red team `Judge`
jmartin-tech 6731f42
Refactor gcg to eliminate fschat dependency. Cull dead code.
erickgalinkin 167085b
Functional GCG implementation sans fschat
erickgalinkin ee09560
Remove fschat dependency from AutoDAN
erickgalinkin 767b418
SPDX Sprinkles
erickgalinkin 6a0dc95
Surface AutoDAN and GCG options in probe.
erickgalinkin c516ddf
Merge branch 'main' into fix/remove-fschat
erickgalinkin fbecc88
Added type hint to GCGAttack.run; fixed return value in generate_gcg.py.
erickgalinkin b27ac9b
Fix `self.generator` bug in `suffix.GCG` probe.
erickgalinkin 73cc774
Update garak/probes/dan.py
erickgalinkin c88ae5a
Update garak/probes/suffix.py
erickgalinkin d0bca64
Make GCG probe have easy way to configure goal and target. Have GCG r…
erickgalinkin 5eb3fdd
Update garak/resources/autodan/autodan.py
erickgalinkin caa9e8f
Update garak/resources/autodan/genetic.py
erickgalinkin c1e43fe
Various fixes from PR comments.
erickgalinkin 5804f6e
Merge remote-tracking branch 'origin/fix/remove-fschat' into fix/remo…
erickgalinkin fe2180e
Pull `system_prompt` from probe instead of config.
erickgalinkin 4d770f1
`try/catch` writing AutoDAN and GCG prompts.
erickgalinkin ebb6fc2
Remove fschat from TAP -- implementation not yet working.
erickgalinkin 9a286ce
Improve json extraction for TAP.
erickgalinkin 893d037
Do a better job returning valid lists from `get_first_k`.
erickgalinkin 180a9f0
Add improved guards against weirdness in `get_first_k` and `extract_j…
erickgalinkin ad0d3f8
Warn when `width` and `attack_max_attempts` are less than three -- th…
erickgalinkin File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
For future revision, this probe takes two model config, the attack model should likely be renamed to consolidate on
red_team_model_*params for consistency with other probes.