bugfix: reduce latent optimisation permutation explosion by leondz · Pull Request #1181 · NVIDIA/garak

leondz · 2025-04-25T10:36:48Z

requires #1152
resolves #1161

create snippets by sampling & using a set to avoid dupes, instead of sampling from all perms

Verification

List the steps needed to make sure this thing works

garak -m test -p latentinjection.LatentWhoisSnippet,latentinjection.LatentWhoisSnippetFull
garak -m test -p latentinjection.LatentInjectionFactSnippetEiffel,latentinjection.LatentInjectionFactSnippetLegal

…etFull

jmartin-tech

Testing with single generation reports expected counts:

latentinjection.LatentInjectionFactSnippetEiffel                            base.TriggerListDetector: PASS  ok on  256/ 256
latentinjection.LatentInjectionFactSnippetLegal                             base.TriggerListDetector: PASS  ok on  256/ 256

latentinjection.LatentWhoisSnippet                                          base.TriggerListDetector: PASS  ok on  256/ 256
latentinjection.LatentWhoisSnippetFull                                      base.TriggerListDetector: PASS  ok on  640/ 640

jmartin-tech · 2025-04-28T17:24:58Z

garak/probes/latentinjection.py

+        self.contexts = (
+            set()
+        )  # goal: a set of n whois reports, with an injection marker in one of them
+        max_context_count = min(ceil(sqrt(self.soft_probe_prompt_cap)), 100)


This seems odd still, what does soft_probe_prompt_cap have to do with number of valid contexts?

Not a blocker, just seems like an odd hidden side-effect for soft_probe_prompt_cap.

jmartin-tech · 2025-04-29T21:16:30Z

Noted impact for setting soft_probe_prompt_cap > 16k

latentinjection.LatentWhoisSnippet                                          base.TriggerListDetector: PASS  ok on 4000/4000
latentinjection.LatentWhoisSnippetFull                                      base.TriggerListDetector: PASS  ok on 4000/4000

jmartin-tech · 2025-04-29T21:27:09Z

Testing showed some concerns in local model exec, I am not sure this is due to the change here however:

probes.latentinjection.LatentWhois:   7%|███▊                                                 | 12/168 [00:08<01:29,  1.75it/s]This is a friendly reminder - the current text generation call will exceed the model's predefined maximum length (1024). Depending on the model, you may observe exceptions, performance degradation, or nothing at all.
/pytorch/aten/src/ATen/native/cuda/Indexing.cu:1369: indexSelectSmallIndex: block: [3,0,0], thread: [96,0,0] Assertion `srcIndex < srcSelectDimSize` failed.
/pytorch/aten/src/ATen/native/cuda/Indexing.cu:1369: indexSelectSmallIndex: block: [3,0,0], thread: [97,0,0] Assertion `srcIndex < srcSelectDimSize` failed.

Further investigation in progress.

jmartin-tech · 2025-04-29T21:30:45Z

Issue is confirmed to exist in released v0.10.3.1 when targeting gpt2:

python -m garak -m huggingface -n gpt2 -p latentinjection.LatentWhois -g 1
garak LLM vulnerability scanner v0.10.3.1 ( https://github.com/NVIDIA/garak ) at 2025-04-29T16:28:14.400861
📜 logging to /home/testing/.local/share/garak/garak.log
🦜 loading generator: Hugging Face 🤗 pipeline: gpt2
Device set to use cuda
📜 reporting to /home/testing/.local/share/garak/garak_runs/garak.a4d3d7a7-8c36-4b16-8247-4f19ac32a4d0.report.jsonl
🕵️  queue of probes: latentinjection.LatentWhois
probes.latentinjection.LatentWhois:  14%|███████▊                                               | 4/28 [00:00<00:04,  4.82it/s]This is a friendly reminder - the current text generation call will exceed the model's predefined maximum length (1024). Depending on the model, you may observe exceptions, performance degradation, or nothing at all.
/pytorch/aten/src/ATen/native/cuda/Indexing.cu:1369: indexSelectSmallIndex: block: [3,0,0], thread: [96,0,0] Assertion `srcIndex < srcSelectDimSize` failed.
/pytorch/aten/src/ATen/native/cuda/Indexing.cu:1369: indexSelectSmallIndex: block: [3,0,0], thread: [97,0,0] Assertion `srcIndex < srcSelectDimSize` failed.

Just noting findings, this will not block merge.

leondz changed the title ~~Update/latentinjection perms~~ bugfix: reduce latentoptimisation permutation explosion Apr 25, 2025

leondz marked this pull request as ready for review April 25, 2025 11:36

leondz requested a review from erickgalinkin April 25, 2025 11:37

leondz assigned jmartin-tech Apr 26, 2025

leondz mentioned this pull request Apr 26, 2025

refactor LatentInjection #1152

Merged

4 tasks

leondz added 5 commits April 28, 2025 12:20

add more whois contexts, sep pairs, inj instructions for LatentWhois

01796ea

add whois payload injection marker check

1f58b89

remove permutation explosion in LatentWhoisSnippet & LatentWhoisSnipp…

039ffec

…etFull

stop dupe injection contexts in snippet assembly

70f1581

defensive check for context cap

e7b2fcf

jmartin-tech force-pushed the update/latentinjection_perms branch from 892ef0b to e7b2fcf Compare April 28, 2025 17:21

leondz mentioned this pull request Apr 29, 2025

probes: refactor fact snippet mixin #1187

Merged

jmartin-tech approved these changes Apr 29, 2025

View reviewed changes

jmartin-tech changed the title ~~bugfix: reduce latentoptimisation permutation explosion~~ bugfix: reduce latent optimisation permutation explosion Apr 29, 2025

jmartin-tech merged commit 316ded9 into NVIDIA:main Apr 29, 2025
9 checks passed

github-actions bot locked and limited conversation to collaborators Apr 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bugfix: reduce latent optimisation permutation explosion#1181

bugfix: reduce latent optimisation permutation explosion#1181
jmartin-tech merged 5 commits intoNVIDIA:mainfrom
leondz:update/latentinjection_perms

leondz commented Apr 25, 2025 •

edited

Loading

Uh oh!

jmartin-tech left a comment

Uh oh!

jmartin-tech Apr 28, 2025

Uh oh!

jmartin-tech commented Apr 29, 2025

Uh oh!

jmartin-tech commented Apr 29, 2025

Uh oh!

jmartin-tech commented Apr 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

leondz commented Apr 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Verification

Uh oh!

jmartin-tech left a comment

Choose a reason for hiding this comment

Uh oh!

jmartin-tech Apr 28, 2025

Choose a reason for hiding this comment

Uh oh!

jmartin-tech commented Apr 29, 2025

Uh oh!

jmartin-tech commented Apr 29, 2025

Uh oh!

jmartin-tech commented Apr 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

leondz commented Apr 25, 2025 •

edited

Loading