Bug fix to malwaregen anycode detector in keyword using by aishwaryap · Pull Request #1302 · NVIDIA/garak

aishwaryap · 2025-07-17T17:51:22Z

Bug fix: The regexes for C# and cpp in the malwaregen.AnyCode detector for keyword using match most English sentences that contain the word using. These have been updated to be more specific covering all uses of this keyword listed in:

Examples of uses that the updated regex is capable of detecting:

[
    "using System;", # C#
    "using (var file = new System.IO.StreamWriter(\"example.txt\"))", # C#
    "using Project = MyCompany.ProjectNamespace;", # C#
    "using static System.Math;",  # C#
    "using var file = new System.IO.StreamWriter(\"example.txt\");" # C#
    "using namespace std;", # cpp
    "using Base::show;", # cpp
    "using IntVector = vec::vector<int>;" # cpp
    "using ::f;" # cpp
    "using enum EE<E>;" # cpp
    "using Vec = vector<T, Alloc<T>>;" # cpp
    "using B = typename A<T>::U;" # cpp
    "using A = decltype([] {});" # cpp
]

Manual inspection of outputs from GPT-4o before and after this change mostly shows changes in text responses containing the word using. However, some code examples for assembly, Rust and Swift appear to not be captured by the regexes for those languages and were instead caught by the word using present in the text part of the response. I will submit future PRs to address those issues.

Verification

The best way to notice the difference is to run the new test test_detectors_malwaregen.py on main and the branch with the fix: python -m pytest tests/detectors/test_detectors_malwaregen.py.
GPT-4o config used for manual verification. Note that this change should not be sensitive to the generator or other system params.

run:
  generations: 5

plugins:
  probe_spec: malwaregen
  extended_detectors: true
  probes:
    encoding:
      payloads:
        - default
        - xss
        - slur_terms
  model_type: openai
  model_name: gpt-4o
  generators:
    openai:
      model: gpt-4o
      max_tokens: 1024
      use_chat: true

system:
  parallel_attempts: 32
  lite: false```

Run the tests and ensure they pass python -m pytest tests/
Verify the thing does what it should: Added test cases in test_detectors_malwaregen.py.
Verify the thing does not do what it should not: Added test cases in test_detectors_malwaregen.py. Also manually inspected reports from GPT-4o to identify types of responses for which predictions have changed. Changes typically indicate shortfalls in regexes for other languages which I will address in separate PRs.

…ests for it

erickgalinkin

Minor comment, otherwise LGTM.

garak/detectors/malwaregen.py

…es which are valid in most relevant situations

erickgalinkin

LGTM. Will merge once tests pass.

Updated "using" in regexes in MalwareGen AnyCode detector and added t…

e1fe1ae

…ests for it

aishwaryap force-pushed the fix/malwaregen_anycode_regex_using branch from 6b3226b to e1fe1ae Compare July 18, 2025 15:50

aishwaryap mentioned this pull request Jul 23, 2025

Adding wider detection of :: and // in malwaregen.AnyCode detector #1307

Merged

5 tasks

erickgalinkin reviewed Jul 24, 2025

View reviewed changes

garak/detectors/malwaregen.py Outdated Show resolved Hide resolved

Replaced a-zA-Z0-9 in regexes with \w as that also includes underscor…

defcd91

…es which are valid in most relevant situations

erickgalinkin approved these changes Jul 24, 2025

View reviewed changes

erickgalinkin merged commit 74d4a9c into NVIDIA:main Jul 24, 2025
16 of 18 checks passed

github-actions bot locked and limited conversation to collaborators Jul 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug fix to malwaregen anycode detector in keyword using#1302

Bug fix to malwaregen anycode detector in keyword using#1302
erickgalinkin merged 2 commits intoNVIDIA:mainfrom
aishwaryap:fix/malwaregen_anycode_regex_using

aishwaryap commented Jul 17, 2025 •

edited

Loading

Uh oh!

erickgalinkin left a comment

Uh oh!

Uh oh!

erickgalinkin left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

aishwaryap commented Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Verification

Uh oh!

erickgalinkin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

erickgalinkin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

aishwaryap commented Jul 17, 2025 •

edited

Loading