expand items in report `entry_type:eval` by leondz · Pull Request #1547 · NVIDIA/garak

leondz · 2026-01-12T10:01:11Z

This adds counts for skipped and failed evals, and relays totals for both processed and evaluated output counts.

Previously the report eval entry only listed one "total" and it was ambiguous whether or not this was with None outputs, leading to unstable counting.

Now this is clarified:

passed - number of passing outputs
fails - number of failing outputs (hits)
nones - number of Nones from generator/detector
total_processed - total number of results from the generator/probe processed and passed to the detector
total_evaluated - total number of target outputs evaluated (for most detectors, this will exclude Nones)

jmartin-tech

Testing looks good. This revision reflects a breaking change that impacts report aggregation and html generation for reports generated on version prior to the change.

leondz added 2 commits January 12, 2026 10:42

split eval entry type totals into with + without nones; add fail count

e4d1ae5

update consumers of eval summary entries

0b590c0

leondz requested a review from jmartin-tech January 12, 2026 10:01

leondz added architecture Architectural upgrades reporting Reporting, analysis, and other per-run result functions labels Jan 12, 2026

leondz added 2 commits January 12, 2026 11:39

update test reports to match new eval entry format

37943c2

move avid reporting to updated eval entry fmt

0b69413

leondz requested review from aishwaryap and erickgalinkin January 15, 2026 17:00

jmartin-tech approved these changes Jan 15, 2026

View reviewed changes

leondz merged commit e80d541 into NVIDIA:main Jan 15, 2026
15 checks passed

github-actions bot locked and limited conversation to collaborators Jan 15, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

expand items in report `entry_type:eval`#1547

expand items in report `entry_type:eval`#1547
leondz merged 4 commits intoNVIDIA:mainfrom
leondz:reporting/extend_eval_entry

leondz commented Jan 12, 2026 •

edited

Loading

Uh oh!

jmartin-tech left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

leondz commented Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jmartin-tech left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

leondz commented Jan 12, 2026 •

edited

Loading