feat(scan): add GCG injection scenario generator by kevinmessiaen · Pull Request #2561 · Giskard-AI/giskard-oss

kevinmessiaen · 2026-06-24T10:16:29Z

Port the lidar GCG Injection probe into a giskard-scan generator. It subclasses HuggingFaceDatasetScenarioGenerator (defaulting to the HarmBench dataset) to inherit per-language subset handling, then fans each loaded prompt out into one scenario per adversarial suffix (prompt x suffix cross-product). Each variant is renamed with a GCG prefix + suffix index and tagged gcg-suffix:, appended to the dataset's own tags rather than replacing them.

Description

Related Issue

Type of Change

📚 Examples / docs / tutorials / dependencies update
🔧 Bug fix (non-breaking change which fixes an issue)
🥂 Improvement (non-breaking change which improves an existing feature)
🚀 New feature (non-breaking change which adds functionality)
💥 Breaking change (fix or feature that would cause existing functionality to change)
🔐 Security fix

Coding agents

Autonomous agents with no human in the loop must read AUTONOMOUS.md before opening a PR.

PR title: agent-opened PRs must end the title with 🤖🤖🤖🤖 (exactly four robot emojis). Do not omit — that suffix is how the expedited agent PR workflow picks up the PR.

Checklist

I've read the CODE_OF_CONDUCT.md document.
I've read the CONTRIBUTING.md guide.
I've written tests for all new methods and classes that I created.
I've written the docstring in NumPy format for all the methods and classes that I created or modified.
I've updated the uv.lock running uv lock (only applicable when pyproject.toml has been
modified)

Register giskardai/harmbench-scenarios with an LLMJudge prompt bundled in giskard-scan and document MIT attribution for commercial use. Co-authored-by: Cursor <cursoragent@cursor.com>

Port the lidar GCG Injection probe into a giskard-scan generator. It subclasses HuggingFaceDatasetScenarioGenerator (defaulting to the HarmBench dataset) to inherit per-language subset handling, then fans each loaded prompt out into one scenario per adversarial suffix (prompt x suffix cross-product). Each variant is renamed with a GCG prefix + suffix index and tagged gcg-suffix:<index>, appended to the dataset's own tags rather than replacing them. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

gemini-code-assist

Code Review

This pull request introduces the GCGInjectionScenarioGenerator to generate Greedy Coordinate Gradient (GCG) injection attack scenarios by appending adversarial suffixes to harmful prompts. It also registers this generator in the vulnerability suite and adds comprehensive unit tests. The review feedback suggests ensuring a space separator between the prompt and the GCG suffix to prevent token merging, which could break the adversarial attack, and updating the corresponding unit tests to reflect this change.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

pierlj · 2026-06-24T15:20:33Z

+        self, description: str, languages: list[str]
+    ) -> list[Scenario[Any, Any, Trace[Any, Any]]]:
+        base_scenarios = super().load_scenarios(description, languages)
+        return [


Maybe it's too much to do the cross product with all suffixes. I would simply take a suffix randomly for each scenario. It preserves the number of scenario generated unchanged, otherwise, you could ask for 10 max_scenarios but more than 100 instead.

pierlj

One change required on the quantity of scenario generated. Otherwise, it looks fine

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

# Conflicts: # libs/giskard-scan/src/giskard/scan/vulnerability.py

Rotate suffixes by scenario index instead of cross-product fan-out so max_scenarios matches the base dataset size. Co-authored-by: Cursor <cursoragent@cursor.com>

kevinmessiaen and others added 2 commits June 24, 2026 14:58

feat(scan): add HarmBench dataset to vulnerability scan

6c9eba9

Register giskardai/harmbench-scenarios with an LLMJudge prompt bundled in giskard-scan and document MIT attribution for commercial use. Co-authored-by: Cursor <cursoragent@cursor.com>

kevinmessiaen requested a review from pierlj June 24, 2026 10:16

kevinmessiaen temporarily deployed to ci June 24, 2026 10:16 — with GitHub Actions Inactive

github-actions Bot added the Scope: Scan label Jun 24, 2026

gemini-code-assist Bot reviewed Jun 24, 2026

View reviewed changes

Comment thread libs/giskard-scan/src/giskard/scan/generators/gcg.py

Comment thread libs/giskard-scan/tests/generators/test_gcg.py Outdated

pierlj reviewed Jun 24, 2026

View reviewed changes

pierlj requested changes Jun 24, 2026

View reviewed changes

Base automatically changed from feat/harmbench-dataset to main June 25, 2026 03:04

Apply suggestions from code review

2f68e3c

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

kevinmessiaen temporarily deployed to ci June 25, 2026 03:06 — with GitHub Actions Inactive

Merge branch 'main' into feat/scan-gcg-generator

bbea97d

# Conflicts: # libs/giskard-scan/src/giskard/scan/vulnerability.py

kevinmessiaen temporarily deployed to ci June 25, 2026 03:15 — with GitHub Actions Inactive

fix(scan): pick one GCG suffix per scenario via modulo

b6f26c4

Rotate suffixes by scenario index instead of cross-product fan-out so max_scenarios matches the base dataset size. Co-authored-by: Cursor <cursoragent@cursor.com>

kevinmessiaen temporarily deployed to ci June 25, 2026 03:21 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat(scan): add GCG injection scenario generator#2561

feat(scan): add GCG injection scenario generator#2561
kevinmessiaen wants to merge 5 commits into
mainfrom
feat/scan-gcg-generator

kevinmessiaen commented Jun 24, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

pierlj Jun 24, 2026

Uh oh!

pierlj left a comment

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants

Uh oh!

Uh oh!

Conversation

kevinmessiaen commented Jun 24, 2026

Description

Related Issue

Type of Change

Coding agents

Checklist

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

pierlj Jun 24, 2026

Choose a reason for hiding this comment

Uh oh!

pierlj left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants