How do you ensure the adversarial validator actually finds gaps?

The validator's system prompt explicitly forbids approval language and requires finding a minimum number of gaps. Key techniques: state 'You MUST identify at least 5 missing items', forbid phrases like 'looks comprehensive', and require checking against specific frameworks (OWASP, CWE, STRIDE).

Orchestrating Subagents for AI Security Reviews: A Complete Implementation Guide

Q: How do subagents communicate with the main agent in Claude Code?

The main agent delegates tasks to subagents using the Task tool. Communication happens through: (1) the task prompt from main agent to subagent, (2) shared files that both can read, and (3) the result returned from subagent to main agent. Subagents cannot directly communicate with each other.

Q: Can subagents run in parallel for faster security reviews?

Yes. Multiple subagents can run concurrently, which is particularly useful for security scanning. You could run input-validation-scanner, authentication-scanner, and authorization-scanner simultaneously, each checking different categories from your checklist.

Q: How does the sast-triage subagent work with existing SAST tools?

Run your SAST tool (Semgrep, SonarQube, CodeQL) first to generate findings. Then use the sast-triage subagent to classify each finding by tracing data flows and evaluating context. This typically reduces actionable findings by 30-70% by filtering false positives, while documenting the reasoning for each classification.

1From Monolithic Agent to Orchestrated Team

In Part 2, we established that effective AI security reviews require self-scaffolding workflows: generating checklists, creating execution plans, documenting findings, and self-auditing completeness. But running all of this through a single agent conversation has limitations:

Context pollution — Deep code exploration fills the context window, pushing out important workflow state
No enforced constraints — Nothing prevents the "read-only scanner" from accidentally modifying files
Serial execution — Each checklist item must complete before the next begins
Inconsistent behavior — The adversarial validator might become agreeable mid-conversation

Subagents solve these problems. Claude Code's subagent feature lets you create specialized AI assistants that run in isolated context windows with enforced tool restrictions and custom system prompts. This document shows you how to create and orchestrate subagents for security reviews.

The series Introduction identified sixteen failure modes that affect agentic workflows. The sub-agent architecture presented here directly addresses several of them: role confusion is solved by the seven archetypes with enforced tool restrictions, behavioral drift is structurally mitigated by isolating critical functions in dedicated sub-agents with hard constraints rather than relying on a single long-running conversation, and the separation of scanning, validation, and auditing into distinct contexts partially addresses scope creep and coordination divergence through structural boundaries.

Part 2: Workflow

→

Part 3: Subagents

→

Part 4: Remediation

The review workflow produces findings.md—a document of identified vulnerabilities. Part 4 extends the workflow to actually fix those issues with remediation subagents.

2Why Subagents Improve Reviews

Subagents aren't just a convenience feature—they fundamentally change what's possible in automated security reviews:

Capability	Single Agent	With Subagents
Tool Restrictions	Must trust agent to follow instructions	Scanner physically cannot write files
Context Management	All exploration fills main context	Exploration stays in subagent context
Parallel Execution	Sequential only	Multiple scanners run simultaneously
Behavioral Consistency	Drift over long conversations	Fresh system prompt each invocation
Cost Control	One model for everything	Haiku for scanning, Sonnet for analysis
Adversarial Validation	Hard to maintain hostility	Validator's prompt enforces gap-finding

Key Insight

The validator subagent's system prompt contains explicit instructions like "You MUST identify at least 5 missing items" and "You are FORBIDDEN from saying 'looks comprehensive'". Because this prompt is applied fresh at each invocation, the validator can't drift toward agreeableness the way a single long-running agent might.

3The Five Review Subagents

The review phase uses five specialized subagents. Four handle the core review workflow, plus an optional triage agent for processing SAST output:

🔍

security-scanner

Model: claude-haiku • Read-only

Fast, thorough code exploration. Searches for vulnerability patterns, traces data flows, and reports findings with exact file locations. Cannot modify any files.

Read Grep Glob Bash (read only)

View Full Subagent Definition

---
name: security-scanner
description: "Use PROACTIVELY to scan codebase for security vulnerabilities. 
Read-only exploration of code patterns, dependencies, and configurations."
tools: Read, Grep, Glob, Bash
model: haiku
---
You are a security-focused code scanner. Your job is to systematically 
search for security-relevant patterns WITHOUT making any changes.

When given a checklist item, you:
1. Search for relevant code patterns using grep/glob
2. Trace data flows from input to sensitive operations
3. Document file locations and line numbers precisely
4. Return structured findings with evidence snippets

You NEVER modify files. You ONLY read and report.

For each pattern found, report:
- File path and line number
- Code snippet (5 lines of context)
- Why this is relevant to the checklist item
- Initial risk assessment (needs-review/likely-safe/likely-vulnerable)

When scanning for a category, be thorough:
- Check all file types that could contain relevant code
- Search for variations and aliases of patterns
- Follow imports to find actual implementations
- Note configuration files that affect behavior

⚔️

checklist-validator

Model: claude-sonnet • Read-only • Adversarial

Hostile reviewer that finds gaps in checklists. Explicitly prompted to find problems, not confirm completeness. Must identify missing items before approval.

Read

View Full Subagent Definition

---
name: checklist-validator
description: "MUST BE USED after generating a checklist. Adversarial 
reviewer that finds gaps and missing items. Will not approve without 
finding issues."
tools: Read
model: sonnet
---
You are a hostile security reviewer. Your job is to find gaps in checklists.

CRITICAL RULES:
1. You MUST identify at least 5 missing items or categories
2. You are FORBIDDEN from saying "looks comprehensive" or "good coverage"
3. Check against: OWASP Top 10, CWE Top 25, STRIDE, ASVS
4. Consider technology-specific issues for the stated stack
5. Think about the specific threat model mentioned

For each gap found, provide:
- Category it belongs to
- Item ID (use format [VALIDATION-XX])
- Full item in the same verbosity format as existing items
- Why this was likely missed (common blind spot? stack-specific?)

After listing gaps, rate the original checklist:
- Coverage score (1-10)
- Most critical missing category
- Recommendations for improvement

If you genuinely cannot find 5 gaps after checking all frameworks, you 
must explicitly state: "I checked [list frameworks] and found only [N] 
gaps because [specific reason]." This should be extremely rare.

📝

findings-writer

Model: claude-sonnet • Read + Write

Documentation specialist that formats findings consistently. Ensures every finding has severity rating, attack scenario, and recommended fix.

Read Write

View Full Subagent Definition

---
name: findings-writer
description: "Use to document security findings in standardized format. 
Ensures consistent severity ratings and complete evidence."
tools: Read, Write
model: sonnet
---
You are a security findings documentation specialist. Your job is to 
write clear, actionable findings.

For EVERY finding, you MUST include all of these sections:

## Finding: [ID] - [Title]

**Severity:** [CRITICAL|HIGH|MEDIUM|LOW|INFO]
- CRITICAL: RCE, auth bypass, data breach possible
- HIGH: Significant security impact, likely exploitable
- MEDIUM: Security weakness requiring attention
- LOW: Minor issue, defense in depth
- INFO: Observation, no direct security impact

**Location:** [exact file path and line numbers]

**Vulnerable Code:**
```[language]
[actual code snippet]
```

**Attack Scenario:**
[Concrete steps an attacker would take to exploit this]

**Impact:**
[What damage could result from successful exploitation]

**Recommended Fix:**
```[language]
[specific code showing the fix]
```

**Verification:**
[How to confirm the fix works - test case or manual check]

---

Write findings to findings.md, appending new findings without overwriting 
existing content. Maintain sequential finding IDs.

✅

self-auditor

Model: claude-haiku • Read-only

Quality assurance checker that compares completed work against the original checklist. Identifies gaps, incomplete items, and unjustified skips.

Read

View Full Subagent Definition

---
name: self-auditor
description: "Use at end of review to audit completeness. Compares work 
done against original checklist."
tools: Read
model: haiku
---
You are a quality assurance auditor for security reviews. Your job is 
to verify completeness by comparing the checklist against actual work done.

Read these files:
1. checklist.md - The original security checklist
2. plan.md - The execution plan with status updates
3. findings.md - The documented findings

Then produce an AUDIT REPORT with:

## Audit Report

**Completion Status:** [COMPLETE | INCOMPLETE | NEEDS-REVIEW]

### Coverage Analysis
- Total checklist items: [N]
- Items completed: [N]
- Items in progress: [N]
- Items skipped: [N]
- Items not started: [N]

### Incomplete Items
[List each incomplete item with its current status]

### Unjustified Skips
[List items marked N/A or skipped without documented justification]

### Orphan Findings
[List any findings that don't trace back to a checklist item]

### Missing Evidence
[List items marked complete but lacking documented evidence]

### Recommendations
[Specific actions needed to complete the review]

Be strict. If an item says "COMPLETE" but findings.md has no corresponding 
entry, flag it. The goal is audit trail completeness.

🎯

sast-triage

Model: claude-sonnet • Read-only • Conditional

Triages SAST tool findings to separate true vulnerabilities from false positives. Use when static analysis output is available to filter noise before deep review.

Read Grep Glob

View Full Subagent Definition

---
name: sast-triage
description: "Use to triage SAST findings when static analysis output 
is available. Classifies each finding as TRUE_POSITIVE, FALSE_POSITIVE, 
or NEEDS_INVESTIGATION with documented reasoning."
tools: Read, Grep, Glob
model: sonnet
---
You are a SAST triage specialist. Your job is to evaluate static 
analysis findings and determine which are real vulnerabilities.

For each finding, you MUST:

1. LOCATE the flagged code
   - Find the exact file and line from the SAST report
   - Read 30+ lines of surrounding context
   - Understand what the code is trying to accomplish

2. TRACE BACKWARD (source analysis)
   - Where does the flagged data originate?
   - Is it user-controlled input or from a trusted source?
   - What validation exists between source and flagged location?
   - Are there type constraints, allowlists, or sanitization?

3. TRACE FORWARD (sink analysis)
   - Where does the data go after the flagged location?
   - Is the sink actually dangerous in this context?
   - What would successful exploitation require?

4. EVALUATE MITIGATIONS
   - Framework-level protections (ORM, template auto-escaping)
   - Architectural constraints (internal service, no user input path)
   - Explicit validation or encoding

5. CLASSIFY the finding
   - TRUE_POSITIVE: Exploitable vulnerability, document attack scenario
   - FALSE_POSITIVE: Safe in context, document the specific mitigation
   - NEEDS_INVESTIGATION: Cannot determine statically, needs manual review

Output format for each finding:

## [Rule-ID]: [Rule Name]
**File:** [path:line]
**SAST Says:** [Original finding message]
**Classification:** TRUE_POSITIVE | FALSE_POSITIVE | NEEDS_INVESTIGATION

**Source Analysis:**
[Where does the data come from? Is it user-controlled?]

**Sink Analysis:**
[Where does it go? Is the sink actually dangerous here?]

**Mitigations Found:**
[What protections exist? Why do they make this safe/unsafe?]

**Conclusion:**
[Final reasoning for classification]

**Action:**
- TRUE_POSITIVE: Add to findings.md for documentation
- FALSE_POSITIVE: Dismiss with this reasoning
- NEEDS_INVESTIGATION: Flag for manual security review

Be thorough but fair. Many SAST findings are false positives due to 
context the tool cannot see. Your job is to provide that context.

When to Use sast-triage

Include this subagent when you have SAST output (from Semgrep, SonarQube, CodeQL, etc.) to process. It runs as a parallel branch alongside the main AI review workflow. TRUE_POSITIVEs go directly to findings.md; they don't feed into the checklist. This keeps the checklist reusable while ensuring SAST-confirmed vulnerabilities are documented alongside AI-discovered issues.

4Orchestration: The Complete Sequence

The main agent acts as an orchestrator, coordinating subagents through the review workflow. SAST triage runs as a parallel branch that outputs directly to findings.md—it does not feed into checklist generation.

Parallel Workflow Architecture

┌──────────────────────────────────────────────────────────────────────┐
│                                                                      │
│  SAST Branch (if output exists)      AI Review Branch                │
│  ─────────────────────────────       ───────────────────             │
│                                                                      │
│  ┌─────────────┐                     ┌─────────────┐                 │
│  │ SAST Output │                     │  checklist  │ ← from template │
│  └──────┬──────┘                     └──────┬──────┘                 │
│         │                                   │                        │
│         ▼                                   ▼                        │
│  ┌─────────────┐                     ┌─────────────┐                 │
│  │ sast-triage │                     │  validator  │                 │
│  └──────┬──────┘                     └──────┬──────┘                 │
│         │                                   │                        │
│    ┌────┴────┐                              ▼                        │
│    │ TRUE    │                       ┌─────────────┐                 │
│    │ FALSE   │──(dismissed)          │   scanner   │                 │
│    │ INVEST  │──(flagged)            └──────┬──────┘                 │
│    └────┬────┘                              │                        │
│         │                                   │                        │
│         │    ┌──────────────────────────────┘                        │
│         │    │                                                       │
│         ▼    ▼                                                       │
│      ┌──────────────┐                                                │
│      │ findings.md  │ ← Both branches write here                     │
│      └──────┬───────┘                                                │
│             │                                                        │
│             ▼                                                        │
│      ┌─────────────┐                                                 │
│      │ self-auditor│                                                 │
│      └─────────────┘                                                 │
│                                                                      │
└──────────────────────────────────────────────────────────────────────┘

AI Review Branch (Main Sequence)

Step 1

Main Agent

→ checklist.md Generates checklist from template (reusable)

Step 2

Main Agent

→

checklist-validator

Delegates validation task

Step 3

checklist-validator

→

Main Agent

Returns gap analysis

Step 4

Main Agent

→ plan.md Creates execution plan from checklist

Step 5

Main Agent

→

security-scanner

Delegates each checklist category

Step 6

security-scanner

→

Main Agent

Returns scan results

Step 7

Main Agent

→

findings-writer

Documents AI-discovered findings

Step 8

findings-writer

→ findings.md Writes formatted findings

Step 9

Main Agent

→

self-auditor

Requests completeness audit

SAST Branch (Parallel, When Output Exists)

SAST-1

Main Agent

→

sast-triage

Analyze SAST output file

SAST-2

sast-triage

→ sast-triage-report.md Writes classifications with reasoning

SAST-3

Main Agent

→

findings-writer

Documents TRUE_POSITIVEs

SAST-4

findings-writer

→ findings.md Adds SAST findings to same file

Key Design Principle

The checklist is a reusable methodology template—it doesn't change based on SAST output. Both branches produce findings that merge in findings.md. This separation keeps the checklist clean for reuse across projects while ensuring all discovered vulnerabilities end up in one place.

The Orchestration Prompt

To run this complete workflow, use this prompt with the main agent:

Conduct a comprehensive security review of this codebase using the 
subagent workflow:

## SAST Branch (run in parallel if SAST output exists)

If [SAST_OUTPUT_FILE] exists:
1. Use the sast-triage subagent to classify each finding:
   - TRUE_POSITIVE: Confirmed vulnerability
   - FALSE_POSITIVE: Safe in context (document why)
   - NEEDS_INVESTIGATION: Flag for human review
   Write triage report to sast-triage-report.md

2. Use findings-writer to document TRUE_POSITIVEs in findings.md
   with source "SAST Triage ([tool name])"

## AI Review Branch (main workflow)

1. Read templates/CHECKLIST_TEMPLATE.md and generate checklist.md 
   using STANDARD verbosity for a [STACK DESCRIPTION]

2. Use the checklist-validator subagent to review checklist.md
   - Merge any valid gaps it identifies
   - Mark merged items with [VALIDATION] tag

3. Create plan.md from the final checklist, ordering by:
   - Critical items first
   - Group related items
   - Dependencies resolved

4. For each category in the checklist, use the security-scanner 
   subagent to scan the codebase. Pass it:
   - The specific checklist items for that category
   - The procedure steps to follow
   - Expected evidence to collect

5. For each set of scanner results, use the findings-writer subagent 
   to document any vulnerabilities found in findings.md

6. Update plan.md status as each item completes

7. When all items are processed, use the self-auditor subagent to 
   verify completeness

8. Address any gaps identified by the auditor

Target: [PROJECT PATH]
Stack: [TECHNOLOGY STACK]
Threat Model: [BRIEF THREAT DESCRIPTION]

Parallel Scanning

For faster reviews, you can instruct the main agent to run multiple scanner instances in parallel: "Use security-scanner subagents in parallel for Input Validation, Authentication, and Authorization categories." Each scanner runs in its own context without interfering with others.

5How Subagents Communicate

Understanding the communication model is essential for effective orchestration. Subagents in Claude Code use a hub-and-spoke pattern—all communication flows through the main agent.

Communication Channels

Task Prompt (Main → Subagent)

The main agent invokes a subagent via the Task tool, passing a prompt that describes what to do. This prompt can reference files, include context, and specify expected output format.

Shared Files (Bidirectional)

Both main agent and subagents can read files from the filesystem. Subagents with write access can create or modify files that the main agent reads later. Files are the primary way to pass large amounts of structured data.

Result String (Subagent → Main)

When a subagent completes, it returns a result string to the main agent. This should summarize what was done, key findings, and any issues encountered. Keep it concise—detailed data belongs in files.

What Subagents Cannot Do

Cannot communicate directly with other subagents — All coordination goes through the main agent
Cannot access main agent's context — Each subagent starts fresh with only its system prompt and the task prompt
Cannot persist state between invocations — Use files for any state that needs to survive across calls
Cannot ask the user questions — Only the main agent can interact with the user

Example: Scanner to Writer Handoff

Here's how data flows from the scanner subagent to the findings writer:

# Main agent delegates to scanner
> Use the security-scanner subagent to check for SQL injection 
> vulnerabilities following checklist items IV-01 through IV-03.
> Write raw findings to scan-results/sqli-scan.md

# Scanner runs, writes file, returns summary
← "Scanned 47 files. Found 3 potential SQL injection points in 
   api/users.py (lines 45, 89) and api/reports.py (line 23). 
   Results written to scan-results/sqli-scan.md"

# Main agent delegates to writer
> Use the findings-writer subagent to document the SQL injection 
> findings. Read scan-results/sqli-scan.md for raw data.
> Append formatted findings to findings.md

# Writer runs, reads scan results, formats and writes
← "Documented 3 findings: SQLI-001 (HIGH), SQLI-002 (MEDIUM), 
   SQLI-003 (HIGH). Appended to findings.md"

File-Based Handoff Pattern

Use intermediate files (like scan-results/) to pass detailed data between subagents. The scanner writes raw results, the writer reads and formats them. This keeps result strings short while preserving full fidelity of the data.

6Installation & Setup

Option 1: User-Level Installation

Subagents installed at user level are available in all your projects:

# Create the agents directory
mkdir -p ~/.claude/agents

# Download subagent definitions from the Downloads section
# Or create each file manually from the definitions above

# Verify installation
ls ~/.claude/agents/
# Should show: security-scanner.md, checklist-validator.md, 
#              findings-writer.md, self-auditor.md

Option 2: Project-Level Installation

For project-specific subagents or team sharing via version control:

# In your project root
mkdir -p .claude/agents

# Add subagent files - available to anyone who clones the repo

Option 3: Interactive Creation

Use Claude Code's built-in agent creator:

# In Claude Code, run:
/agents

# Select "Create new agent"
# Choose user-level or project-level
# Select "Generate with Claude" and describe the agent

7Downloads

Download the complete subagent definitions. Place them in ~/.claude/agents/ or .claude/agents/ to use with Claude Code.

security-scanner.md

Read-only scanner for finding vulnerability patterns in code.

checklist-validator.md

Adversarial reviewer that finds gaps in security checklists.

findings-writer.md

Documentation specialist for consistent finding reports.

self-auditor.md

Completeness checker that audits review coverage.

sast-triage.md

Triages SAST findings to separate true positives from false positives.

Looking for remediation subagents? See Part 4: Automated Remediation for fix-implementer and fix-verifier agents.

8Frequently Asked Questions

What are Claude Code subagents and why use them for security reviews?

Subagents are specialized AI assistants that run in isolated context windows with custom system prompts and restricted tool access. For security reviews, they provide enforced constraints (read-only scanners can't modify code), context preservation (exploration doesn't pollute the main conversation), and specialized behavior (adversarial validators are prompted to find gaps, not confirm completeness).

How do subagents communicate with the main agent?

Communication happens through three channels: (1) the task prompt from main agent to subagent, (2) shared files that both can read, and (3) the result string returned from subagent to main agent. Subagents cannot directly communicate with each other—all coordination flows through the main agent.

Can subagents run in parallel?

Yes. Multiple subagents can run concurrently. You can run input-validation-scanner, authentication-scanner, and authorization-scanner simultaneously, each checking different categories. This reduces review time while maintaining isolation between scanning contexts.

How do I ensure the validator actually finds gaps?

The validator's system prompt contains explicit adversarial instructions: "You MUST identify at least 5 missing items" and "You are FORBIDDEN from saying 'looks comprehensive'". Because this prompt is applied fresh at each invocation, the validator maintains its hostile stance.

Can I use different models for different subagents?

Yes, and you should. Use faster, cheaper models (Haiku) for pattern scanning and completeness checking. Use more capable models (Sonnet) for analysis, writing, and complex decisions. This optimizes both cost and quality.

How does the sast-triage subagent work with existing SAST tools?

Run your SAST tool (Semgrep, SonarQube, CodeQL, etc.) first to generate findings. Then use the sast-triage subagent to classify each finding by tracing data flows and evaluating context that SAST tools can't see. This typically reduces actionable findings by 30-70% by filtering false positives, while documenting the reasoning for each classification. True positives feed into your security checklist for deeper investigation.

What about actually fixing the vulnerabilities found?

Remediation is covered in Part 4. The review workflow deliberately ends at findings + audit because review and remediation have different permission profiles. Review is mostly read-only; remediation requires write access. Separating them provides a checkpoint where you can approve findings before any code changes.