Prompt Injection Red-Team Review with Claude Code: Artifact Delivery

A production playbook for prompt injection red-team review in cross-industry operations using Claude Code: artifact delivery, run-scoped inputs, logs, typed results, and artifacts.

Audience: AI security teams

The problem

AI security teams need prompt injection red-team review to run repeatedly against prompts, tool policies, transcripts, and suspicious inputs. In cross-industry operations, the pain is not one good answer; it is repeatability, auditability, exception handling, and evidence that survives handoff.

Implementation path

Require Claude Code to write customer-visible files under /skill/output/artifacts, validate filenames and sizes, then return signed artifact metadata in argo.result.v1.

Tradeoffs and failure modes

Artifact policy constrains file output, but customers receive files that are durable, typed, and safe to download. For prompt injection red-team review, the practical test is whether a second run can be debugged, retried, and consumed by a product without reading the raw agent transcript.

Artifact manifest

artifacts:
  - red-team-prompt-review-summary.md
  - red-team-prompt-review-evidence.csv
  - red-team-prompt-review-review.json
signed_urls: true
retention: org_policy

Run this on Argo