Long-Running Agent Job with Claude Code: Human Review Queue

A production playbook for long-running agent job in cross-industry operations using Claude Code: human review queue, run-scoped inputs, logs, typed results, and artifacts.

Audience: AI product teams running multi-minute workflows

The problem

AI product teams running multi-minute workflows need long-running agent job to run repeatedly against documents, repos, tool access, and output contracts. In cross-industry operations, the pain is not one good answer; it is repeatability, auditability, exception handling, and evidence that survives handoff.

Implementation path

Split the long-running agent job result into automatable fields and review-only exceptions, then send low-confidence cases to a human queue with evidence artifacts attached.

Tradeoffs and failure modes

Human review slows a subset of runs, but it lets the workflow ship before every edge case is fully automated. For long-running agent job, the practical test is whether a second run can be debugged, retried, and consumed by a product without reading the raw agent transcript.

Review handoff

review_status: needs_review | approved | rejected
review_reason: string
source_evidence: artifact_url[]
agent: Claude Code
workflow: long-running-agent-job

Run this on Argo