Bulk PDF Processing with Codex: SKILL.md Template

A production playbook for bulk PDF processing in cross-industry operations using Codex: skill.md template, run-scoped inputs, logs, typed results, and artifacts.

Audience: Document automation teams with batch jobs

The problem

Document automation teams with batch jobs need bulk PDF processing to run repeatedly against hundreds of PDFs, schemas, review rules, and output files. In cross-industry operations, the pain is not one good answer; it is repeatability, auditability, exception handling, and evidence that survives handoff.

Implementation path

Put the operating procedure in SKILL.md, keep examples beside the skill, attach hundreds of PDFs, schemas, review rules, and output files per run, and let Argo turn the folder into a repeatable Codex execution.

Tradeoffs and failure modes

A skill folder is less flexible than an open chat, but it gives the product a versioned workflow that can be tested and rolled back. For bulk PDF processing, the practical test is whether a second run can be debugged, retried, and consumed by a product without reading the raw agent transcript.

SKILL.md starter

# SKILL.md
You run bulk PDF processing using Codex.
Read only /skill/.argo/inputs.
Write artifacts to /skill/output/artifacts.
Return argo.result.v1 with body.type = "bulk_pdf_processing".

Run this on Argo