AI PM Playbook

Define when and how humans review AI output — before building any UI. Skip this and you default to "ship it and see," which is how AI products lose user trust.

Upstream: autonomy levels and human review rules from the AI PRD. Downstream: review failures and review-quality signals feed the PRD risk table, launch gate, and weekly post-launch review.

Review mode per action

Pick a mode per AI action. Different actions in one product can use different modes. Higher risk moves up the table.

Mode	Human role	Use when
Human-in-the-loop	Approves before the output or action takes effect	Customer-facing, high-impact, irreversible, or low-confidence
Human-on-the-loop	Monitors live, can intervene or roll back	Reversible, monitored, low-to-medium risk
Human-after-the-loop	Reviews samples and trends after the fact	Low risk, reversible, online metrics strong
No human loop	None	Low-risk, reversible, well-tested, and observable

AI action	Mode	Reviewer	Why this mode	Upgrade / downgrade trigger

Hard boundaries — actions AI must never take

Reversible auto-actions and rollback

Action	Rollback mechanism	Undo window	Who monitors

Required review points

Review point	Trigger	Reviewer	What they check	If rejected

Reviewer capacity and SLA

Outputs needing review: per hour / day at target usage
Review throughput per reviewer: items/hour at target quality, from a timed sample
Reviewers staffed: count and coverage across shifts
Review SLA: max time from output to decision
Overflow behavior: when the queue exceeds SLA: hold output, fall back to suggest-only, page a lead

Guarding against rubber-stamping

Review-quality signal: flag if approval rate > X% or median time-per-review < Y seconds
Blind audit: a second reviewer re-checks a random N% of approvals; track disagreement rate
Seeded checks: inject known-bad outputs at a low rate; a reviewer who approves them is not reviewing
Reviewer calibration: expert domains: inter-reviewer agreement target before a reviewer works solo

Review UI requirements

Output shown: full, diff, or summary
Evidence and sources shown: what the AI based its answer on, inline
Confidence shown: and whether it is calibrated (see PRD)
Edit in place: can the reviewer modify before approving
Target time per review: ties to the capacity math above

Escalation path

Reviewer flags output as uncertain
who or what handles it next
resolution, and how it is recorded

Audit trail

AI output (full), plus input and evidence used
Reviewer identity
Decision (approve / reject / edit) and edits made
Confidence and mode at time of decision
Timestamp

Feedback captured from review

Feedback type	How captured	How used
Rejection reason	dropdown + free text	added to eval set
Edit diff	stored automatically	correction patterns, fine-tuning data

Human review workflow