AI PM Playbook

Use this to define what you're building, how the AI behaves, and what "good" looks like. The AI job statement is the most important line in this document.

Upstream: this should follow an approved opportunity brief. Downstream: use this PRD to create the eval plan, human review workflow, cost model, observability plan, and launch gate. If engineers or coding agents need a tighter handoff, use the optional build brief.

Problem

Goals

Non-goals

Target users

Current workflow

Proposed workflow

AI job statement

The AI [does what] using [inputs] to produce [outputs] for [user] inside [workflow], subject to [constraints].

Constraints and guardrails

Guardrail	Hard limit	Enforced by	On violation	Monitored as (target)
Input scope	only in-scope requests	intent classifier / router	decline + fallback	out-of-scope rate; router false-route rate
Prompt injection	ignore instructions inside user or retrieved content	system prompt + input sanitization	flag, do not act	flagged-input rate; missed-attack rate (FN)
Grounding	every factual claim traces to a cited source; no fabricated citations	output validator checks cited IDs exist	suppress output	ungrounded-output rate; fabrications passing validator (FN)
Output schema	matches the output contract	schema validation	fallback (see Failure behavior)	malformed-output rate
Action scope	no irreversible action without approval	autonomy map + approval gate	stop, escalate to human	blocked-action rate; false-block rate (FP)
Safety topics	refuse defined categories (self-harm, legal, etc.)	safety classifier	escalate to human	escalation rate; missed-trigger rate (FN)
Cost / rate ceiling	max tokens + retries per task	runtime budget cap	stop, fallback	cap-hit rate

Model requirements

Parameter	Value	Notes
Model / provider	e.g., Claude Sonnet via Anthropic API
Token budget per task	e.g., 2k input, 500 output
Multi-model routing	e.g., cheap model for classification, expensive model for generation
Context window needs	e.g., must handle 50-page documents

System persona

Tone: e.g., professional, direct, no hedging
Constraints: e.g., never speculate beyond source material, always cite the document section
Persona boundaries: e.g., does not give opinions, does not role-play

Data provenance

Retrieval sources: e.g., customer knowledge base, internal docs
Data permissions: e.g., customer data processed under DPA, no cross-tenant access
Retention policy: e.g., prompts and outputs logged for 30 days, then deleted

Input contract

Input	Format	Required	Max size	Fallback if missing

Output contract

Output field	Type	Always present	Example

Autonomy level

Draft: AI produces output, human reviews before anything happens
Suggest: AI recommends an action, human accepts or rejects
Act: AI takes action, human can undo
Autonomous: AI takes action, no human in the loop

AI action	Autonomy level	Justification
e.g., draft support response	e.g., draft	e.g., customer-facing, must be reviewed before sending
e.g., categorize incoming ticket	e.g., act	e.g., low risk, reversible

Agent tool boundaries

Tool / capability	Allowed	Constraints
e.g., read customer records	yes/no	e.g., read-only, current tenant only
e.g., send email	yes/no	e.g., draft only, requires human approval
e.g., modify database	yes/no	e.g., never

Escalation: When the agent encounters something outside its scope, what happens? e.g., hand off to human, surface uncertainty, stop and ask.

Example inputs and outputs

Case	Input	Expected output
Happy path	typical input	what good looks like
Rejection	input the AI should refuse	e.g., refusal with escalation to human
Edge case	unusual but valid input	acceptable behavior

Human review rules

Risks and mitigations

Risk	Scenario	User impact	Business impact	Likelihood	Severity	Mitigation	Detection signal	Owner
Incorrect output	plausible but wrong output
Over-trust	user treats AI as authoritative
Data leakage	wrong user/tenant sees data
Permission failure	acts outside the user's authz scope
Prompt injection	instructions in user/retrieved content hijack behavior
Bias / unfair treatment	worse outcomes for a protected group
Regulatory exposure	output or data handling breaches a rule
Unsafe autonomy	AI takes action beyond scope
Cost spike	usage or retries exceed budget
Silent degradation	quality drops without alert

Agentic risks, if relevant:

Risk	Scenario	Mitigation	Owner
Goal hijacking
Tool misuse
Identity abuse
Memory poisoning
Error cascading

AI product requirements document

Problem

Goals

Non-goals

Target users

Current workflow

Proposed workflow

AI job statement

Constraints and guardrails

Model requirements

System persona

Data provenance

Input contract

Output contract

Autonomy level

Agent tool boundaries

Example inputs and outputs

Human review rules

Risks and mitigations

Quality bar

Latency target

Cost constraint

Failure behavior

Observability requirements

Launch gates

Open questions

Read alongside this template

Bad to good AI PRD

Agentic products

Customer Support Copilot: AI PRD