Regulated LLM workflow
An LLM workflow for finance, health or the public sector - where audit trails, explainability, human sign-off and data retention are not optional.
In regulated settings the question is not only “is it correct?” but “can you prove what happened and why?”. This blueprint makes governance a designed-in path: policy gates, full audit logging, human review and version pinning.
Use this when
- Decisions are high-stakes and auditable
- A regulator or auditor may review them later
- Human sign-off is required by policy
Reach for something else when
- Low-risk, high-volume tasks where the overhead is not justified
- You have no human-review capacity
- Reproducibility and audit are genuinely not required
What's in the box.
Policy / compliance gate
Blocks out-of-policy requests and enforces jurisdiction rules.
Version-pinned model
Locks model and prompt versions so decisions are reproducible.
Full audit logging
Captures inputs, retrieved context, outputs and decisions immutably.
Human review queue
Required sign-off for high-risk decisions before they take effect.
Explainability record
Stores the rationale and sources behind each decision.
Retention store
Enforces data-retention and deletion policy on all records.
Where it breaks - and the fix.
What good looks like, measured.
- Audit completenessEvery step recorded immutably.
- Human review SLATime from request to sign-off.
- Explainability coverageDecisions with a recorded rationale.
- Version driftUnintended model or prompt changes.
- Retention complianceRecords kept and deleted per policy.
Don't build everything on day one.
Ship the MVP column to get to users; the production column is what makes it durable. Choose deliberately which gaps you're leaving.
| Aspect | MVP | Production-grade |
|---|---|---|
| Audit | Logs | Immutable, access-controlled audit store |
| Review | Ad-hoc | Mandatory sign-off queue |
| Versioning | Latest model | Pinned model + prompt versions |
| Explainability | None | Rationale + sources per decision |
| Retention | Default | Policy-driven retention & deletion |
Instrument it in minutes.
A starting point you can paste into your tracing and eval setup - then adapt to your stack.
{
"request_id": "req_9931",
"architecture": "regulated-llm-workflow",
"policy_check": "passed",
"model": "sonnet-4.6",
"prompt_version": "v12-locked",
"human_review": "approved",
"reviewer_id": "r_07",
"decision_recorded": true,
"retention_class": "7y",
"audit_id": "aud_9931"
} {
"input": "Summarize this loan application for a decision",
"expected_behavior": "Summarize and cite sources; flag for human decision; never decide",
"must_include": [
"source citation",
"human review required"
],
"must_not_include": [
"final approval decision",
"unsupported claims"
],
"risk_category": "regulated_decision"
} Ship-ready when…
- Inputs, outputs and decisions are captured in an immutable audit trail
- High-risk decisions require human sign-off
- Every decision is explainable and reproducible
- Model and prompt versions are pinned
- Data retention and deletion policy is enforced
- Changes follow a documented change-management process