AI ENGINEERING / PRODUCTION & GOVERNANCE

Production & Governance

Shipping and operating AI: cost, latency, guardrails, PII and safety, human-in-the-loop, accountability, and deployment. The gap between a demo and an AI that runs on Monday morning.

Foundation · 2

Production note
Production gotchas: what the demo never showed you
A demo proves an AI system can work once. Production proves it works on Monday morning, under load, on the inputs nobody scripted, when the token bill is real and the agent can delete things. The gap between the two is where AI systems break — not on capability, but on the cost ceiling nobody set, the latency nobody budgeted, the prompt injection nobody screened, the PII that walked to a third party, the irreversible action with no approval step, and the kill switch that didn't exist when it mattered. Ten gotchas that separate a demo from a system you can run, each with the trap, the fix, and the question to answer before you ship.
Decision framework
Production Style Guide: the gate an AI clears before it runs unattended
The opinionated rules Cleon applies before an AI system runs on real traffic — the pre-ship gate as a binary checklist (cost ceiling, latency fallback, input guardrail, PII masked, a human on irreversible actions, the audit trail, a rollback ready, the eval gate passed), and the in-platform-versus-build-it matrix that says what Agentforce and the Einstein Trust Layer give you by construction versus what you assemble off-platform, dimension by dimension. The discipline document that turns the production gotchas into rules and the production-readiness principles into a checklist: an unmet row blocks the ship. And because this is the last page of the AI Engineering catalog, it ties the five subcategories together — agents, grounding, prompting, evaluation, production — into the single arc the whole discipline traces.

Reference · 5

How-to · 1

How-to
Deploying to production: the safe path from a passing eval to live traffic
The eval is green — now how do you actually ship it without learning the hard way that green offline is not green in production? Six steps that take a prompt, model, or agent change from a passing test to live traffic with a way back: build and test in an isolated environment (Agentforce DX moves agent metadata between scratch orgs, sandboxes, and prod; off-platform, a staging environment), pass the eval gate before merge, version the change so you know exactly what shipped, roll out gradually behind a canary instead of flipping 100 percent at once, keep a one-step rollback ready, and monitor on live traffic after — because the silent degrade a frozen set can't see is caught by online eval and tracing. The throughline: deployment is not the finish line. It's where evaluation and observability start doing their real work.

Production & Governance

Foundation · 2

Production gotchas: what the demo never showed you

Production Style Guide: the gate an AI clears before it runs unattended

Reference · 5

What is production readiness? The gap between a demo that works and an AI that runs on Monday

Cost and latency: the levers, in order of force

Input and output guardrails: the safety layer around a shipped agent

PII and data governance: masking, retention, and the audit trail

Human-in-the-loop and accountability: who is on the hook when the agent acts

How-to · 1

Deploying to production: the safe path from a passing eval to live traffic