Six Phases at a Glance
Each phase earns the right to the next
Discovery
Inventory use cases, classify data, choose deployment surface.
Gate
Executive approval in writing
Foundation
SSO, API gateway, observability, acceptable use policy.
Gate
Security validates + first request flows end-to-end
Pilot
Build use cases, MCP servers, Claude Code rollout.
Gate
Value + security + budget -- all three
Hardening
Pentest, PII filtering, audit trails, load test.
Gate
CISO sign-off + 2x peak load passed
Rollout
Wave by business unit, training, CoE, prompt libraries.
Gate
Adoption targets met per business unit
Optimization
Cost tuning, model upgrades, use case intake, monitoring.
Gate
QBR re-approval. This phase does not end.
Constant discipline
Testing
Embedded in every phase -- not just Phase 4
Constant discipline
Ownership Clarity
Three columns: partner / client / joint
Most enterprise Claude deployments do not fail on technology. The models work. The API works. The integrations work. They fail on sequencing.
A team runs a brilliant pilot that proves real value, then spends nine months trying to get it into production because the pilot was never built to be hardened. A different team flips Claude on for the entire company in a week, then watches adoption stall because nobody was trained and nothing was governed. A third team gets everything into production and declares victory, then discovers a quarter later that costs have tripled and output quality has drifted because no one owned optimization.
These are not technology failures. They are order-of-operations failures. The deployment got the right components in the wrong sequence, or skipped a phase that felt optional and turned out to be load-bearing.
This piece is the map. It lays out the six phases of an enterprise Claude deployment and, more importantly, the gate that sits between each one. The gate is the thing most roadmaps leave out. A phase is not done when the work is done. It is done when a specific, verifiable condition is met that earns the right to start the next phase. Skip a gate and the failure does not show up immediately. It shows up two phases later, when it is expensive to fix.
Over the next several weeks, this series will take each of the six phases in turn, going deep on the decisions, the failure modes, and the operational specifics that a single overview cannot hold. This piece is the map. Here is the arc at altitude.
Phase 1: Discovery and Assessment
The work: inventory candidate use cases, classify the data each one touches, assess the existing infrastructure, and choose a deployment surface (direct API, Bedrock, or Vertex).
The gate: an executive sponsor approves the platform choice, the prioritized pilot use cases, and the budget. Not a verbal nod in a hallway. A documented decision with a name attached.
Where teams get it wrong: they skip discovery entirely because it feels like overhead. They jump to a pilot on whatever use case is loudest, without classifying the data or confirming the use case is even a good fit for an LLM. Discovery is two weeks that saves two quarters. The teams that skip it build pilots on the wrong problems and learn it the hard way.
Phase 1 -- Discovery and Assessment
The Work
- Inventory candidate use cases
- Classify data sensitivity per use case
- Assess existing infrastructure
- Choose deployment surface: direct API, Bedrock, or Vertex
The Gate
Executive sponsor approves in writing
- Platform choice documented
- Pilot use cases prioritized
- Budget committed
Teams that skip discovery and jump to the loudest use case build pilots on the wrong problems. Two weeks of discovery saves two quarters of rework.
Phase 2: Foundation and Access Layer
The work: stand up the platform layer before the use cases. SSO and RBAC, an API gateway with cost attribution and logging, model routing rules, an acceptable use policy, and observability.
The gate: the security team validates access controls, and the first real request flows through the production path end to end, authenticated, logged, and cost-attributed.
Where teams get it wrong: they build use cases first because use cases are visible and infrastructure is not. Six months later finance cannot tell which team is spending what, security cannot audit who accessed which model, and the whole deployment gets frozen pending a governance review that should have happened at the start. Foundation is unglamorous and it is the phase that determines whether everything above it can scale.
Phase 2 -- Foundation and Access Layer
Before use casesSSO and RBAC
Authentication and role-based access before the first user touches anything.
API Gateway
Cost attribution, logging, and routing rules across all Claude workloads.
Model Routing Rules
Which model, at what tier, for which use case. Defined in policy, not ad hoc.
Acceptable Use Policy
Written, approved, and distributed before the first real request.
Observability
Latency, cost, error rate, and audit trail visible from day one.
Gate: security team validates controls + first real request flows end-to-end, authenticated, logged, cost-attributed
Building use cases before foundation means no cost attribution, no audit trail, and a governance freeze six months later when finance and security finally ask questions.
Phase 3: Pilot Implementation
The work: build the top prioritized use cases for real, with proper error handling and evaluation. Build the MCP servers that connect Claude to internal systems. Roll Claude Code out to a pilot group of engineers.
The gate: pilot users confirm genuine value, the MCP servers pass a security review, and the cost lands within the projected budget. All three. Value without security is a liability. Value at triple the projected cost is not a viable deployment.
Where teams get it wrong: pilot purgatory. The pilot proves value, everyone is excited, and then it sits there for months because it was built as a demo, not as the first increment of a production system. A pilot that cannot be hardened is not a pilot. It is a prototype that has to be rebuilt.
Phase 3 -- Pilot Gate: All Three Required
Value
Pilot users confirm measurable productivity or quality improvement. Not just 'it seems useful.'
Security
MCP servers pass review: input validation, parameterized queries, audit logging, tool annotations.
Cost
Actual cost lands within the projected budget. Triple the estimate is not a viable deployment model.
All three gates must pass. Not two out of three.
Value without security is a liability. Value at triple the budget is not a viable deployment.
Pilot purgatory: the pilot proves value but was built as a demo, not as the first increment of a production system. A prototype that cannot be hardened is not a pilot.
Phase 4: Hardening and Compliance
The work: penetration test the Claude-integrated surfaces, remediate the findings, implement PII filtering, configure immutable audit trails, validate data residency, and load test at projected scale.
The gate: the CISO signs off, compliance attestation is complete for the applicable frameworks, and the system passes a load test at two times projected peak volume.
Where teams get it wrong: they skip hardening because the pilot worked. The logic is seductive and wrong. A pilot proves the use case is valuable. It proves nothing about whether the system survives a prompt injection attack, handles PII correctly, or holds up under real load. Hardening is the phase that converts a thing that works in a demo into a thing that survives in production. It is also the phase under the most pressure to skip, because by now everyone wants to ship.
Phase 4 -- Hardening and Compliance Checklist
Penetration test all Claude-integrated surfaces
SecurityRemediate pentest findings before cutover
SecurityPII filtering configured and tested end-to-end
ComplianceImmutable audit trails configured and retained
ComplianceData residency validated per applicable frameworks
ComplianceLoad test at 2x projected peak volume
ReliabilityGate: CISO sign-off required
Plus completed compliance attestation and passage of the 2x peak load test.
Skipping hardening because the pilot worked. A successful pilot proves the use case is valuable. It proves nothing about prompt injection, PII handling, or production load.
Phase 5: Enterprise Rollout
The work: roll out in waves by business unit, with audience-specific training, prompt libraries, a self-service MCP server catalog, and a Center of Excellence to own standards.
The gate: adoption targets are met per business unit, typically a majority of provisioned users active within thirty days, and ROI is demonstrated for the pilot use cases before the next wave begins.
Where teams get it wrong: the big-bang rollout. They flip Claude on for the entire company at once with no training, no prompt libraries, and no support structure. Provisioned seats are not adoption. Without enablement, most users try it twice, get a mediocre result because they do not know how to work with the tool, and quietly stop. Rollout is a change-management exercise wearing a technology costume.
Phase 5 -- Enterprise Rollout: Wave Model
Majority of provisioned users active within 30 days. ROI demonstrated for pilot use cases.
Adoption targets met. Prompt libraries shared across units. Support structure operating.
Center of Excellence operating. All units have trained users and access to shared resources.
Big-bang rollout: Claude on for the entire company at once with no training, no prompt libraries, and no support structure. Provisioned seats are not adoption.
Phase 6: Optimization and Evolution
The work: cost optimization, model upgrade evaluation, new use case intake, and continuous platform monitoring. This phase does not end.
The gate: a quarterly business review that re-approves continuation based on demonstrated ROI, and an annual strategy refresh aligned to where the platform is going.
Where teams get it wrong: they treat production as the finish line. They declare victory and reassign the team. Then costs balloon because nobody tunes the model and effort mix, quality drifts because nobody re-tests against new model versions, and the deployment slowly falls behind the platform it is built on. Optimization is not a phase you complete. It is a practice you sustain, and it is the subject of its own treatment in this series.
Phase 6 -- Optimization and Evolution
This phase does not endCost Optimization
Tune model and effort mix. Review token spend by use case. Identify where cheaper models can substitute.
Model Upgrade Evaluation
Run prompt regression suite against new model. Require quality and cost parity before cutover.
Use Case Intake
Review new use case requests against governance criteria. Prioritize and sequence the roadmap.
Platform Monitoring
Watch the changelog. Triage deprecations. Maintain migration playbooks for active beta features.
Gate: quarterly business review re-approves continuation on demonstrated ROI. Annual strategy refresh.
Treating production as the finish line. Costs balloon, quality drifts, and the deployment slowly falls behind the platform it is built on when nobody owns optimization.
Two Disciplines That Run Through Every Phase
The six phases are sequential. Two disciplines are not. They run through all of them, and a deployment that treats either as a phase rather than a constant will fail.
The first is testing. Testing is not a gate at the end of Phase 4. It is embedded in every phase: smoke tests in Foundation, acceptance tests and prompt regression suites in Pilot, penetration and load tests in Hardening, wave validation in Rollout, model-upgrade regression in Optimization. A deployment that defers testing to a single hardening pass discovers its problems all at once, late, when they are most expensive.
The second is ownership clarity. Every task in every phase has an owner, and the honest version of that map has three columns, not one. Some work is the delivery partner's. Some is the client's alone, because the authority cannot be delegated: the CISO sign-off, the executive budget decision, the security team's penetration test. Some is genuinely joint. Deployments that blur these columns stall at exactly the moments that require a decision nobody clearly owns.
Discipline 1: Testing -- Embedded in Every Phase
Foundation
Smoke tests
Pilot
Acceptance + prompt regression
Hardening
Penetration + load
Rollout
Wave validation
Optimization
Model-upgrade regression
A deployment that defers testing to a single hardening pass discovers all problems at once, when they are most expensive to fix.
Discipline 2: Ownership Clarity -- Three Columns, Not One
Delivery Partner
- MCP server builds
- Integration code
- Prompt engineering
- Technical architecture
Client Only
- CISO sign-off
- Executive budget approval
- Security team pentest
- Compliance attestation
Joint
- Use case selection
- Pilot scope definition
- Rollout sequencing
- QBR evaluation
Deployments that blur these columns stall at exactly the moments that require a decision nobody clearly owns.
The Map Is Not the Territory
This is the arc. Six phases, six gates, two constant disciplines. The phases are not a checklist to rush through. They are a sequence where each one earns the right to the next, and the gates are the mechanism that enforces the sequence.
The detail lives in the phases. Each one has decisions, failure modes, and operational specifics that deserve their own treatment. Over the next several weeks, this series will take them one at a time, starting with Discovery and ending with the optimization practice that never stops. Start here with the shape of the whole thing, because the single most common reason enterprise Claude deployments fail is that someone treated a load-bearing phase as optional and only found out which one it was after it was too late to put it back.
Follow along for the deep-dives. Next in the series: Phase 1, Discovery and Assessment.
Work with Riptide
Ready to put a governance framework behind your Claude deployment?
Our Claude Enterprise Readiness Assessment maps your file structure, permissions model, and MCP surface in three weeks.
Book a discovery callAndrew Poole
Founder of Riptide Consulting, an Anthropic-first AI engineering firm based in Carlsbad, CA. Building the intelligence layer for enterprise and growth-stage companies on the Anthropic platform.