The $2M model that never made it to production

Simor Consulting | 09 Jun, 2026 | 05 Mins read

A retail chain with 400 stores spent two years and $2.1 million building an inventory optimization model. The model was technically excellent. It reduced predicted stockouts by thirty-two percent and predicted overstock situations by twenty-seven percent in offline evaluation against two years of historical data. The data science team published internal papers on the approach. The VP of supply chain sponsored the project through three budget cycles. The model won an internal innovation award.

It never ran on a single real store.

The project was canceled eighteen months after the model was technically complete. The cancellation was not because the model performed poorly. It was because the organization could not operationalize it. The model required daily data feeds from seven source systems, produced recommendations in a format that no existing system could consume, and needed a human review step that no operational team had bandwidth to perform. The model existed. The deployment pathway did not.

How the project was structured

The project was organized as a data science initiative. The team reported to the chief data officer. The team’s success criteria were model accuracy metrics: precision, recall, F1 score, and a custom inventory cost function. The team optimized these metrics rigorously. They tested multiple architectures, ran ablation studies, and validated against multiple time periods. By every measure in their success criteria, the project was a success.

The team’s charter did not include deployment. Deployment was the responsibility of the supply chain technology team, which was a separate organization with separate leadership, separate priorities, and a separate budget. The data science team delivered a trained model and a prediction API. The supply chain technology team was expected to integrate the API into their inventory management system.

The integration never happened. The supply chain technology team had a backlog of fourteen projects that were funded, staffed, and committed to business stakeholders. The inventory optimization model was not on their roadmap. When the data science team delivered the model, the supply chain technology team had no capacity to integrate it and no incentive to reprioritize. The model was added to their backlog at position fifteen.

The integration gap

When we were brought in to assess the project, we found that the technical integration was solvable. The prediction API was well-documented. The data feeds required standard ETL work. The human review step could have been automated for low-risk recommendations. None of these were hard engineering problems.

The hard problem was organizational. The model’s deployment required coordinated changes across four teams: the data science team that owned the model, the supply chain technology team that owned the inventory system, the data engineering team that owned the data pipelines, and the store operations team that would consume the recommendations. None of these teams had the deployment in their quarterly plan. None of them had been allocated budget for the integration work. None of them had a product owner who was accountable for getting the model into production.

The data science team assumed that a technically superior model would create its own deployment urgency. It did not. The supply chain technology team had their own definition of urgency, which was driven by store operations pain points, not model accuracy metrics. The model’s value proposition — reduced stockouts and overstock — was real but indirect. It did not map to a specific operational failure that a specific team was being measured on.

This diagram requires JavaScript.

Enable JavaScript in your browser to use this feature.

The structural failure

The project failed because of a missing role: the deployment product owner. In successful ML deployments, there is a person — or a team — whose job is to bridge the gap between model development and operational integration. This person is not a data scientist. They are not an engineer. They are a product manager who owns the end-to-end journey from trained model to business impact.

The deployment product owner’s responsibilities include: securing integration capacity from the operational technology team before the model is complete, defining the deployment success criteria alongside the model accuracy criteria, negotiating the data pipeline changes with the data engineering team, designing the human review workflow with the operations team, and measuring the business impact after deployment.

Without this role, the model development and the deployment integration happen on separate tracks with separate timelines and separate accountability. The model reaches technical completion. The integration does not start. The gap between them is where projects die.

What a rescue would have required

We proposed a rescue plan that would have taken twelve weeks and $180,000. The plan included: appointing a deployment product owner from the supply chain organization, integrating the data feeds using existing ETL tooling, deploying the prediction API to the existing inventory management system’s integration layer, automating the human review step for recommendations below a risk threshold, and measuring the business impact in ten pilot stores over four weeks.

The rescue plan was not approved. By the time we were engaged, the project had been politically toxic for two quarters. Sponsoring a rescue would have required a VP to admit that the original project had been misstructured. The easier political path was cancellation, which is what happened.

The model artifacts, the documentation, and the internal papers were archived. The data science team was reassigned. The $2.1 million was written off as an innovation investment that did not pan out.

The lessons

The first lesson: model accuracy is not a deployment criterion. A model that achieves perfect accuracy but cannot be integrated into an operational system has zero business value. The deployment pathway must be designed, funded, and staffed in parallel with model development, not after model development is complete.

The second lesson: the team that builds the model must have a contractual handoff to the team that will deploy it, and that handoff must include integration requirements, timeline commitments, and shared accountability for business outcomes. Without this contract, the handoff is aspirational and the deployment is uncertain.

The third lesson: ML projects need two success criteria. The first is model quality — does the model make accurate predictions. The second is deployment quality — does the model’s output reach the people and systems that can act on it. A project that passes the first and fails the second has failed.

The decision heuristic

Before starting an ML project, identify the team that will integrate the model into the operational workflow and confirm that they have the capacity, the budget, and the accountability to do so. If you cannot identify this team, or if they cannot commit to a timeline, do not start the model development. The model will reach completion and sit on a shelf. Fund the deployment or do not fund the project. There is no middle ground where a technically excellent model creates its own deployment momentum. It does not happen.

Shipping a production AI system?

Find the control gaps before they turn into incidents. Take the AI Production Scorecard for a fast baseline across the seven layers, or book an architecture review and we will turn it into a hardening plan.

Take the AI Production Scorecard Book an Architecture Review

This comment section requires JavaScript.

Enable JavaScript in your browser to use this feature.

Similar Articles

DataOps MLOps

MLOps vs DataOps: Understanding the Differences and Overlaps

08 Feb, 2024 | 03 Mins read

DataOps and MLOps both aim to improve reliability and efficiency in data-centric workflows, but they address different parts of the data science lifecycle. Understanding their boundaries helps organiz

Case Study AI Infrastructure

How a retailer reduced inference latency 90% with feature store caching

21 Apr, 2026 | 04 Mins read

A mid-market e-commerce retailer with roughly $200M in annual revenue had invested eighteen months building a product recommendation engine. The models were accurate. Offline evaluation showed meaning

Case Study Data Architecture

The data pipeline that cost $50K/month — and the audit that found why

22 Apr, 2026 | 04 Mins read

A financial services firm running analytics on trade settlement data came to us with a specific complaint: their cloud data platform cost had tripled in eighteen months, and nobody could explain why.

Case Study Data Architecture

Migrating from batch to streaming: a 6-month journey

28 Apr, 2026 | 05 Mins read

A logistics company processing two million shipments per day ran their entire operational reporting stack on nightly batch ETL. Every morning at 6 AM, operations managers reviewed dashboards built on

Case Study Knowledge Layer

When RAG failed: a knowledge retrieval project post-mortem

29 Apr, 2026 | 05 Mins read

A legal technology company had invested six months building a retrieval-augmented generation system to help contract attorneys find relevant precedent clauses across a corpus of 180,000 executed agree

Case Study Data Architecture

From 3-hour dashboards to 3-minute insights: a BI modernization story

05 May, 2026 | 05 Mins read

A manufacturing company with facilities in twelve countries ran its operational reporting on a traditional BI stack: a data warehouse, an ETL pipeline, and a dashboard tool that had been deployed six

Case Study AI Infrastructure

The vector database that couldn't scale — and what we did instead

12 May, 2026 | 05 Mins read

A media company with a library of twelve million articles, transcripts, and research documents had built a semantic search system on a managed vector database. The system was designed to let journalis

Case Study AI Infrastructure

Building an AI operating system for a 10,000-person company

19 May, 2026 | 05 Mins read

A diversified industrial company with 10,000 employees across manufacturing, logistics, and field services had accumulated forty-seven separate AI projects over three years. Each business unit had bui

Tooling MLOps

Feature store comparison: Feast, Tecton, Hopsworks

20 May, 2026 | 05 Mins read

Feature stores solve a specific problem: the features you use to train a model must be the same features you use to serve it. When the training pipeline computes features differently than the serving

Case Study Data Architecture

How we killed our ETL pipeline (and productivity went up)

26 May, 2026 | 05 Mins read

A B2B SaaS company running a customer success platform had a data pipeline that consumed sixty percent of the data engineering team's time. Not feature work. Not analytics. Pipeline maintenance. The p

Case Study AI Governance

A compliance-first AI rollout in financial services

03 Jun, 2026 | 05 Mins read

A regional bank with $12 billion in assets wanted to use machine learning to improve its commercial loan underwriting process. The existing process was manual, relying on credit analysts who spent fou

Tooling MLOps

Model serving: vLLM, TGI, Triton — which fits your stack?

18 Jun, 2026 | 05 Mins read

Serving a language model in production is an infrastructure problem, not a model problem. The model weights are the same regardless of how you serve them. What differs is throughput (how many requests

Case Study Data Architecture

Data mesh in practice: year 2 retrospective

16 Jun, 2026 | 05 Mins read

An insurance company with $400 million in premium volume adopted data mesh two years ago. The central data team had become a bottleneck. Every business unit — claims, underwriting, actuarial, and dist

Tooling MLOps

CI/CD for ML: MLflow vs Weights & Biases vs Neptune

25 Jun, 2026 | 05 Mins read

Machine learning teams face a version control problem that Git does not solve. Git tracks code changes, but ML experiments change more than code — they change hyperparameters, datasets, model architec

Case Study AI Infrastructure

When your AI vendor goes bankrupt — surviving platform lock-in

23 Jun, 2026 | 05 Mins read

A healthcare analytics company received notice on a Tuesday afternoon that their primary AI infrastructure vendor was filing for Chapter 7 bankruptcy. The platform hosted their patient risk stratifica

Case Study AI Infrastructure

Real-time fraud detection: from proof-of-concept to production in 90 days

30 Jun, 2026 | 05 Mins read

A payment processor handling twelve million transactions per day had a fraud detection system that was accurate but slow. The system reviewed transactions in batch, four times per day. A fraudulent tr

Case Study Knowledge Layer

Consolidating 47 data sources into one knowledge layer

01 Jul, 2026 | 05 Mins read

A global professional services firm with 8,000 consultants maintained institutional knowledge across forty-seven separate systems. Project proposals lived in a document management system. Client engag

Case Study AI Governance

The GDPR audit that reshaped our entire ML pipeline

07 Jul, 2026 | 05 Mins read

A European fintech with twelve million customers received a GDPR audit notice from their national data protection authority. The audit focused on the company's machine learning pipeline, which powered

Case Study AI Governance

How a healthcare org deployed LLMs without violating HIPAA

14 Jul, 2026 | 05 Mins read

A hospital system with twelve facilities and 14,000 clinical staff wanted to use large language models to assist with clinical documentation. Physicians spent an average of two hours per day on docume

Case Study Data Architecture

Legacy mainframe to cloud-native: the data migration they said was impossible

21 Jul, 2026 | 06 Mins read

An insurance company running on an IBM mainframe had accumulated forty years of policy data in VSAM files and DB2 tables. The mainframe processed 600,000 transactions per day across policy administrat

Case Study AI Governance

Building trust in AI recommendations — the change management story

28 Jul, 2026 | 06 Mins read

A consumer goods company built an AI system that recommended reorder quantities for 12,000 SKUs across 340 distribution points. The system optimized for a multi-objective function that balanced invent

MLOps Infrastructure

Scaling Machine Learning Infrastructure: From POC to Production

10 May, 2024 | 04 Mins read

# Scaling Machine Learning Infrastructure: From POC to Production Moving a machine learning model from notebook to production exposes gaps that notebooks hide. Data scientists produce working models

MLOps Kubernetes

Deploying ML Models on Kubernetes: Best Practices

06 May, 2024 | 03 Mins read

# Deploying ML Models on Kubernetes: Best Practices ML models in production need orchestration, scaling, and monitoring infrastructure. Kubernetes provides these capabilities, though the learning cur

Machine Learning MLOps

Incremental ML: Continuous Learning Systems

12 Jul, 2024 | 11 Mins read

Traditional ML trains on historical data, deploys, and waits until performance degrades. This fails in dynamic environments where data patterns evolve. Incremental ML continuously updates models as ne

Serverless MLOps

Serverless Machine Learning: Patterns with AWS Lambda, GCP Cloud Run & Azure Functions

18 Jul, 2025 | 05 Mins read

A social media analytics company watched their Kubernetes cluster fail to handle traffic spikes from trending topics. The cluster would scale from 50 to 500 pods in minutes, but not fast enough to pre

Observability MLOps

AI Observability: Monitoring Drift, Data Quality & Model Performance

12 Sep, 2025 | 02 Mins read

An insurance company's premium pricing model had been quietly going haywire for two weeks. Young drivers in high-risk areas were getting bargain prices while safe drivers faced astronomical quotes. By

Case Study RAG

Case Study: End-to-End RAG Platform for Customer Support

05 Dec, 2025 | 05 Mins read

A SaaS company with 200 support agents and 10,000+ knowledge base articles had an 18-hour average response time and 23% first-contact resolution. Their largest enterprise client threatened to cancel a

Knowledge Layer Case Study

Case Study: Building a Production AI Knowledge Layer for Financial Services

01 Mar, 2026 | 10 Mins read

A regional bank's investment research team spent 60% of their time gathering information and 40% doing analysis. Analysts had to search through regulatory filings, internal research memos, market data

Agent Orchestration Case Study

Case Study: Multi-Agent System for Supply Chain Optimization

13 Jun, 2026 | 12 Mins read

A mid-size automotive parts manufacturer with operations spanning 15 countries and relationships with over 200 suppliers faced a supply chain coordination problem that was consuming too much of their