Simor

Data Infrastructure for Production AI

Practical writing on AI data engineering, feature stores, and the infrastructure choices that determine whether AI systems work in production.

The data pipeline that cost $50K/month — and the audit that found why
The data pipeline that cost $50K/month — and the audit that found why
22 Apr, 2026 | 04 Mins read

A financial services firm running analytics on trade settlement data came to us with a specific complaint: their cloud data platform cost had tripled in eighteen months, and nobody could explain why.

How a retailer reduced inference latency 90% with feature store caching
How a retailer reduced inference latency 90% with feature store caching
21 Apr, 2026 | 04 Mins read

A mid-market e-commerce retailer with roughly $200M in annual revenue had invested eighteen months building a product recommendation engine. The models were accurate. Offline evaluation showed meaning

Why most AI transformations fail (it's not the technology)
Why most AI transformations fail (it's not the technology)
20 Apr, 2026 | 04 Mins read

The CTO of a mid-size financial services firm told me they had spent $4 million on AI tooling in eighteen months. They had three large language model providers under contract, a vector database cluste

Knowledge Graphs and Vector Search: Complementary, Not Competitive
Knowledge Graphs and Vector Search: Complementary, Not Competitive
19 Apr, 2026 | 11 Mins read

The framing of knowledge graphs versus vector databases as competing technologies is a symptom of hype cycles that simplify complex architectural decisions for public discourse. Practitioners argue ab

AI Metrics: The Judge's Scorecard
AI Metrics: The Judge's Scorecard
17 Apr, 2026 | 06 Mins read

Figure skating judges do not give one score. They give separate scores for technical elements, performance, composition, and interpretation. Each dimension captures something different. A skater can l

Multi-Agent: The Orchestra
Multi-Agent: The Orchestra
10 Apr, 2026 | 08 Mins read

An orchestra does not have one musician playing everything. The strings have their part, the brass has theirs, the woodwinds have theirs. They do not all play the same notes. They play different notes

Evaluating LLM Providers for Enterprise: A Framework Beyond Benchmark
Evaluating LLM Providers for Enterprise: A Framework Beyond Benchmark
08 Apr, 2026 | 10 Mins read

Benchmark scores tell you how a model performs on problems that someone else chose. Your enterprise systems present different problems: your proprietary terminology, your specific data distributions,

Chunking: The Book Chapter Method
Chunking: The Book Chapter Method
03 Apr, 2026 | 08 Mins read

You have a 600-page book on regulatory compliance. You do not read it front to back. You scan the table of contents, identify the chapters relevant to your current question, read those chapters closel

Tool Calling and Function Calling: Connecting AI to Enterprise Systems
Tool Calling and Function Calling: Connecting AI to Enterprise Systems
28 Mar, 2026 | 14 Mins read

A language model that only generates text is not enough for most enterprise problems. The real value emerges when an AI system can look up your customer record, check inventory levels across warehouse