Simor

Data Infrastructure for Production AI

Practical writing on AI data engineering, feature stores, and the infrastructure choices that determine whether AI systems work in production.

RAG frameworks head-to-head: LlamaIndex vs Haystack vs Semantic Kernel
RAG frameworks head-to-head: LlamaIndex vs Haystack vs Semantic Kernel
04 Jun, 2026 | 05 Mins read

Retrieval-augmented generation is simple in theory: retrieve relevant documents, stuff them into a prompt, get a grounded answer. In practice, the retrieval step is where most RAG applications fail. T

A compliance-first AI rollout in financial services
A compliance-first AI rollout in financial services
03 Jun, 2026 | 05 Mins read

A regional bank with $12 billion in assets wanted to use machine learning to improve its commercial loan underwriting process. The existing process was manual, relying on credit analysts who spent fou

Responsible AI by Design: Integrating Ethics into AI Architecture
Responsible AI by Design: Integrating Ethics into AI Architecture
02 Jun, 2026 | 09 Mins read

Responsible AI is not a checklist you complete before deployment. It is a set of architectural decisions that you make throughout the design process, each of which involves trade-offs that are real an

The paradox of AI automation: more tools, less productivity?
The paradox of AI automation: more tools, less productivity?
01 Jun, 2026 | 05 Mins read

A data engineering team I worked with had adopted six AI-powered tools in twelve months. An automated code reviewer, a data quality scanner, a pipeline orchestrator with intelligent retry, a natural l

Migration playbook: batch to streaming in 5 phases
Migration playbook: batch to streaming in 5 phases
31 May, 2026 | 06 Mins read

The case for streaming is straightforward: data that arrives in minutes instead of hours enables decisions that were previously impossible. Fraud detection catches transactions before they clear. Pers

The great model commoditization: what happens when everyone has GPT-5
The great model commoditization: what happens when everyone has GPT-5
30 May, 2026 | 03 Mins read

OpenAI shipped GPT-5. Anthropic shipped Claude 4. Google shipped Gemini Ultra 2. Within six weeks of each other, the three leading model providers released frontier models that are, by most benchmarks

Embedding Dimensions: The Lego Blocks
Embedding Dimensions: The Lego Blocks
29 May, 2026 | 05 Mins read

Lego blocks come in standard sizes. A 2x4 stud configuration connects with other 2x4 configurations. A 1x2 connects with other 1x2s. The shape determines which pieces fit together. You do not need to

The observability stack: Datadog vs Grafana vs Monte Carlo
The observability stack: Datadog vs Grafana vs Monte Carlo
28 May, 2026 | 05 Mins read

Observability is not one problem — it is three. Infrastructure observability watches your servers, containers, and network. Application observability watches your code, APIs, and user-facing behavior.

AI spending is up 300% — where is it actually going?
AI spending is up 300% — where is it actually going?
27 May, 2026 | 03 Mins read

Enterprise AI spending increased roughly 300% year-over-year according to multiple industry surveys released this quarter. The headline number gets attention, but the breakdown is where the actionable