Simor

Data Infrastructure for Production AI

Practical writing on AI data engineering, feature stores, and the infrastructure choices that determine whether AI systems work in production.

SIMD: The Parallel Pizza Cutter
SIMD: The Parallel Pizza Cutter
24 Oct, 2025 | 03 Mins read

Picture a pizza shop on Friday night. Method one: single pizza cutter, cut one line at a time, eight cuts for eight slices. Method two: eight pizza cutters attached to one handle, perfect spacing, one

Multimodal AI Systems: Combining Text, Image & Audio Data
Multimodal AI Systems: Combining Text, Image & Audio Data
24 Oct, 2025 | 06 Mins read

Human communication is multimodal: we gesture while speaking, draw diagrams while explaining, and understand meaning through the interplay of sensory inputs. Yet most AI systems operate in silos—compu

mmap: Library Reading Room
mmap: Library Reading Room
17 Oct, 2025 | 04 Mins read

Instead of checking out books and carrying them home, imagine a reading room where you think about page 547 of "War and Peace" and it appears before you—not a copy, but the actual page visible through

Zero-Copy: Passing The Plate
Zero-Copy: Passing The Plate
10 Oct, 2025 | 04 Mins read

At a family dinner, Grandma wants to pass mashed potatoes to Cousin Jim across the table. The inefficient approach: Grandma scoops potatoes onto her plate, passes to Uncle Bob, who scoops onto his pla

mTLS: Secret Handshake
mTLS: Secret Handshake
03 Oct, 2025 | 04 Mins read

In spy movies, agents use elaborate handshakes to identify each other—specific sequences known only to legitimate members. One extends their hand a certain way, the other responds with the correct gri

Fine-Tuning LLMs: Parameter-Efficient Techniques (LoRA, QLoRA, PEFT)
Fine-Tuning LLMs: Parameter-Efficient Techniques (LoRA, QLoRA, PEFT)
03 Oct, 2025 | 05 Mins read

Fine-tuning a 70B parameter model costs $50K+ and requires weeks of training on expensive hardware. This is the reality for teams building domain-specific language models. Traditional full-parameter f

Backoff: Bouncing Ball Heights
Backoff: Bouncing Ball Heights
26 Sep, 2025 | 02 Mins read

Drop a rubber ball from shoulder height. It bounces back, but not as high. Each bounce is lower than the last—vigorous at first, then gradually settling, until it barely leaves the ground before final

Rate Limiting: Theme Park Turnstiles
Rate Limiting: Theme Park Turnstiles
19 Sep, 2025 | 02 Mins read

Disney World on a summer morning. Thousands of families rushing toward gates. Without control, it would be a stampede. Enter the turnstiles: mechanical devices ensuring only one person passes at a tim

Event-Driven Architectures for Real-Time Analytics
Event-Driven Architectures for Real-Time Analytics
19 Sep, 2025 | 02 Mins read

A food delivery platform's real-time dashboard froze during Friday dinner rush. Restaurants could not see incoming orders. Dispatchers could not assign drivers. Customer service was blind to delivery