Practical articles on AI, DevOps, Cloud, Linux, and infrastructure engineering.
Prompt Versioning and Regression Testing. Practical guidance for reliable, scalable platform operations.
LLM Gateway Design for Multi-Provider Inference. Practical guidance for reliable, scalable platform operations.
A field report from rolling out retrieval-augmented generation in production, including cache bugs, bad embeddings, and how we fixed them.
AI Inference Cost Optimization. Practical guidance for reliable, scalable platform operations.