Practical articles on AI, DevOps, Cloud, Linux, and infrastructure engineering.
A field report from rolling out retrieval-augmented generation in production, including cache bugs, bad embeddings, and how we fixed them.
Use prompts to get reliable, safe outputs from LLMs for runbooks, code, and ops tasks.
LLM Gateway Design for Multi-Provider Inference. Practical guidance for reliable, scalable platform operations.