_d
devops/ness
Blog
Reading ListAbout
Subscribe

Blog

Practical articles on AI, DevOps, Cloud, Linux, and infrastructure engineering.

Tag: #sreClear filters
Cloud Disaster Recovery Runbook Design: How Small Teams Rehearse Multi-Region Failover
••0 months ago

Cloud Disaster Recovery Runbook Design: How Small Teams Rehearse Multi-Region Failover

A practical disaster recovery runbook guide for small cloud teams that need realistic failover steps, clear ownership, and repeatable rehearsals instead of shelfware documents.

KU
Kiril urbonas
Read article
End-of-Week Engineering: Why Smart Tech Teams Don’t Ship Major Changes on Friday
••last month

End-of-Week Engineering: Why Smart Tech Teams Don’t Ship Major Changes on Friday

A practical risk-management framework for release timing, Friday deployment policies, progressive delivery, and how elite teams protect reliability and people.

KU
Kiril Urbonas
Read article
SRE Error Budgets in Practice: Shipping Fast Without Burning Reliability
••last month

SRE Error Budgets in Practice: Shipping Fast Without Burning Reliability

A practical way to define SLOs and error budgets, connect them to release decisions, and avoid reliability debates without data.

KU
Kiril Urbonas
Read article
3 posts