_d
devops/ness
Blog
Reading ListAbout
Subscribe

Blog

Practical articles on AI, DevOps, Cloud, Linux, and infrastructure engineering.

Tag: #monitoringClear filters
A Pragmatic Multi-Region Strategy for Small Teams
••last month

A Pragmatic Multi-Region Strategy for Small Teams

How a small team moved from single-region risk to a simple active/passive multi-region setup without doubling complexity.

KU
Kiril urbonas
Read article
How We Stopped Terraform Drift from Surprising On-Call
••last month

How We Stopped Terraform Drift from Surprising On-Call

A real story of removing console-only changes, adding drift detection, and getting Terraform back in charge.

KU
Kiril urbonas
Read article
A Pragmatic Multi-Region Strategy for Small Teams
••last month

A Pragmatic Multi-Region Strategy for Small Teams

How a small team moved from single-region risk to a simple active/passive multi-region setup without doubling complexity.

KU
Kiril urbonas
Read article
End-of-Week Engineering: Why Smart Tech Teams Don’t Ship Major Changes on Friday
••last month

End-of-Week Engineering: Why Smart Tech Teams Don’t Ship Major Changes on Friday

A practical risk-management framework for release timing, Friday deployment policies, progressive delivery, and how elite teams protect reliability and people.

KU
Kiril Urbonas
Read article
SRE Error Budgets in Practice: Shipping Fast Without Burning Reliability
••last month

SRE Error Budgets in Practice: Shipping Fast Without Burning Reliability

A practical way to define SLOs and error budgets, connect them to release decisions, and avoid reliability debates without data.

KU
Kiril Urbonas
Read article
Operational Checklist: AI Inference Cost Optimization
••last month

Operational Checklist: AI Inference Cost Optimization

AI Inference Cost Optimization. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas
Read article
How We Stopped Terraform Drift from Surprising On-Call
••last month

How We Stopped Terraform Drift from Surprising On-Call

A real story of removing console-only changes, adding drift detection, and getting Terraform back in charge.

KU
Kiril urbonas
Read article
Operational Checklist: SLO-Based Monitoring for APIs
••last month

Operational Checklist: SLO-Based Monitoring for APIs

SLO-Based Monitoring for APIs. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas
Read article
A Pragmatic Multi-Region Strategy for Small Teams
••last month

A Pragmatic Multi-Region Strategy for Small Teams

How a small team moved from single-region risk to a simple active/passive multi-region setup without doubling complexity.

KU
Kiril urbonas
Read article
Operational Checklist: Secure Container Supply Chain Controls
••last month

Operational Checklist: Secure Container Supply Chain Controls

Secure Container Supply Chain Controls. Practical guidance for reliable, scalable platform operations.

KU
Kiril Urbonas
Read article
How We Stopped Terraform Drift from Surprising On-Call
••last month

How We Stopped Terraform Drift from Surprising On-Call

A real story of removing console-only changes, adding drift detection, and getting Terraform back in charge.

KU
Kiril urbonas
Read article
A Pragmatic Multi-Region Strategy for Small Teams
••2 months ago

A Pragmatic Multi-Region Strategy for Small Teams

How a small team moved from single-region risk to a simple active/passive multi-region setup without doubling complexity.

KU
Kiril urbonas
Read article
Page 2 of 25 · 291 posts
Previous
123...25
Next