Tagged articles
405 articles
Page 5 of 5
MaGe Linux Operations
MaGe Linux Operations
May 27, 2016 · Operations

Why Google Relies on Software Engineers to Run Its Services: Inside SRE

The article explains Google’s Site Reliability Engineering (SRE) philosophy, how it empowers software engineers to automate operations, the balance between development and reliability, the concept of error budgets, and the cultural shift that turned DevOps into a core practice for large‑scale services.

DevOpsError BudgetOperations Automation
0 likes · 10 min read
Why Google Relies on Software Engineers to Run Its Services: Inside SRE
Efficient Ops
Efficient Ops
Jul 27, 2015 · Operations

What Google SREs Do: Inside the Role that Powers Reliable Services

This article explains the responsibilities, requirements, and daily work of Google Site Reliability Engineers, contrasts them with Software Engineers, outlines key internal infrastructure components, and discusses the future direction of operations engineering in the cloud era.

GoogleInfrastructureOperations
0 likes · 11 min read
What Google SREs Do: Inside the Role that Powers Reliable Services
MaGe Linux Operations
MaGe Linux Operations
Apr 28, 2015 · Operations

How Yelp Achieved Zero‑Downtime HAProxy Reloads Using Linux qdisc

Yelp’s infrastructure team tackled HAProxy’s reload‑induced packet loss by leveraging Linux’s plug qdisc and iptables to delay SYN packets during reloads, enabling zero‑downtime service updates and improving reliability despite the kernel’s brief binding window.

HAProxyLinux qdiscNetwork Traffic Control
0 likes · 7 min read
How Yelp Achieved Zero‑Downtime HAProxy Reloads Using Linux qdisc