SRE Weekly - SRE Weekly Issue #292

View on sreweekly.com

A message from our sponsor, Rootly:

Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right team, postmortem timeline, setting up reminders, and more. Book a demo:
https://rootly.io/?utm_source=sreweekly

Articles

The lessons:

  1. Acknowledge human error as a given and aim to compensate for it
  2. Conduct blameless post-mortems
  3. Avoid the “deadly embrace”
  4. Favor decentralized IT architectures

There have been quite a few of these “lessons learned” articles that I’ve passed over, but I feel like this one is worth reading.

Anurag Gupta — Shoreline.io

Niall Murphy

Could us-east-1 go away? What might you do about it? Let’s catastrophize!

I love catastrophizing!

Tim Bray

When evaluating options, this article focuses on reliability, both of the service itself and the options it provides for building reliable services on it.

Quentin Rousseau — Rootly

This article is published by my sponsor, Rootly, but their sponsorship did not influence its inclusion in this issue.

This one answers the questions: what are failure domains, and how can we structure them to improve reliability?

brandon willett

It’s a great list of questions, and it covers a lot of ground. SREs wear many hats.

Opsera

I’ve always been curious about how Prometheus and similar time-series DBs compress metric data. Now I know!

Alex Vondrak — Honeycomb

This one has some unconfirmed (but totally plausible!) deeper details about what might have gone wrong in the Facebook outage, sourced from rumors.

rachelbythebay

There’s a really intriguing discussion in here about why organizations might justify a choice of profit at the expense of safety, and how the deck is stacked.

Rob Poston

Outages







This email was sent to you
why did I get this?    unsubscribe from this list    update subscription preferences
SRE Weekly · PO Box 253 · South Lancaster, MA 01561-0253 · USA

Older messages

SRE Weekly Issue #291

Monday, October 11, 2021

View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right

SRE Weekly Issue #290

Monday, October 4, 2021

View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right

SRE Weekly Issue #289

Monday, September 27, 2021

View on sreweekly.com A message from our sponsor, StackHawk: Semgrep and StackHawk are showing you what's new with automated security testing on September 30. Grab your spot: https://sthwk.com/

SRE Weekly Issue #288

Monday, September 20, 2021

View on sreweekly.com A message from our sponsor, StackHawk: Want to see what's new with automated security tooling? Tune in on September 30 to see how StackHawk and Semgrep are making it possible

SRE Weekly Issue #287

Monday, September 13, 2021

View on sreweekly.com A message from our sponsor, StackHawk: Trying to figure out how to keep your APIs secure? You're not the only one. See how DataRobot is automating API security testing with

You Might Also Like

Weekend Reading — The Agentic AI revolution

Saturday, March 8, 2025

Street Art Utopia “Darth Fisher sculpture by Frankey for Amsterdam Light Festival in Amsterdam, The Netherlands. Photos by Janus van den Eijnden.” Tech Stuff Claude Code overview The Agentic AI

Daily Coding Problem: Problem #1712 [Easy]

Saturday, March 8, 2025

Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given the root of a binary search tree, and a target K , return two nodes in the

⚙️ AI agents are failing the compliance industry

Saturday, March 8, 2025

March 08, 2025 | Read Online Subscribe | Advertise Good Morning. Welcome to this special edition of The Deep View, brought to you in collaboration with Delve. 2025 is the year of AI agents. While

Ranked | The World's Best-Selling Cars From 2024 🚗

Saturday, March 8, 2025

Which vehicles are dominating global sales? See our latest graphic ranking the world's best-selling cars from 2024. View Online | Subscribe | Download Our App See new charts from hundreds of

🐍 New Python tutorials on Real Python

Saturday, March 8, 2025

Hey there, There's always something going on over at Real Python as far as Python tutorials go. Here's what you may have missed this past week: Quiz: What Is the Python Global Interpreter Lock

Want to get ready for how AI will change your career?

Saturday, March 8, 2025

Professionals and parents ask me this a lot ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏

FIN7, FIN8, and Others Use Ragnar Loader for Persistent Access and Ransomware Operations

Saturday, March 8, 2025

THN Daily Updates Newsletter cover ⚡ LIVE WEBINAR ➟ ASPM: The Future of AppSec -- Boom or Bust? Discover How ASPM is Redefining Application Security with Smarter, Unified Solutions. Download Now

🍿 Will We Ever Own Movies Again? — How Accurate Are Cell Phone Network Coverage Maps

Saturday, March 8, 2025

Also: 6 Iconic Games You Didn't Know Were Actually Knockoffs How-To Geek Logo March 8, 2025 Did You Know Although best known for her role as Trinity in the iconic 1999 sci-fi film The Matrix, that

2.4M new accounts in Feb

Saturday, March 8, 2025

YouTube for enterprise onboarding, new UX audits grants program, OG Tracker Feb report, and more ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏

SWLW #641: The Meeting Manifesto, Avoiding Strategy Drift, and more.

Saturday, March 8, 2025

Weekly articles & videos about people, culture and leadership: everything you need to design the org that makes the product. A weekly newsletter by Oren Ellenbogen with the best content I found