SRE Weekly - SRE Weekly Issue #222

A message from our sponsor, StackHawk:

The last thing we need is more noise from more tooling. With the new Findings Management feature, you can add AppSec tests to your CI pipeline without being innundated with alerts.
https://www.stackhawk.com/blog/appsec-findings-management?utm_source=SREWeekly

Articles

This article in a nutshell:

Kolton Andrus — Gremlin

I hadn’t heard of this distinction before. If you haven’t either, click through to find out more.

Ayende Rahien — RavenDB

In our experience, the three big sources of production stress are:

  • Toil
  • Bad monitoring
  • Immature incident handling procedures

Cheryl Kang — Google

ProPublica picks apart the incident in exhaustive detail, showing how multiple problems interwoven in the organization contributed to this tragedy.

Robert Faturechi, Megan Rose and T. Christian Miller — ProPublica

There’s a great review of Rasmussen’s safety boundary model, which I wasn’t previously familiar with. A system moves between three boundaries:

  • the boundary to economic failure
  • the boundary of unacceptable work load
  • the boundary of functionally acceptable performance

Lorin Hochstein

This one includes a really nifty graph showing how reliable your N backend microservices need to be in order to hit a given reliability target R.

Bill Duncan

Here are the results of the survey I linked here a couple weeks ago. There are some interesting and surprising results, well worth a read.

Rich Burroughs — FireHydrant

A commonly-used CA’s Root certificate expired, causing some havoc. Even though Sectigo did everything right, some software didn’t handle the transition to the new root well.

Paul Ducklin — Naked Security

Outages







This email was sent to you
        
SRE Weekly · PO Box 253 · South Lancaster, MA 01561-0253 · USA

Older messages

SRE Weekly Issue #221

Monday, June 1, 2020

View on sreweekly.com Don't forget, Catchpoint's SRE From Home event is happening this Friday. The speaker list has some names you'll recognize from articles linked here in previous issues.

SRE Weekly Issue #220

Monday, May 25, 2020

View on sreweekly.com A message from our sponsor, StackHawk: Hi, SRE Weekly. We're your new newsletter sponsor, StackHawk. We believe that application security is an important part of reliability

SRE Weekly Issue #219

Monday, May 18, 2020

View on sreweekly.com Articles Download our new on-call book [Atlassian] Check out this new 100-page ebook on incident response from Atlassian, great for folks setting up a brand new on-call structure

SRE Weekly Issue #218

Monday, May 11, 2020

View on sreweekly.com Articles Checklists and Runbooks An airplane pilot's take on runbooks, by way of comparison to aviation checklists. Bill Duncan Old box, dumb code, few thousand connections,

SRE Weekly Issue #217

Monday, May 4, 2020

View on sreweekly.com A message from our sponsor, VictorOps: Our people and tools need to be connected now more than ever before. That's why VictorOps is offering free, 90-day extended Enterprise

You Might Also Like

Software Testing Weekly - Issue 261

Wednesday, March 12, 2025

New AI solutions for testing 👀 View on the Web Archives ISSUE 261 March 12th 2025 COMMENT Welcome to the 261st issue! I have nothing more to add to the genuinely great news that came out recently. I

JSK Daily for Mar 11, 2025

Tuesday, March 11, 2025

JSK Daily for Mar 11, 2025 View this email in your browser A community curated daily e-mail of JavaScript news How to Enforce Type Safety in FormData with TypeScript When working with the FormData

Binary Data, Tail Calls, Pickles, and More

Tuesday, March 11, 2025

Bytes Objects: Handling Binary Data in Python #672 – MARCH 11, 2025 VIEW IN BROWSER The PyCoder's Weekly Logo Bytes Objects: Handling Binary Data in Python In this tutorial, you'll learn about

Shaking The Wasp’s Nest 🐝

Tuesday, March 11, 2025

How Gamergate swarmed into our online lives. Here's a version for your browser. Hunting for the end of the long tail • March 11, 2025 Today In Tedium: You probably have noticed, just like me, that

Daily Coding Problem: Problem #1714 [Easy]

Tuesday, March 11, 2025

Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. You are given an N by N matrix of random letters and a dictionary of words. Find

Mapped | The State of Democracy Around the World 🌐

Tuesday, March 11, 2025

After a historic election year, we show the state of democracy worldwide as it declines to its lowest level in two decades. View Online | Subscribe | Download Our App NEW REPORT: The Age of Data >

Stories, Free Tool & CRM Template

Tuesday, March 11, 2025

Notion stories, smart tools, and a free template to organize your contacts 🔥 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏

LW 173 - How to become a Shopify Developer in 2025

Tuesday, March 11, 2025

How to become a Shopify Developer in 2025 ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ Shopify Development news and articles Issue 173 - 03/11

This free AI tool beats Perplexity

Tuesday, March 11, 2025

Ubuntu vs. Debian; The new HR; YouTube randomizer -- ZDNET ZDNET Tech Today - US March 11, 2025 webfeetgettyimages-10141124 DuckDuckGo's AI beats Perplexity in one big way - and it's free to

⚙️ AI bubble bursts (?)

Tuesday, March 11, 2025

Plus: We talk to the CEO of Read AI ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌