SRE Weekly - SRE Weekly Issue #228

View on sreweekly.com

SRE From Home is back! It’s happening this Thursday, and I’ll be on the Ask an SRE panel answering your questions. And don’t miss the talks by lots of great folks, some of whom have had articles featured here previously!

A message from our sponsor, StackHawk:

StackHawk is built on the open source ZAP application security scanner, the most widely used AppSec tool out there. Now the founder of ZAP has joined our team to bring AppSec to developers. Read all about it.
https://www.stackhawk.com/blog/zap-founder-decides-to-join-stackhawk?utm_source=SREWeekly

Articles

They don’t. They just don’t.

[…] as deployments grow beyond a certain size it’s almost impossible to execute them successfully.

Alex Yates — Octopus Deploy

Whoops, forgot to include this one last week.

On June 30, Google’s email delivery service was targeted in what we believe was an attempt to bypass spam classification. The result was delayed message processing and increased message queuing.

My favorite part is the focus on blame awareness:

But it’s not enough to just be blameless—it’s also important to be blame-aware. Being blame-aware means that we are aware of our biases and how they may impact our ability to view an incident impartially.

Isabella Pontecorvo — PagerDuty

Netflix has a team dedicated to the overall reliability of their service.

Practically speaking, this includes activities such as systemic risk identification, handling the lifecycle of an incident, and reliability consulting.

Hank Jacobs– Netflix

Another good reference if you’re looking to bootstrap SRE at your organization.

Rich Burroughs — FireHydrant

Bill Duncan’s back with an easy and very close approximation for the “Tail at Scale” formula. The question it answers is: how many nines do you need on all of your backend microservices for X nines on the frontend?

Bill Duncan

Tons of great links in here with enticing descriptions to make you want to read them. Includes books, tools, hiring, certification, and general SRE goodness.

Emily Arnot — Blameless

SRE is all about keeping the user experience working, and working with product-focused folks can really help. For more on this, check out my former coworker Jen Wohlner’s awesome SRECon19 talk on SRE & product management.

Samantha Coffman — HelloFresh

Outages

  • Cloudflare
    • Cloudflare had a 50% drop in traffic served by their network subsequent to a BGP issue. Linked is their analysis including snippets of router configurations. Lots of services suffered contemporaneous outages possibly stemming from Cloudflare’s, including Discord, Postmates, Hosted Graphite, and DownDetector.John Graham-Cumming — Cloudflare
  • Twitter
    • Twitter had a major security breach, and as part of their response, they temporarily cut off large parts of their service. Click for their post about what happened.
  • GitHub
  • WhatsApp
  • Hulu
  • Snapchat
  • Microsoft Outlook
    • Notably, the outage involved the Outlook application that people run on their computer, not the cloud version.
  • Fastly






This email was sent to you
why did I get this?    unsubscribe from this list    update subscription preferences
SRE Weekly · PO Box 253 · South Lancaster, MA 01561-0253 · USA

Older messages

SRE Weekly Issue #227

Monday, July 13, 2020

View on sreweekly.com A message from our sponsor, StackHawk: When a team introduces security bugs, they don't know because nothing tells them. We test for everything else… why not security bugs?

SRE Weekly Issue #226

Monday, July 6, 2020

View on sreweekly.com A message from our sponsor, StackHawk: When a team introduces security bugs, they don't know because nothing tells them. We test for everything else… why not security bugs?

SRE Weekly Issue #225

Monday, June 29, 2020

View on sreweekly.com A message from our sponsor, StackHawk: Application security is shifting to a model where the engineers who write the code also take ownership of the security. Read our docs to

SRE Weekly Issue #224

Monday, June 22, 2020

View on sreweekly.com Happy Juneteenth (a couple days late)! Let's all work to strengthen the SRE profession by working to improve inclusion and diversity. A message from our sponsor, StackHawk: Do

SRE Weekly Issue #223

Monday, June 15, 2020

View on sreweekly.com A message from our sponsor, StackHawk: DevSecCon24 starts tonight at 10pm ET and runs for 24 hours. Tune in for great talks on building and deploying secure, resiliant software.

You Might Also Like

Software Testing Weekly - Issue 261

Wednesday, March 12, 2025

New AI solutions for testing 👀 View on the Web Archives ISSUE 261 March 12th 2025 COMMENT Welcome to the 261st issue! I have nothing more to add to the genuinely great news that came out recently. I

JSK Daily for Mar 11, 2025

Tuesday, March 11, 2025

JSK Daily for Mar 11, 2025 View this email in your browser A community curated daily e-mail of JavaScript news How to Enforce Type Safety in FormData with TypeScript When working with the FormData

Binary Data, Tail Calls, Pickles, and More

Tuesday, March 11, 2025

Bytes Objects: Handling Binary Data in Python #672 – MARCH 11, 2025 VIEW IN BROWSER The PyCoder's Weekly Logo Bytes Objects: Handling Binary Data in Python In this tutorial, you'll learn about

Shaking The Wasp’s Nest 🐝

Tuesday, March 11, 2025

How Gamergate swarmed into our online lives. Here's a version for your browser. Hunting for the end of the long tail • March 11, 2025 Today In Tedium: You probably have noticed, just like me, that

Daily Coding Problem: Problem #1714 [Easy]

Tuesday, March 11, 2025

Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. You are given an N by N matrix of random letters and a dictionary of words. Find

Mapped | The State of Democracy Around the World 🌐

Tuesday, March 11, 2025

After a historic election year, we show the state of democracy worldwide as it declines to its lowest level in two decades. View Online | Subscribe | Download Our App NEW REPORT: The Age of Data >

Stories, Free Tool & CRM Template

Tuesday, March 11, 2025

Notion stories, smart tools, and a free template to organize your contacts 🔥 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏

LW 173 - How to become a Shopify Developer in 2025

Tuesday, March 11, 2025

How to become a Shopify Developer in 2025 ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ Shopify Development news and articles Issue 173 - 03/11

This free AI tool beats Perplexity

Tuesday, March 11, 2025

Ubuntu vs. Debian; The new HR; YouTube randomizer -- ZDNET ZDNET Tech Today - US March 11, 2025 webfeetgettyimages-10141124 DuckDuckGo's AI beats Perplexity in one big way - and it's free to

⚙️ AI bubble bursts (?)

Tuesday, March 11, 2025

Plus: We talk to the CEO of Read AI ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌