SRE Weekly - SRE Weekly Issue #283
I’m on vacation enjoying the sunny beaches in Maine with my family, so I prepared this week’s issue in advance. No outages section, save for one big one I noticed due to direct personal experience. See you all next week!
Articles
We needed a way to deploy our new service seamlessly, and to roll back that deploy should something go wrong. Ultimately many, many, things did go wrong, and every bit of failure tolerance put into the system proved to be worth its weight in gold because none of this was visible to customers.
Geoffrey Plouviez — Cloudflare
I especially like the idea of tailoring retrospective documents to disparate audiences — you may have more than you realize.
Emily Arnott — Blameless
An analysis of two incidents from the venerable John Allspaw. These are from 2012 back when he was at Etsy, and yet there’s still a ton we can learn now by reading them.
John Allspaw — Etsy
An account of how Gojek responds to production issues, and why the RCA is a critical part of the process.
Sooraj Rajmohan — Gojek
Type carefully… or rather, design resilient systems.
JJ Tang — Rootly
Requiring development teams to fully own their services can lead to siloing and redundancy. Heroku works to ameliorate that by embedding SREs in development teams.
Johnny Boursiquot — Salesforce (presented at QCon)
I’ve shared some articles here suggesting doing away with incident metrics like MTTR entirely. This author says that they are useful, but the numbers must be properly ccontextualized.
Vanessa Huerta Granda — Learning From Incidents
Everything could be fine, or we could failing to report or missing problems altogether — we’re flying blind.
Chris Evans — incident.io
Outages
|
Older messages
SRE Weekly Issue #282
Monday, August 9, 2021
View on sreweekly.com A message from our sponsor, StackHawk: ICYMI ZAP Creator and Project Lead Simon Bennetts recently unveiled ZAP's new automation framework. Watch the session and see how it
SRE Weekly Issue #281
Monday, August 2, 2021
View on sreweekly.com A message from our sponsor, StackHawk: Traditional application security testing methods fail for single page applications. Check out why single page apps are different and how you
SRE Weekly Issue #280
Monday, July 26, 2021
View on sreweekly.com A message from our sponsor, StackHawk: DataRobot is using StackHawk to automate API security testing and to scale AppSec across their dev team. Learn more about all they're up
SRE Weekly Issue #279
Monday, July 19, 2021
View on sreweekly.com A message from our sponsor, StackHawk: On July 28, ZAP Creator Simon Bennetts is giving a first look at ZAP's new automation framework. Grab your spot: https://sthwk.com/ZAP-
SRE Weekly Issue #278
Monday, July 12, 2021
View on sreweekly.com A message from our sponsor, StackHawk: Learn how our team at StackHawk tests external cookie authentication using Ktor, and check out some of the helper functions we wrote to make
You Might Also Like
Your Phone’s Other Number 📱
Saturday, April 27, 2024
Let's talk about your phone's IMEI number. Here's a version for your browser. Hunting for the end of the long tail • April 27, 2024 Today in Tedium: As you may know, Tedium is a blog and/or
🕹️ How to Play Retro Games for Free on iPhone — Why I Can't Live Without an eReader
Saturday, April 27, 2024
Also: Anker MagGo (Qi2) Power Bank Review, and More! How-To Geek Logo April 27, 2024 📩 Get expert reviews, the hottest deals, how-to's, breaking news, and more delivered directly to your inbox by
Weekend Reading — The Bob Ross of programming
Saturday, April 27, 2024
This week we use coffee tasting as our design practice, get as close to and as far away from the metal as possible, find an easier way to write documentation, discover why Google Search is getting so
Issue #538: All the Jam entries, Panthera 2, and Tristram
Saturday, April 27, 2024
Weekly newsletter about HTML5 Game Development. Is this email not displaying correctly? View it in your browser. Issue #538 - April 26th 2024 If you have anything you want to share with the HTML5 game
Daily Coding Problem: Problem #1424 [Easy]
Saturday, April 27, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Microsoft. Implement a URL shortener with the following methods: shorten(url) , which
Charted | Countries That Became More Happy (or Unhappy) Since 2010 😅
Saturday, April 27, 2024
Which countries had the highest happiness gains since 2010? Which became sadder? View Online | Subscribe Presented by Voronoi: The App Where Data Tells the Story FEATURED STORY Countries With the
Noonification: What Is E-Waste Hacking?
Saturday, April 27, 2024
Top Tech Content sent at Noon! The first AI-powered startup unlocking the “billionaire economy” for your benefit How are you, @newsletterest1? 🪐 What's happening in tech this week: The
TikTok faces a ban in the US, Tesla profits drop and healthcare data leaks
Saturday, April 27, 2024
Plus: Amazon's new delivery subscription and a deep dive on Rippling View this email online in your browser By Kyle Wiggers Saturday, April 27, 2024 Image Credits: TechCrunch Welcome, folks, to
🐍 New Python tutorials on Real Python
Saturday, April 27, 2024
Hey there, There's always something going on over at realpython.com as far as Python tutorials go. Here's what you may have missed this past week: Write Unit Tests for Your Python Code With
Bogus npm Packages Used to Trick Software Developers into Installing Malware
Saturday, April 27, 2024
THN Daily Updates Newsletter cover Webinar -- Uncovering Contemporary DDoS Attack Tactics -- and How to Fight Back Stop DDoS Attacks Before They Stop Your Business... and Make You Headline News.