SRE Weekly - SRE Weekly Issue #416
What can we, in turn, learn from some of the most honest and blameless—and public—postmortems of the last few years?
They cover incidents from GitLab, Tarsnap, Roblox, and Cloudflare with great summaries and takeaways.
The Hacker News
My favorite part of this interview is when Vanessa describes parenting twin babies as constant incident response.
Shane Hastie — InfoQ
Here follow some lessons I’ve learned from the trenches in small start-ups and larger engineering teams, to improve your on-call shift experience and remediation time for production issues and make sure you’re spending on-call efforts on what has the most impact.
Alex Wauters
Doing your chaos experiments in a non-production environment can feel safer, but what are you giving up?
Sam Rossoff — Gremlin
Sometimes, shell is just the right tool for the job.
Amin Astaneh — Certo Modo
Catherine from Mastodon summarized this incident report beautifully:
this is one of the most violently unhinged CSB reports i’ve ever read […]
while investigating an explosion at a facility, CSB staff tried to prevent another explosion of the same kind in the same facility, and being unable to convince the workers to not cause it, ended up hiding behind a shipping container
U.S. Chemical Safety and Hazard Investigation Board
This one’s about why people tend to want a “SPoG” and what we should want instead. Bonus points for the Star Trek reference.
Nočnica Mellifera — Checkly
Right in the middle of migrating from one datacenter to an HA pair of new datacenters, one of the new ones failed. They had to quickly do a partial rollback of the migration to ride out the outage.
Gauthier François — Doctolib
Today, we are thrilled to announce the release of bpftop, a command-line tool designed to streamline the performance optimization and monitoring of eBPF programs.
Jose Fernandez — Netflix
|
Older messages
SRE Weekly Issue #415
Monday, March 11, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: Join FireHydrant and talk shop with your DevOps peers on March 28! You'll gain a better understanding of what makes a fatigue-free on-
SRE Weekly Issue #414
Monday, March 4, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: 91% of engineering leaders say they want a better alerting tool. The other 9% couldn't take the survey on their Blackberry. Meet
SRE Weekly Issue #1
Monday, February 26, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: Check out how global payments company Dock uses FireHydrant to streamline and consolidate their incident management stack and reduce what
SRE Weekly Issue #413
Monday, February 26, 2024
View on sreweekly.com Sorry about the automation fail and resend! That definitely wasn't issue #1. A message from our sponsor, FireHydrant: Check out how global payments company Dock uses
SRE Weekly Issue #412
Monday, February 19, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: FireHydrant's new and improved MTTX analytics dashboard is here! See which services are most affected by incidents, where they take
You Might Also Like
This Week in Rust #588
Saturday, March 1, 2025
Email isn't displaying correctly? Read this e-mail on the Web This Week in Rust issue 588 — 26 FEB 2025 Hello and welcome to another issue of This Week in Rust! Rust is a programming language
WebAIM February 2025 Newsletter
Friday, February 28, 2025
WebAIM February 2025 Newsletter Read this newsletter online at https://webaim.org/newsletter/2025/february Feature Global Digital Accessibility Salary Survey Results The results of the WebAIM and GAAD
JSK Daily for Feb 28, 2025
Friday, February 28, 2025
JSK Daily for Feb 28, 2025 View this email in your browser A community curated daily e-mail of JavaScript news Introducing the New Angular TextArea Component It is a robust and flexible user interface
Daily Coding Problem: Problem #1704 [Medium]
Friday, February 28, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Amazon. At a popular bar, each customer has a set of favorite drinks, and will happily
iOS Dev Weekly – Issue 701
Friday, February 28, 2025
What does Dave write about when he has a fever? 🤒 Let's find out!
Feature | The Best Visualizations from February on Voronoi 🏆
Friday, February 28, 2025
See the most popular, most discussed, and most liked visualizations on our new data storytelling app Voronoi from February. View Online | Subscribe About a year ago, we launched Voronoi, our free new
Issue #582: Phaser Launcher, DOOM in TypeScript types, and A Prison for Dreams
Friday, February 28, 2025
View this email in your browser Issue #582 - February 28th 2025 Weekly newsletter about Web Game Development. If you have anything you want to share with our community please let me know by replying to
Stop Android photo surveillance 🔍
Friday, February 28, 2025
Cheaper streaming 📺; 1Password nightmare 💀 -- ZDNET ZDNET Week in Review - US February 28, 2025 machine eye A new Android feature is scanning your photos for 'sensitive content' - how to stop
Why Natural Language Coding Isn’t for Everyone—Yet
Friday, February 28, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? 🪐 What's happening in tech today, February 28, 2025? The
iOS Cocoa Treats
Friday, February 28, 2025
View in browser Hello, you're reading Infinum iOS Cocoa Treats, bringing you the latest iOS related news straight to your inbox every week. Animatable Protocol: Taming Unruly SwiftUI Animations In