SRE Weekly - SRE Weekly Issue #416
What can we, in turn, learn from some of the most honest and blameless—and public—postmortems of the last few years?
They cover incidents from GitLab, Tarsnap, Roblox, and Cloudflare with great summaries and takeaways.
The Hacker News
My favorite part of this interview is when Vanessa describes parenting twin babies as constant incident response.
Shane Hastie — InfoQ
Here follow some lessons I’ve learned from the trenches in small start-ups and larger engineering teams, to improve your on-call shift experience and remediation time for production issues and make sure you’re spending on-call efforts on what has the most impact.
Alex Wauters
Doing your chaos experiments in a non-production environment can feel safer, but what are you giving up?
Sam Rossoff — Gremlin
Sometimes, shell is just the right tool for the job.
Amin Astaneh — Certo Modo
Catherine from Mastodon summarized this incident report beautifully:
this is one of the most violently unhinged CSB reports i’ve ever read […]
while investigating an explosion at a facility, CSB staff tried to prevent another explosion of the same kind in the same facility, and being unable to convince the workers to not cause it, ended up hiding behind a shipping container
U.S. Chemical Safety and Hazard Investigation Board
This one’s about why people tend to want a “SPoG” and what we should want instead. Bonus points for the Star Trek reference.
Nočnica Mellifera — Checkly
Right in the middle of migrating from one datacenter to an HA pair of new datacenters, one of the new ones failed. They had to quickly do a partial rollback of the migration to ride out the outage.
Gauthier François — Doctolib
Today, we are thrilled to announce the release of bpftop, a command-line tool designed to streamline the performance optimization and monitoring of eBPF programs.
Jose Fernandez — Netflix
|
Older messages
SRE Weekly Issue #415
Monday, March 11, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: Join FireHydrant and talk shop with your DevOps peers on March 28! You'll gain a better understanding of what makes a fatigue-free on-
SRE Weekly Issue #414
Monday, March 4, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: 91% of engineering leaders say they want a better alerting tool. The other 9% couldn't take the survey on their Blackberry. Meet
SRE Weekly Issue #1
Monday, February 26, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: Check out how global payments company Dock uses FireHydrant to streamline and consolidate their incident management stack and reduce what
SRE Weekly Issue #413
Monday, February 26, 2024
View on sreweekly.com Sorry about the automation fail and resend! That definitely wasn't issue #1. A message from our sponsor, FireHydrant: Check out how global payments company Dock uses
SRE Weekly Issue #412
Monday, February 19, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: FireHydrant's new and improved MTTX analytics dashboard is here! See which services are most affected by incidents, where they take
You Might Also Like
📳 Galaxy Z Flip 6 Review — How to Watch the 2024 Summer Olympics for Free
Friday, July 26, 2024
Also: Fixing Spotify's Repeating Ads, and More! How-To Geek Logo July 26, 2024 Did You Know The rectangular area of a flag found in the upper left corner (top hoist corner) of the flag, such as the
Your monthly update has arrived
Friday, July 26, 2024
What's new in Google Play and Android July 2024 The Collections surface engages users with content Introducing Collections, a new on-device surface for your content Collections present users with
iOS Dev Weekly - Issue 671
Friday, July 26, 2024
There are two types of apps on the visionOS App Store. Will you create an app that makes people reach for the headset? 🥽 View on the Web Archives ISSUE 671 July 26th 2024 Comment In the last two weeks
Ranked | The 10 Busiest Ports in the World, by Cargo Traffic 🚢
Friday, July 26, 2024
As critical nodes for trade and commercial activity, we show the top 10 busiest ports in the world by cargo volume. View Online | Subscribe Presented by: Is Your Portfolio Powering the Future? >>
Let the Games Begin
Friday, July 26, 2024
Week of July 22, 2024 Let the Games Begin Week of July 22, 2024 By MG Siegler • 26 Jul 2024 View in browser View in browser Mark Zuckerberg loves two things above all else right now: llamas and
Daily Coding Problem: Problem #1508 [Hard]
Friday, July 26, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Uber. Given an array of integers, return a new array such that each element at index i
OpenAI announces SearchGPT - Weekly News Roundup - Issue #477
Friday, July 26, 2024
Plus: Will billionaires live forever; a police robot dog jamming wireless networks; Alphabet to invest $5B into Waymo; warnings about “model collapse”; a new partnership for AI security; and more! ͏ ͏
Using Data as a Product Manager
Friday, July 26, 2024
If you had your choice between a little data or a lot of data on which to guide decisions, which would you pick?
Last Mile of Blockchains: RPC and Node-as-a-Service
Friday, July 26, 2024
Top Tech Content sent at Noon! Find the hottest jobs from top tech companies Read this email in your browser How are you, @newsletterest1? 🪐 What's happening in tech today, July 26, 2024? The
⚙️ Generative AI is making workers less productive
Friday, July 26, 2024
Plus: Runway trained video generator on thousands of YouTube videos