SRE Weekly - SRE Weekly Issue #337
Thanks for all the vacation well-wishes! It was really great and relaxing. Take vacations, it’s important for reliability!
While I was out, I shipped the past two issues with content prepared in advance, and without the Outages section. This gave me a chance to really think hard about the value of the Outages section versus the time and effort I put into it.
I’ve decided to put the Outages section on hiatus for the time being. For notable outages, I’ll include them in the main section, on a case-by-case basis. Read on if you’re interested in what went into this decision.
The Outages section has always been of lower quality than the rest of the newsletter. I have no scientific process for choosing which Outages make the cut — mostly it’s just whatever shows up in my Google search alerts and seems “important”, minus a few arbitrary categories that don’t seem particularly interesting like telecoms and games. I do only a cursory review of the outage-related news articles I link to, and often they’re on poor-quality sites with a ton of intrusive ads. Gathering the list of Outages has begun taking more and more of my time, and I’d much rather spend that effort on curating quality content, so that’s what I’m going to do going forward.
Every one of these 10 items is enough reason to read this article! This makes me want to go investigate some incidents right now.
Fischer Jemison — Jeli
Slack shares with us in great detail why they use circuit breakers and how they rolled them out.
Frank Chen — Slack
My favorite part of this one is the section on expectations. We need to socialize this to help reduce the pressure on folks going on call for the first time.
Prakya Vasudevan — Squadcast
Status pages are marketing material. Prove me wrong.
Ellen Steinke — Metrist
incidents have unusually high information density compared with day-to-day work, and they enable you to piggy-back on the experience of others
Lisa Karlin Curtis — incident.io
These folks realized that they had two different use cases for the same data, real-time transactions and batch processing. Rather than try to find one DB that could support both, they fork two copies of the data.
Xi Chen and Siliang Cao — Grab
It’s all about gathering enough information that you can ask new questions when something goes wrong, rather than being stuck with only answers to the questions you thought to ask in advance.
Charity Majors
They needed the speed of local ephemeral SSDs but the reliability of network-based persistent disks. The solution: a linux MD option to mirror but prefer to read from the local disks. Neat!
Glen Oakley — Discord
OS upgrades can be risky. LinkedIn developed a system to unify OS upgrade procedures and make them much less risky.
Hengyang Hu, Dinesh Dhakal, and Kalyanasundaram Somasundaram — LinkedIn
|
Older messages
SRE Weekly Issue #336
Monday, August 29, 2022
View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and
SRE Weekly Issue #335
Monday, August 22, 2022
View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and
SRE Weekly Issue #334
Monday, August 15, 2022
View on sreweekly.com I'll be on vacation starting next Sunday (yay!). That means the next two issues will be prepared in advance, so there won't be an Outages section. A message from our
SRE Weekly Issue #333
Monday, August 8, 2022
View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and
SRE Weekly Issue #332
Monday, August 1, 2022
View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and
You Might Also Like
Daily Coding Problem: Problem #1708 [Medium]
Tuesday, March 4, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Indeed. Given a 32 -bit positive integer N , determine whether it is a power of four in
Underscore Naming, Flask-SQLAlchemy, Kivy, and More
Tuesday, March 4, 2025
Single and Double Underscore Naming Conventions in Python #671 – MARCH 4, 2025 VIEW IN BROWSER The PyCoder's Weekly Logo Single and Double Underscore Naming Conventions in Python In this video
Dial An Advertiser ☎️
Tuesday, March 4, 2025
Things like phone books existed before phone books. Here's a version for your browser. Hunting for the end of the long tail • March 4, 2025 I've decided to stop being so unfair to myself with
Ranked | The World's Top 20 Economies by GDP Growth (2015-2025) 📊
Tuesday, March 4, 2025
Halfway through the 2020s, here's a report card on the top 20 economies and their progress since 2015. View Online | Subscribe | Download Our App Presented by Hinrich Foundation NEW REPORT:
Open Source Isnt Dead...Its Just Forked
Tuesday, March 4, 2025
Top Tech Content sent at Noon! Augment Code: Developer AI for real eng work. Start for free Read this email in your browser How are you, @newsletterest1? 🪐 What's happening in tech today, March 4,
LW 172 - How to Make Compare at Pricing Show at Checkout
Tuesday, March 4, 2025
How to Make Compare at Pricing Show at Checkout Shopify Development news and articles Issue 172 -
Issue 165
Tuesday, March 4, 2025
💻🖱️ A single click destroyed this man's entire life. Fake murders get millions of YouTube views. Zuckerberg can now read your silent thoughts. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
This top multitool is under $30
Tuesday, March 4, 2025
Thinnest phone ever?📱; ArcoPlasma; Siri alternatives 🗣️ -- ZDNET ZDNET Tech Today - US March 4, 2025 GOTRAX 4 electric scooter I finally found a high-quality multitool for under $30 Compact and durable
Post from Syncfusion Blogs on 03/04/2025
Tuesday, March 4, 2025
New blogs from Syncfusion ® Stacked vs. Grouped Bar Charts in Blazor: Which is Better for Data Visualization? By Gowrimathi S Learn the difference between the stacked and grouped bar charts and choose
⚙️ GenAI Siri
Tuesday, March 4, 2025
Plus: TSMC's hundred billion dollar investment