SRE Weekly - SRE Weekly Issue #440
As part of designing their new paging product, incident.io created a set of end-to-end tests to exercise the system and alert on failures. Click through for details on how they designed the tests and lessons learned.
Rory Malcolm — incident.io
As Slack rolled out their new experience for large, multi-workspace customers, they had to re-work fundamental parts of their infrastructure, including database sharding.
Ian Hoffman and Mike Demmer — Slack
A third-party vendor’s Support Engineer [...] acknowledged that the root cause for both outages was a monitoring agent consuming all available resources.
Heroku
Resilience engineering is about focusing on making your organization better able to handle the unexpected, rather than preventing repetition of the same incident. This article gives a thought-provoking overview of the difference.
John Allspaw — InfoQ
Metrics are great for many other things, but they can't compete with traces for investigating problems.
Jean-Mark Wright
Through fictional storytelling, this article explains not just the benefits of retries, but how they can go wrong.
Denis Isaev — Yandex
Hot take? Sure, but they back it up with a well-reasoned argument.
Ethan McCue
A detailed look at the importance of backpressure and how to use it to reduce load effectively, as implemented in WarpStream.
Richard Artoul — WarpStream
|
Older messages
SRE Weekly Issue #439
Monday, August 26, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: Migrate off of PagerDuty, save money, and then have all of your configuration exported as Terraform modules? We did that. We know one of
SRE Weekly Issue #438
Tuesday, August 20, 2024
View on sreweekly.com Are there any blind or low-vision readers out there that would be willing to answer a few questions? I'm looking to learn more about your experience of reading a newsletter
SRE Weekly Issue #437
Monday, August 12, 2024
View on sreweekly.com This week's issue is entirely focused on the CrowdStrike incident: more details on what happened, analysis, and learnings. I'll be back next week with a selection of all
SRE Weekly Issue #436
Monday, August 5, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: Migrate off of PagerDuty, save money, and then have all of your configuration exported as Terraform modules? We did that. We know one of
SRE Weekly Issue #435
Monday, July 29, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: We've gone all out on our new integration with Microsoft Teams. If you're a MS Teams user, FireHydrant now supports the most
You Might Also Like
WP Weekly 212 - Ecosystem - Hosting AI, $5 Million Raised, GDPR Social Feeds
Monday, September 23, 2024
Read on Website WP Weekly 212 / Ecosystem Since Matt Mullenweg's Q&A session at WordCamp US concluded, the WordPress ecosystem has been in active discussion mode! Also in this issue: Many
Party In The Rear 📺
Monday, September 23, 2024
How the rear projection television got flattened. Here's a version for your browser. Hunting for the end of the long tail • September 22, 2024 Today in Tedium: These days, it's common to see a
SRE Weekly Issue #443
Monday, September 23, 2024
View on sreweekly.com I'm working on launching a new sibling project to SRE Weekly that will have a different format. I'm on the lookout for potential sponsors now, so if you're interested,
👎 Mistakes to Avoid When Setting Up a Wi-Fi Network — Handhelds Are the Future of Gaming
Sunday, September 22, 2024
Also: Starlink Bypassed My Country's Bad Internet, and More! How-To Geek Logo September 22, 2024 Did You Know The letter "J" is not found anywhere on the periodic table of elements,
C#524 Anatomy of the .NET dictionary
Sunday, September 22, 2024
Impress friends and colleagues knowing your key value pairs
PD#593 On Being A Senior Engineer
Sunday, September 22, 2024
There are not many modern books about being good senior engineer
RD#473 Clean React with TypeScript
Sunday, September 22, 2024
How to properly type React components
⚙️ Special Edition: The Deep View talks to Gary Marcus
Sunday, September 22, 2024
We met up with Dr. Gary Marcus to talk AI and regulation.
Mastering our mind for better ideas & Setapp Mobile beta is here
Sunday, September 22, 2024
Team messaging is broken, unlock your full potential today, Linear launches mobile apps, eight ways to banish misery, and a lot more in this week's issue of Creativerly. Creativerly Mastering our
Daily Coding Problem: Problem #1564 [Hard]
Sunday, September 22, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Let A be an N by M matrix in which every row and every column is sorted. Given i