SRE Weekly - SRE Weekly Issue #318
Articles
This talk summary explores the concept that “error” is a concept applied to an event from the outside, rather than a simple fact. What can this tell us about our after-incident investigation process?
Fred Hebert
Here’s a deep dive into a performance degradation in Cloudflare last December that was related to missing error handling in a shell script.
  Alex Forster — Cloudflare
Atlassian is having a tough time. It seems as if they deleted a few hundred customers’ data and have to pull it out of their backups one at a time.
Here’s another article about the outage (Steven J. Vaughan-Nichols — The New Stack).
Gergely Orosz — Pragmatic Engineer
Cool trick: their client library can fall back to a backup domain if DNS ably.io fails.
Jo Stichbury — Ably
It still wasn’t quite DNS, it was an interesting situation with the Linux kernel’s martian packet detection algorithm.
Laurent Bernaille and David Lentz — DataDog
Aside from the terrifying risk of nuclear war, this sounds very similar to the kind of complex system failures SREs deal with routinely.
Zia Mian, M. V. Ramana — Scientific American
Both approaches have their pros and cons. The right strategy for your company or team depends, of course, on your needs and priorities.
Quentin Rousseau — Rootly
This article is published by my sponsor, Rootly, but their sponsorship did not influence its inclusion in this issue.
Outages
|
Older messages
SRE Weekly Issue #317
Monday, April 11, 2022
View on sreweekly.com Bit of a short issue this week, as I'm currently recovering from COVID-19. Please don't worry! I seem to have a very minor case, likely thanks in large part to vaccination
SRE Weekly Issue #316
Monday, April 4, 2022
View on sreweekly.com I'm on vacation, so I prepared this issue in advance. Practically speaking, that just means there's no Outages section this week. See you all next week! PS Okay, I know I
SRE Weekly Issue #316
Monday, April 4, 2022
View on sreweekly.com I'm on vacation, so I prepared this issue in advance. Practically speaking, that just means there's no Outages section this week. See you all next week! PS Okay, I know I
SRE Weekly Issue #315
Monday, March 28, 2022
View on sreweekly.com I'm going on vacation, so I'm going to prepare next week's issue in advance. It'll look much like most issues, except there won't be an Outages section. See
SRE Weekly Issue #314
Monday, March 21, 2022
View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and
You Might Also Like
📧 Did you want this discount?
Thursday, March 6, 2025
Hey, it's Milan. I want to make sure you see this today because it may be gone this weekend: There are 29 coupons left to join Pragmatic REST APIs with 30% off. After that, the price goes back to
Tiny Type On Yellow Pages ☎️
Thursday, March 6, 2025
That time phone books got a font upgrade. Here's a version for your browser. Hunting for the end of the long tail • March 5, 2025 Tiny Type On Yellow Pages Why AT&T had to redesign its primary
Simplify Kotlin Error Handling
Thursday, March 6, 2025
View in browser 🔖 Articles Goodbye try-catch, Hello runCatching! Exception handling in Kotlin just got cleaner! This article explores how runCatching can replace traditional try-catch blocks, making
JSK Daily for Mar 5, 2025
Wednesday, March 5, 2025
JSK Daily for Mar 5, 2025 View this email in your browser A community curated daily e-mail of JavaScript news Unions and intersections of object types in TypeScript In this blog post, we explore what
Daily Coding Problem: Problem #1709 [Medium]
Wednesday, March 5, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Facebook. Given an array of integers, write a function to determine whether the array
How Swiss Tables make Go 1.24 faster
Wednesday, March 5, 2025
Plus a way to call external library functions without Cgo. | #544 — March 5, 2025 Unsub | Web Version Together with pgAnalyze Go Weekly Faster Go Maps with Swiss Tables — One of Go's newest
Mapped | European Fertility Rates by Country 👶
Wednesday, March 5, 2025
The population replacement threshold is a fertility rate of 2.1. In 2025, all of Europe, except one small nation, is well below that level. View Online | Subscribe | Download Our App Invest in your
Trust in JS supply chain; sync vs. async code; JIT vulnerabilities; parseInt() and keycap emojis; V8
Wednesday, March 5, 2025
We have 10 links for you - the latest on JavaScript and tools Secure your JavaScript dependencies. socket.dev Sponsor Open source code makes up 90% of most codebases. Socket detects what traditional
The importance of flow state for developers
Wednesday, March 5, 2025
You are receiving this email because you subscribed to microservices.io. Considering migrating a monolith to microservices? Struggling with the microservice architecture? I can help: architecture
This beefy phone is a projector too 📽️
Wednesday, March 5, 2025
Biggest tech opps; How Firefox changed; Drone flying tips -- ZDNET ZDNET Tech Today - US March 5, 2025 GOTRAX 4 electric scooter A smartphone that's also a projector? I tested it, and it's