SRE Weekly - SRE Weekly Issue #320
Articles
Slack shared this write-up of their February outage, which involved complex systems interactions and cascading failure.
Laura Nolan — Slack
Go watch this lightning talk now! She had me hooked within the first ten seconds.
Hi, my name is Emily Ruppe, I work at Jeli.io, and I am a recovering incident commander, and I am sick of the phrase “to prevent this incident from ever happening again”.
Emily Ruppe — DevOpsDays Rockies
This is my personal story of starting the SRE organization at Uber.
This article was written by a former Uber employee and is posted on their personal blog.
Will Larson
This is total transparency at its finest. This write-up has all the details you could ever hope for on what went wrong, how they responded, and what comes next.
Sri Viswanath — Atlassian
The target audience is new SREs and executive sponsors who would keep hearing these terms repeatedly but not take the time to read 1000s of words at a time.
[source: author comment on Reddit]
Ash P. — SREPath
Dropbox wanted to be able to handle datacenter failure. To reach this goal, they moved from an active/active model to active/passive and spun up a new Disaster Readiness team to rework their failover system.
Krishelle Hardson-Hurley, Ross Delinger, and Tong Pham — Dropbox
HelloFresh drove the implementation of SLOs in their Kubernetes-based infrastructure using Prometheus and Sloth.
Chris Loukas — HelloFresh
A Roblox engineer outlines the way that Roblox handles reliability at scale.
Alberto Covarrubias — Roblox
[…] let’s look at some common on call antipatterns and some simple things we can do to alleviate their common pitfalls.
Nickolas Means — Sym
Outages
|
Older messages
SRE Weekly Issue #319
Monday, April 25, 2022
View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and
SRE Weekly Issue #318
Monday, April 18, 2022
View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and
SRE Weekly Issue #317
Monday, April 11, 2022
View on sreweekly.com Bit of a short issue this week, as I'm currently recovering from COVID-19. Please don't worry! I seem to have a very minor case, likely thanks in large part to vaccination
SRE Weekly Issue #316
Monday, April 4, 2022
View on sreweekly.com I'm on vacation, so I prepared this issue in advance. Practically speaking, that just means there's no Outages section this week. See you all next week! PS Okay, I know I
SRE Weekly Issue #316
Monday, April 4, 2022
View on sreweekly.com I'm on vacation, so I prepared this issue in advance. Practically speaking, that just means there's no Outages section this week. See you all next week! PS Okay, I know I
You Might Also Like
📧 Did you want this discount?
Thursday, March 6, 2025
Hey, it's Milan. I want to make sure you see this today because it may be gone this weekend: There are 29 coupons left to join Pragmatic REST APIs with 30% off. After that, the price goes back to
Tiny Type On Yellow Pages ☎️
Thursday, March 6, 2025
That time phone books got a font upgrade. Here's a version for your browser. Hunting for the end of the long tail • March 5, 2025 Tiny Type On Yellow Pages Why AT&T had to redesign its primary
Simplify Kotlin Error Handling
Thursday, March 6, 2025
View in browser 🔖 Articles Goodbye try-catch, Hello runCatching! Exception handling in Kotlin just got cleaner! This article explores how runCatching can replace traditional try-catch blocks, making
JSK Daily for Mar 5, 2025
Wednesday, March 5, 2025
JSK Daily for Mar 5, 2025 View this email in your browser A community curated daily e-mail of JavaScript news Unions and intersections of object types in TypeScript In this blog post, we explore what
Daily Coding Problem: Problem #1709 [Medium]
Wednesday, March 5, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Facebook. Given an array of integers, write a function to determine whether the array
How Swiss Tables make Go 1.24 faster
Wednesday, March 5, 2025
Plus a way to call external library functions without Cgo. | #544 — March 5, 2025 Unsub | Web Version Together with pgAnalyze Go Weekly Faster Go Maps with Swiss Tables — One of Go's newest
Mapped | European Fertility Rates by Country 👶
Wednesday, March 5, 2025
The population replacement threshold is a fertility rate of 2.1. In 2025, all of Europe, except one small nation, is well below that level. View Online | Subscribe | Download Our App Invest in your
Trust in JS supply chain; sync vs. async code; JIT vulnerabilities; parseInt() and keycap emojis; V8
Wednesday, March 5, 2025
We have 10 links for you - the latest on JavaScript and tools Secure your JavaScript dependencies. socket.dev Sponsor Open source code makes up 90% of most codebases. Socket detects what traditional
The importance of flow state for developers
Wednesday, March 5, 2025
You are receiving this email because you subscribed to microservices.io. Considering migrating a monolith to microservices? Struggling with the microservice architecture? I can help: architecture
This beefy phone is a projector too 📽️
Wednesday, March 5, 2025
Biggest tech opps; How Firefox changed; Drone flying tips -- ZDNET ZDNET Tech Today - US March 5, 2025 GOTRAX 4 electric scooter A smartphone that's also a projector? I tested it, and it's