SRE Weekly - SRE Weekly Issue #320
Articles
Slack shared this write-up of their February outage, which involved complex systems interactions and cascading failure.
Laura Nolan — Slack
Go watch this lightning talk now! She had me hooked within the first ten seconds.
Hi, my name is Emily Ruppe, I work at Jeli.io, and I am a recovering incident commander, and I am sick of the phrase “to prevent this incident from ever happening again”.
Emily Ruppe — DevOpsDays Rockies
This is my personal story of starting the SRE organization at Uber.
This article was written by a former Uber employee and is posted on their personal blog.
Will Larson
This is total transparency at its finest. This write-up has all the details you could ever hope for on what went wrong, how they responded, and what comes next.
Sri Viswanath — Atlassian
The target audience is new SREs and executive sponsors who would keep hearing these terms repeatedly but not take the time to read 1000s of words at a time.
[source: author comment on Reddit]
Ash P. — SREPath
Dropbox wanted to be able to handle datacenter failure. To reach this goal, they moved from an active/active model to active/passive and spun up a new Disaster Readiness team to rework their failover system.
Krishelle Hardson-Hurley, Ross Delinger, and Tong Pham — Dropbox
HelloFresh drove the implementation of SLOs in their Kubernetes-based infrastructure using Prometheus and Sloth.
Chris Loukas — HelloFresh
A Roblox engineer outlines the way that Roblox handles reliability at scale.
Alberto Covarrubias — Roblox
[…] let’s look at some common on call antipatterns and some simple things we can do to alleviate their common pitfalls.
Nickolas Means — Sym
Outages
|
Older messages
SRE Weekly Issue #319
Monday, April 25, 2022
View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and
SRE Weekly Issue #318
Monday, April 18, 2022
View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging and
SRE Weekly Issue #317
Monday, April 11, 2022
View on sreweekly.com Bit of a short issue this week, as I'm currently recovering from COVID-19. Please don't worry! I seem to have a very minor case, likely thanks in large part to vaccination
SRE Weekly Issue #316
Monday, April 4, 2022
View on sreweekly.com I'm on vacation, so I prepared this issue in advance. Practically speaking, that just means there's no Outages section this week. See you all next week! PS Okay, I know I
SRE Weekly Issue #316
Monday, April 4, 2022
View on sreweekly.com I'm on vacation, so I prepared this issue in advance. Practically speaking, that just means there's no Outages section this week. See you all next week! PS Okay, I know I
You Might Also Like
JSK Daily for Nov 25, 2024
Monday, November 25, 2024
JSK Daily for Nov 25, 2024 View this email in your browser A community curated daily e-mail of JavaScript news JavaScript Certification Black Friday Offer – Up to 54% Off! Certificates.dev, the trusted
Ranked | How Americans Rate Business Figures 📊
Monday, November 25, 2024
This graphic visualizes the results of a YouGov survey that asks Americans for their opinions on various business figures. View Online | Subscribe Presented by: Non-consensus strategies that go where
Spyglass Dispatch: Apple Throws Their Film to the Wolves • The AI Supercomputer Arms Race • Sony's Mobile Game • The EU Hunts Bluesky • Bluesky Hunts User Trust • 'Glicked' Pricked • One Massive iPad
Monday, November 25, 2024
Apple Throws Their Film to the Wolves • The AI Supercomputer Arms Race • Sony's Mobile Game • The EU Hunts Bluesky • Bluesky Hunts User Trust • 'Glicked' Pricked • One Massive iPad The
Daily Coding Problem: Problem #1619 [Hard]
Monday, November 25, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given two non-empty binary trees s and t , check whether tree t has exactly the
Unpacking “Craft” in the Software Interface & The Five Pillars of Creative Flow
Monday, November 25, 2024
Systems Over Substance, Anytype's autumn updates, Ghost's progress with its ActivityPub integration, and a lot more in this week's issue of Creativerly. Creativerly Unpacking “Craft” in the
What Investors Want From AI Startups in 2025
Monday, November 25, 2024
Top Tech Content sent at Noon! How the world collects web data Read this email in your browser How are you, @newsletterest1? 🪐 What's happening in tech today, November 25, 2024? The HackerNoon
GCP Newsletter #426
Monday, November 25, 2024
Welcome to issue #426 November 25th, 2024 News LLM Official Blog Vertex AI Announcing Mistral AI's Large-Instruct-2411 on Vertex AI - Google Cloud has announced the availability of Mistral AI's
⏳ 36 Hours Left: Help Get "The Art of Data" Across the Finish Line 🏁
Monday, November 25, 2024
Visual Capitalist plans to unveal its secrets behind data storytelling, but only if the book hits its minimum funding goal. View Online | Subscribe | Download Our App We Need Your Help Only 36 Hours
DeveloPassion's Newsletter #180 - Black Friday Week
Monday, November 25, 2024
Edition 180 of my newsletter, discussing Knowledge Management, Knowledge Work, Zen Productivity, Personal Organization, and more! Sébastien Dubois DeveloPassion's Newsletter DeveloPassion's
Meet HackerNoon's Latest Features: Boost Stories with Translations, Speech-to-Text & More
Monday, November 25, 2024
Hey, Hacker! HackerNoon's monthly product update is here! Get ready for a new version of the mobile app, more translation developments, a new AI Gallery, backend moves, and more! 🚀 This product