SRE Weekly - SRE Weekly Issue #233
Articles
In this post, I’ll share how we ensured that Meet’s available service capacity was ahead of its 30x COVID-19 usage growth, and how we made that growth technically and operationally sustainable by leveraging a number of site reliability engineering (SRE) best practices.
Samantha Schaevitz — Google
I love the concept of “battleshorts” just as much as I’ve been enjoying this series of articles analyzing STAMP.
Lorin Hochstein
Honeycomb had 5 incidents in just over a week, prompting not only their normal incident investigation process, but a meta-analysis of all five together.
Emily Nakashima — Honeycomb
Why is Chromium responsible for half of the DNS queries to the root nameservers? And why do they all return NXDOMAIN?
Matthew Thomas — APNIC
“That Moment” when your fire suppression system triggers and the fire department shows up. This is part war story and part description of incident response practices.
Ariel Pisetzky — Taboola
An overload in an internal blob storage system impacted many dependent services.
Sharding as a service, now there’s an interesting idea.
Gerald Guo, Thawan Kooburat — Facebook
In Kubernetes Operators: Automating the Container Orchestration Platform, authors Jason Dobies and Joshua Wood describe an Operator as “an automated Site Reliability Engineer for its application.” Given an SRE’s multifaceted experience and diverse workload, this is a bold statement. So what exactly can the Operator do?
Emily Arnot — Blameless
Outages
- Zoom
- Slack
- Let’s Encrypt
- NZX (New Zealand Stock Exchange)
- eBay
- Garmin
- Heroku
- Fastly
- Also this one.
Full disclosure: Fastly is my employer.
- Also this one.
- Cloudflare
|
Older messages
SRE Weekly Issue #231
Tuesday, August 25, 2020
View on sreweekly.com I have a special treat for you this week: 7 detailed incident reports! Just a note, I'll be on vacation next week, so I'll see you in two weeks on August 23. A message
SRE Weekly Issue #232
Tuesday, August 25, 2020
View on sreweekly.com A message from our sponsor, StackHawk: Is your company adopting GraphQL? Adding security testing is simple. Watch this 20 minute walk through to see how easy it is to get up and
SRE Weekly Issue #230
Monday, August 3, 2020
View on sreweekly.com Happy BTW: Wear a mask. A message from our sponsor, StackHawk: Add security testing to your CI pipelines with GitHub Actions. Check out this webinar recording (no email required)
SRE Weekly Issue #229
Monday, July 27, 2020
View on sreweekly.com A message from our sponsor, StackHawk: Read about how to build test driven security with StackHawk + Travis CI + Docker Compose. https://www.stackhawk.com/blog/test-driven-
SRE Weekly Issue #228
Monday, July 20, 2020
View on sreweekly.com SRE From Home is back! It's happening this Thursday, and I'll be on the Ask an SRE panel answering your questions. And don't miss the talks by lots of great folks,
You Might Also Like
Software Testing Weekly - Issue 261
Wednesday, March 12, 2025
New AI solutions for testing 👀 View on the Web Archives ISSUE 261 March 12th 2025 COMMENT Welcome to the 261st issue! I have nothing more to add to the genuinely great news that came out recently. I
JSK Daily for Mar 11, 2025
Tuesday, March 11, 2025
JSK Daily for Mar 11, 2025 View this email in your browser A community curated daily e-mail of JavaScript news How to Enforce Type Safety in FormData with TypeScript When working with the FormData
Binary Data, Tail Calls, Pickles, and More
Tuesday, March 11, 2025
Bytes Objects: Handling Binary Data in Python #672 – MARCH 11, 2025 VIEW IN BROWSER The PyCoder's Weekly Logo Bytes Objects: Handling Binary Data in Python In this tutorial, you'll learn about
Shaking The Wasp’s Nest 🐝
Tuesday, March 11, 2025
How Gamergate swarmed into our online lives. Here's a version for your browser. Hunting for the end of the long tail • March 11, 2025 Today In Tedium: You probably have noticed, just like me, that
Daily Coding Problem: Problem #1714 [Easy]
Tuesday, March 11, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. You are given an N by N matrix of random letters and a dictionary of words. Find
Mapped | The State of Democracy Around the World 🌐
Tuesday, March 11, 2025
After a historic election year, we show the state of democracy worldwide as it declines to its lowest level in two decades. View Online | Subscribe | Download Our App NEW REPORT: The Age of Data >
Stories, Free Tool & CRM Template
Tuesday, March 11, 2025
Notion stories, smart tools, and a free template to organize your contacts 🔥 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
LW 173 - How to become a Shopify Developer in 2025
Tuesday, March 11, 2025
How to become a Shopify Developer in 2025 Shopify Development news and articles Issue 173 - 03/11
This free AI tool beats Perplexity
Tuesday, March 11, 2025
Ubuntu vs. Debian; The new HR; YouTube randomizer -- ZDNET ZDNET Tech Today - US March 11, 2025 webfeetgettyimages-10141124 DuckDuckGo's AI beats Perplexity in one big way - and it's free to
⚙️ AI bubble bursts (?)
Tuesday, March 11, 2025
Plus: We talk to the CEO of Read AI