SRE Weekly - SRE Weekly Issue #276
Articles
HBO accidentally sent an email to a bunch of people, and they tweeted (jokingly?) blaming their intern. This is a link to a short, thoughtful response thread.
Gergely Orosz
This is the story of the Bunny CDN outage linked below. Great read, thanks folks!
Dejan Grofelnik Pelzel — Bunny
There’s never a bad time to review the fallacies of distributed computing. This article introduces them with examples and discussion of each.
Alex Diaconu — Ably
These aren’t specific tools, but rather 7 classes of tools (with examples). They are:
- Chaos engineering
- Monitoring and alerting
- Observability
- Paging tools
- SLO management
- Infrastructure-as-Code (and everything-as-code)
- Automated incident response
Quentin Rousseau — Rootly
Design is interpretive. We have to find common ground before we can even start to create a design, but finding that common ground is part of the design.
For example, we think of building codes as being precise, but when applied to new situations, they are ambiguous, and the engineers must make a judgment about how to apply them.
Lorin Hochstein
This starts with a really neat moment in which the interviewer asks Yiu to talk about lessons from her jewelry-making hobby that she applies to SRE.
Kurt Andersen
When Gamestop’s stock shot through the roof earlier this year, Reddit’s traffic did too. This is the first article in a short series by Reddit’s SRE team on how they handled the influx.
This article is about the ways that user actions affected their systems in unexpected ways, and how they responded.
Courtney Wang — Reddit
Recently in our Site Reliability Engineering organization in Azure, we established a set of cultural values that we hold ourselves and each other accountable to.
Bill Johnson — Microsoft
Outages
- Western Digital “My Book Live” hard drives
- Amazon Prime Video and Alexa
- PharmOutcomes
- PharmOutcomes is a SaaS used by pharmacies.
- Commonwealth Bank
- medium
- I’ve gotten a few 500s from Medium while trying to review articles last week and this week. Maybe it’s this incident on their status page?
- Bunny (CDN)
- reddit
- This post on their status site says “API errors”, but I saw rumblings that suggested that reddit itself was down.
|
Older messages
SRE Weekly Issue #275
Monday, June 21, 2021
View on sreweekly.com A message from our sponsor, StackHawk: Join ZAP Founder & Project Lead Simon Bennetts on June 30 for a live AMA where he will be answering questions on all things open source
SRE Weekly Issue #274
Monday, June 14, 2021
View on sreweekly.com A message from our sponsor, StackHawk: Join the GraphQL Security Testing Learning Lab on June 29 at 9 AM PT. Learn how to run automated security testing against your GraphQL APIs
SRE Weekly Issue #273
Monday, June 7, 2021
View on sreweekly.com A message from our sponsor, StackHawk: StackHawk is helping One Medical equip developers with automated security testing and self-service remediations. See how: http://sthwk.com/
SRE Weekly Issue #272
Monday, May 31, 2021
View on sreweekly.com A message from our sponsor, StackHawk: See how automated security testing can change how your teams find and fix security vulnerabilities. http://sthwk.com/security-automation
SRE Weekly Issue #271
Monday, May 24, 2021
View on sreweekly.com A message from our sponsor, StackHawk: Join StackHawk on Tuesday, May 25 for a hands-on authenticated security testing workshop. Follow along as we walk through three common
You Might Also Like
New Blogs on ThomasMaurer.ch for 04/23/2024
Tuesday, April 23, 2024
View this email in your browser Thomas Maurer Cloud & Datacenter Update This is the update for blog posts on ThomasMaurer.ch. Cloud operations for Windows Server through Azure Arc By Thomas Maurer
Post-Post 🗨️
Tuesday, April 23, 2024
Assessing the post-Twitter climate amid Post.News' shutdown. Here's a version for your browser. Hunting for the end of the long tail • April 22, 2024 Post-Post The demise of Post, one of the
BetterDev #257 - Building a GPS Receiver
Monday, April 22, 2024
Better Dev #257 Apr 22, 2024 Hi all, We come back with a new issue this week. If you like BetterDev, please help spead word out by refer to your friends. Buy me a coffee would be great too. This week I
Tomorrow's Free Notes Class: How to sign up!
Monday, April 22, 2024
Hi there, Tomorrow we will be hosting a Free Notes App Class. This is your last chance to register for tomorrow's live class and learn how to get the most out of your Notes app. Our experienced
Elon’s ‘balls to the wall’ autonomy push
Monday, April 22, 2024
Plus: Amazon ends California drone deliveries and Rippling's founder has a brand-new bag View this email online in your browser By Christine Hall Monday, April 22, 2024 Image Credits: Toru Hanai/
📱 Your iPhone is Now Discoverable by Others — Tips for Building Your First PC
Monday, April 22, 2024
Also: How to Play Windows Games on Your Mac, and More! How-To Geek Logo April 22, 2024 📩 Get expert reviews, the hottest deals, how-to's, breaking news, and more delivered directly to your inbox by
JSK Daily for Apr 22, 2024
Monday, April 22, 2024
JSK Daily for Apr 22, 2024 View this email in your browser A community curated daily e-mail of JavaScript news It Is so Cool to Develop React Native With Expo 1. What are the benefits of Expo?. "
😺 The social walkie-talkie
Monday, April 22, 2024
Hi, hi! It's Monday and it's Earth Day! Don't miss the Cat Nips section below for innovative products in the... Product Hunt Read in browser This newsletter is brought to you by YOU MIGHT
The Rings of Power
Monday, April 22, 2024
A paid tier for Spyglass: 'The Inner Ring' The Rings of Power By MG Siegler • 22 Apr 2024 View in browser View in browser On January 22, 2024, exactly one quarter ago, I launched Spyglass. Over
Engineering the future
Monday, April 22, 2024
Don't worry -- we'll be diving into the Mars Sample Return news. View this email online in your browser By Aria Alamalhodaei Monday, April 22, 2024 Hello and welcome back to TechCrunch Space.