SRE Weekly - SRE Weekly Issue #428
This article presents in incident theme that I've lived through many times but never had such a pithy name for.
Geoff Townsend — Blameless
There are risks and downsides inherent in a distributed system, so it's worth thinking about whether you really need one.
Pipitz — Adevinta
And here's a counterpoint to the previous article: deciding whether you need a distributed system isn't just about scale.
Marc Brooker
The effectiveness of memes in availability campaigns.
This short post is a pile of memes, and the video one is top notch.
Ross Brodbeck
Paraphrasing part of this article: either you didn't understand your system fully when you wrote the alert, or there really are sporadic failures.
Chris Siebenmann
If you've ever created an action item from an incident along the lines of "don't take unnecessary risks in the future", you need to read this one.
The rest of you need to read it too.
Lorin Hochstein
A how-to for building anomaly detection alerting in Prometheus with specific config examples.
Karl Stoney
A panicked engineer asks reddit's r/sre about an incident they caused: how could they have done better? Will they be fired? The comments are spot on, and this conversation is fresh enough that you could jump in too if you're interested.
u/console_fulcrum and others — reddit
Last Monday, Honeycomb had an outaged related to a schema migration involving MySQL's ENUM data type, and they posted this incident report.
Bonus content: I wasn't aware of ENUMs at all, so I had to brush up with this article: 8 Reasons Why MySQL's ENUM Data Type Is Evil.
Honeycomb
Full disclosure: Honeycomb is my employer.
An experienced SRE discusses the skills and experiences you might be quizzed about in an interview for an SRE role.
Krishna Vinnakota — DZone
|
Older messages
SRE Weekly Issue #426
Monday, June 3, 2024
View on sreweekly.com Got any burning questions to ask an experienced SRE? I'm gathering your questions in this google form, and I'd love to hear from you. I'm hoping to use your questions
SRE Weekly Issue #427
Monday, June 3, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: We've gone all out on our new integration with Microsoft Teams. If you're a MS Teams user, FireHydrant now supports the most
SRE Weekly Issue #425
Monday, May 20, 2024
View on sreweekly.com Welcome to a special re-send of SRE Weekly Issue #425! For those of you getting this for a second time, my apologies. I attempted to change to a new email vendor, but they
SRE Weekly Issue #425
Monday, May 20, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: FireHydrant is now AI-powered for faster, smarter incidents! Power up your incidents with auto-generated real-time summaries,
SRE Weekly Issue #424
Monday, May 13, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: FireHydrant is now AI-powered for faster, smarter incidents! Power up your incidents with auto-generated real-time summaries,
You Might Also Like
📧 Flexible PDF Reporting in .NET Using Razor Views
Saturday, June 29, 2024
Flexible PDF Reporting in .NET Using Razor Views Read on: my website / Read time: 5 minutes BROUGHT TO YOU BY Store secrets in your Postman Vault! Postman Vault enables you to store sensitive
The Entrapment of Apple
Friday, June 28, 2024
The EU would like Apple to roll out their products, just not their version of their products The Entrapment of Apple The EU would like Apple to roll out their products, just not their version of their
X and Threads battle it out for presidential debate crown
Friday, June 28, 2024
Plus, the EU probes Shein and Temu while Volkswagen drives off with Rivian talent View this email online in your browser By Christine Hall Friday, June 28, 2024 Welcome to TechCrunch PM! For your
What's The Deal With HackerNoon Story Tags?
Friday, June 28, 2024
4 Editor Tips to Make the Most of Story Tags ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
iOS Dev Weekly - Issue 667
Friday, June 28, 2024
No comment from me this week, but there are plenty of links! 🥂 View on the Web Archives ISSUE 667 June 28th 2024 Comment Time got away from me so quickly today that by the time I would normally be
🛜 Is It Time to Start Preserving the Internet? — Gadgets Every Digital Nomad Should Have
Friday, June 28, 2024
Also: Here's Why I Still Use Skype for Video Calls, and More! How-To Geek Logo June 28, 2024 Did You Know Hockey is the only major sport divided into three periods (as opposed to other larger or
WebAIM June 2024 Newsletter
Friday, June 28, 2024
WebAIM June 2024 Newsletter Read this newsletter online at https://webaim.org/newsletter/2024/june Web Accessibility in Mind Conference Registration is Open Join WebAIM and Pope Tech for this virtual
JSK Daily for Jun 28, 2024
Friday, June 28, 2024
JSK Daily for Jun 28, 2024 View this email in your browser A community curated daily e-mail of JavaScript news Snapshots for IPC Fuzzing - Mozilla Hacks - the Web developer blog Process separation is
📝 Guest Post: Designing Prompts for LLM-as-a-Judge Model Evals*
Friday, June 28, 2024
In this guest post, Nikolai Liubimov, CTO of HumanSignal provides helpful resources to get started building LLM-as-a-judge evaluators for AI models. HumanSignal recently launched a suite of tools
DEI? More like 'common decency' — and Silicon Valley is saying 'no thanks'
Friday, June 28, 2024
Plus, a robot with living skin and Rivian gets a boost View this email online in your browser By Haje Jan Kamps Friday, June 28, 2024 Image Credits: Bryce Durbin / TechCrunch Welcome to Startups Weekly