SRE Weekly - SRE Weekly Issue #296
Articles
WOW! This is the longest, most detailed public incident post I’ve ever seen from any company. I’ve linked to their short(er) summary, but be sure to check out the long version for all the juicy details.
If we operate too far from the edge, we lose sight of it and can’t anticipate when corrective work should be emphasized. If we operate too close to it, we are constantly in high-stakes situations and firefighting.
Fred Hebert — Honeycomb
This article goes through the actual math of creating an alert for an SLO, including how to avoid alerting for the entire sliding window even after the problem is fixed.
Ervin Barta
This reddit thread doesn’t have any firm answers, but the discussion is pretty interesting.
u/faidoc and others — reddit
Good advice for writing resumes in general, with some SRE-specific tips. There are also links to example SRE resumes.
Quentin Rousseau — Rootly
This article is published by my sponsor, Rootly, but their sponsorship did not influence its inclusion in this issue.
Turns out they have runbooks too — or I guess you could say we have SOPs.
Hugh Brien — Transposit
What do you do about developers that just don’t want to be on call?
Charity Majors — Honeycomb
Before opening their new API up to the public, Ably walloped it with Locust.
Denis Sellu — Ably
Outages
- Robinhood
- Google Cloud Platform, Gmail, Google Calendar, Chat, Meet, and Groups
- Linked above is their followup report from the perspective of GCP. There’s also a report for the GSuite side here.
|
Older messages
SRE Weekly Issue #295
Monday, November 8, 2021
View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right
SRE Weekly Issue #294
Monday, November 1, 2021
View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right
SRE Weekly Issue #293
Monday, October 25, 2021
View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right
SRE Weekly Issue #292
Monday, October 18, 2021
View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right
SRE Weekly Issue #291
Monday, October 11, 2021
View on sreweekly.com A message from our sponsor, Rootly: Manage incidents directly from Slack with Rootly 🚒. Automate manual admin tasks like creating incident channel, Jira and Zoom, paging the right
You Might Also Like
GCP Newsletter #433
Monday, January 13, 2025
Welcome to issue #433 January 13th, 2025 News Official Blog Vertex AI Introducing Vertex AI RAG Engine: Scale your Vertex AI RAG pipeline with confidence - Vertex AI RAG Engine is a fully managed
Spyglass Dispatch: It's Political & Personal
Monday, January 13, 2025
On Meta's Moderation Changes • Inside DOGE • Zuck Slams Apple (Again) • Apple's Muted 2025 • CES 2025 Recap The Spyglass Dispatch is a newsletter sent on weekdays featuring links and commentary
$200 to invest today... (USA Only)
Monday, January 13, 2025
Join me in investing in blue chip art on Masterworks, and you will receive $200 to invest on the platform. Not kidding. Founder interview coming soon! ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The Sequence Knowledge #468: A New Series About RAG
Monday, January 13, 2025
Exploring key concepts of one of the most popular methods in generative AI solutions. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
How a Kafka-Like Producer Writes to Disk
Monday, January 13, 2025
We take a Kafka client, call the producer, send the message, and boom, expect it to be delivered on the other end. And that's actually how it goes. But wouldn't it be nice to understand better
FAQs: The AI Consultancy Project
Monday, January 13, 2025
This is how we'll help you become an AI Consultant ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
⚡ THN Weekly Recap: Top Cybersecurity Threats, Tools and Tips [13 Jan]
Monday, January 13, 2025
Your one-stop-source for last week's top cybersecurity headlines. The Hacker News Cybersecurity Recap The cyber world's been buzzing this week, and it's all about staying ahead of the bad
My 3 must-buy CES 2025 gadgets
Monday, January 13, 2025
Alarming iPhone bug; Router-based VPN; 90-second vision test -- ZDNET ZDNET Tech Today - US January 13, 2025 Mcon magsafe phone controller Three CES 2025 products I'd buy as soon as they'd take
⚙️ Meta's copyright struggles
Monday, January 13, 2025
Plus: Achieving data center efficiency
Post from Syncfusion Blogs on 01/13/2025
Monday, January 13, 2025
New blogs from Syncfusion Top 5 React Chart Libraries for 2025 By Gowrimathi S Explore the top 5 React chart libraries with a comparison of their features, pros, and cons to boost your data