SRE Weekly - SRE Weekly Issue #249
I’m having a hard time wrapping my head around the fact that this issue marks 5 years of SRE Weekly. A massive thank you to everyone who writes the content I feature here every week, and also to all of you that subscribe!
Articles
Every service needs a couple of big hammers that are easy to swing.
Jennifer Mace — O’Reilly and Google
Answer: automation. Lots of automation. And automation of the automation.
Fred Lin, Harish Dattatraya Dixit, and Sriram Sankar — Facebook
Oh, how quaint! This article was written back when people traveled for the holidays.
Ashley Roof — Transposit
Surprise! Fortunately, there are some ways to fix this limitation.
Heidi Howard, Ittai Abraham — Decentralized Thoughts
A common question when a company is implementing incident management is: why do we need this process?
It turns out that the easiest way to answer this question is to look at the world of unsuccessful incident management.
Kintaba
Whether you’re new to Just Culture or an old hand, there’s a lot of great detail in this article.
Tory Thompson — Firehouse
Not sold yet on full service ownership for development teams? This interview may help.
Vivian Chan — PagerDuty
While ostensibly about Jeli.io, this article makes a great case for why incident analysis is important in general and what kind of data we should be trying to gather.
John Allspaw — Adaptive Capacity Labs
A new feature roll-out resulted in impaired service for some customers.
The adaptive universe: where adaptations to challenges feed back and cause more challenges, requiring more adaptations.
Lorin Hochstein
Our first GraphQL release was twice as slow as our old REST API. Here’s how we fixed it.
Another great example of making a duplicate request to a new API in the background to test it before deploying it.
Michael P. Geraci — OkCupid
Outages
- Google Workspace Status Dashboard
- All Google services that use OAuth were unreachable due to an issue with Google’s User ID service. Click through for their report. This one caused issues for the start of my daughters’ school day since Meet and Classroom were down.
- Google Cloud Status Dashboard
- Gmail
- Delivery of messages to @gmail.com addresses failed permanently and would not be retried. This report by Google has the details.
- Microsoft Outlook
- Galileo (satellite navigation system)
- Spotify
|
Older messages
SRE Weekly Issue #248
Monday, December 14, 2020
View on sreweekly.com A message from our sponsor, StackHawk: Join StackHawk and Snyk on Wednesday to learn about how to automate application security testing with GitHub Actions. Register for the
SRE Weekly Issue #247
Monday, December 7, 2020
View on sreweekly.com A message from our sponsor, StackHawk: The ZAP open source project is the underlying security scanner for StackHawk. Check out this 21 minute introduction to ZAP from project
SRE Weekly Issue #246
Friday, December 4, 2020
View on sreweekly.com A message from our sponsor, StackHawk: Looking to get started with application security testing in CI/CD? Here is a broad overview of steps you can take. https://sthwk.com/how-to-
SRE Weekly Issue #245
Monday, November 23, 2020
View on sreweekly.com A message from our sponsor, StackHawk: Check out how we have built our microservices in Kubernetes here at StackHawk. https://sthwk.com/kube-services Articles Trust Asia 2021 has
SRE Weekly Issue #244
Monday, November 16, 2020
View on sreweekly.com A message from our sponsor, StackHawk: Are you attending KubeCon this week? Be sure to swing by StackHawk's virtual booth to get a t-shirt and be entered to win a Nintendo
You Might Also Like
Youre Overthinking It
Wednesday, January 15, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? 🪐 What's happening in tech today, January 15, 2025? The
eBook: Software Supply Chain Security for Dummies
Wednesday, January 15, 2025
Free access to this go-to-guide for invaluable insights and practical advice to secure your software supply chain. The Hacker News Software Supply Chain Security for Dummies There is no longer doubt
The 5 biggest AI prompting mistakes
Wednesday, January 15, 2025
✨ Better Pixel photos; How to quit Meta; The next TikTok? -- ZDNET ZDNET Tech Today - US January 15, 2025 ai-prompting-mistakes The five biggest mistakes people make when prompting an AI Ready to
An interactive tour of Go 1.24
Wednesday, January 15, 2025
Plus generating random art, sending emails, and a variety of gopher images you can use. | #538 — January 15, 2025 Unsub | Web Version Together with Posthog Go Weekly An Interactive Tour of Go 1.24 — A
Spyglass Dispatch: Bromo Sapiens
Wednesday, January 15, 2025
Masculine Startups • The Fall of Xbox • Meta's Misinformation Off Switch • TikTok's Switch Off The Spyglass Dispatch is a newsletter sent on weekdays featuring links and commentary on timely
The $1.9M client
Wednesday, January 15, 2025
Money matters, but this invisible currency matters more. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
⚙️ Federal data centers
Wednesday, January 15, 2025
Plus: Britain's AI roadmap
Post from Syncfusion Blogs on 01/15/2025
Wednesday, January 15, 2025
New blogs from Syncfusion Introducing the New .NET MAUI Bottom Sheet Control By Naveenkumar Sanjeevirayan This blog explains the features of the Bottom Sheet control introduced in the Syncfusion .NET
The Sequence Engineering #469: Llama.cpp is The Framework for High Performce LLM Inference
Wednesday, January 15, 2025
One of the most popular inference framework for LLM apps that care about performance. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
3 Actively Exploited Zero-Day Flaws Patched in Microsoft's Latest Security Update
Wednesday, January 15, 2025
THN Daily Updates Newsletter cover The Kubernetes Book: Navigate the world of Kubernetes with expertise , Second Edition ($39.99 Value) FREE for a Limited Time Containers transformed how we package and