SRE Weekly - SRE Weekly Issue #451
View on sreweekly.com
Most fascinating air incident report I've seen in awhile! The pilots deviated from the non-normal checklist, and it immediately made me think of runbooks. On the one hand, you want the runbook to be simple and easy to handle in an incident. On the other hand, it can be very useful to tell the operator why they should do something.
Mentour Pilot
With their claimed 14.5% of all websites depending on Cloudflare's DNS, they had to be super careful with this migration. Lots of good stuff in here including:
- replacing direct DB access by multiple services with an API
- keeping the old and new DB in sync
- ensuring both forward and reverse migration were possible in case of rollback
Alex Fattouche and Corey Horton — Cloudflare
I didn't get to experience the value of a good tracing tool until recently in my career, and I didn't understand the hype. If you're in the same boat, this article may help you understand the value of tracing.
Sam Starling — incident.io
About a year ago, Honeycomb git rid of incident severity levels in favor of incident types, which are purposefully not sortable. Here's how their experiment has gone so far.
Fred Hebert — Honeycomb
Full disclosure: Honeycomb is my employer.
Is Service Level Indicator (SLI) the same as Key Performance Indicator (KPI)?
There's a really cool framing in there: KPIs are moonshots, so we aim high and rarely hit all of them, while with SLOs, we under-promise and over-deliver.
Alex Ewerlöf
A fun dive into some unix/linux internals with nine different methods to run a program with timeouts and retries. If you have a soft spot in your heart for signals and system calls, this one's for you.
Philippe Gaultier
Cosmos DB is Azure's answer to Amazon's DynamoDB. This article gives a nice overview and compares it to various other data stores to help you decide whether it's right for your use case.
Adam Gordon Bell — Pulumi
An engineer at Mercari shares their plan for migrating to their new payment system in this five-part article series, all of which are published now. They created their design after reading 80(!) similar articles from folks at other companies.
resotto — Mercari
|
Older messages
SRE Weekly Issue #450
Monday, November 11, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: Practice Makes Prepared: Why Every Minor System Hiccup Is Your Team's Secret Training Ground. https://firehydrant.com/blog/the-hidden-
SRE Weekly Issue #449
Monday, November 4, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: Practice Makes Prepared: Why Every Minor System Hiccup Is Your Team's Secret Training Ground. https://firehydrant.com/blog/the-hidden-
SRE Weekly Issue #448
Monday, October 28, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: Practice Makes Prepared: Why Every Minor System Hiccup Is Your Team's Secret Training Ground. https://firehydrant.com/blog/the-hidden-
SRE Weekly Issue #447
Monday, October 21, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: If the entire team is on a Zoom bridge during an incident – how do you know what really happened and when? We added real-time Zoom/Google
SRE Weekly Issue #446
Sunday, October 20, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: If the entire team is on a Zoom bridge during an incident – how do you know what really happened and when? We added real-time Zoom/Google
You Might Also Like
Christmas On Repeat 🎅
Monday, December 23, 2024
Christmas nostalgia is a hell of a drug. Here's a version for your browser. Hunting for the end of the long tail • December 22, 2024 Hey all, Ernie here with a refresh of a piece from our very
SRE Weekly Issue #456
Monday, December 23, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: On-call during the holidays? Spend more time taking in some R&R and less getting paged. Let alerts make their rounds fairly with our
The Power of an Annual Review & Grammarly acquires Coda
Sunday, December 22, 2024
I am looking for my next role, Zen Browser got a fresh new look, Flipboard introduces Surf, Campsite shuts down, and a lot more in this week's issue of Creativerly. Creativerly The Power of an
Daily Coding Problem: Problem #1645 [Hard]
Sunday, December 22, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Facebook. Implement regular expression matching with the following special characters: .
PD#606 How concurrecy works: A visual guide
Sunday, December 22, 2024
A programmer had a problem. "I'll solve it with threads!". has Now problems. two he ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
RD#486 (React) Things I Regret Not Knowing Earlier
Sunday, December 22, 2024
Keep coding, stay curious, and remember—you've got this
🎶 GIFs Are Neat, but I Want Clips With Sound — Your Own Linux Desktop in the Cloud
Sunday, December 22, 2024
Also: 9 Games That Were Truly Ahead of Their Time, and More! How-To Geek Logo December 22, 2024 Did You Know Dextrose is another name for glucose, so if you see it listed prominently on the ingredients
o3—the new state-of-the-art reasoning model - Sync #498
Sunday, December 22, 2024
Plus: Nvidia's new tiny AI supercomputer; Veo 2 and Imagen 3; Google and Microsoft release reasoning models; Waymo to begin testing in Tokyo; Apptronik partners with DeepMind; and more! ͏ ͏ ͏ ͏ ͏ ͏
Sunday Digest | Featuring 'The World’s 20 Largest Economies, by GDP (PPP)' 📊
Sunday, December 22, 2024
Every visualization published this week, in one place. Dec 22, 2024 | View Online | Subscribe | VC+ | Download Our App Hello, welcome to your Sunday Digest. This week, we visualized public debt by
Android Weekly #654 🤖
Sunday, December 22, 2024
View in web browser 654 December 22nd, 2024 Articles & Tutorials Sponsored Solving ANRs with OpenTelemetry While OpenTelemetry is the new observability standard, it lacks official support for many