SRE Weekly - SRE Weekly Issue #273
Articles
What indeed? It depends on who you ask.
Quentin Rousseau — Rootly
This academic paper explains Google’s efforts toward identifying “mercurial” CPU coores — cores that make erroneous computations.
[…] we observe on the order of a few mercurial cores per several thousand machines […]
This one blew my mind:
A deterministic AES mis-computation, which was “selfinverting”: encrypting and decrypting on the same core yielded the identity function, but decryption elsewhere yielded gibberish.
Peter H. Hochschild, Paul Turner, Jeffrey C. Mogul, Rama Govindaraju, Parthasarathy Ranganathan, David E. Culler, and Amin Vahdat — Google
The decisions, non-decisions, and workarounds that we implement now can have lasting effects on the Internet as a whole.
Mark Nottingham — Fastly
Full disclosure: Fastly is my employer.
A great intro to the topic of resilience engineering. Hint: resilience !=
high availability.
Piet van Dongen — Luminis Arnhem
When you include people in your definition of “the system”, something that looked like a system failure where humans had to “step in” is actually a success in which the system adapted.
Lorin Hochstein
I find the way this author presented this argument especially convincing. My favorite part is the real-world story toward the end.
Rachel by the Bay
Facebook presents their method for finding and dealing with PCIe errors in their infrastructure.
Ashwin Poojary, Bill Holland, Makan Diarra, and Ray Park — Facebook
Overflow of a 32-bit integer primary key caused a security issue.
Scott Sanders — GitHub
This caught my eye. I’ve seldom been in an on-call rotation with shifts that were not a week or two at a time.
The optimal frequency for being on call is about three days a month.
There’s also a good discussion of paying for on-call shifts, which, in my experience, goes a long way toward making on-call more palatable.
Christine Patton — SoundCloud
Outages
- HBO Max
- Apple Card
- Sling TV
- Google Meet
- GitHub
- Discord
- Discord had several outages this week.
|
Older messages
SRE Weekly Issue #272
Monday, May 31, 2021
View on sreweekly.com A message from our sponsor, StackHawk: See how automated security testing can change how your teams find and fix security vulnerabilities. http://sthwk.com/security-automation
SRE Weekly Issue #271
Monday, May 24, 2021
View on sreweekly.com A message from our sponsor, StackHawk: Join StackHawk on Tuesday, May 25 for a hands-on authenticated security testing workshop. Follow along as we walk through three common
SRE Weekly Issue #270
Monday, May 17, 2021
View on sreweekly.com A message from our sponsor, StackHawk: APIs are not only the backbone of modern application architecture, but they are also a key part of security. Discover what API security
SRE Weekly Issue #269
Monday, May 10, 2021
View on sreweekly.com A message from our sponsor, StackHawk: Tune into ZAPCon After Hours this Tuesday at 8 am PT to learn how to include automated security testing in your builds with ZAP http://sthwk
SRE Weekly Issue #268
Monday, May 3, 2021
View on sreweekly.com A message from our sponsor, StackHawk: Join StackHawk Tuesday May 4 at 9 am PT for a hands-on technical workshop! By the end of the session, you will have three types of security
You Might Also Like
Stripe changes its … stripes
Wednesday, April 24, 2024
TikTok on the president's docket and Nvidia acquires Run:ai View this email online in your browser By Christine Hall Wednesday, April 24, 2024 Good afternoon, and welcome to TechCrunch PM! Today
💪 You Can Use Copilot AI as a Personal Trainer — Why Your Laptop Needs a Docking Station
Wednesday, April 24, 2024
Also: Here's How to Make Your Apple ID Recoverable, and More! How-To Geek Logo April 24, 2024 📩 Get expert reviews, the hottest deals, how-to's, breaking news, and more delivered directly to
JSK Daily for Apr 24, 2024
Wednesday, April 24, 2024
JSK Daily for Apr 24, 2024 View this email in your browser A community curated daily e-mail of JavaScript news JSK Weekly - 24th April, 2024 React 19 has introduced many great functionalities and
Daily Coding Problem: Problem #1422 [Hard]
Wednesday, April 24, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Airbnb. Given a list of integers, write a function that returns the largest sum of non-
Charted | Artificial Intelligence Patents, by Country 🤖
Wednesday, April 24, 2024
This visualization shows which countries have been granted the most AI patents each year, from 2012 to 2022. View Online | Subscribe Presented by: New on VC+: Our Visual Briefing on the IMF's World
Save your seat: 1Password’s 2024 Security report insights webinar
Wednesday, April 24, 2024
Join us April 25th. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Top Tech Deals 📱 LG Flex TV, Google Pixel 7, DJI Mini 3, and More
Wednesday, April 24, 2024
Get yourself a discounted DJI drone, save on the Pixel 7, or score some PC and phone accessories. How-To Geek Logo April 24, 2024 Top Tech Deals: LG Flex TV, Google Pixel 7, DJI Mini 3, and More Find
The Protest Song Wakes Up 🎙️
Wednesday, April 24, 2024
Is this song the future of musical protest? Here's a version for your browser. Hunting for the end of the long tail • April 24, 2024 The Protest Song Wakes Up A buzzy protest song about the
JSK Weekly - 24th April, 2024
Wednesday, April 24, 2024
React 19 has introduced many great functionalities and features, among which the useOptimistic hook stands out. The useOptimistic hook offers a seamless way to manage UI states during asynchronous
The clock’s ticking for TikTok
Wednesday, April 24, 2024
The US Senate has passed a bill that would ban TikTok if its US business is not divested by Bytedance View this email online in your browser By Alex Wilhelm Wednesday, April 24, 2024 Good morning, and