Profile avatar
stevemcghee.dev
Former athlete, reliability wonk.
94 posts 465 followers 315 following
Prolific Poster
Conversation Starter

www.justgiving.com/page/carolin...

Speedrunning conferences - still exhausting.

Once more unto the breach, dear friends, once more.

Just googled "how old am i if i was born in october 1979" so that's how things are going today.

Hi it's me, I am in Hawaii and I visited the Volcano but it was during the 2d window where it was not glowy red and exciting.

Grant me the serentiy to accept failure, to know the dependency failures i cannot fix, to mitigate the ones i can, and to know the difference.

I recently read the paper "Towards Joint Activity Design Heuristics: Essentials for Human-Machine Teaming" which I loved so much I wanted to make it easier to share. To that end, I've excerpted the Ten Heuristics from the paper here: human-machine.team with anchors for each heuristic.

Merry Perihelion to those who celebrate.

happy new years friends

"Your year wrapped" or whatever on every app is just an annual reminder of your level of opt-in surveillance. (I have plenty, sigh)

BREAKFAST APPLE PIE 🎄🎅❄️

My one true nemisis.

if you have the means I highly recommend picking up the Charlie and Lola Christmas special.

Cool.

Tis the season for background Tolkien

Today is the last scheduled DORA Community discussion of the year. Join us at 12PM ET. I'll be kicking off the discussion with a look at A Decade of DORA. Then we'll open up the agenda to everyone in attendance. Join the mailing list at dora.community for more details. #DORA #GBGB #Community

When your SLO error budget is breached, what do you do? Immediately, indefinitely halt releases? What if it's a brand new budget that you don't have any confidence in? What if the team is undergoing so much strife that another beating will not improve morale? My point: don't be a zealot.

If your ability to respond quickly to a problem entirely relies on other’s ability to do the same, take another look at your plan.

Here's a good, concrete reliability story: sreally.com/alphabet-sou... 1) Was it an incident? What severity? 2) What was the time to detect / repair? 3) How does this compare to other incidents? Statistical jargon goes here. Answers: None of these make sense in this context, they don't help.

Severity is vibes not science.

Come have a listen to our latest episode with John Allspaw and Casey Rosenthal on Safety, Resilience, and maybe a little Chaos. www.youtube.com/shorts/kMMqg... Check out the whole Season (and previous ones!) here: sre.google/prodcast/

If a member of SRE/staff pushes a bad configuration and you lose 10% of revenue generating traffic for a few minutes, before the engineer fixes it, what do you do as their manager?

Good morning from December in America.

Spotify Wrapped? Don’t even have to look. All interstellar all the time.

I just moved this account handle thingy to a custom domain, what could possibly go wrong.

Corn Chex is best Chex. I will not be taking questions at this time.

New favorite metaphor for emergence: rainbows.

If you want to learn about opsec, watch Patriot. Also it’s great. www.google.com/gasearch?q=p...

Xmas lights are up! Mostly!

Suggestion: this might be a perfect day for a hurkle durkle 🏴󠁧󠁢󠁳󠁣󠁴󠁿 www.scotsman.com/news/scotlan...

Odie would like you to know that it’s time for breakfast.

Maybe this year I’ll get more than 2 days of advent of code.

I keep reading it as bskyb and shuddering.