Ship It Weekly - DevOps, SRE, and Platform Engineering News
Ship It Weekly - DevOps, SRE, and Platform Engineering News

Ship It Weekly - DevOps, SRE, and Platform Engineering News

Teller's Tech - DevOps SRE Podcast

Overview
Episodes

Details

Ship It Weekly is a short, practical recap of what actually matters in DevOps, SRE, and platform engineering.Each episode, your host Brian Teller walks through the latest outages, releases, tools, and incident writeups, then translates them into “here’s what this means for your systems” instead of just reading headlines. Expect a couple of main stories with context, a quick hit of tools or releases worth bookmarking, and the occasional segment on on-call, burnout, or team culture.This isn’t a certification prep show or a lab walkthrough. It’s aimed at people who are already working in the space and want to stay sharp without scrolling status pages and blogs all week. You’ll hear about things like cloud provider incidents, Kubernetes and platform trends, Terraform and infrastructure changes, and real postmortems that are actually worth your time.Most episodes are 10–25 minutes, so you can catch up on the way to work or between meetings. Every now and then there will be a “special” focused on a big outage or a specific theme, but the default format is simple: what happened, why it matters, and what you might want to do about it in your own environment.If you’re the person people DM when something is broken in prod, or you’re building the platform everyone else ships on top of, Ship It Weekly is meant to be in your rotation.

Recent Episodes

Ship It Conversations: Yvonne Young on Linux Foundations, Mentorship, and Getting Job Ready in Cloud
MAR 9, 2026
Ship It Conversations: Yvonne Young on Linux Foundations, Mentorship, and Getting Job Ready in Cloud
<p>This is a guest conversation episode of <strong>Ship It Weekly</strong> (separate from the weekly news recaps).</p><p>In this Ship It: Conversations episode I talk with <strong>Yvonne Young</strong>, a cloud and Linux mentor active in the CloudWhistler community. We talk about the real path into cloud and DevOps, why Linux still matters as a foundation, what “job ready” actually means, and why focus, consistency, and business thinking matter more than chasing every new tool.</p><p>Highlights</p><ul><li>Linux fundamentals still matter because so much of cloud and infra work sits on top of Linux</li><li>What “job ready” really means: prepare for both technical and behavioral interviews, know the basics, and show how you learn when you don’t know something</li><li>Why so many juniors stall out by trying to learn everything instead of picking a direction</li><li>Why daily reps beat cramming: short, consistent practice keeps skills fresh better than marathon study sessions</li><li>How Yvonne thinks about certifications, including why hands-on certs like RHCSA stand out</li><li>Hands-on practice ideas: break things on purpose, troubleshoot, fix services, inspect ports, and use the help files</li><li>Why tools matter less than the business problem they solve</li><li>Using Vault as an example of solving real issues like secret sprawl, rotation, and centralized access</li><li>How to think about cloud learning: pick one provider, learn the concepts, and map your path to the kinds of companies you want to work for</li><li>Why mentorship and community matter, especially for juniors trying not to waste time or head in the wrong direction</li><li>What seniors can do better: better onboarding, real availability, and giving juniors an actual lifeline when they get stuck</li></ul><p>Yvonne’s links</p><ul><li>LinkedIn: <a target="_blank" rel="noopener noreferrer nofollow" href="https://www.linkedin.com/in/yvonne-young">https://www.linkedin.com/in/yvonne-young</a></li></ul><p>Stuff mentioned</p><ul><li>Ali Sohail on LinkedIn: <a target="_blank" rel="noopener noreferrer nofollow" href="https://www.linkedin.com/in/alisohailit/">https://www.linkedin.com/in/alisohailit/</a></li><li>Tech With Engineers on LinkedIn: <a target="_blank" rel="noopener noreferrer nofollow" href="https://uk.linkedin.com/company/tech-with-engineers">https://uk.linkedin.com/company/tech-with-engineers</a></li><li>CloudWhistler community / training: <a target="_blank" rel="noopener noreferrer nofollow" href="http://training.cloudwhistler.com">training.cloudwhistler.com</a></li><li>Vault: <a target="_blank" rel="noopener noreferrer nofollow" href="https://www.hashicorp.com/en/products/vault">https://www.hashicorp.com/en/products/vault</a></li><li>OpenBao: <a target="_blank" rel="noopener noreferrer nofollow" href="https://openbao.org/">https://openbao.org/</a></li></ul><p>More episodes + details: <a target="_blank" rel="noopener noreferrer nofollow" href="https://shipitweekly.fm">https://shipitweekly.fm</a></p>
play-circle icon
30 MIN
AWS Bahrain/UAE Data Center Issues Amid Iran Strikes, ArgoCD vs Flux GitOps Failures, GitHub Actions Hackerbot-Claw Attacks (Trivy), RoguePilot Codespaces Prompt Injection, Block “AI Remake” Layoffs, Claude Code Security
MAR 7, 2026
AWS Bahrain/UAE Data Center Issues Amid Iran Strikes, ArgoCD vs Flux GitOps Failures, GitHub Actions Hackerbot-Claw Attacks (Trivy), RoguePilot Codespaces Prompt Injection, Block “AI Remake” Layoffs, Claude Code Security
<p>This week on <strong>Ship It Weekly</strong>, Brian looks at how the boundary of ops keeps expanding.</p><p>We cover AWS flagging issues in Bahrain/UAE amid Iran strikes, ArgoCD vs Flux and why ArgoCD can get stuck in failed sync states, GitHub Actions being exploited at scale (plus Trivy’s incident), RoguePilot prompt injection meeting real credentials in Codespaces, Block’s “AI remake” layoffs, and Anthropic’s Claude Code Security for defenders.</p><p>Lightning round: DeepSeek model access geopolitics, Vercel’s agentic security boundaries, a KEV CVE to patch, an MCP-atlassian SSRF-to-RCE chain, and Claude Cowork scheduled tasks.</p><p>Links</p><p>AWS Bahrain/UAE (Reuters) <a target="_blank" rel="noopener noreferrer nofollow" href="https://www.reuters.com/world/middle-east/amazon-cloud-unit-flags-issues-bahrain-uae-data-centers-amid-iran-strikes-2026-03-02/">https://www.reuters.com/world/middle-east/amazon-cloud-unit-flags-issues-bahrain-uae-data-centers-amid-iran-strikes-2026-03-02/</a> </p><p>ArgoCD to Flux <a target="_blank" rel="noopener noreferrer nofollow" href="https://hai.wxs.ro/migrations/argocd-to-flux/">https://hai.wxs.ro/migrations/argocd-to-flux/</a> </p><p>GitHub Actions exploitation <a target="_blank" rel="noopener noreferrer nofollow" href="https://www.stepsecurity.io/blog/hackerbot-claw-github-actions-exploitation">https://www.stepsecurity.io/blog/hackerbot-claw-github-actions-exploitation</a> </p><p>Trivy incident <a target="_blank" rel="noopener noreferrer nofollow" href="https://github.com/aquasecurity/trivy/discussions/10265">https://github.com/aquasecurity/trivy/discussions/10265</a> </p><p>RoguePilot <a target="_blank" rel="noopener noreferrer nofollow" href="https://thehackernews.com/2026/02/roguepilot-flaw-in-github-codespaces.html">https://thehackernews.com/2026/02/roguepilot-flaw-in-github-codespaces.html</a> </p><p>Block layoffs (WSJ) <a target="_blank" rel="noopener noreferrer nofollow" href="https://www.wsj.com/business/jack-dorseys-block-to-lay-off-4-000-employees-in-ai-remake-28f0d869">https://www.wsj.com/business/jack-dorseys-block-to-lay-off-4-000-employees-in-ai-remake-28f0d869</a> </p><p>Claude Code Security <a target="_blank" rel="noopener noreferrer nofollow" href="https://www.anthropic.com/news/claude-code-security">https://www.anthropic.com/news/claude-code-security</a> </p><p>DeepSeek (Reuters) <a target="_blank" rel="noopener noreferrer nofollow" href="https://www.reuters.com/world/china/deepseek-withholds-latest-ai-model-us-chipmakers-including-nvidia-sources-say-2026-02-25/">https://www.reuters.com/world/china/deepseek-withholds-latest-ai-model-us-chipmakers-including-nvidia-sources-say-2026-02-25/</a> </p><p>Agentic boundaries <a target="_blank" rel="noopener noreferrer nofollow" href="https://vercel.com/blog/security-boundaries-in-agentic-architectures">https://vercel.com/blog/security-boundaries-in-agentic-architectures</a> </p><p>CISA KEV <a target="_blank" rel="noopener noreferrer nofollow" href="https://www.cisa.gov/news-events/alerts/2026/03/03/cisa-adds-two-known-exploited-vulnerabilities-catalog">https://www.cisa.gov/news-events/alerts/2026/03/03/cisa-adds-two-known-exploited-vulnerabilities-catalog</a> </p><p>mcp-atlassian CVE <a target="_blank" rel="noopener noreferrer nofollow" href="https://arcticwolf.com/resources/blog-uk/cve-2026-27825-critical-unauthenticated-rce-and-ssrf-in-mcp-atlassian/">https://arcticwolf.com/resources/blog-uk/cve-2026-27825-critical-unauthenticated-rce-and-ssrf-in-mcp-atlassian/</a> </p><p>Claude Cowork tasks <a target="_blank" rel="noopener noreferrer nofollow" href="https://support.claude.com/en/articles/13854387-schedule-recurring-tasks-in-cowork">https://support.claude.com/en/articles/13854387-schedule-recurring-tasks-in-cowork</a> </p><p>More: <a target="_blank" rel="noopener noreferrer nofollow" href="https://shipitweekly.fm">https://shipitweekly.fm</a></p>
play-circle icon
18 MIN
Cloudflare BYOIP BGP Withdrawals, Clerk’s Postgres Query-Plan Flip Outage, and AWS Kiro Permissions Lessons (Grafana Privesc + runc CVEs)
FEB 27, 2026
Cloudflare BYOIP BGP Withdrawals, Clerk’s Postgres Query-Plan Flip Outage, and AWS Kiro Permissions Lessons (Grafana Privesc + runc CVEs)
<p>This week on <strong>Ship It Weekly,</strong> Brian covers three “automation meets reality” stories that every DevOps, SRE, and platform team can learn from.</p><p>Cloudflare accidentally withdrew customer BYOIP prefixes due to a buggy cleanup task, Clerk got knocked over by a Postgres auto-analyze query plan flip, and AWS responded to reports about its internal Kiro tooling by framing the incident as misconfigured access controls. Plus: a quick EKS node monitoring update, and a tight security lightning round.</p><p><strong>Links</strong></p><p>Cloudflare BYOIP outage postmortem <a target="_blank" rel="noopener noreferrer nofollow" href="https://blog.cloudflare.com/cloudflare-outage-february-20-2026/">https://blog.cloudflare.com/cloudflare-outage-february-20-2026/</a> </p><p>Clerk outage postmortem (Feb 19, 2026) <a target="_blank" rel="noopener noreferrer nofollow" href="https://clerk.com/blog/2026-02-19-system-outage-postmortem">https://clerk.com/blog/2026-02-19-system-outage-postmortem</a> </p><p>AWS outage report (Reuters) <a target="_blank" rel="noopener noreferrer nofollow" href="https://www.reuters.com/business/retail-consumer/amazons-cloud-unit-hit-by-least-two-outages-involving-ai-tools-ft-says-2026-02-20/">https://www.reuters.com/business/retail-consumer/amazons-cloud-unit-hit-by-least-two-outages-involving-ai-tools-ft-says-2026-02-20/</a> </p><p>AWS response on Kiro + access controls <a target="_blank" rel="noopener noreferrer nofollow" href="https://www.aboutamazon.com/news/aws/aws-service-outage-ai-bot-kiro">https://www.aboutamazon.com/news/aws/aws-service-outage-ai-bot-kiro</a></p><p>EKS Node Monitoring Agent (open source) <a target="_blank" rel="noopener noreferrer nofollow" href="https://aws.amazon.com/about-aws/whats-new/2026/02/amazon-eks-node-monitoring-agent-open-source/">https://aws.amazon.com/about-aws/whats-new/2026/02/amazon-eks-node-monitoring-agent-open-source/</a></p><p>Grafana CVE-2026-21721 <a target="_blank" rel="noopener noreferrer nofollow" href="https://grafana.com/security/security-advisories/cve-2026-21721/">https://grafana.com/security/security-advisories/cve-2026-21721/</a></p><p>runc CVEs (AWS-2025-024) <a target="_blank" rel="noopener noreferrer nofollow" href="https://aws.amazon.com/security/security-bulletins/rss/aws-2025-024/">https://aws.amazon.com/security/security-bulletins/rss/aws-2025-024/</a> </p><p>GitLab patch releases <a target="_blank" rel="noopener noreferrer nofollow" href="https://about.gitlab.com/releases/2025/11/26/patch-release-gitlab-18-6-1-released/">https://about.gitlab.com/releases/2025/11/26/patch-release-gitlab-18-6-1-released/</a> </p><p>Atlassian Feb 2026 security bulletin <a target="_blank" rel="noopener noreferrer nofollow" href="https://confluence.atlassian.com/security/security-bulletin-february-17-2026-1722256046.html">https://confluence.atlassian.com/security/security-bulletin-february-17-2026-1722256046.html</a></p><p>Human story: SRE Is Anti-Transactional (ACM Queue) <a target="_blank" rel="noopener noreferrer nofollow" href="https://queue.acm.org/detail.cfm?id=3773094">https://queue.acm.org/detail.cfm?id=3773094</a></p><p>More episodes and show notes at <a target="_blank" rel="noopener noreferrer nofollow" href="https://shipitweekly.fm">https://shipitweekly.fm</a></p><p>On Call Briefs at: <a target="_blank" rel="noopener noreferrer nofollow" href="https://oncallbrief.com">https://oncallbrief.com</a></p>
play-circle icon
17 MIN
Ship It Conversations: Mike Lady on Day Two Readiness + Guardrails in the AI Era
FEB 24, 2026
Ship It Conversations: Mike Lady on Day Two Readiness + Guardrails in the AI Era
<p>This is a guest conversation episode of <strong>Ship It Weekly</strong> (separate from the weekly news recaps).</p><p>In this Ship It: Conversations episode I talk with <strong>Mike Lady</strong> (Senior DevOps Engineer, distributed systems) from <strong>Enterprise Vibe Code</strong> on YouTube. We talk day two readiness, guardrails/quality gates, and why shipping safely matters even more now that AI can generate code fast.</p><p>Highlights</p><ul><li>Day 0 vs Day 1 vs <strong>Day 2</strong> (launching vs operating and evolving safely)</li><li>What teams look like without guardrails (“hope is not a strategy”)</li><li>Why guardrails <strong>speed you up</strong> long-term (less firefighting, more predictable delivery)</li><li>Day-two audit checklist: source control/branches/PRs, branch protection, CI quality gates, secrets/config, staging→prod flow</li><li>AI agents: they’ll “lie, cheat, and steal” to satisfy the goal unless you gate them</li><li>Multi-model reviews (Claude/Gemini/Codex) as different perspectives</li><li>AI in prod: start read-only (logs/traces), then earn trust slowly</li></ul><p>Mike’s links</p><ul><li>YouTube: <a target="_blank" rel="noopener noreferrer nofollow" href="https://www.youtube.com/@EnterpriseVibeCode">https://www.youtube.com/@EnterpriseVibeCode</a></li><li>Site: <a target="_blank" rel="noopener noreferrer nofollow" href="https://www.enterprisevibecode.com/">https://www.enterprisevibecode.com/</a></li><li>LinkedIn: <a target="_blank" rel="noopener noreferrer nofollow" href="https://www.linkedin.com/in/mikelady/">https://www.linkedin.com/in/mikelady/</a></li></ul><p>Stuff mentioned</p><ul><li><em>Vibe Coding</em> (Gene Kim + Steve Yegge): <a target="_blank" rel="noopener noreferrer nofollow" href="https://www.simonandschuster.com/books/Vibe-Coding/Gene-Kim/9781966280026">https://www.simonandschuster.com/books/Vibe-Coding/Gene-Kim/9781966280026</a></li><li>Beads (agent memory/issue tracker): <a target="_blank" rel="noopener noreferrer nofollow" href="https://github.com/steveyegge/beads">https://github.com/steveyegge/beads</a></li><li>Gas Town (agent orchestration): <a target="_blank" rel="noopener noreferrer nofollow" href="https://github.com/steveyegge/gastown">https://github.com/steveyegge/gastown</a></li><li><a target="_blank" rel="noopener noreferrer nofollow" href="http://AGENTS.md">AGENTS.md</a> (agent instructions file): <a target="_blank" rel="noopener noreferrer nofollow" href="https://agents.md/">https://agents.md/</a></li><li>OpenAI Codex: <a target="_blank" rel="noopener noreferrer nofollow" href="https://openai.com/codex/">https://openai.com/codex/</a></li></ul><p>More episodes + details: <a target="_blank" rel="noopener noreferrer nofollow" href="https://shipitweekly.fm">https://shipitweekly.fm</a></p>
play-circle icon
34 MIN