Episode 71: Durable Agents - How to Build AI Systems That Survive a Crash with Samuel Colvin
FEB 18, 202651 MIN
Episode 71: Durable Agents - How to Build AI Systems That Survive a Crash with Samuel Colvin
FEB 18, 202651 MIN
Description
<p><em>Our thesis is that </em><strong><em>AI is still just engineering</em></strong><em>… those people who tell us for fun and profit, that somehow AI is so, so profound, so new, so different from anything that’s gone before that it somehow eclipses the need for good engineering practice are wrong. We need that good engineering practice still, and for the most part, most things are not new. But </em><strong><em>there are some things that have become more important with AI</em></strong><em>. </em><strong><em>One of those is durability</em></strong><em>.</em></p><p><strong>Samuel Colvin</strong>, <strong>Creator of Pydantic AI</strong>, joins Hugo to talk about applying battle-tested software engineering principles to build durable and reliable AI agents.</p><p><strong>They Discuss:</strong></p><p>* <strong>Production agents require engineering-grade reliability:</strong> Unlike messy coding agents, production agents need high <strong>constraint</strong>, <strong>reliability</strong>, and the ability to perform <strong>hundreds of tasks without drifting</strong> into unusual behavior;</p><p>* <strong>Agents are the new “quantum” of AI software:</strong> Modern architecture uses discrete <strong>“agentlets”</strong>: small, specialized building blocks stitched together for <strong>sub-tasks</strong> within larger, <strong>durable systems</strong>;</p><p>* <strong>Stop building “chocolate teapot” execution frameworks:</strong> Ditch rudimentary snapshotting; use <strong>battle-tested durable execution engines</strong> like <strong>Temporal</strong> for robust <strong>retry logic</strong> and <strong>state management</strong>;</p><p>* <strong>AI observability will be a native feature:</strong> In five years, <strong>AI observability</strong> will be integrated, with <strong>token counts</strong> and <strong>prompt traces</strong> becoming standard features of all <strong>observability platforms</strong>;</p><p>* <strong>Split agents into deterministic workflows and stochastic activities:</strong> Ensure <strong>true durability</strong> by isolating <strong>deterministic workflow logic</strong> from <strong>stochastic activities</strong> (IO, LLM calls) to cache results and prevent <strong>redundant model calls</strong>;</p><p>* <strong>Type safety is essential for enterprise agents:</strong> Sacrificing <strong>type safety</strong> for flexible graphs leads to unmaintainable software; professional AI engineering demands strict <strong>type definitions</strong> for <strong>parallel node execution</strong> and <strong>state recovery</strong>;</p><p>* <strong>Standardize on OpenTelemetry for portability:</strong> Use <strong>OpenTelemetry (OTel)</strong> to ensure <strong>agent traces</strong> and <strong>logs</strong> are portable, preventing <strong>vendor lock-in</strong> and integrating seamlessly into existing enterprise monitoring.</p><p>You can also find the full episode on <a target="_blank" href="https://open.spotify.com/show/3yuz89gqAhcMcdy3SZPe4X?si=AKl2jvIARD2Liw1bBH2Nng&nd=1&dlsi=8dfe7221896c4fc3">Spotify</a>, <a target="_blank" href="https://podcasts.apple.com/us/podcast/vanishing-gradients/id1610318868">Apple Podcasts</a>, and <a target="_blank" href="https://youtu.be/qM9wQxSM1ow">YouTube</a>.</p><p>You can also interact directly with the transcript here in <a target="_blank" href="https://notebooklm.google.com/notebook/f4acc37b-078e-4b00-ba6f-c2caa67e9533">NotebookLM</a>: If you do so, let us know anything you find in the comments!</p><p></p><p>👉 <strong><em>Want to learn more about Building AI-Powered Software? Check out our </em></strong><a target="_blank" href="http://xxx"><strong><em>Building AI Applications course</em></strong></a>. It’s a live cohort with hands on exercises and office hours. Here is a <a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgfs">25% discount code for listeners</a>. 👈</p><p>LINKS</p><p>* <a target="_blank" href="https://www.linkedin.com/in/samuel-colvin/">Samuel Colvin on LinkedIn</a></p><p>* <a target="_blank" href="https://pydantic.dev/">Pydantic</a></p><p>* <a target="_blank" href="https://github.com/pydantic/pydantic-stack-demo">Pydantic Stack Demo repo</a></p><p>* <a target="_blank" href="https://github.com/pydantic/pydantic-stack-demo/blob/main/durable-exec/deep_research.py">Deep research example code</a></p><p>* <a target="_blank" href="https://temporal.io/">Temporal</a></p><p>* <a target="_blank" href="https://docs.dbos.dev/">DBOS (Postgres alternative to Temporal)</a></p><p>* <a target="_blank" href="https://luma.com/calendar/cal-8ImWFDQ3IEIxNWk">Upcoming Events on Luma</a></p><p>* <a target="_blank" href="https://www.youtube.com/@vanishinggradients">Vanishing Gradients on YouTube</a></p><p>* <a target="_blank" href="https://www.youtube.com/live/Qr4eiLbCfg4">Watch the podcast video on YouTube</a></p><p>👉Want to learn more about Building AI-Powered Software? Check out our <a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles">Building AI Applications course</a>. It’s a live cohort with hands on exercises and office hours. Our final cohort starts March 10, 2026. Here is a <a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgfs">25% discount code for listeners</a>.👈</p><p>https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgfs</p> <br/><br/>Get full access to Vanishing Gradients at <a href="https://hugobowne.substack.com/subscribe?utm_medium=podcast&utm_campaign=CTA_4">hugobowne.substack.com/subscribe</a>