Vanishing Gradients | Podcast Guru

Overview

Episodes

Recent Episodes

Episode 72: Why Agents Solve the Wrong Problem (and What Data Scientists Do Instead)

MAR 20, 2026

Episode 72: Why Agents Solve the Wrong Problem (and What Data Scientists Do Instead)

<p><em>I often see what I would consider to be </em><strong><em>b******t evals</em></strong><em>, especially in data, like write this </em><strong><em>dumb SQL</em></strong><em>. Almost every one of these </em><strong><em>dumb SQL</em></strong><em> questions that I’ve seen for benchmarks are just so either obviously easy or overwhelmingly adversarial. They just, they </em><strong><em>don’t feel valuable</em></strong><em> as a </em><strong><em>data scientist</em></strong><em>, it’s something that you probably would never ask a real data scientist to do. So I went </em><strong><em>out my way to create real ones. Let me read one to you.</em></strong></p><p><strong>Bryan Bischof</strong>, <strong>Head of AI</strong> at <strong>Theory Ventures</strong>, joins Hugo to talk about what happened when <strong>150 people</strong> spent <strong>six hours</strong> using <strong>AI agents</strong> to answer <strong>real data science questions</strong> across <strong>SQL tables</strong>, <strong>log files</strong>, and <strong>750,000 PDFs</strong>.</p><p><strong>They Discuss:</strong></p><p>* <strong>Failure Funnels</strong>, pinpoint where <strong>agent reasoning breaks down</strong> using causal-chain binary evaluations instead of vague 1-5 scales;</p><p>* <strong>Median Score: 23 out of 65</strong>, what happened when world-class engineers turned agents loose on real data work, and why <strong>general-purpose coding agents</strong> with human prodding beat fancy frameworks;</p><p>* <strong>Zero-Cost Submissions Kill Trust</strong>, without a penalty for wrong answers, agents <strong>hill-climb</strong> to correct submissions through brute force instead of building confidence;</p><p>* <strong>Data Science is “Zooming”</strong>, moving beyond binary decisions to iterative <strong>problem framing</strong>, refining “does our inventory suck?” into a tractable hypothesis;</p><p>* <strong>MCP as Semantic Layer</strong>, model your organization’s <strong>proprietary knowledge</strong> once and distribute it to whatever LLM interface your team prefers;</p><p>* <strong>The Subagent vs. Tool Debate</strong>, a distinction that adds <strong>cognitive load</strong> without hiding complexity;</p><p>* <strong>Self-Orchestration Gap</strong>, agents don’t yet realize they should trigger specialized extraction frameworks like <strong>DocETL</strong> instead of reading 750K PDFs one by one;</p><p>* <strong>The Future of Evals</strong>, from vibe checks to <strong>objective functions</strong> and continuous user feedback that lets systems converge on reliability.</p><p>You can also find the full episode on <a target="_blank" href="https://open.spotify.com/show/3yuz89gqAhcMcdy3SZPe4X?si=AKl2jvIARD2Liw1bBH2Nng&nd=1&dlsi=8dfe7221896c4fc3">Spotify</a>, <a target="_blank" href="https://podcasts.apple.com/us/podcast/vanishing-gradients/id1610318868">Apple Podcasts</a>, and <a target="_blank" href="https://youtube.com/live/seh9oVngJJQ?feature=share">YouTube</a>.</p><p><a target="_blank" href="https://notebooklm.google.com/notebook/8d091eee-7a65-4212-b04d-cb52f00ea00a">You can also interact directly with the transcript here in NotebookLM</a>: If you do so, let us know anything you find in the comments!</p><p>👉 <strong><em>Want to learn more about Building AI-Powered Software? Check out our </em></strong><a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles"><strong><em>Building AI Applications course</em></strong></a>. It’s a live cohort with hands on exercises and office hours. <strong>Our final cohort has started</strong>. Registration is still open. <strong>All sessions are recorded</strong> so don’t worry about having missed any. Here is a <a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgfs"><strong><em>25% discount code for readers</em></strong></a>. 👈</p><p><strong>LINKS</strong></p><p>* <a target="_blank" href="https://x.com/BEBischof">Bryan Bischof on Twitter/X</a></p><p>* <a target="_blank" href="https://www.linkedin.com/in/bryan-bischof/">Bryan Bischof on LinkedIn</a></p><p>* <a target="_blank" href="https://theoryvc.com/">Theory Ventures</a></p><p>* <a target="_blank" href="https://theoryvc.com/blog-posts/the-hunt-for-a-trustworthy-data-agent">The Hunt for a Trustworthy Data Agent (blog post)</a></p><p>* <a target="_blank" href="https://github.com/TheoryVentures/antm">America’s Next Top Modeler GitHub repo</a></p><p>* <a target="_blank" href="https://hamel.dev/blog/posts/evals-faq/how-do-i-evaluate-agentic-workflows.html">Hamel’s evals FAQ: How do I evaluate agentic workflows?</a></p><p>* <a target="_blank" href="https://www.docetl.org/">DocETL</a></p><p>* <a target="_blank" href="https://hugobowne.substack.com/p/llm-judges-and-ai-agents-at-scale">LLM Judges and AI Agents at Scale (Hugo’s podcast with Shreya Shankar)</a></p><p>* <a target="_blank" href="https://www.cimolabs.com/blog/metrics-lying">When Your Metrics Are Lying (Cimo Labs)</a></p><p>* <a target="_blank" href="https://youtube.com/live/c0gcsprsFig?feature=share">Lessons from a Year of Building with LLMs (livestream on YouTube)</a></p><p>* <a target="_blank" href="https://www.youtube.com/watch?v=zqjnEptOn4k">Bryan Bischof: The Map is Not the Territory (YouTube)</a></p><p>* <a target="_blank" href="https://luma.com/calendar/cal-8ImWFDQ3IEIxNWk">Upcoming Events on Luma</a></p><p>* <a target="_blank" href="https://www.youtube.com/@vanishinggradients">Vanishing Gradients on YouTube</a></p><p>* <a target="_blank" href="https://youtube.com/live/seh9oVngJJQ">Watch the podcast video on YouTube</a></p><p></p><p></p><p>👉 <strong><em>Want to learn more about Building AI-Powered Software? Check out our </em></strong><a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles"><strong><em>Building AI Applications course</em></strong></a>. It’s a live cohort with hands on exercises and office hours. <strong>Our final cohort has started</strong>. Registration is still open. <strong>All sessions are recorded</strong> so don’t worry about having missed any. Here is a <a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgfs"><strong><em>25% discount code for readers</em></strong></a>. 👈</p> <br/><br/>Get full access to Vanishing Gradients at <a href="https://hugobowne.substack.com/subscribe?utm_medium=podcast&utm_campaign=CTA_4">hugobowne.substack.com/subscribe</a>

93 MIN

Episode 71: Durable Agents - How to Build AI Systems That Survive a Crash with Samuel Colvin

FEB 18, 2026

Episode 71: Durable Agents - How to Build AI Systems That Survive a Crash with Samuel Colvin

<p><em>Our thesis is that </em><strong><em>AI is still just engineering</em></strong><em>… those people who tell us for fun and profit, that somehow AI is so, so profound, so new, so different from anything that’s gone before that it somehow eclipses the need for good engineering practice are wrong. We need that good engineering practice still, and for the most part, most things are not new. But </em><strong><em>there are some things that have become more important with AI</em></strong><em>. </em><strong><em>One of those is durability</em></strong><em>.</em></p><p><strong>Samuel Colvin</strong>, <strong>Creator of Pydantic AI</strong>, joins Hugo to talk about applying battle-tested software engineering principles to build durable and reliable AI agents.</p><p><strong>They Discuss:</strong></p><p>* <strong>Production agents require engineering-grade reliability:</strong> Unlike messy coding agents, production agents need high <strong>constraint</strong>, <strong>reliability</strong>, and the ability to perform <strong>hundreds of tasks without drifting</strong> into unusual behavior;</p><p>* <strong>Agents are the new “quantum” of AI software:</strong> Modern architecture uses discrete <strong>“agentlets”</strong>: small, specialized building blocks stitched together for <strong>sub-tasks</strong> within larger, <strong>durable systems</strong>;</p><p>* <strong>Stop building “chocolate teapot” execution frameworks:</strong> Ditch rudimentary snapshotting; use <strong>battle-tested durable execution engines</strong> like <strong>Temporal</strong> for robust <strong>retry logic</strong> and <strong>state management</strong>;</p><p>* <strong>AI observability will be a native feature:</strong> In five years, <strong>AI observability</strong> will be integrated, with <strong>token counts</strong> and <strong>prompt traces</strong> becoming standard features of all <strong>observability platforms</strong>;</p><p>* <strong>Split agents into deterministic workflows and stochastic activities:</strong> Ensure <strong>true durability</strong> by isolating <strong>deterministic workflow logic</strong> from <strong>stochastic activities</strong> (IO, LLM calls) to cache results and prevent <strong>redundant model calls</strong>;</p><p>* <strong>Type safety is essential for enterprise agents:</strong> Sacrificing <strong>type safety</strong> for flexible graphs leads to unmaintainable software; professional AI engineering demands strict <strong>type definitions</strong> for <strong>parallel node execution</strong> and <strong>state recovery</strong>;</p><p>* <strong>Standardize on OpenTelemetry for portability:</strong> Use <strong>OpenTelemetry (OTel)</strong> to ensure <strong>agent traces</strong> and <strong>logs</strong> are portable, preventing <strong>vendor lock-in</strong> and integrating seamlessly into existing enterprise monitoring.</p><p>You can also find the full episode on <a target="_blank" href="https://open.spotify.com/show/3yuz89gqAhcMcdy3SZPe4X?si=AKl2jvIARD2Liw1bBH2Nng&nd=1&dlsi=8dfe7221896c4fc3">Spotify</a>, <a target="_blank" href="https://podcasts.apple.com/us/podcast/vanishing-gradients/id1610318868">Apple Podcasts</a>, and <a target="_blank" href="https://youtu.be/qM9wQxSM1ow">YouTube</a>.</p><p>You can also interact directly with the transcript here in <a target="_blank" href="https://notebooklm.google.com/notebook/f4acc37b-078e-4b00-ba6f-c2caa67e9533">NotebookLM</a>: If you do so, let us know anything you find in the comments!</p><p></p><p>👉 <strong><em>Want to learn more about Building AI-Powered Software? Check out our </em></strong><a target="_blank" href="http://xxx"><strong><em>Building AI Applications course</em></strong></a>. It’s a live cohort with hands on exercises and office hours. Here is a <a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgfs">25% discount code for listeners</a>. 👈</p><p>LINKS</p><p>* <a target="_blank" href="https://www.linkedin.com/in/samuel-colvin/">Samuel Colvin on LinkedIn</a></p><p>* <a target="_blank" href="https://pydantic.dev/">Pydantic</a></p><p>* <a target="_blank" href="https://github.com/pydantic/pydantic-stack-demo">Pydantic Stack Demo repo</a></p><p>* <a target="_blank" href="https://github.com/pydantic/pydantic-stack-demo/blob/main/durable-exec/deep_research.py">Deep research example code</a></p><p>* <a target="_blank" href="https://temporal.io/">Temporal</a></p><p>* <a target="_blank" href="https://docs.dbos.dev/">DBOS (Postgres alternative to Temporal)</a></p><p>* <a target="_blank" href="https://luma.com/calendar/cal-8ImWFDQ3IEIxNWk">Upcoming Events on Luma</a></p><p>* <a target="_blank" href="https://www.youtube.com/@vanishinggradients">Vanishing Gradients on YouTube</a></p><p>* <a target="_blank" href="https://www.youtube.com/live/Qr4eiLbCfg4">Watch the podcast video on YouTube</a></p><p>👉Want to learn more about Building AI-Powered Software? Check out our <a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles">Building AI Applications course</a>. It’s a live cohort with hands on exercises and office hours. Our final cohort starts March 10, 2026. Here is a <a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgfs">25% discount code for listeners</a>.👈</p><p>https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgfs</p> <br/><br/>Get full access to Vanishing Gradients at <a href="https://hugobowne.substack.com/subscribe?utm_medium=podcast&utm_campaign=CTA_4">hugobowne.substack.com/subscribe</a>

51 MIN

Episode 70: 1,400 Production AI Deployments

FEB 12, 2026

Episode 70: 1,400 Production AI Deployments

<p><em>There’s a company who spent almost $50,000 because an agent went into an infinite loop and they forgot about it for a month.</em></p><p><em>It had no failures and I guess no one was monitoring these costs. It’s nice that people do write about that in the database as well. After it happened, they said: watch out for infinite loops. Watch out for cascading tool failures. Watch out for silent failures where the agent reports it has succeeded when it didn’t!</em></p><p><strong>We Discuss:</strong></p><p>* Why the most successful teams are <strong>ripping out and rebuilding their agent systems every few weeks</strong> as models improve, and why over-engineering now creates technical debt you can’t afford later;</p><p>* The <strong>$50,000 infinite loop disaster</strong> and why “silent failures” are the biggest risk in production: agents confidently report success while spiraling into expensive mistakes;</p><p>* How <strong>ELIOS built emergency voice agents</strong> with sub-400ms response times by aggressively throwing away context every few seconds, and why these extreme patterns are becoming standard practice;</p><p>* Why <strong>DoorDash uses a three-tier agent architecture</strong> (manager, progress tracker, and specialists) with a persistent workspace that lets agents collaborate across hours or days;</p><p>* Why simple <strong>text files and markdown</strong> are emerging as the best “continual learning” layer: human-readable memory that persists across sessions without fine-tuning models;</p><p>* The <strong>100-to-1 problem</strong>: for every useful output, tool-calling agents generate 100 tokens of noise, and the three tactics (reduce, offload, isolate) teams use to manage it;</p><p>* Why companies are <strong>choosing Gemini Flash for document processing and Opus for long reasoning chains</strong>, and how to match models to your actual usage patterns;</p><p>* The debate over <strong>vector databases versus simple grep and cat</strong>, and why giving agents standard command-line tools often beats complex APIs;</p><p>* What <strong>“re-architect” as a job title</strong> reveals about the shift from 70% scaffolding / 30% model to 90% model / 10% scaffolding, and why knowing when to rip things out is the may be the most important skill today.</p><p>You can also find the full episode on <a target="_blank" href="https://open.spotify.com/show/3yuz89gqAhcMcdy3SZPe4X?si=AKl2jvIARD2Liw1bBH2Nng&nd=1&dlsi=8dfe7221896c4fc3">Spotify</a>, <a target="_blank" href="https://podcasts.apple.com/us/podcast/vanishing-gradients/id1610318868">Apple Podcasts</a>, and <a target="_blank" href="https://www.youtube.com/live/uf80BfD70Lw?si=RtkR2C5aYqBea2Us">YouTube</a>.</p><p><a target="_blank" href="https://notebooklm.google.com/notebook/ceef53be-ffe8-47d5-8850-07335c434100">You can also interact directly with the transcript here in NotebookLM</a>: If you do so, let us know anything you find in the comments!</p><p></p><p></p><p>👉 <strong><em>Want to learn more about Building AI-Powered Software? Check out our </em></strong><a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=ss-rav"><strong><em>Building AI Applications course</em></strong></a>. It’s a live cohort with hands on exercises and office hours. <strong>Our final cohort starts March 10, 2026</strong>. Here is a <a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgch">25% discount code</a> for readers. 👈</p><p>Show Notes Links</p><p>* <a target="_blank" href="https://www.linkedin.com/in/strickvl/">Alex Strick van Linschoten on LinkedIn</a></p><p>* <a target="_blank" href="https://x.com/strickvl">Alex Strick van Linschoten on Twitter/X</a></p><p>* <a target="_blank" href="https://www.zenml.io/llmops-database">LLMOps Database</a></p><p>* <a target="_blank" href="https://huggingface.co/datasets/zenml/llmops-database">LLMOps Database Dataset on Hugging Face</a></p><p>* <a target="_blank" href="https://huggingface.co/spaces/hugobowne/llmops-database-mcp">Hugo’s MCP Server for LLMOps Database</a></p><p>* <a target="_blank" href="https://www.zenml.io/blog/what-1200-production-deployments-reveal-about-llmops-in-2025">Alex’s Blog: What 1,200+ Production Deployments Reveal About LLMOps in 2025</a></p><p>* <a target="_blank" href="https://hugobowne.substack.com/p/practical-lessons-from-750-real-world">Previous Episode: Practical Lessons from 750 Real-World LLM Deployments</a></p><p>* <a target="_blank" href="https://hugobowne.substack.com/p/episode-43-tales-from-400-llm-deployments-f60">Previous Episode: Tales from 400 LLM Deployments</a></p><p>* <a target="_blank" href="https://research.trychroma.com/context-rot">Context Rot Research by Chroma</a></p><p>* <a target="_blank" href="https://hugobowne.substack.com/p/ai-agent-harness-3-principles-for">Hugo’s Post: AI Agent Harness - 3 Principles for Context Engineering</a></p><p>* <a target="_blank" href="https://hugobowne.substack.com/p/the-rise-of-agentic-search">Hugo’s Post: The Rise of Agentic Search</a></p><p>* <a target="_blank" href="https://high-signal.delphina.ai/episode/the-post-coding-era-what-happens-when-ai-writes-the-system">Episode with Nick Moy: The Post-Coding Era</a></p><p>* <a target="_blank" href="https://gist.github.com/hugobowne/959419146f1a8276c78511e801b85e40">Hugo’s Personal Podcast Prep Skill Gist</a></p><p>* <a target="_blank" href="https://platform.claude.com/docs/en/agents-and-tools/tool-use/tool-search-tool">Claude Tool Search Documentation</a></p><p>* <a target="_blank" href="https://github.com/steveyegge/gastown">Gastown on GitHub (Steve Yegge)</a></p><p>* <a target="_blank" href="https://steve-yegge.medium.com/welcome-to-gas-town-4f25ee16dd04">Welcome to Gastown by Steve Yegge</a></p><p>* <a target="_blank" href="https://www.zenml.io">ZenML - Open Source MLOps & LLMOps Framework</a></p><p>* <a target="_blank" href="https://luma.com/calendar/cal-8ImWFDQ3IEIxNWk">Upcoming Events on Luma</a></p><p>* <a target="_blank" href="https://www.youtube.com/@vanishinggradients">Vanishing Gradients on YouTube</a></p><p>* <a target="_blank" href="https://www.youtube.com/live/uf80BfD70Lw?si=RtkR2C5aYqBea2Us">Watch the podcast livestream on YouTube</a></p><p>* <a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgfs">Join the final cohort of our Building AI Applications course in March, 2026 (25% off for listeners)</a></p><p>👉 <strong><em>Want to learn more about Building AI-Powered Software? Check out our </em></strong><a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=ss-rav"><strong><em>Building AI Applications course</em></strong></a>. It’s a live cohort with hands on exercises and office hours. <strong>Our final cohort starts March 10, 2026</strong>. Here is a <a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgch">25% discount code</a> for readers. 👈</p> <br/><br/>Get full access to Vanishing Gradients at <a href="https://hugobowne.substack.com/subscribe?utm_medium=podcast&utm_campaign=CTA_4">hugobowne.substack.com/subscribe</a>

69 MIN

Episode 69: Python is Dead. Long Live Python! With the Creators of pandas & Parquet

FEB 3, 2026

Episode 69: Python is Dead. Long Live Python! With the Creators of pandas & Parquet

<p><em>> It’s the agent writing the code. And it’s the development loop of writing the code, building testing, write the code, build test and iterating. And so</em><strong><em> I do think we’ll see for many types of software, a shift away from Python towards other programming languages</em></strong><em>. I think Go is probably the best language for those like other types of software projects. And like I said, I haven’t written a line of Go code in my life.</em></p><p><em>– Wes McKinney </em>(<strong>creator</strong> of <strong>pandas</strong> <strong>Principal Architect</strong> at <strong>Posit)</strong>,</p><p><strong>Wes McKinney</strong>, <strong>Marcel Kornacker</strong>, and <strong>Alison Hill</strong> join Hugo to talk about the architectural shift for multimodal AI, the rise of “agent ergonomics,” and the evolving role of developers in an AI-generated future.</p><p><strong>We Discuss:</strong></p><p>* <strong>Agent Ergonomics:</strong> Optimize for <strong>agent iteration speed</strong>, shifting from human coding to fast test environments, potentially favoring languages like <strong>Go</strong>;</p><p>* <strong>Adversarial Code Review:</strong> Deploy <strong>diverse AI models</strong> to <strong>peer-review agent-generated code</strong>, catching subtle bugs humans miss;</p><p>* <strong>Multimodal Data Verbs:</strong> Make operations like <strong>resizing</strong> and <strong>rotating</strong> native to your database to eliminate <strong>data-plumbing bottlenecks</strong>;</p><p>* <strong>Taste as Differentiator:</strong> Value <strong>“taste”</strong>—the ability to <strong>curate</strong> and refine the best output from countless <strong>AI-generated options</strong>—over sheer execution speed;</p><p>* <strong>100x Software Volume:</strong> Embrace <strong>ephemeral, just-in-time software</strong>; prioritize aggressive generation and <strong>adversarial testing</strong> over careful planning for quality.</p><p>You can also find the full episode on <a target="_blank" href="https://open.spotify.com/show/3yuz89gqAhcMcdy3SZPe4X?si=AKl2jvIARD2Liw1bBH2Nng&nd=1&dlsi=d536dc4dceca4619">Spotify</a>, <a target="_blank" href="https://podcasts.apple.com/us/podcast/vanishing-gradients/id1610318868">Apple Podcasts</a>, and <a target="_blank" href="https://youtube.com/live/UwdpNxHZDwI?feature=share">YouTube</a>.</p><p><a target="_blank" href="https://notebooklm.google.com/notebook/6225d2fa-6a36-4a48-a0ab-131736c1652e">You can also interact directly with the transcript of the workshop & fireside chat here in NotebookLM</a>: If you do so, let us know anything you find in the comments!</p><p>👉 <strong><em>Want to learn more about Building AI-Powered Software? Check out our </em></strong><a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgfs"><strong><em>Building AI Applications course</em></strong></a>. It’s a live cohort with hands on exercises and office hours. Here is a <a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgfs">discount code</a> for readers. 👈</p><p>This was a fireside chat at the end of a livestreamed workshop we did on building multimodal AI systems with Pixeltable. Check out the full workshop below (all code here on Github):</p><p><strong>Links and Resources</strong></p><p>* <a target="_blank" href="https://www.linkedin.com/in/wesmckinn/">Wes McKinney on LinkedIn</a></p><p>* <a target="_blank" href="https://www.linkedin.com/in/marcelkornacker/">Marcel Kornacker on LinkedIn</a></p><p>* <a target="_blank" href="https://www.linkedin.com/in/apreshill/">Alison Hill on LinkedIn</a></p><p>* <a target="_blank" href="https://spicytakes.org">Spicy Takes</a></p><p>* <a target="_blank" href="https://allisonhorst.github.io/palmerpenguins/">Palmer Penguins</a></p><p>* <a target="_blank" href="https://pixeltable.com/">Pixeltable</a></p><p>* <a target="_blank" href="https://posit.co/">Posit</a></p><p>* <a target="_blank" href="https://positron.posit.co/">Positron</a></p><p>* <a target="_blank" href="https://github.com/apreshill/pxt-primetime">Building Multimodal AI Systems Workshop Repository</a></p><p>* <a target="_blank" href="https://docs.pixeltable.com/howto/cookbooks/agents/llm-tool-calling#using-mcp-servers-as-tools">Pixeltable Docs: LLM Tool Calling with MCP Servers</a></p><p>* <a target="_blank" href="https://docs.pixeltable.com/howto/providers/working-with-pydantic">Pixeltable Docs: Working with Pydantic</a></p><p>* <a target="_blank" href="https://lu.ma/vanishinggradients">Upcoming Events on Luma</a></p><p>* <a target="_blank" href="https://www.youtube.com/@vanishinggradients">Vanishing Gradients on YouTube</a></p><p>* <a target="_blank" href="https://youtube.com/live/UwdpNxHZDwI?feature=share">Watch the podcast video on YouTube</a></p><p>* <a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgfs">Join the final cohort of our Building AI Applications course in March, 2026 (25% off for listeners)</a></p><p>https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgfs</p><p></p><p><strong>What people said during the workshop</strong></p><p>“I think the interface looks amazing/simple. Strong work! 🦾” — @goldentribe</p><p></p><p>“This is quite amazing. Watching this I felt the same way when I first leant pandas, NumPy and scikit and how well i was able to manipulate and wrangle data. PixelTable feels seamless and looks as good as those legendary frameworks but for Multimodal Data.” — @vinod7</p><p></p><p>“This is all extremely cool to see, I love the API and the approach.” — @steveb4191</p><p></p><p>“Thanks so much, Hugo! That was very insightful! Great work Alison and Marcel!” — @vinod7</p><p></p><p>“Just wrapped up watching a replay of the Pixeltable workshop. So cool!! Love the notebooks and working examples. The important parts were covered and worked beautifully 🕺” — @therobbrennan</p><p></p><p>👉 <strong><em>Want to learn more about Building AI-Powered Software? Check out our </em></strong><a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgfs"><strong><em>Building AI Applications course</em></strong></a>. It’s a live cohort with hands on exercises and office hours. Here is a <a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgfs">discount code</a> for readers. 👈</p> <br/><br/>Get full access to Vanishing Gradients at <a href="https://hugobowne.substack.com/subscribe?utm_medium=podcast&utm_campaign=CTA_4">hugobowne.substack.com/subscribe</a>

55 MIN

Episode 68: A Builder’s Guide to Agentic Search & Retrieval with Doug Turnbull & John Berryman

JAN 23, 2026

Episode 68: A Builder’s Guide to Agentic Search & Retrieval with Doug Turnbull & John Berryman

<p><em>The best way to build a horrible search product? Don’t ever measure anything against what a user wants.</em></p><p>Search veterans <strong>Doug Turnbull</strong> (Led Search at Reddit + Shopify; Wrote <em>Relevant Search</em> + <em>AI Powered Search</em>) and <strong>John Berryman</strong> (Early Engineer on Github Copilot; Author of <em>Relevant Search </em>+ <em>Prompt Engineering for LLMs</em>), join Hugo to talk about how to build Agentic Search Applications.</p><p><strong>We Discuss:</strong></p><p>* The evolution of <strong>information retrieval</strong> as it moves from <strong>traditional keyword search</strong> toward “<strong>agentic search</strong>“ and what this means for <strong>builders</strong>.</p><p>* John’s <strong>five-level maturity model</strong> (you can prototype today!) for AI adoption, moving from <strong>Trad Search </strong>to<strong> conversational AI</strong> to <strong>asynchronous research assistants</strong> that reason about result quality.</p><p>* The <strong>Agentic Search Builders Playbook</strong>, including why and how <em>you should “hand-roll” your own agentic loops</em> to maintain control;</p><p>* The importance of <strong>“revealed preferences”</strong> that <strong>LLM-judges</strong> often miss (<strong>evaluations</strong> must use real clickstream data to capture “revealed preferences” that semantic relevance alone cannot infer)</p><p>* <strong>Patterns</strong> and <strong>Anti-Patterns</strong> for <strong>Agentic Search</strong> Applications</p><p>* <strong>Learning</strong> and <strong>teaching Search</strong> in the <em>Age of Agents</em></p><p>You can find the full episode on <a target="_blank" href="https://open.spotify.com/show/3yuz89gqAhcMcdy3SZPe4X?si=AKl2jvIARD2Liw1bBH2Nng&nd=1&dlsi=2d2146ac32d54d73">Spotify</a>, <a target="_blank" href="https://podcasts.apple.com/us/podcast/vanishing-gradients/id1610318868">Apple Podcasts</a>, and <a target="_blank" href="https://youtube.com/live/H6ua9HjGq60?feature=share">YouTube</a>.</p><p><a target="_blank" href="https://notebooklm.google.com/notebook/a245e43c-3fa0-4f0b-a375-18aa14fd9364">You can also interact directly with the transcript here in NotebookLM</a>: If you do so, let us know anything you find in the comments!</p><p>👉 <strong><em>Want to learn more about Building AI-Powered Software? Check out our </em></strong><a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgpod"><strong><em>Building AI Applications course</em></strong></a>. It’s a live cohort with hands on exercises and office hours. Here is a <a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgpod">discount code</a> for readers. 👈</p><p></p><p>Doug and Hugo are also doing a free lightning lesson on Feb 20 about <a target="_blank" href="https://maven.com/p/b297f3/how-to-build-your-first-agentic-search-application"><strong><em>How To Build Your First Agentic Search Application</em></strong></a><em>! </em>You’ll walk away with a <strong>framework</strong> & <strong>code</strong> to <strong>build your first agentic search app</strong>. <a target="_blank" href="https://maven.com/p/b297f3/how-to-build-your-first-agentic-search-application">Register here to join live or get the recording after</a>.</p><p></p><p></p><p>Links and Resources</p><p>Guests</p><p>* <a target="_blank" href="https://arcturus-labs.com/">Arcturus Labs (John’s website)</a></p><p>* <a target="_blank" href="https://softwaredoug.com/">Software Doug (Doug’s website)</a></p><p>* <a target="_blank" href="https://www.linkedin.com/in/john-berryman-864b1713/">John Berryman on LinkedIn</a></p><p>* <a target="_blank" href="https://www.linkedin.com/in/softwaredoug/">Doug Turnbull on LinkedIn</a></p><p>Books</p><p>* <a target="_blank" href="https://www.manning.com/books/relevant-search">Relevant Search by Doug Turnbull & John Berryman (Manning)</a></p><p>* <a target="_blank" href="https://www.manning.com/books/ai-powered-search">AI-Powered Search by Doug Turnbull (Manning)</a></p><p>* <a target="_blank" href="https://www.oreilly.com/library/view/prompt-engineering-for/9781098156145/">Prompt Engineering for LLMs by John Berryman (O’Reilly)</a></p><p>Blog Posts</p><p>* <a target="_blank" href="https://arcturus-labs.com/blog/2026/01/18/incremental-ai-adoption-for-e-commerce/">Incremental AI Adoption for E-commerce by John Berryman</a></p><p>* <a target="_blank" href="https://arcturus-labs.com/blog/2024/11/21/roaming-rag--rag-without-the-vector-database/">Roaming RAG – RAG without the Vector Database by John Berryman</a></p><p>* <a target="_blank" href="https://softwaredoug.com/blog/2025/09/22/reasoning-agents-need-bad-search">Agents Turn Simple Keyword Search into Compelling Search Experiences by Doug Turnbull</a></p><p>* <a target="_blank" href="https://softwaredoug.com/blog/2025/10/15/a-simple-agentic-loop-with-just-python-functions">A Simple Agentic Loop with Just Python Functions by Doug Turnbull</a></p><p>* <a target="_blank" href="https://softwaredoug.com/blog/2025/10/19/agentic-code-generation-to-optimize-a-search-reranker">Agentic Code Generation to Optimize a Search Reranker by Doug Turnbull</a></p><p>* <a target="_blank" href="https://softwaredoug.com/blog/2025/11/02/llm-judges-arent-the-shortcut-you-think">LLM Judges Aren’t the Shortcut You Think by Doug Turnbul (Hugo’s 5 minute video below)</a></p><p>* <a target="_blank" href="https://www.inkandswitch.com/essay/malleable-software/">Malleable Software by Ink & Switch (inc. Geoffrey Lit)</a></p><p>* <a target="_blank" href="https://hugobowne.substack.com/p/patterns-and-anti-patterns-for-building">Patterns and Anti-Patterns for Building with AI by Hugo Bowne-Anderson</a></p><p>Other Resources</p><p>* <a target="_blank" href="https://hugobowne.substack.com/p/the-rise-of-agentic-search">The Rise of Agentic Search, a recent VG Podcast with Jeff Huber</a></p><p>* <a target="_blank" href="https://x.com/karpathy/status/1938626382248149433">Karpathy on Cognitive Core LLMs</a></p><p>* <a target="_blank" href="https://maven.com/softwaredoug/cheat-at-search?utm_campaign=eeeda8&utm_medium=partner&utm_source=instructor">Cheat at Search with Agents course by Doug Turnbull (use code: vanishinggradients for $200 off)</a></p><p>* <a target="_blank" href="https://luma.com/calendar/cal-8ImWFDQ3IEIxNWk">Upcoming Events on Luma</a></p><p>* <a target="_blank" href="https://www.youtube.com/@vanishinggradients">Vanishing Gradients on YouTube</a></p><p>* <a target="_blank" href="https://www.youtube.com/live/H6ua9HjGq60?si=hMP7-BXJtOK7Czbd">Watch the podcast video on YouTube</a></p><p>* <a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgpod">Join the final cohort of our Building AI Applications course in Q1, 2026 (25% off for listeners)</a></p><p></p><p></p><p><strong>Timestamps (for YouTube livestream)</strong></p><p>00:00 How to Build Agentic Search & Retrieval Systems</p><p>02:48 Defining Search and AI</p><p>03:26 Evolution of Search Technologies08:46 Search in E-commerce and Other Domains</p><p>12:15 Combining Search and AI: RAG and LLMs</p><p>23:50 User Intent and Search Optimization</p><p>29:47 Levels of AI Integration in Search</p><p>32:25 Exploring the Complexity of Search in Various Domains</p><p>33:49 The Evolution and Impact of Agentic Search</p><p>34:07 Defining Terms: RAG and Agentic Search</p><p>34:52 The Research Loop and Tool Interaction</p><p>35:55 Formal Protocols and Structured Outputs</p><p>38:39 Building Agentic Search Experiences: Tips and Advice</p><p>41:50 The Importance of Empathy in AI and Search Development</p><p>54:30 The Role of UX in Search Applications</p><p>01:01:15 Future of Search: Malleable User Interfaces</p><p>01:02:38 Exploring Malleable Software</p><p>01:04:20 The Coordination Challenge in Software Development</p><p>01:05:23 The Impact of Claude Code & Claude Cowork</p><p>01:06:22 The Future of Knowledge Work with AI</p><p>01:12:39 Evaluating Search Algorithms with AI</p><p>01:15:15 The Role of Agents in Search Optimization</p><p>01:29:55 Teaching AI and Search Techniques</p><p>01:34:25 Final Thoughts and Farewell</p><p>👉 <strong><em>Want to learn more about Building AI-Powered Software? Check out our </em></strong><a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgpod"><strong><em>Building AI Applications course</em></strong></a>. It’s a live cohort with hands on exercises and office hours. Here is a <a target="_blank" href="https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgpod">discount code</a> for readers. 👈</p><p>https://maven.com/hugo-stefan/building-ai-apps-ds-and-swe-from-first-principles?promoCode=vgpod</p> <br/><br/>Get full access to Vanishing Gradients at <a href="https://hugobowne.substack.com/subscribe?utm_medium=podcast&utm_campaign=CTA_4">hugobowne.substack.com/subscribe</a>

88 MIN