Dwarkesh Podcast
Dwarkesh Podcast

Dwarkesh Podcast

Dwarkesh Patel

Overview
Episodes

Details

Deeply researched interviews www.dwarkesh.com

Recent Episodes

Ada Palmer – Machiavelli is the most misunderstood thinker of all time
JUN 16, 2026
Ada Palmer – Machiavelli is the most misunderstood thinker of all time
<p>Had Ada Palmer back on – this time to talk about Machiavelli, perhaps the most misunderstood thinker of all time.</p><p>Machiavelli cut his teeth as a high-level diplomat for Florence, a position from which he got to closely observe the most important rulers in Europe at the time, including the ones who were on the path to destroying his dearly beloved Florence.</p><p>In 1513 the Medici retook control of Florence and, wrongly suspecting Machiavelli of participating in a coup attempt, fired, tortured, and exiled him.</p><p>Machiavelli could have left exile and worked for any number of different principalities that would have been eager to make use of his talents.</p><p>Instead, he decided to rot in the countryside and compile his career’s lessons about power, politics, and human nature into a book he dedicated to the very man whose new regime had tortured and exiled him, Lorenzo di Piero de’ Medici.</p><p>But at least the Medici were in a position to use his insights to defend Florence. Machiavelli the patriot did not want any other hands to touch these books, because those hands, armed further with these lessons, might pose an existential danger to Florence.</p><p>The closest modern analogy, at least as Machiavelli would have seen it, would be Szilard’s letter warning FDR about the possibility of a nuclear fission bomb.</p><p>What were those insights? And how were they inspired by Machiavelli’s dangerous diplomatic missions all across Europe, and his extensive reading of antiquity? Watch this episode with Ada Palmer to find out!</p><p>By the way, Ada is launching a new podcast which I’m very excited about. The first season will be about Machiavelli - a perfect way to dive deeper into the topics we discussed in this episode. Subscribe at <a target="_blank" href="https://beforecast-seven.vercel.app/">Beforecast’s</a> website to be notified of the first episode, subscribe on <a target="_blank" href="https://www.youtube.com/@Beforecast">YouTube</a>, follow her on <a target="_blank" href="https://www.patreon.com/adapalmer">Patreon</a>, and if you want even more Ada, check out her <a target="_blank" href="https://savingtheworldpod.transistor.fm/">FixTheNews</a> Podcast episode, and check out her <a target="_blank" href="https://www.adapalmer.com/">books and more</a>.</p><p>Watch on <a target="_blank" href="https://youtu.be/U1FrhkLQnCI">YouTube</a>; read the <a target="_blank" href="https://www.dwarkesh.com/p/ada-palmer-2">transcript</a>.</p><p><strong>Sponsors</strong></p><p>* <a target="_blank" href="https://cursor.com/dwarkesh">Cursor</a> recently saved one of my podcast recordings. When a video file from a shoot came out corrupted, I pointed Cursor at it: it recovered the footage on its own, tracking down the right reference file from the file’s metadata and realigning the out-of-sync audio. My whole team now uses Cursor for everyday tasks, not just coding. Get started at<a target="_blank" href="https://cursor.com/dwarkesh"> </a><a target="_blank" href="http://cursor.com/dwarkesh">cursor.com/dwarkesh</a></p><p>* <a target="_blank" href="https://janestreet.com/dwarkesh">Jane Street</a>’s hiring process has been going viral on Twitter lately. The memes are pretty funny, but I wanted to see what their interviews were actually like. So I had Ricson, one of Jane Street’s ML researchers, walk me through a retired puzzle: he gave me an image dataset where 50% of the files had been corrupted – I had to figure out how to recover them. If you’re interested in these sorts of puzzles, you can find Jane Street’s open roles at<a target="_blank" href="https://janestreet.com/dwarkesh"> janestreet.com/dwarkesh</a></p><p>* <a target="_blank" href="https://crusoe.ai/dwarkesh">Crusoe</a> is turning the AI datacenter buildout into an industrial process. At their massive Colorado factory, they assemble Spark units, modular datacenters with power, cooling, and fire suppression built in. They also manufacture specific components in-house to skip the longest lead times. Crusoe has experience running these Spark units on a range of energy sources, including solar and used EV batteries, ensuring they don’t get bottlenecked by grid availability. Learn more at<a target="_blank" href="https://crusoe.ai/dwarkesh"> </a><a target="_blank" href="http://crusoe.ai/dwarkesh">crusoe.ai/dwarkesh</a></p><p>Timestamps</p><p>(00:00:00) – How Florence bargained with Cesare Borgia for survival</p><p>(00:15:08) – Machiavelli’s analytical innovations</p><p>(00:23:58) – Why popes became warlords</p><p>(00:36:13) – Why the common people demanded nepotism</p><p>(00:47:57) – Cesare Borgia brought terror to rulers and justice to the people</p><p>(00:57:55) – Art as a proxy for war</p><p>(01:06:41) – Florence, a city famous in hell</p><p>(01:15:57) – The Prince was a job application to Machiavelli’s torturers</p><p>(01:41:39) – During the Renaissance, original ideas had to be couched in antiquity</p><p>(01:50:44) – Why copyright began with the Inquisition</p><p>(02:02:12) – Machiavelli wasn’t Machiavellian</p> <br/><br/>Get full access to Dwarkesh Podcast at <a href="https://www.dwarkesh.com/subscribe?utm_medium=podcast&#38;utm_campaign=CTA_4">www.dwarkesh.com/subscribe</a>
play-circle icon
128 MIN
Alex Imas and Phil Trammell – What remains scarce after AGI?
JUN 4, 2026
Alex Imas and Phil Trammell – What remains scarce after AGI?
<p>Economics of AGI episode w <a target="_blank" href="https://www.aleximas.com/">Alex Imas</a> and <a target="_blank" href="https://philiptrammell.com/">Phil Trammell</a>.</p><p>There’s a bunch of important questions about how we deal with AI that only economics can answer.</p><p>What is the optimal way to tax and redistribute the wealth that will be generated? How should countries not in the AI supply chain index into the gains? Is there any world where inequality doesn’t explode?</p><p>It might seem like these questions have obvious answers, but the first thing economics teaches you is that your intuitions can often be entirely wrong.</p><p>It was very helpful to chat through these things with Alex and Phil.</p><p>Watch on <a target="_blank" href="https://youtu.be/Jj-kBHzUohs">YouTube</a>; read the <a target="_blank" href="https://www.dwarkesh.com/p/alex-imas-phil-trammell">transcript</a>.</p><p><strong>Sponsors</strong></p><p><a target="_blank" href="https://janestreet.com/dwarkesh">Jane Street</a> invests heavily in turning smart people into exceptional researchers and engineers. In addition to their apprenticeship model, Jane Street runs lectures and bootcamps in their in-office classrooms -- managers clear their teams’ schedules to encourage attendance. If you’d like to work at a place that takes learning this seriously, Jane Street is hiring. Check out their open roles at <a target="_blank" href="https://janestreet.com/dwarkesh">janestreet.com/dwarkesh</a></p><p><a target="_blank" href="https://gemini.google">Google’s Gemini Omni</a> has incredible video editing capabilities -- you can upload a video and have Omni change the background, adjust lighting, or add specific elements. But Omni is also a preview of how future frontier models will be trained -- fully multimodal on both input and output. You can try it yourself in the Gemini app at <a target="_blank" href="https://gemini.google">gemini.google</a> or in Flow at <a target="_blank" href="https://flow.google">flow.google</a></p><p><a target="_blank" href="https://cursor.com/dwarkesh">Cursor</a> used targeted RL with textual feedback to help train their Composer 2.5 model. One of their researchers, Sasha Rush, gave me an impromptu blackboard lecture to explain how this form of on-policy self-distillation works -- I posted the full thing on X. If you want to try Composer 2.5, go to <a target="_blank" href="https://cursor.com/dwarkesh">cursor.com/dwarkesh</a></p><p>Timestamps</p><p>(00:00:00) – Will capital share increase?</p><p>(00:19:36) – Messy Middle scenario</p><p>(00:25:57) – How to tax and redistribute AI wealth</p><p>(00:30:02) – Why demand collapse is unlikely</p><p>(00:39:26) – Human employees would be hard to integrate into the machine economy</p><p>(00:43:08) – What if some humans (or AIs) value wealth accumulation intrinsically?</p><p>(01:01:28) – What should developing countries do?</p> <br/><br/>Get full access to Dwarkesh Podcast at <a href="https://www.dwarkesh.com/subscribe?utm_medium=podcast&#38;utm_campaign=CTA_4">www.dwarkesh.com/subscribe</a>
play-circle icon
76 MIN
Reiner Pope – Chip design from the bottom up
MAY 22, 2026
Reiner Pope – Chip design from the bottom up
<p>New blackboard lecture with Reiner Pope: how do chips actually work - starting with basic logic gates, and working up to why GPUs, TPUs, FPGAs, and the human brain each look the way they do.</p><p><a target="_blank" href="https://reiner.org/">Reiner</a> is CEO of <a target="_blank" href="https://matx.com/">MatX</a>, a new chip startup (full disclosure - I’m an angel investor). He was previously at Google, where he worked on <a target="_blank" href="https://arxiv.org/abs/2211.05102">software</a> <a target="_blank" href="https://jax-ml.github.io/scaling-book/">efficiency</a>, compilers, and TPU architecture.</p><p>Watch this one on <a target="_blank" href="https://youtu.be/oIk3R-sMX5o">YouTube</a> so you can see the chalkboard. Read the <a target="_blank" href="https://www.dwarkesh.com/p/reiner-pope-2">transcript</a>.</p><p>Sponsors</p><p>* <a target="_blank" href="https://crusoe.ai/dwarkesh">Crusoe</a> was one of only five GPU clouds that made the gold tier in SemiAnalysis' most recent ClusterMAX report. Gold-tier providers like Crusoe delivered 5-15% lower TCO than silver-tier clouds, even with identical GPU pricing. This is because optimizations like early fault detection and rapid node replacement don't necessarily show up in the sticker price, but still matter a ton in the real world. Learn more at <a target="_blank" href="https://crusoe.ai/dwarkesh">crusoe.ai/dwarkesh</a></p><p>* <a target="_blank" href="https://cursor.com/dwarkesh">Cursor</a> is where I do most of my work—from reading research papers to visualizing technical concepts to coding up internal tools for the podcast. Most recently, I used it to build two different review interfaces for my essay contest, one that anonymizes submissions for scoring and another that lets me see applicants' essays next to their resumes and websites. Whatever you're working on, you should try doing it in Cursor. Get started at <a target="_blank" href="https://cursor.com/dwarkesh">cursor.com/dwarkesh</a></p><p>* <a target="_blank" href="https://janestreet.com/dwarkesh">Jane Street</a> let me ask Ron Minsky and Dan Pontecorvo, two senior Jane Streeters, a bunch of questions about how they use AI. We discussed everything from the types of models they're training to how they think about the future of trading to why they're more bullish than ever on hiring technical talent. You can watch the full conversation and learn more about their open positions at <a target="_blank" href="https://janestreet.com/dwarkesh">janestreet.com/dwarkesh</a></p><p>Timestamps</p><p>00:00:00 – Building a multiply-accumulate from logic gates</p><p>00:16:31 – Muxes and the cost of data movement</p><p>00:26:10 – How systolic arrays work</p><p>00:39:11 – Clock cycles and pipeline registers</p><p>00:51:51 – FPGAs vs ASICs</p><p>01:03:25 – Cache vs scratchpad</p><p>01:07:27 – Why CPU cores are much bigger than GPU cores</p><p>01:12:00 – Brains vs chips</p><p>01:15:33 – A GPU is just a bunch of tiny TPUs</p> <br/><br/>Get full access to Dwarkesh Podcast at <a href="https://www.dwarkesh.com/subscribe?utm_medium=podcast&#38;utm_campaign=CTA_4">www.dwarkesh.com/subscribe</a>
play-circle icon
80 MIN
Eric Jang – Building AlphaGo from scratch
MAY 15, 2026
Eric Jang – Building AlphaGo from scratch
<p><a target="_blank" href="https://evjang.com/">Eric Jang</a> walks through how to build AlphaGo from scratch, but with modern AI tools.</p><p>Sometimes you understand the future better by stepping backward. AlphaGo is still the cleanest worked example of the primitives of intelligence: search, learning from experience, and self-play. You have to go back to 2017 to get insight into how the more general AIs of the future might learn.</p><p>Once he explained how AlphaGo works, it gave us the context to have a discussion about how RL works in LLMs and how it could work better – naive policy gradient RL has to figure out which of the 100k+ tokens in your trajectory actually got you the right answer, while AlphaGo’s MCTS suggests a strictly better action every single move, giving you a training target that sidesteps the credit assignment problem. The way humans learn is surely closer to the second.</p><p>Eric also kickstarted an Autoresearch loop on his project. And it was very interesting to discuss which parts of AI research LLMs can already automate pretty well (implementing and running experiments, optimizing hyperparameters) and which they still struggle with (choosing the right question to investigate next, escaping research dead ends). Informative to all the recent discussion about when we should expect an intelligence explosion, and what it would look like from the inside.</p><p>Watch on <a target="_blank" href="https://youtu.be/X_ZVSPcZhtw">YouTube</a>. Read the <a target="_blank" href="https://www.dwarkesh.com/p/eric-jang">transcript</a>.</p><p>And check out the <a target="_blank" href="https://flashcards.dwarkesh.com/eric-jang/">flashcards</a> I wrote to retain the insights.</p><p>Sponsors</p><p>* <a target="_blank" href="https://cursor.com/dwarkesh">Cursor</a>‘s agent SDK let me build a pipeline to generate flashcards for this episode. For each card, I had an agent read the transcript, ingest blackboard screenshots, generate an SVG visual, and run everything through a critic. A durable agent is much better at this kind of work than a chain of LLM calls, and Cursor’s SDK made it easy. Check out the cards at <a target="_blank" href="https://flashcards.dwarkesh.com">flashcards.dwarkesh.com</a> and get started with the SDK at <a target="_blank" href="https://cursor.com/dwarkesh">cursor.com/dwarkesh</a></p><p>* <a target="_blank" href="https://janestreet.com/dwarkesh">Jane Street</a> gave me a real deep-dive tour of one of their datacenters. I got to ask a bunch of questions to Ron Minsky, who co-leads Jane Street’s tech group, and Dan Pontecorvo, who runs Jane Street’s physical engineering team. They were willing to literally pull up the floorboards and take out racks to explain how everything works. Check out the full tour at <a target="_blank" href="https://janestreet.com/dwarkesh">janestreet.com/dwarkesh</a></p><p>Timestamps</p><p>(00:00:00) – Basics of Go</p><p>(00:08:17) – Monte Carlo Tree Search</p><p>(00:32:04) – What the neural network does</p><p>(01:00:33) – Self-play</p><p>(01:25:38) – Alternative RL approaches</p><p>(01:45:47) – Why doesn't MCTS work for LLMs</p><p>(02:01:09) – Off-policy training</p><p>(02:12:02) – RL is even more information inefficient than you thought</p><p>(02:22:16) – Automated AI researchers</p> <br/><br/>Get full access to Dwarkesh Podcast at <a href="https://www.dwarkesh.com/subscribe?utm_medium=podcast&#38;utm_campaign=CTA_4">www.dwarkesh.com/subscribe</a>
play-circle icon
157 MIN