Dwarkesh Podcast

MAR 25, 2025

AMA ft. Sholto & Trenton: New Book, Career Advice Given AGI, How I'd Start From Scratch

I recorded an AMA! I had a blast chatting with my friends Trenton Bricken and Sholto Douglas. We discussed my new book, career advice given AGI, how I pick guests, how I research for the show, and some other nonsense.My book, “<a target="_blank" href="https://press.stripe.com/scaling">The Scaling Era: An Oral History of AI, 2019-2025</a>” is available in <a target="_blank" href="https://www.amazon.com/Scaling-Era-Oral-History-2019-2025-ebook/dp/B0F22SKW5Y/ref=tmm_kin_swatch_0">digital format</a> now. Preorders for the <a target="_blank" href="https://www.amazon.com/Scaling-Era-Oral-History-2019-2025/dp/1953953557/ref=tmm_hrd_swatch_0">print version</a> are also open!Watch on <a target="_blank" href="https://youtu.be/XLaRfZ4AHn8">YouTube</a>; listen on <a target="_blank" href="https://podcasts.apple.com/us/podcast/dwarkesh-podcast/id1516093381">Apple Podcasts</a> or <a target="_blank" href="https://open.spotify.com/episode/4yso3gE93kHV6vGZw2cgtp?si=c1dfbe07b63343f8">Spotify</a>.Timestamps(0:00:00) - Book launch announcement(0:04:57) - AI models not making connections across fields(0:10:52) - Career advice given AGI(0:15:20) - Guest selection criteria(0:17:19) - Choosing to pursue the podcast long-term(0:25:12) - Reading habits(0:31:10) - Beard deepdive(0:33:02) - Who is best suited for running an AI lab?(0:35:16) - Preparing for fast AGI timelines(0:40:50) - Growing the podcast Get full access to Dwarkesh Podcast at <a href="https://www.dwarkesh.com/subscribe?utm_medium=podcast&utm_campaign=CTA_4">www.dwarkesh.com/subscribe</a>

49 MIN

MAR 12, 2025

Joseph Henrich – Why Humans Survived and Smarter Species Didn't

112 MIN

MAR 5, 2025

Notes on China

I’m so excited with how this visualization of <a target="_blank" href="https://www.dwarkesh.com/p/notes-on-china">Notes on China</a> turned out. <a target="_blank" href="https://www.petrsalaba.cz/">Petr</a>, thank you for such beautiful watercolor artwork. More to come!Watch on <a target="_blank" href="https://youtu.be/UU9jbImVsNY">YouTube</a>.----------Timestamps(0:00:00) - Intro(0:00:32) - Scale(0:05:50) - Vibes(0:11:14) - Youngsters(0:14:27) - Tech & AI(0:15:47) - Hearts & Minds(0:17:07) - On Travel Get full access to Dwarkesh Podcast at <a href="https://www.dwarkesh.com/subscribe?utm_medium=podcast&utm_campaign=CTA_4">www.dwarkesh.com/subscribe</a>

19 MIN

FEB 19, 2025

Satya Nadella – Microsoft’s AGI Plan & Quantum Breakthrough

76 MIN

FEB 12, 2025

Jeff Dean & Noam Shazeer – 25 years at Google: from PageRank to AGI

This week I welcome on the show two of the most important technologists ever, in any field.Jeff Dean is Google's Chief Scientist, and through 25 years at the company, has worked on basically the most transformative systems in modern computing: from MapReduce, BigTable, Tensorflow, AlphaChip, to Gemini.Noam Shazeer invented or co-invented all the main architectures and techniques that are used for modern LLMs: from the Transformer itself, to Mixture of Experts, to Mesh Tensorflow, to Gemini and many other things.We talk about their 25 years at Google, going from PageRank to MapReduce to the Transformer to MoEs to AlphaChip – and maybe soon to ASI.My favorite part was Jeff's vision for Pathways, Google’s grand plan for a mutually-reinforcing loop of hardware and algorithmic design and for going past autoregression. That culminates in us imagining *all* of Google-the-company, going through one huge MoE model.And Noam just bites every bullet: 100x world GDP soon; let’s get a million automated researchers running in the Google datacenter; living to see the year 3000.Watch on <a target="_blank" href="https://youtu.be/v0gjI__RyCY">Youtube</a>; listen on <a target="_blank" href="https://podcasts.apple.com/us/podcast/jeff-dean-noam-shazeer-25-years-at-google-from-pagerank/id1516093381?i=1000691556147">Apple Podcasts</a> or <a target="_blank" href="https://open.spotify.com/episode/4atx1POpKIL8WGvdVfdnbb?si=XYxo6SIyRi2qmZ1ZGfl6vw">Spotify</a>.SponsorsScale partners with major AI labs like Meta, Google Deepmind, and OpenAI. Through Scale’s Data Foundry, labs get access to high-quality data to fuel post-training, including advanced reasoning capabilities. If you’re an AI researcher or engineer, learn about how Scale’s Data Foundry and research lab, SEAL, can help you go beyond the current frontier at <a target="_blank" href="https://scale.com/dwarkesh">scale.com/dwarkesh</a>Curious how Jane Street teaches their new traders? They use Figgie, a rapid-fire card game that simulates the most exciting parts of markets and trading. It’s become so popular that Jane Street hosts an inter-office Figgie championship every year. Download from the app store or play on your desktop at <a target="_blank" href="https://www.figgie.com/">figgie.com</a>Meter wants to radically improve the digital world we take for granted. They’re developing a foundation model that automates network management end-to-end. To do this, they just announced a long-term partnership with Microsoft for tens of thousands of GPUs, and they’re recruiting a world class AI research team. To learn more, go to <a target="_blank" href="https://meter.com/dwarkesh">meter.com/dwarkesh</a>To sponsor a future episode, visit <a target="_blank" href="https://www.dwarkeshpatel.com/p/advertise">dwarkeshpatel.com/p/advertise</a>Timestamps00:00:00 - Intro00:02:44 - Joining Google in 199900:05:36 - Future of Moore's Law00:10:21 - Future TPUs00:13:13 - Jeff’s undergrad thesis: parallel backprop00:15:10 - LLMs in 200700:23:07 - “Holy s**t” moments00:29:46 - AI fulfills Google’s original mission00:34:19 - Doing Search in-context00:38:32 - The internal coding model00:39:49 - What will 2027 models do?00:46:00 - A new architecture every day?00:49:21 - Automated chip design and intelligence explosion00:57:31 - Future of inference scaling01:03:56 - Already doing multi-datacenter runs01:22:33 - Debugging at scale01:26:05 - Fast takeoff and superalignment01:34:40 - A million evil Jeff Deans01:38:16 - Fun times at Google01:41:50 - World compute demand in 203001:48:21 - Getting back to modularity01:59:13 - Keeping a giga-MoE in-memory02:04:09 - All of Google in one model02:12:43 - What’s missing from distillation02:18:03 - Open research, pros and cons02:24:54 - Going the distance Get full access to Dwarkesh Podcast at <a href="https://www.dwarkesh.com/subscribe?utm_medium=podcast&utm_campaign=CTA_4">www.dwarkesh.com/subscribe</a>

134 MIN

Sign In

Details

Recent Episodes