Anthropic CEO Dario Amodei, plus Amanda Askell and Chris Olah, on Claude, scaling laws, AGI, AI safety, and interpretability.

Dario Amodei (with Amanda Askell and Chris Olah) — Dario Amodei is co-founder and CEO of Anthropic, the company behind Claude. He is joined by Amanda Askell, a researcher who designs Claude's character and alignment, and Chris Olah, a pioneer of mechanistic interpretability.
Dario Amodei traces the scaling hypothesis from his early speech-recognition work to today's frontier models, arguing that bigger networks, more data, and more compute reliably yield more intelligence and could reach human-level 'powerful AI' by 2026-2027. He details Anthropic's safety framework (the Responsible Scaling Policy and ASL levels), the 'race to the top' theory of change, his views on regulation, and his optimistic essay 'Machines of Loving Grace.' Amanda Askell explains how Claude's character is crafted as an alignment problem, covering sycophancy, prompting, constitutional AI, and the ethics of AI consciousness. Chris Olah closes with a deep dive into mechanistic interpretability: features, circuits, superposition, sparse autoencoders, and the goal of understanding neural networks for both safety and beauty.
Books, products and media the guest or host genuinely endorsed here — with the buy link.
Affiliate link — we may earn a commission at no extra cost to you.
Anysphere (inferred)
“I program but I also love programming and I claw 35 through cursor is what I use to assist me in programming” — Lex Fridman 00:33:51Find it on Amazon
Anthropic
“the following is a conversation with Dario amade CEO of anthropic the company that created Claude” — Lex Fridman 00:01:34Find it on Amazon