Ilya Sutskever explains why deep learning works and walks through OpenAI's research on meta-learning, reinforcement learning, and self-play.

Ilya Sutskever — Co-founder and research director of OpenAI; previously a research scientist at Google Brain and a student of Geoffrey Hinton at Toronto; a foundational figure in modern deep learning.
In this MIT AGI lecture hosted by Lex Fridman, Ilya Sutskever lays out a theoretical foundation for why deep learning works, arguing that backpropagation solves the profound problem of circuit search. He surveys reinforcement learning fundamentals, then dives into OpenAI's work on meta-learning, including hindsight experience replay, sim-to-real transfer via domain randomization, and learning hierarchies of actions. A large portion focuses on self-play, where agents create their own escalating challenges, illustrated by OpenAI's Dota 2 bots and wrestling humanoids. He closes with speculation on societies of agents developing language and social skills, and the technical and political problem of conveying goals safely to systems likely to become smarter than humans.