Berkeley roboticist Anca Dragan on why robots must model messy humans, learn hidden rewards, and treat human-robot interaction as a shared game.

Anca Dragan — A professor at UC Berkeley working on human-robot interaction and reward engineering algorithms, who also consults at Waymo. She studies how robots can generate behavior that accounts for coordinating with people.
Anca Dragan explains her work on human-robot interaction, where the robot's job is to optimize for what people actually want rather than what a programmer literally specified. She argues that humans who look irrational may simply be operating under different assumptions or simpler internal models, and shows how robots can use their own actions to gather information about human intent. The conversation covers inverse reinforcement learning, the difficulty of designing reward functions, autonomous driving as a game-theoretic problem with humans, and semi-autonomous driving's risks. It closes on mortality, the meaning of life, and how finiteness might belong in our reward functions.
Books, products and media the guest or host genuinely endorsed here — with the buy link.
Affiliate link — we may earn a commission at no extra cost to you.
Pixar
“my favorite fictional robot is Wally and I love how amazingly expressive it is some personal things a little bit about expressive motion” — guest 00:07:52Find it on Amazon
Stuart Russell and Peter Norvig
“I got my hands on a PDF copy in Romania of Russell Norvig a I modern approach... it was so captivating” — guest 01:26:44Find it on Amazon