Home Lex Fridman Notes
Lex Fridman · 2019-08-27 · 1h 44m

Jeremy Howard: fast.ai Deep Learning Courses and Research | Lex Fridman Podcast #35

fast.ai founder Jeremy Howard on making deep learning accessible, training fast on a single GPU, and why anyone can do it.

Jeremy Howard: fast.ai Deep Learning Courses and Research | Lex Fridman Podcast #35
The guest

Jeremy Howard — Founder of fast.ai, distinguished research scientist at University of San Francisco, former president and top-ranked competitor at Kaggle, and serial entrepreneur (founded FastMail and Enlitic).

The gist

Jeremy Howard traces his path from programming on a Commodore 64 through esoteric array languages like APL and J to building fast.ai. He argues that most deep learning research is a waste of time and that the real impact comes from empowering domain experts with practical, accessible tools like transfer learning and active learning. He recounts how a handful of his students beat Google and Intel on Stanford's DAWNBench competition by training on cheap single-GPU setups using tricks like progressive resizing and super-convergence learning rates. He shares strong opinions on programming languages (Python is slow and unhackable, Swift is the hope, TensorFlow is a mess), and warns about labor force displacement and ethics in AI. He closes with his self-funded startup philosophy and his use of spaced repetition for learning Chinese.

Big reveals

  • Jeremy's small team of about four students beat Google and Intel on the Stanford DAWNBench competition, topping both the time and cost leaderboards.
  • The key trick was training on tiny 64x64 images first, then fine-tuning briefly on full 224x224 images since the model already knew what a dog looked like.
  • Jeremy claims none of the big deep learning breakthroughs of the last 20 years required multiple GPUs, and all major breakthroughs of the next 20 years will be doable on a single GPU.
  • His team developed GAN-level image results without using GANs at all, and Jason Antic's DeOldify colorizes whole movies on a single home GPU.
  • Jeremy's only paper introduced successful transfer learning to NLP (ULMFiT) and the prototype took a couple of days yet smashed state-of-the-art.
  • Leslie Smith discovered super-convergence (training 10x faster with 10x higher learning rates) but couldn't publish because reviewers demanded an explanation.
  • Across teaching thousands of domain experts, the single differentiator between success and failure is tenacity, not talent.

Things worth remembering

  • The K programming language (from the APL family) is so compact it fits inside the L3 cache of a CPU, wins data-processing benchmarks, and costs about $100,000 per CPU, used by elite hedge funds.
  • Google deliberately made TPUs almost entirely unprogrammable because they feared people would learn their IP secrets.
  • There is roughly a 10x global shortage of doctors, and training enough the traditional way would take about 300 years.
  • Outside South Africa, the entire African continent has only five pediatric radiologists.
  • HIPAA stands for portability, not privacy, and was designed to enable thoughtful data sharing, but hospital lawyers interpret its gray areas conservatively.
  • Pete Worden (now at Google) once spent about a year figuring out how to train ImageNet in a converted granny flat behind his house.
  • Spaced repetition was discovered by psychologist Ebbinghaus roughly 150 years ago by memorizing random sequences of letters on cards.
  • Jeremy uses Anki only for Chinese, and after a three-year medical break the characters were still retained and far faster to relearn.
  • Jeremy built FastMail and wrote everything in Perl, including his own monitoring system and web framework because none existed in the late 90s.

Recommended in this episode

Books, products and media the guest or host genuinely endorsed here — with the buy link.

Affiliate link — we may earn a commission at no extra cost to you.

RecommendedProduct

Anki

AnkiWeb (inferred)

“I used Anki quite a lot myself... I actually don't ever talk to anybody about it... but it works incredibly well for me” — Jeremy Howard 01:32:14
Find it on Amazon
RecommendedProduct

Microsoft Access

Microsoft

“my favorite programming environment almost certainly was Microsoft Access back in like the earliest days... I've never seen anything as good” — Jeremy Howard 00:03:38
Find it on Amazon
RecommendedProduct

Delphi

Borland (inferred)

“Delphi was amazing because it was like a compiled fast language that was as easy to use as Visual Basic” — Jeremy Howard 00:07:18
Find it on Amazon