TensorFlow Tutorial (Sherry Moore, Google Brain)

The guest

Sherry Moore — Software engineer on the Google Brain team who worked on TensorFlow alongside researchers, including Alex Krizhevsky who invented AlexNet.

The gist

Sherry Moore of Google Brain introduces TensorFlow, Google's open-source machine learning library that became the most popular ML project on GitHub. She explains core concepts: tensors as multi-dimensional arrays, computation graphs of connected nodes, and the modular architecture spanning front-end languages, a core execution runtime, and portable device kernels (CPU, GPU, phones, TPU). The bulk of the session is a live coding lab where the audience builds two classic models in Jupyter notebooks: a linear regression to guess a mystery line, and an MNIST handwritten-digit classifier with hidden layers. She teaches practical infrastructure including placeholders, checkpoints, savers, global step, and evaluation. The talk ends with an extended audience Q&A about C++ APIs, Windows/ARM support, TPU availability, serving, and loading custom datasets.

Big reveals

TensorFlow became the most popular machine learning library on GitHub with over 32,000 stars, 14,000 forks, and 8,000 contributions from 400 developers.
00:01:39
TensorFlow's flexible data flow infrastructure makes it suitable for almost any application that can fire asynchronously when data is ready, not just machine learning.
00:02:41
Over 10% of all responses sent on mobile in February were generated by Google's Smart Reply.
00:10:59
When Smart Reply was first trained, its first answer to everything was always 'I love you'.
00:11:02
Training Inception originally took about six days, and even with replicas still took about two and a half days, which is why checkpointing is critical.
00:38:15
Moore cites roughly 78.6% as the state-of-the-art accuracy benchmark she watches for when training Inception.
00:48:21

Things worth remembering

Moore sat right next to Alex Krizhevsky, the inventor of AlexNet, while developing TensorFlow.
00:03:12
In TensorFlow all data is held in a 'tensor', a multi-dimensional array similar to a numpy ND array, that flows through the graph.
00:04:46
The same TensorFlow graph can be dispatched to different device kernels: CPU, GPU, phone, or TPU.
00:08:26
TensorFlow-trained systems not only learn to play games but learn to generate game scenarios for you to play.
00:11:30
MNIST stands for the National Institute of Standards and Technology's collection of handwritten digits.
00:29:50
MNIST pixel values are normalized between 0 and 1, so hand-drawn uploads (often 0-255) must be rescaled to be recognized.
00:44:41
Google's image captioning model would label anything it didn't recognize, like a watermelon on a post, as 'man talking on a cell phone'.
00:45:43
TensorFlow could not support Windows at the time because its build tool Bazel did not yet support Windows.
00:53:05
Moore frames every model around four pieces: data, an inference (forward) graph, a training graph with loss and optimizer, and running the graph.
00:31:23

Topics

TensorFlow machine learning deep learning tutorial Google Brain neural networks MNIST linear regression open source