This document discusses machine learning on source code. It begins by defining machine learning on source code as applying machine learning where the input data is source code. It then discusses some of the challenges of applying machine learning to source code, including data retrieval and analysis. Finally, it provides examples of potential use cases like predicting the next token in code, learning to represent programs with graphs, and building tools to assist with code reviews.