This document is an introduction to statistical machine learning presented by Christfried Webers from NICTA and The Australian National University. It discusses linear basis function models and how to perform maximum likelihood and least squares estimation. Specifically, it shows that maximizing the likelihood is equivalent to minimizing the sum-of-squares error, and that the maximum likelihood solution is given by the pseudo-inverse of the design matrix. It also examines the geometry of least squares and the bias-variance decomposition.