This document discusses evaluating hypotheses and estimating hypothesis accuracy. It provides the following key points:
- The accuracy of a hypothesis estimated from a training set may be different from its true accuracy due to bias and variance. Testing the hypothesis on an independent test set provides an unbiased estimate.
- Given a hypothesis h that makes r errors on a test set of n examples, the sample error r/n provides an unbiased estimate of the true error. The variance of this estimate depends on r and n based on the binomial distribution.
- For large n, the binomial distribution can be approximated by the normal distribution. Confidence intervals for the true error can then be determined based on the sample error and standard deviation