This document describes a machine learning approach to web page filtering that combines content and structural analysis. The proposed approach represents web pages with features extracted from content, such as terms and phrases, and from links. These features are used as input for machine learning algorithms like neural networks and support vector machines to classify pages. An experiment compares this approach to keyword-based and lexicon-based filtering, finding the proposed approach generally performs better, especially with few training examples. The approach could benefit topic-specific search engines and other applications.