This document discusses several key aspects of processing text for information retrieval, including:
1) Converting documents to index terms and why this is important for matching queries to documents.
2) Statistical properties of text, such as Zipf's Law which shows word frequencies follow a skewed distribution.
3) Techniques for processing text including tokenization, stopping, stemming, and identifying phrases.