This document summarizes techniques for predictive analytics using text mining and unstructured text data. It discusses representing text data using bag-of-words models and vector space models. Dimensionality reduction techniques like latent semantic analysis and topic models like latent Dirichlet allocation are described for extracting semantic information from text. Clustering methods like k-means clustering and hierarchical clustering are discussed for grouping similar documents. The document also covers classification techniques like rule-based classifiers, decision trees, and linear classifiers like logistic regression for assigning labels to documents.