Decision trees are a supervised learning technique that can be used for classification problems. They work by recursively splitting a dataset based on the values of predictor variables, with the goal of maximizing purity in the descendant nodes. The document discusses how decision trees are constructed using a greedy approach that selects the predictor variable resulting in the largest information gain at each split. It provides an example of constructing a decision tree on a dataset about factors predicting support for Hillary Clinton.