The document discusses discretization, which is the process of converting continuous numeric attributes in data into discrete intervals. Discretization is important for data mining algorithms that can only handle discrete attributes. The key steps in discretization are sorting values, selecting cut points to split intervals, and stopping the process based on criteria. Different discretization methods vary in their approach, such as being supervised or unsupervised, and splitting versus merging intervals. The document provides examples of discretization methods like K-means and minimum description length, and discusses properties and criteria for evaluating discretization techniques.
Related topics: