- Data mining is the process of discovering interesting patterns and knowledge from large amounts of data. It involves steps like data cleaning, integration, selection, transformation, mining, pattern evaluation and knowledge presentation.
- There are various types of data that can be mined, including database data, data warehouses, transactional data, text data, web data, time-series data, images, audio, video and others. Common data mining techniques include characterization, discrimination, clustering, classification, regression, and outlier detection. The goal is to extract useful patterns from data for tasks like prediction and description.