Big data involves large and complex data sets from multiple sources that are rapidly growing across all domains of science and engineering. The paper presents the HACE theorem to characterize big data and proposes a processing model from a data mining perspective. This data-driven model involves aggregating information sources, mining and analyzing data, modeling user interests, and considering security and privacy, while analyzing challenges in the big data revolution.
Related topics: