SlideShare a Scribd company logo
Additional themes of data mining for Msc CS
Data mining 
Data mining (the analysis step of the "Knowledge Discovery in 
Databases" process, or KDD), an interdisciplinary subfield of computer 
science,is the computational process of discovering patterns in large 
data sets involving methods at the intersection of artificial intelligence, 
machine learning, statistics, and database systems
Additional themes of data mining for Msc CS
Theoretical Foundations of Data Mining
Data Reduction - The basic idea of this theory is to reduce the 
data representation which trades accuracy for speed in response 
to the need to obtain quick approximate answers to queries on 
very large data bases.Some of the data reduction techniques are 
as follows: 
•Singular value Decomposition 
•Wavelets 
•Regression 
•Log-linear models 
•Histograms 
•Clustering 
•Sampling 
•Construction of Index Trees
Data Compression - The basic idea of this theory is to 
compress the given data by encoding in terms of the following: 
•Bits 
•Association Rules 
•Decision Trees 
•Clusters
Pattern Discovery - The basic idea of this theory is to discover 
patterns occurring in the database. Following are the areas that 
contributes to this theory: 
• Machine Learning 
• Neural Network 
• Association Mining 
• Sequential Pattern Matching 
• Clustering
Probability Theory - This theory is based on statistical theory. 
The basic idea behind this theory is to discover joint probability 
distributions of random variables. 
Microeconomic View - As per the perception of this theory, the 
database schema consist of data and patterns that are stored in 
the database. Therefore according to this theory data mining is 
the task of performing induction on databases.
Inductive databases - Apart from the database oriented 
techniques, there are statistical techniques also available for data 
analysis. These techniques can be applied to scientific data and 
data from economic & social sciences as well.
Statistical Data Mining 
Statistics is the traditional field that deals with the quantification,collection, analysis, interpretation, and 
drawing conclusions from data. 
Some of the Statistical Data Mining Techniques are as follows: 
•Regression - The regression methods are used to predict the value of 
response variable from one or more predictor variables where the variables 
are numeric.Following are the several forms of Regression: 
• Linear 
• Multiple 
• Weighted 
• Polynomial 
• Nonparametric 
• Robust
•Generalized Linear Models - Generalized Linear Model includes: 
• Logistic Regression 
• Poisson Regression 
•The model's generalization allow a categorical response variable to be 
related to set of predictor variables in manner similar to the modelling of 
numeric response variable using linear regression. 
Analysis of Variance - This technique analyzes: 
•Experimental data for two or more populations described by a numeric 
response variable. 
•One or more categorical variables (factors).
•Mixed-effect Models - These models are used for analyzing the grouped 
data. These models describe the relationship between a response variable 
and some covariates in data grouped according to one or more factors. 
•Factor Analysis - Factor Analysis Method is used to predict a categorical 
response variable. This method assumes that independent variable follow a 
multivariate normal distribution. 
•Time Series Analysis - Following are the methods for analyzing time-series 
data: 
•Autoregression Methods 
•Univariate ARIMA (AutoRegressive Integrated Moving Average) Modeling 
•Long-memory time-series modeling
Additional themes of data mining for Msc CS
Additional themes of data mining for Msc CS
Visual Data Mining 
Visual Data Mining uses data and/or knowledge visualization 
techniques to discover implicit knowledge from the large data 
sets. The Visual Data Mining can be viewed as an integration of 
following disciplines: 
•Data Visualization 
•Data Mining
Visual Data Mining is closely related to the following: 
• Computer Graphics 
• Multimedia Systems 
• Human Computer Interaction 
• Pattern Recognition 
• High performance computing
Generally data visualization and data mining can be integrated in the following ways: 
•Data Visualization - The data in the databases or the data 
warehouses can be viewed in several visual forms that are listed 
below: 
• Boxplots 
• 3-D Cubes 
• Data distribution charts 
• Curves 
• Surfaces 
• Link graphs etc.
Data Mining result Visualization - Data Mining Result 
Visualization is the presentation of the results of data mining in 
visual forms. These visual forms could be scatter plots and 
boxplots etc. 
Data Mining Process Visualization - Data Mining Process 
Visualization presents the several processes of data mining. This 
allows the users to see how the data are extracted. This also 
allow the users to see from which database or data warehouse 
data are cleaned, integrated, preprocessed, and mined.
Additional themes of data mining for Msc CS
Rate of cloud in an 
area
Audio Data Mining 
To indicate the patterns of data or the features of data mining 
results, Audio Data Mining makes use of audio signals. By 
transforming patterns into sound and musing instead of watching 
pictures, we can listen to pitches,tunes in order to identify 
anything interesting.
Additional themes of data mining for Msc CS
Data Mining and Collaborative Filtering 
Today the consumer faced with large variety of goods and 
services while shopping. During live customer transactions, the 
Recommender System helps the consumer by making product 
recommendation. The Collaborative Filtering Approach is 
generally used for recommending products to customers. These 
recommendations are based on the opinions of other customers.
Additional themes of data mining for Msc CS
Openness 
Individuals have the right to know what information is collected about 
them, who have access to the data and how the data are being used. One 
social concern of data mining is the issue of privacy and information 
security. Opt-out policies, which allow consumers to specify limitations 
on the use of their personal data, are one approach toward data privacy 
protection, while data security-enhancing techniques can 
anonymize information for security and privacy.
thanvi

More Related Content

PPT
1. Introduction to DBMS
koolkampus
 
PDF
data mining
manasa polu
 
PPTX
Association Rule mining
Megha Sharma
 
PPTX
Structure of dbms
Megha yadav
 
PPTX
04 Classification in Data Mining
Valerii Klymchuk
 
PPTX
Grid based method & model based clustering method
rajshreemuthiah
 
PPT
Data integrity
Rahul Gupta
 
PPTX
Image restoration and degradation model
AnupriyaDurai
 
1. Introduction to DBMS
koolkampus
 
data mining
manasa polu
 
Association Rule mining
Megha Sharma
 
Structure of dbms
Megha yadav
 
04 Classification in Data Mining
Valerii Klymchuk
 
Grid based method & model based clustering method
rajshreemuthiah
 
Data integrity
Rahul Gupta
 
Image restoration and degradation model
AnupriyaDurai
 

What's hot (20)

PPTX
Data Mining: Application and trends in data mining
DataminingTools Inc
 
PPTX
Computer graphics LINE DRAWING algorithm.pptx
R S Anu Prabha
 
PPTX
Performance analysis(Time & Space Complexity)
swapnac12
 
PPTX
Security services and mechanisms
Rajapriya82
 
PPTX
Dynamic Itemset Counting
Tarat Diloksawatdikul
 
PPTX
STRUCTURE OF SQL QUERIES
VENNILAV6
 
PPT
Groupware
VJ Aiswaryadevi
 
PPTX
Hit and-miss transform
Krish Everglades
 
PDF
Network security - OSI Security Architecture
BharathiKrishna6
 
PPTX
SPATIAL FILTERING IN IMAGE PROCESSING
muthu181188
 
PDF
Unit 1: Introduction to DBMS Unit 1 Complete
Raj vardhan
 
DOC
Data mining notes
AVC College of Engineering
 
PPTX
Substitution techniques
vinitha96
 
PPTX
Multidimensional schema of data warehouse
kunjan shah
 
PPTX
Image compression in digital image processing
DHIVYADEVAKI
 
PPTX
Introduction to Web Mining and Spatial Data Mining
AarshDhokai
 
PPT
Database Security
alraee
 
PPTX
Introduction to Data Mining
DataminingTools Inc
 
PPTX
Characteristic of dabase approach
Luina Pani
 
DOCX
Crucial decisions in designing a data warehouse
Manju Rajput
 
Data Mining: Application and trends in data mining
DataminingTools Inc
 
Computer graphics LINE DRAWING algorithm.pptx
R S Anu Prabha
 
Performance analysis(Time & Space Complexity)
swapnac12
 
Security services and mechanisms
Rajapriya82
 
Dynamic Itemset Counting
Tarat Diloksawatdikul
 
STRUCTURE OF SQL QUERIES
VENNILAV6
 
Groupware
VJ Aiswaryadevi
 
Hit and-miss transform
Krish Everglades
 
Network security - OSI Security Architecture
BharathiKrishna6
 
SPATIAL FILTERING IN IMAGE PROCESSING
muthu181188
 
Unit 1: Introduction to DBMS Unit 1 Complete
Raj vardhan
 
Data mining notes
AVC College of Engineering
 
Substitution techniques
vinitha96
 
Multidimensional schema of data warehouse
kunjan shah
 
Image compression in digital image processing
DHIVYADEVAKI
 
Introduction to Web Mining and Spatial Data Mining
AarshDhokai
 
Database Security
alraee
 
Introduction to Data Mining
DataminingTools Inc
 
Characteristic of dabase approach
Luina Pani
 
Crucial decisions in designing a data warehouse
Manju Rajput
 
Ad

Viewers also liked (14)

PPTX
Data mining
Akannsha Totewar
 
PPT
Olap operations
Om Prakash
 
PPTX
Online analytical processing (olap) tools
kulkarnivaibhav
 
PPTX
OLAP Cubes: Basic operations
Sthefan Berwanger
 
PPTX
Knowledge Discovery in Databases
Diwas Kandel
 
PPTX
Multimedia Database
Syamsul Bahrin Zaibon
 
PPTX
Web content mining
Akanksha Dombe
 
PPT
4.2 spatial data mining
Krish_ver2
 
PPTX
Multimedia Database
Avnish Patel
 
PPTX
Knowledge Discovery and Data Mining
Amritanshu Mehra
 
PDF
Data warehouse architecture
pcherukumalla
 
PPS
Introduction to Data Warehousing
Jason S
 
PPTX
Building an Effective Data Warehouse Architecture
James Serra
 
PDF
Data mining (lecture 1 & 2) conecpts and techniques
Saif Ullah
 
Data mining
Akannsha Totewar
 
Olap operations
Om Prakash
 
Online analytical processing (olap) tools
kulkarnivaibhav
 
OLAP Cubes: Basic operations
Sthefan Berwanger
 
Knowledge Discovery in Databases
Diwas Kandel
 
Multimedia Database
Syamsul Bahrin Zaibon
 
Web content mining
Akanksha Dombe
 
4.2 spatial data mining
Krish_ver2
 
Multimedia Database
Avnish Patel
 
Knowledge Discovery and Data Mining
Amritanshu Mehra
 
Data warehouse architecture
pcherukumalla
 
Introduction to Data Warehousing
Jason S
 
Building an Effective Data Warehouse Architecture
James Serra
 
Data mining (lecture 1 & 2) conecpts and techniques
Saif Ullah
 
Ad

Similar to Additional themes of data mining for Msc CS (20)

DOCX
Seminar Report Vaibhav
Vaibhav Dhattarwal
 
PPTX
Data mining
jadhav_priti
 
PDF
Data Mining System and Applications: A Review
ijdpsjournal
 
PPTX
Unit-V-Introduction to Data Mining.pptx
Harsha Patil
 
PPTX
Classification and prediction in data mining
Er. Nawaraj Bhandari
 
PPTX
DWDM_UNIT4.pptx ddddddddddddddddddddddddddddd
GangeshSawarkar
 
PDF
Fundamentals of data mining and its applications
Subrat Swain
 
DOCX
Mining internal sources of data
nomanbhutta
 
PPTX
Week-1-Introduction to Data Mining.pptx
Take1As
 
DOC
DATA MINING.doc
butest
 
PDF
G045033841
IJERA Editor
 
PDF
A Review Of Data Mining Literature
Addison Coleman
 
PDF
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
editorijettcs
 
PDF
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
editorijettcs
 
DOC
An analysis and impact factors on Agriculture field using Data Mining Techniques
ijcnes
 
PPTX
Introduction to Data mining
Hadi Fadlallah
 
PPTX
DOWLD SLIDES.pptx
ÁŠHÍŸÂ ŹÂBÊÊÑ
 
PPT
Dma unit 1
thamizh arasi
 
Seminar Report Vaibhav
Vaibhav Dhattarwal
 
Data mining
jadhav_priti
 
Data Mining System and Applications: A Review
ijdpsjournal
 
Unit-V-Introduction to Data Mining.pptx
Harsha Patil
 
Classification and prediction in data mining
Er. Nawaraj Bhandari
 
DWDM_UNIT4.pptx ddddddddddddddddddddddddddddd
GangeshSawarkar
 
Fundamentals of data mining and its applications
Subrat Swain
 
Mining internal sources of data
nomanbhutta
 
Week-1-Introduction to Data Mining.pptx
Take1As
 
DATA MINING.doc
butest
 
G045033841
IJERA Editor
 
A Review Of Data Mining Literature
Addison Coleman
 
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
editorijettcs
 
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
editorijettcs
 
An analysis and impact factors on Agriculture field using Data Mining Techniques
ijcnes
 
Introduction to Data mining
Hadi Fadlallah
 
DOWLD SLIDES.pptx
ÁŠHÍŸÂ ŹÂBÊÊÑ
 
Dma unit 1
thamizh arasi
 

Recently uploaded (20)

PPTX
How to Close Subscription in Odoo 18 - Odoo Slides
Celine George
 
PPTX
Measures_of_location_-_Averages_and__percentiles_by_DR SURYA K.pptx
Surya Ganesh
 
PPTX
Tips Management in Odoo 18 POS - Odoo Slides
Celine George
 
PPTX
BASICS IN COMPUTER APPLICATIONS - UNIT I
suganthim28
 
PPTX
Cleaning Validation Ppt Pharmaceutical validation
Ms. Ashatai Patil
 
PPTX
20250924 Navigating the Future: How to tell the difference between an emergen...
McGuinness Institute
 
DOCX
Unit 5: Speech-language and swallowing disorders
JELLA VISHNU DURGA PRASAD
 
DOCX
pgdei-UNIT -V Neurological Disorders & developmental disabilities
JELLA VISHNU DURGA PRASAD
 
PPTX
HISTORY COLLECTION FOR PSYCHIATRIC PATIENTS.pptx
PoojaSen20
 
PPTX
An introduction to Prepositions for beginners.pptx
drsiddhantnagine
 
PPTX
Artificial-Intelligence-in-Drug-Discovery by R D Jawarkar.pptx
Rahul Jawarkar
 
PPTX
Dakar Framework Education For All- 2000(Act)
santoshmohalik1
 
PDF
Antianginal agents, Definition, Classification, MOA.pdf
Prerana Jadhav
 
PPTX
Basics and rules of probability with real-life uses
ravatkaran694
 
PPTX
Kanban Cards _ Mass Action in Odoo 18.2 - Odoo Slides
Celine George
 
PPTX
Information Texts_Infographic on Forgetting Curve.pptx
Tata Sevilla
 
PPTX
How to Apply for a Job From Odoo 18 Website
Celine George
 
PDF
2.Reshaping-Indias-Political-Map.ppt/pdf/8th class social science Exploring S...
Sandeep Swamy
 
PDF
BÀI TẬP TEST BỔ TRỢ THEO TỪNG CHỦ ĐỀ CỦA TỪNG UNIT KÈM BÀI TẬP NGHE - TIẾNG A...
Nguyen Thanh Tu Collection
 
PDF
Virat Kohli- the Pride of Indian cricket
kushpar147
 
How to Close Subscription in Odoo 18 - Odoo Slides
Celine George
 
Measures_of_location_-_Averages_and__percentiles_by_DR SURYA K.pptx
Surya Ganesh
 
Tips Management in Odoo 18 POS - Odoo Slides
Celine George
 
BASICS IN COMPUTER APPLICATIONS - UNIT I
suganthim28
 
Cleaning Validation Ppt Pharmaceutical validation
Ms. Ashatai Patil
 
20250924 Navigating the Future: How to tell the difference between an emergen...
McGuinness Institute
 
Unit 5: Speech-language and swallowing disorders
JELLA VISHNU DURGA PRASAD
 
pgdei-UNIT -V Neurological Disorders & developmental disabilities
JELLA VISHNU DURGA PRASAD
 
HISTORY COLLECTION FOR PSYCHIATRIC PATIENTS.pptx
PoojaSen20
 
An introduction to Prepositions for beginners.pptx
drsiddhantnagine
 
Artificial-Intelligence-in-Drug-Discovery by R D Jawarkar.pptx
Rahul Jawarkar
 
Dakar Framework Education For All- 2000(Act)
santoshmohalik1
 
Antianginal agents, Definition, Classification, MOA.pdf
Prerana Jadhav
 
Basics and rules of probability with real-life uses
ravatkaran694
 
Kanban Cards _ Mass Action in Odoo 18.2 - Odoo Slides
Celine George
 
Information Texts_Infographic on Forgetting Curve.pptx
Tata Sevilla
 
How to Apply for a Job From Odoo 18 Website
Celine George
 
2.Reshaping-Indias-Political-Map.ppt/pdf/8th class social science Exploring S...
Sandeep Swamy
 
BÀI TẬP TEST BỔ TRỢ THEO TỪNG CHỦ ĐỀ CỦA TỪNG UNIT KÈM BÀI TẬP NGHE - TIẾNG A...
Nguyen Thanh Tu Collection
 
Virat Kohli- the Pride of Indian cricket
kushpar147
 

Additional themes of data mining for Msc CS

  • 2. Data mining Data mining (the analysis step of the "Knowledge Discovery in Databases" process, or KDD), an interdisciplinary subfield of computer science,is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics, and database systems
  • 5. Data Reduction - The basic idea of this theory is to reduce the data representation which trades accuracy for speed in response to the need to obtain quick approximate answers to queries on very large data bases.Some of the data reduction techniques are as follows: •Singular value Decomposition •Wavelets •Regression •Log-linear models •Histograms •Clustering •Sampling •Construction of Index Trees
  • 6. Data Compression - The basic idea of this theory is to compress the given data by encoding in terms of the following: •Bits •Association Rules •Decision Trees •Clusters
  • 7. Pattern Discovery - The basic idea of this theory is to discover patterns occurring in the database. Following are the areas that contributes to this theory: • Machine Learning • Neural Network • Association Mining • Sequential Pattern Matching • Clustering
  • 8. Probability Theory - This theory is based on statistical theory. The basic idea behind this theory is to discover joint probability distributions of random variables. Microeconomic View - As per the perception of this theory, the database schema consist of data and patterns that are stored in the database. Therefore according to this theory data mining is the task of performing induction on databases.
  • 9. Inductive databases - Apart from the database oriented techniques, there are statistical techniques also available for data analysis. These techniques can be applied to scientific data and data from economic & social sciences as well.
  • 10. Statistical Data Mining Statistics is the traditional field that deals with the quantification,collection, analysis, interpretation, and drawing conclusions from data. Some of the Statistical Data Mining Techniques are as follows: •Regression - The regression methods are used to predict the value of response variable from one or more predictor variables where the variables are numeric.Following are the several forms of Regression: • Linear • Multiple • Weighted • Polynomial • Nonparametric • Robust
  • 11. •Generalized Linear Models - Generalized Linear Model includes: • Logistic Regression • Poisson Regression •The model's generalization allow a categorical response variable to be related to set of predictor variables in manner similar to the modelling of numeric response variable using linear regression. Analysis of Variance - This technique analyzes: •Experimental data for two or more populations described by a numeric response variable. •One or more categorical variables (factors).
  • 12. •Mixed-effect Models - These models are used for analyzing the grouped data. These models describe the relationship between a response variable and some covariates in data grouped according to one or more factors. •Factor Analysis - Factor Analysis Method is used to predict a categorical response variable. This method assumes that independent variable follow a multivariate normal distribution. •Time Series Analysis - Following are the methods for analyzing time-series data: •Autoregression Methods •Univariate ARIMA (AutoRegressive Integrated Moving Average) Modeling •Long-memory time-series modeling
  • 15. Visual Data Mining Visual Data Mining uses data and/or knowledge visualization techniques to discover implicit knowledge from the large data sets. The Visual Data Mining can be viewed as an integration of following disciplines: •Data Visualization •Data Mining
  • 16. Visual Data Mining is closely related to the following: • Computer Graphics • Multimedia Systems • Human Computer Interaction • Pattern Recognition • High performance computing
  • 17. Generally data visualization and data mining can be integrated in the following ways: •Data Visualization - The data in the databases or the data warehouses can be viewed in several visual forms that are listed below: • Boxplots • 3-D Cubes • Data distribution charts • Curves • Surfaces • Link graphs etc.
  • 18. Data Mining result Visualization - Data Mining Result Visualization is the presentation of the results of data mining in visual forms. These visual forms could be scatter plots and boxplots etc. Data Mining Process Visualization - Data Mining Process Visualization presents the several processes of data mining. This allows the users to see how the data are extracted. This also allow the users to see from which database or data warehouse data are cleaned, integrated, preprocessed, and mined.
  • 20. Rate of cloud in an area
  • 21. Audio Data Mining To indicate the patterns of data or the features of data mining results, Audio Data Mining makes use of audio signals. By transforming patterns into sound and musing instead of watching pictures, we can listen to pitches,tunes in order to identify anything interesting.
  • 23. Data Mining and Collaborative Filtering Today the consumer faced with large variety of goods and services while shopping. During live customer transactions, the Recommender System helps the consumer by making product recommendation. The Collaborative Filtering Approach is generally used for recommending products to customers. These recommendations are based on the opinions of other customers.
  • 25. Openness Individuals have the right to know what information is collected about them, who have access to the data and how the data are being used. One social concern of data mining is the issue of privacy and information security. Opt-out policies, which allow consumers to specify limitations on the use of their personal data, are one approach toward data privacy protection, while data security-enhancing techniques can anonymize information for security and privacy.