PRESENTATION
BY
- M Awais :))
- Taha
- Ibrahim
- M Ahmed
Data mining
Definition:
Data mining in databases is the
process of analyzing large sets of data stored
in a database to find patterns, trends, and
useful information.
Why Important?
Without data mining, large
amounts of data stored in databases might go
underused
•.
Key Concepts in Data Mining
Data Warehousing:
Centralized data storage from different sources for analysis.
Database Queries:
Using SQL to retrieve specific data for mining.
Data Preprocessing:
Cleaning data to ensure it is usable for mining (removing errors or irrelevant
info).
Mining Algorithms:
Tools used to analyze data, like classification, clustering, and association rules.
Data Mining Techniques in Databases
1. Classification: Organizing data into predefined
categories.
Example: Categorizing customer data into different
segments (e.g., age groups).
2. Clustering: Grouping data into clusters based on
similarities.
Example: Grouping customers by purchasing behavior
without predefined labels.
3. Association Rule Mining: Discovering relationships
between data.
Example: If a customer buys a laptop, they might also
buy a mouse.
4. Regression: Predicting values based on existing data.
Example: Predicting the future sales of a product
based on past data.
Data Mining Process in Databases
Data Selection: Choosing relevant data from the
database.
Data Cleaning: Removing errors or irrelevant data from
the database.
Data Transformation: Converting data into a suitable
format for mining.
Data Mining: Applying mining techniques to discover
patterns.
Pattern Evaluation: Assessing the usefulness of
discovered patterns.
Knowledge Representation: Presenting the results of
data mining in an understandable way.
Challenges of Data Mining
Data Quality Issues:
Incomplete, noisy, or inconsistent data can lead to wrong results.
Scalability:
Mining large databases with millions of records can be resource-intensive.
Privacy Concerns:
Mining personal data requires ethical handling and user consent.
Complexity:
Finding meaningful patterns in huge datasets can be difficult.
Data warehouse
What is Data Warehousing?
■ A data warehouse is a system that collects and
stores large amounts of data from different sources
in one place. It's organized to make it easy to
analyze the data and create reports, helping
businesses make better decisions.
Key characteristics
■ Organized for Fast Answers
Data is structured to make searching and analyzing quick and easy.
■ Handles Large Data
Can store and process a huge amount of data from many sources
■ Consistent Data
Ensures all data is cleaned and formatted the same way, so everyone gets reliable
results.
■ Non-Volatile
Once data is entered into the warehouse, it is not altered or deleted.
Components of data warehouse
■ Data Sources
■ ETL/ELT Process
 Extract:
 Transform
 Load
■ Data Storage
■ Metadata
■ Access Tools
Architectures of data warehouse
■ Single-Tier Architecture
Everything is in one place
■ Two-Tier Architecture
One layer stores and manages the data.
Another layer is for users to access and analyze the data.
■ Three-Tier Architecture
Bottom Tier: Data storage
Middle Tier: Processing layer
Top Tier: User interface

More Related Content

PPT
Data Warehouse and Data Mining
PPT
Data Mining and Data Warehousing
PPT
Dwdmunit1 a
PPT
Data Mining
PPT
Data mining techniques unit 1
PDF
Data mining and data warehousing notes
PPTX
Data mining
PPT
Data mining & data warehousing
Data Warehouse and Data Mining
Data Mining and Data Warehousing
Dwdmunit1 a
Data Mining
Data mining techniques unit 1
Data mining and data warehousing notes
Data mining
Data mining & data warehousing

Similar to Data mining and Data Warehousing in Databases.pptx (20)

PDF
Chapter 1 Handoutfffffffffffffffffffffffffffffffffffff.pdf
PPTX
Data warehousing and Data mining
PPT
Data mining slides
 
PPTX
9. Data Warehousing & Mining.pptx
PPTX
Business Intelligence and Analytics Unit-2 part-A .pptx
PPTX
INTRODUCTION to datawarehouse IN DATA.pptx
PPT
dwdm unit 1.ppt
DOC
An introduction to data mining
PDF
What Is Data Mining How It Works, Benefits, Techniques.pdf
PPTX
Data mining
PPTX
Aggahsbsbsbsbsbsbsbsbsbwbshhwhwhwgwhwhwh
PPTX
Chapter 1 - Introduction to Data Mining Concepts and Techniques.pptx
PDF
2 introductory slides
PPT
Data Mining: Concepts and Techniques.ppt
PPTX
Introduction to dm and dw
PPTX
Data warehouse and data mining
PDF
An introduction to Data Mining by Kurt Thearling
PDF
A Survey on Data Mining
PPT
Introduction.ppt
Chapter 1 Handoutfffffffffffffffffffffffffffffffffffff.pdf
Data warehousing and Data mining
Data mining slides
 
9. Data Warehousing & Mining.pptx
Business Intelligence and Analytics Unit-2 part-A .pptx
INTRODUCTION to datawarehouse IN DATA.pptx
dwdm unit 1.ppt
An introduction to data mining
What Is Data Mining How It Works, Benefits, Techniques.pdf
Data mining
Aggahsbsbsbsbsbsbsbsbsbwbshhwhwhwgwhwhwh
Chapter 1 - Introduction to Data Mining Concepts and Techniques.pptx
2 introductory slides
Data Mining: Concepts and Techniques.ppt
Introduction to dm and dw
Data warehouse and data mining
An introduction to Data Mining by Kurt Thearling
A Survey on Data Mining
Introduction.ppt
Ad

Recently uploaded (20)

PPTX
AI AND ML PROPOSAL PRESENTATION MUST.pptx
PPTX
Sheep Seg. Marketing Plan_C2 2025 (1).pptx
PPTX
PPT for Diseases (1)-2, types of diseases.pptx
PPTX
PPT for Diseases.pptx, there are 3 types of diseases
PPTX
GPS sensor used agriculture land for automation
PPTX
inbound2857676998455010149.pptxmmmmmmmmm
PPTX
9 Bioterrorism.pptxnsbhsjdgdhdvkdbebrkndbd
PPT
expt-design-lecture-12 hghhgfggjhjd (1).ppt
PPTX
AI_Agriculture_Presentation_Enhanced.pptx
PPTX
machinelearningoverview-250809184828-927201d2.pptx
PPTX
ch20 Database System Architecture by Rizvee
PPTX
DATA MODELING, data model concepts, types of data concepts
PDF
©️ 01_Algorithm for Microsoft New Product Launch - handling web site - by Ale...
PPTX
Stats annual compiled ipd opd ot br 2024
PDF
CS3352FOUNDATION OF DATA SCIENCE _1_MAterial.pdf
PPTX
Chapter security of computer_8_v8.1.pptx
PPT
Classification methods in data analytics.ppt
PDF
©️ 02_SKU Automatic SW Robotics for Microsoft PC.pdf
PPTX
OJT-Narrative-Presentation-Entrep-group.pptx_20250808_102837_0000.pptx
PPTX
inbound6529290805104538764.pptxmmmmmmmmm
AI AND ML PROPOSAL PRESENTATION MUST.pptx
Sheep Seg. Marketing Plan_C2 2025 (1).pptx
PPT for Diseases (1)-2, types of diseases.pptx
PPT for Diseases.pptx, there are 3 types of diseases
GPS sensor used agriculture land for automation
inbound2857676998455010149.pptxmmmmmmmmm
9 Bioterrorism.pptxnsbhsjdgdhdvkdbebrkndbd
expt-design-lecture-12 hghhgfggjhjd (1).ppt
AI_Agriculture_Presentation_Enhanced.pptx
machinelearningoverview-250809184828-927201d2.pptx
ch20 Database System Architecture by Rizvee
DATA MODELING, data model concepts, types of data concepts
©️ 01_Algorithm for Microsoft New Product Launch - handling web site - by Ale...
Stats annual compiled ipd opd ot br 2024
CS3352FOUNDATION OF DATA SCIENCE _1_MAterial.pdf
Chapter security of computer_8_v8.1.pptx
Classification methods in data analytics.ppt
©️ 02_SKU Automatic SW Robotics for Microsoft PC.pdf
OJT-Narrative-Presentation-Entrep-group.pptx_20250808_102837_0000.pptx
inbound6529290805104538764.pptxmmmmmmmmm
Ad

Data mining and Data Warehousing in Databases.pptx

  • 1. PRESENTATION BY - M Awais :)) - Taha - Ibrahim - M Ahmed
  • 2. Data mining Definition: Data mining in databases is the process of analyzing large sets of data stored in a database to find patterns, trends, and useful information. Why Important? Without data mining, large amounts of data stored in databases might go underused •.
  • 3. Key Concepts in Data Mining Data Warehousing: Centralized data storage from different sources for analysis. Database Queries: Using SQL to retrieve specific data for mining. Data Preprocessing: Cleaning data to ensure it is usable for mining (removing errors or irrelevant info). Mining Algorithms: Tools used to analyze data, like classification, clustering, and association rules.
  • 4. Data Mining Techniques in Databases 1. Classification: Organizing data into predefined categories. Example: Categorizing customer data into different segments (e.g., age groups). 2. Clustering: Grouping data into clusters based on similarities. Example: Grouping customers by purchasing behavior without predefined labels. 3. Association Rule Mining: Discovering relationships between data. Example: If a customer buys a laptop, they might also buy a mouse. 4. Regression: Predicting values based on existing data. Example: Predicting the future sales of a product based on past data.
  • 5. Data Mining Process in Databases Data Selection: Choosing relevant data from the database. Data Cleaning: Removing errors or irrelevant data from the database. Data Transformation: Converting data into a suitable format for mining. Data Mining: Applying mining techniques to discover patterns. Pattern Evaluation: Assessing the usefulness of discovered patterns. Knowledge Representation: Presenting the results of data mining in an understandable way.
  • 6. Challenges of Data Mining Data Quality Issues: Incomplete, noisy, or inconsistent data can lead to wrong results. Scalability: Mining large databases with millions of records can be resource-intensive. Privacy Concerns: Mining personal data requires ethical handling and user consent. Complexity: Finding meaningful patterns in huge datasets can be difficult.
  • 8. What is Data Warehousing? ■ A data warehouse is a system that collects and stores large amounts of data from different sources in one place. It's organized to make it easy to analyze the data and create reports, helping businesses make better decisions.
  • 9. Key characteristics ■ Organized for Fast Answers Data is structured to make searching and analyzing quick and easy. ■ Handles Large Data Can store and process a huge amount of data from many sources ■ Consistent Data Ensures all data is cleaned and formatted the same way, so everyone gets reliable results. ■ Non-Volatile Once data is entered into the warehouse, it is not altered or deleted.
  • 10. Components of data warehouse ■ Data Sources ■ ETL/ELT Process  Extract:  Transform  Load ■ Data Storage ■ Metadata ■ Access Tools
  • 11. Architectures of data warehouse ■ Single-Tier Architecture Everything is in one place ■ Two-Tier Architecture One layer stores and manages the data. Another layer is for users to access and analyze the data. ■ Three-Tier Architecture Bottom Tier: Data storage Middle Tier: Processing layer Top Tier: User interface