PRESENTED BY,
S .Nandhini
I- Msc(CS&IT)
NADAR SARASWATHI ARTS
AND SCIENCE COLLEGE,
THENI.
DATA MINING:
Data mining refers to extracting or
“mining” knowledge from large amounts of
data. Also referred as knowledge discovery in
databases.
ARCHITECTURE OF DATA MINING
SYSTEM:
*The architecture and design of a
data mining system is critically important.
*A good system architecture will
facilitate the system to make best use of the
software environment,accomplish data mining
tasks in an efficient and timely manner.
ARCHITECTURE OF DATA MINING DIAGRAM
Data Selection
Data integration
database
Data
warehouse
World
wide web
Other
informatio
n repostior
Database/data warehouse server
Database engine
Data/pattern evaluation
Graphical user interface
Knowledge
base
Data mining system can be integrated with a
DB/DW system using the following coupling
schemes:
 NO COUPLING
 LOOSE COUPLING
 SEMI-TIGHT COUPLING
 TIGHT COUPLING
NO COUPLING:
No coupling means that a DM system
will not utilize any function of a DB or DW
system.
It may fetch data from a particular
source process data using some data mining
algorthims, and then store the mining result in
another file.
DM system may spend a substantial
amount of time finding,collecting,cleaning, and
transforming data.
No coupling represents a poor design.
LOOSE COUPLING:
* Loose coupling means that a DM
system will use some facilities of a DB or DW
system.
*Fetching data from a data repository
managed by these system,performing data mining
and then storing the mining results either in file or
in a designated place in a database or data
warehouse.
Advantages of the Flexibility, efficiency,
and other features provided by such systems.
loose coupling to achieve High Scalability
and good performance with large data sets.
SEMI-TIGHT COUPLING:
Semitight coupling means that
besides linking a DM system to a DB/DM
system.
These primitives can include
sorting,indexing,aggregation,histogram
analysis,multiway join,and precomputation of
some essential statistical measures.
such as sum,count,max,min,standard
deviation
TIGHT COUPLING:
 Tight coupling means that a DM system is smoothly
integrated into DB/DM system.
 Data mining queries and functions are optimized
based on mining query analysis,data
structures,indexing schemes,and query processing
methods of a DB or Dw system
 Efficient implementations of data mining
functions,high system performance, and an
integrated information processing environment.
Loose coupling though not efficient is
better than no coupling since it makes use of
both data and system facilities of a DB/DW
system.
Tight coupling is highly desirable but
its implementation is nontrivial and more
research is needed in this area.
Semitight coupling is a compromise
between loose and tight coupling.
THANK YOU

Architecture of data mining system

  • 1.
    PRESENTED BY, S .Nandhini I-Msc(CS&IT) NADAR SARASWATHI ARTS AND SCIENCE COLLEGE, THENI.
  • 2.
    DATA MINING: Data miningrefers to extracting or “mining” knowledge from large amounts of data. Also referred as knowledge discovery in databases. ARCHITECTURE OF DATA MINING SYSTEM: *The architecture and design of a data mining system is critically important. *A good system architecture will facilitate the system to make best use of the software environment,accomplish data mining tasks in an efficient and timely manner.
  • 3.
    ARCHITECTURE OF DATAMINING DIAGRAM Data Selection Data integration database Data warehouse World wide web Other informatio n repostior Database/data warehouse server Database engine Data/pattern evaluation Graphical user interface Knowledge base
  • 4.
    Data mining systemcan be integrated with a DB/DW system using the following coupling schemes:  NO COUPLING  LOOSE COUPLING  SEMI-TIGHT COUPLING  TIGHT COUPLING
  • 5.
    NO COUPLING: No couplingmeans that a DM system will not utilize any function of a DB or DW system. It may fetch data from a particular source process data using some data mining algorthims, and then store the mining result in another file. DM system may spend a substantial amount of time finding,collecting,cleaning, and transforming data. No coupling represents a poor design.
  • 6.
    LOOSE COUPLING: * Loosecoupling means that a DM system will use some facilities of a DB or DW system. *Fetching data from a data repository managed by these system,performing data mining and then storing the mining results either in file or in a designated place in a database or data warehouse. Advantages of the Flexibility, efficiency, and other features provided by such systems. loose coupling to achieve High Scalability and good performance with large data sets.
  • 7.
    SEMI-TIGHT COUPLING: Semitight couplingmeans that besides linking a DM system to a DB/DM system. These primitives can include sorting,indexing,aggregation,histogram analysis,multiway join,and precomputation of some essential statistical measures. such as sum,count,max,min,standard deviation
  • 8.
    TIGHT COUPLING:  Tightcoupling means that a DM system is smoothly integrated into DB/DM system.  Data mining queries and functions are optimized based on mining query analysis,data structures,indexing schemes,and query processing methods of a DB or Dw system  Efficient implementations of data mining functions,high system performance, and an integrated information processing environment.
  • 9.
    Loose coupling thoughnot efficient is better than no coupling since it makes use of both data and system facilities of a DB/DW system. Tight coupling is highly desirable but its implementation is nontrivial and more research is needed in this area. Semitight coupling is a compromise between loose and tight coupling.
  • 10.