DISTRIBUTED DATA BASE :
Database:-
Logical interrelated collection of shared data, along with description of
data, physically distributed over a computer network
1
A distributed database (DDB) is a collection of
multiple, logically interrelated databases
distributed over a computer network.
A distributed database management system
(DDBMS) is the software that manages the DDB
and provides an access mechanism that makes this
distribution transparent to the users
2
3
A DDBMS mainly classified into two types:
◦ Homogeneous Distributed database management
systems
◦ Heterogeneous Distributed database management
systems
4
⚫ All sites are interconnected.
⚫ Fragments can be replicated.
⚫ Logically related shared data can be collected.
⚫ Data at each and every site is controlled by the DBMS.
⚫ Each Distributed Database Management System takes part
in at least one global application.
CHARACTERISTICS
5
⚫ Security
⚫ Keeping track of data
⚫ Replicated data management
⚫ System catalog management
⚫ Distributed transaction management
⚫ Distributed database recovery
FUNCTIONALITY:
6
⚫ In a homogeneous distributed database all sites
have identical software and are aware of each
other and agree to cooperate in processing user
requests.
⚫ The homogeneous system is much easier to
design and manage
⚫ The operating system used, at each
location must be same or compatible.
⚫ The database application (or DBMS) used at
each location must be same or compatible.
HOMOGENEOUS DDMS::
7
⚫ In a heterogeneous distributed database
different sites may use different schema
and software.
⚫ In heterogeneous systems, different nodes
may have different hardware & software
and data structures at various nodes or
locations are also incompatible.
⚫ Different computers and operating
systems, database applications or data
models may be used at each of the
locations
HETEROGENEOUS DDBMS:
8
⚫ On heterogeneous system, translations are
required to allow communication between
different sites (or DBMS).
⚫ The heterogeneous system is often not
technically or economically feasible. In
this system, a user at one location may be
able to read but not update the data at
another location.
9
⚫ Less danger of a single-point failure.
When one of the computers fails, the
workload is picked up by other
workstations.
⚫ Data are also distributed at multiple
sites.
⚫ The end user is able to access any
available copy of the data, and an end
user's request is processed by any
processor at the data location
ADVANTAGES:
10
ADVANTAGES(Cont..,)
⚫ Less danger of a single-point failure. When
one of the computers fails, the workload is
picked up by other workstations.
⚫ Data are also distributed at multiple sites.
⚫ The end user is able to access any available
copy of the data, and an end user's request is
processed by any processor at the data
location
11
⚫ Complexity of management and control.
⚫ Applications must recognize data location,
and they must be able to stitch together data
from various sites.
⚫ Security.
DISADVANTAGES
12
⚫ Increased storage and infrastructure
requirements.
⚫ Multiple copies of data has to be at different
sites, thus an additional disk
storage space will be required.
⚫ The probability of security lapses
increases when data are located at
multiple sites.
DISADVANTAGES(Cont..,)
13
LOCKS CONCEPTS:
⚫ A lock is used when multiple users need to access a
database concurrently. This prevents data from being
corrupted or invalidated when multiple users try to write to
the database.
⚫ Any single user can only modify those database records
(that is, items in the database) to which they have applied a
lock that gives them exclusive access to the record until the
lock is released. Locking not only provides exclusivity to
write but also prevents (or controls) reading of unfinished
modifications.
14
⚫ A parallel database system is to improve performance through
parallelization of various operations, such as loading data,
building indexes and evaluating queries.
⚫ The distribution is solely done on the bases of performance.
⚫ Parallel databases improve processing and input/output
speeds by using multiple CPUs and disks in parallel.
⚫ Many operations are performed
simultaneously
⚫ Data may be stored in a distributed fashion.
PARALLEL DATABASE
15
Change B/W Distributed
Database and Parallel Database
Characteristics Parallel Database Distributed database
Definition It is a software system
where multiple
processors or machines are
used to
execute and run queries in
parallel.
It is a software system that
manages multiple logically
interrelated databases
distributed over a computer
network.
Geographical
Location
The nodes are located at
geographically
same location.
The nodes are usually
located at geographically
different locations.
Execution
Speed
Quicker Slower
Overhead Less More
16
Node types Compulsorily Homogeneous Need not be homogeneous
Performance Lower reliability &
availability.
Higher reliability &
availability.
Scope of
Expansion
Difficult to expand Easier to expand
Backup Backup at one site only Backup at multiple sites
Consistency Maintaining consistency is
easier
Maintaining consistency is
difficult.
17
⚫ Fragmentation is a process of division or the
mapping of the tables based on the columns and
rows of data into the smallest unit of data.
⚫ Data that has broken down is still possible to be
combined again with the intention to complete the
data collection using fragmentation.
⚫ Fragmentation is a database server feature that allows
you to control where data is stored at the table level.
⚫ Fragmentation enables you to define groups of rows
or index keys within a table.
Data Fragmentation
18
⚫ Replication is that we store several copies of a relation
or relation fragment. An entire relation can be
replicated at one or more sites.
⚫ Similarly, one or more fragments of a relation can
be replicated at other sites.
⚫ For example, if a relation R is fragmented into R1,R2,
and R3, there might be just one copy of R1, whereas
R2 is replicated at two other sites and R3 is replicated
at all sites.
REPLICATION:
19
The motivation for replication is twofold:
1. Increased Availability of Data: If a site that
contains a replica goes down, we can find
the same data at other sites. Similarly, if
local copies of remote relations are
available, we are less vulnerable to failure
of communication links.
2. Faster Query Evaluation: Queries can
execute faster by using a local copy of a
relation instead of going to a remote site
TWO FOLD REPLICATION
20
⚫ In a distributed DBMS,a given transaction is
submitted at some one site, but it can access data
at other sites as well.
⚫ When a transaction is submitted at some site, the
transaction manager at that site breaks it up into
a collection of one or more sub-transactions that
execute at different sites, submits them to
transaction managers at the other sites, and
coordinates their activity.
DISTRIBUTED TRANSACTIONS:
21
⚫ Distributed Concurrency Control: Procedure in
which required for controlling concurrent execution
of operation that takes
⚫ Distributed Recovery: Transaction atomicity must
be ensured when a transaction commits, all its
actions, across all the sites at which it executes,
must persist. Similarly, when a transaction aborts,
none of its actions must be allowed to persist
22
⚫ The choice of technique determines which objects
are to be locked.
⚫ When locks are obtained and released is determined
by the concurrency control protocol.
⚫ We now consider how lock and unlock requests are
implemented in a distributed environment. Lock
management can be distributed across sites in many
ways:
23
⚫ Centralized : A single site is in charge of handling lock
and unlock requests for all objects.
⚫ Primary Copy: One copy of each object is designated the
primary copy.All requests to lock or unlock a copy of this
object are handled by the lock manager at the site where the
primary copy is stored, regardless of where the copy itself is
stored.
⚫ Fully Distributed : Requests to lock or unlock a copy of an
object stored at a site are handled by the lock manager at the
site where the copy is stored.
24
⚫ Recovery in a distributed DBMS is more complicated than in a
centralized DBMS for the following reasons:
◦ New kinds of failure can arise : Failure of communication links and
failure of a remote site at which a sub-transaction is executing.
◦ Either all sub-transactions of a given transaction must commit or none
must commit, and this property must be guaranteed despite any
combination of site and link failures. This guarantee is achieved using a
commit protocol.
DISTRIBUTED RECOVERY
25
26

More Related Content

PPTX
Distributed database
PPTX
Distributed database
PPTX
DDBMS.pptx
PPTX
Distributed database
PPTX
lecture-13.pptx
PPTX
Distributed database management system
PPT
Distributed Database Management System - Introduction
Distributed database
Distributed database
DDBMS.pptx
Distributed database
lecture-13.pptx
Distributed database management system
Distributed Database Management System - Introduction

Similar to nnnn.pptx (20)

PPT
distributed database management system.ppt
PPTX
Santosh Kumar Meher(2105040008) DISTRIBUTED DATABASE.pptx
PPTX
Introduction to distributed database
PPT
Distributed databases
PPTX
Distributed database detailed version by jh
PPTX
Distributed Database system in Computer Science.pptx
PPTX
Types of Distributed data base system_49724.pptx
DOCX
distributed database system" (DDBS)(distributed DBMS)
PPT
Distributed D B
PPTX
PPTX
DBMS - Distributed Databases
PPTX
Pmit 6102-14-lec1-intro
PPT
Distributed databases,types of database
PPT
Normalisation having DBMS architecture h
PPTX
Lec 8 (distributed database)
PPTX
Distributed Data Base.pptx
PPT
1 ddbms jan 2011_u
PPTX
Distributed Storage in advanced database.pptx
PPTX
Distributed dbms
PPT
01-Description of the Transport Layer.ppt
distributed database management system.ppt
Santosh Kumar Meher(2105040008) DISTRIBUTED DATABASE.pptx
Introduction to distributed database
Distributed databases
Distributed database detailed version by jh
Distributed Database system in Computer Science.pptx
Types of Distributed data base system_49724.pptx
distributed database system" (DDBS)(distributed DBMS)
Distributed D B
DBMS - Distributed Databases
Pmit 6102-14-lec1-intro
Distributed databases,types of database
Normalisation having DBMS architecture h
Lec 8 (distributed database)
Distributed Data Base.pptx
1 ddbms jan 2011_u
Distributed Storage in advanced database.pptx
Distributed dbms
01-Description of the Transport Layer.ppt

More from NANDHINIS900805 (10)

PPTX
Multiple choice questions related to data structures
PPTX
3b318431-df9f-4a2c-9909-61ecb6af8444.pptx
PPTX
wepik-breaking-down-spam-detection-a-deep-learning-approach-with-tensorflow-a...
PPTX
Alligation OR mixture.pptx
PPTX
AP&GP.pptx
PPTX
PERMUTATION AND COMBINATION.pptx
PPTX
ARCHITECTURE.pptx
PPTX
after 10th (1).pptx
PPTX
DBMS.pptx
PPTX
Multiple choice questions related to data structures
3b318431-df9f-4a2c-9909-61ecb6af8444.pptx
wepik-breaking-down-spam-detection-a-deep-learning-approach-with-tensorflow-a...
Alligation OR mixture.pptx
AP&GP.pptx
PERMUTATION AND COMBINATION.pptx
ARCHITECTURE.pptx
after 10th (1).pptx
DBMS.pptx

Recently uploaded (20)

PPTX
Thinking Routines and Learning Engagements.pptx
PDF
Health aspects of bilberry: A review on its general benefits
PDF
Farming Based Livelihood Systems English Notes
PDF
Fun with Grammar (Communicative Activities for the Azar Grammar Series)
PPTX
Reproductive system-Human anatomy and physiology
PDF
anganwadi services for the b.sc nursing and GNM
PDF
Nurlina - Urban Planner Portfolio (english ver)
PDF
Journal of Dental Science - UDMY (2021).pdf
PDF
fundamentals-of-heat-and-mass-transfer-6th-edition_incropera.pdf
PPTX
Case Study on mbsa education to learn ok
PDF
Journal of Dental Science - UDMY (2022).pdf
PPTX
BSCE 2 NIGHT (CHAPTER 2) just cases.pptx
PDF
Hospital Case Study .architecture design
PDF
Myanmar Dental Journal, The Journal of the Myanmar Dental Association (2013).pdf
PDF
FYJC - Chemistry textbook - standard 11.
PPTX
Power Point PR B.Inggris 12 Ed. 2019.pptx
PDF
Diabetes Mellitus , types , clinical picture, investigation and managment
PDF
Compact First Student's Book Cambridge Official
PPTX
Key-Features-of-the-SHS-Program-v4-Slides (3) PPT2.pptx
PDF
LIFE & LIVING TRILOGY - PART - (2) THE PURPOSE OF LIFE.pdf
Thinking Routines and Learning Engagements.pptx
Health aspects of bilberry: A review on its general benefits
Farming Based Livelihood Systems English Notes
Fun with Grammar (Communicative Activities for the Azar Grammar Series)
Reproductive system-Human anatomy and physiology
anganwadi services for the b.sc nursing and GNM
Nurlina - Urban Planner Portfolio (english ver)
Journal of Dental Science - UDMY (2021).pdf
fundamentals-of-heat-and-mass-transfer-6th-edition_incropera.pdf
Case Study on mbsa education to learn ok
Journal of Dental Science - UDMY (2022).pdf
BSCE 2 NIGHT (CHAPTER 2) just cases.pptx
Hospital Case Study .architecture design
Myanmar Dental Journal, The Journal of the Myanmar Dental Association (2013).pdf
FYJC - Chemistry textbook - standard 11.
Power Point PR B.Inggris 12 Ed. 2019.pptx
Diabetes Mellitus , types , clinical picture, investigation and managment
Compact First Student's Book Cambridge Official
Key-Features-of-the-SHS-Program-v4-Slides (3) PPT2.pptx
LIFE & LIVING TRILOGY - PART - (2) THE PURPOSE OF LIFE.pdf

nnnn.pptx

  • 1. DISTRIBUTED DATA BASE : Database:- Logical interrelated collection of shared data, along with description of data, physically distributed over a computer network 1
  • 2. A distributed database (DDB) is a collection of multiple, logically interrelated databases distributed over a computer network. A distributed database management system (DDBMS) is the software that manages the DDB and provides an access mechanism that makes this distribution transparent to the users 2
  • 3. 3
  • 4. A DDBMS mainly classified into two types: ◦ Homogeneous Distributed database management systems ◦ Heterogeneous Distributed database management systems 4
  • 5. ⚫ All sites are interconnected. ⚫ Fragments can be replicated. ⚫ Logically related shared data can be collected. ⚫ Data at each and every site is controlled by the DBMS. ⚫ Each Distributed Database Management System takes part in at least one global application. CHARACTERISTICS 5
  • 6. ⚫ Security ⚫ Keeping track of data ⚫ Replicated data management ⚫ System catalog management ⚫ Distributed transaction management ⚫ Distributed database recovery FUNCTIONALITY: 6
  • 7. ⚫ In a homogeneous distributed database all sites have identical software and are aware of each other and agree to cooperate in processing user requests. ⚫ The homogeneous system is much easier to design and manage ⚫ The operating system used, at each location must be same or compatible. ⚫ The database application (or DBMS) used at each location must be same or compatible. HOMOGENEOUS DDMS:: 7
  • 8. ⚫ In a heterogeneous distributed database different sites may use different schema and software. ⚫ In heterogeneous systems, different nodes may have different hardware & software and data structures at various nodes or locations are also incompatible. ⚫ Different computers and operating systems, database applications or data models may be used at each of the locations HETEROGENEOUS DDBMS: 8
  • 9. ⚫ On heterogeneous system, translations are required to allow communication between different sites (or DBMS). ⚫ The heterogeneous system is often not technically or economically feasible. In this system, a user at one location may be able to read but not update the data at another location. 9
  • 10. ⚫ Less danger of a single-point failure. When one of the computers fails, the workload is picked up by other workstations. ⚫ Data are also distributed at multiple sites. ⚫ The end user is able to access any available copy of the data, and an end user's request is processed by any processor at the data location ADVANTAGES: 10
  • 11. ADVANTAGES(Cont..,) ⚫ Less danger of a single-point failure. When one of the computers fails, the workload is picked up by other workstations. ⚫ Data are also distributed at multiple sites. ⚫ The end user is able to access any available copy of the data, and an end user's request is processed by any processor at the data location 11
  • 12. ⚫ Complexity of management and control. ⚫ Applications must recognize data location, and they must be able to stitch together data from various sites. ⚫ Security. DISADVANTAGES 12
  • 13. ⚫ Increased storage and infrastructure requirements. ⚫ Multiple copies of data has to be at different sites, thus an additional disk storage space will be required. ⚫ The probability of security lapses increases when data are located at multiple sites. DISADVANTAGES(Cont..,) 13
  • 14. LOCKS CONCEPTS: ⚫ A lock is used when multiple users need to access a database concurrently. This prevents data from being corrupted or invalidated when multiple users try to write to the database. ⚫ Any single user can only modify those database records (that is, items in the database) to which they have applied a lock that gives them exclusive access to the record until the lock is released. Locking not only provides exclusivity to write but also prevents (or controls) reading of unfinished modifications. 14
  • 15. ⚫ A parallel database system is to improve performance through parallelization of various operations, such as loading data, building indexes and evaluating queries. ⚫ The distribution is solely done on the bases of performance. ⚫ Parallel databases improve processing and input/output speeds by using multiple CPUs and disks in parallel. ⚫ Many operations are performed simultaneously ⚫ Data may be stored in a distributed fashion. PARALLEL DATABASE 15
  • 16. Change B/W Distributed Database and Parallel Database Characteristics Parallel Database Distributed database Definition It is a software system where multiple processors or machines are used to execute and run queries in parallel. It is a software system that manages multiple logically interrelated databases distributed over a computer network. Geographical Location The nodes are located at geographically same location. The nodes are usually located at geographically different locations. Execution Speed Quicker Slower Overhead Less More 16
  • 17. Node types Compulsorily Homogeneous Need not be homogeneous Performance Lower reliability & availability. Higher reliability & availability. Scope of Expansion Difficult to expand Easier to expand Backup Backup at one site only Backup at multiple sites Consistency Maintaining consistency is easier Maintaining consistency is difficult. 17
  • 18. ⚫ Fragmentation is a process of division or the mapping of the tables based on the columns and rows of data into the smallest unit of data. ⚫ Data that has broken down is still possible to be combined again with the intention to complete the data collection using fragmentation. ⚫ Fragmentation is a database server feature that allows you to control where data is stored at the table level. ⚫ Fragmentation enables you to define groups of rows or index keys within a table. Data Fragmentation 18
  • 19. ⚫ Replication is that we store several copies of a relation or relation fragment. An entire relation can be replicated at one or more sites. ⚫ Similarly, one or more fragments of a relation can be replicated at other sites. ⚫ For example, if a relation R is fragmented into R1,R2, and R3, there might be just one copy of R1, whereas R2 is replicated at two other sites and R3 is replicated at all sites. REPLICATION: 19
  • 20. The motivation for replication is twofold: 1. Increased Availability of Data: If a site that contains a replica goes down, we can find the same data at other sites. Similarly, if local copies of remote relations are available, we are less vulnerable to failure of communication links. 2. Faster Query Evaluation: Queries can execute faster by using a local copy of a relation instead of going to a remote site TWO FOLD REPLICATION 20
  • 21. ⚫ In a distributed DBMS,a given transaction is submitted at some one site, but it can access data at other sites as well. ⚫ When a transaction is submitted at some site, the transaction manager at that site breaks it up into a collection of one or more sub-transactions that execute at different sites, submits them to transaction managers at the other sites, and coordinates their activity. DISTRIBUTED TRANSACTIONS: 21
  • 22. ⚫ Distributed Concurrency Control: Procedure in which required for controlling concurrent execution of operation that takes ⚫ Distributed Recovery: Transaction atomicity must be ensured when a transaction commits, all its actions, across all the sites at which it executes, must persist. Similarly, when a transaction aborts, none of its actions must be allowed to persist 22
  • 23. ⚫ The choice of technique determines which objects are to be locked. ⚫ When locks are obtained and released is determined by the concurrency control protocol. ⚫ We now consider how lock and unlock requests are implemented in a distributed environment. Lock management can be distributed across sites in many ways: 23
  • 24. ⚫ Centralized : A single site is in charge of handling lock and unlock requests for all objects. ⚫ Primary Copy: One copy of each object is designated the primary copy.All requests to lock or unlock a copy of this object are handled by the lock manager at the site where the primary copy is stored, regardless of where the copy itself is stored. ⚫ Fully Distributed : Requests to lock or unlock a copy of an object stored at a site are handled by the lock manager at the site where the copy is stored. 24
  • 25. ⚫ Recovery in a distributed DBMS is more complicated than in a centralized DBMS for the following reasons: ◦ New kinds of failure can arise : Failure of communication links and failure of a remote site at which a sub-transaction is executing. ◦ Either all sub-transactions of a given transaction must commit or none must commit, and this property must be guaranteed despite any combination of site and link failures. This guarantee is achieved using a commit protocol. DISTRIBUTED RECOVERY 25
  • 26. 26