SlideShare a Scribd company logo
By-
Neha Kulkarni
ME Computer
Pune Insitute of Computer Technology
 What is spatial data?
 Types of spatial data
 Types of queries
 Applications
 Indexing Techniques
 Comparison of Indexing techniques
 GiST
 Indexing High-dimensional data
 Conclusion
• Spatial data represent the location ,size
and shape of an object on earth
• Ex. Building, lake
Point data: Line data: polygon data:
 Point Data:
 Simplest form of representing spatial data
 No space and has no associated area or volume
 Consists of collection of points
 Ex. Raster data
 Region data:
 Has spatial extend with location and boundaries
 Represented using of points, line, polygons
 Ex. roads, rivers: line data
Spatial databases
1) Spatial range queries:
Related with region data
Ex. “Find all cities within 50 miles of Pune”
2)Nearest Neighbor queries:
Related with point data
Ex. “Find 10 cities nearest to Pune”
In ordered cities
Use in multimedia database
3) Spatial Join Queries:
- use both point and region data.
- Ex. “Find pairs of cities within 200 miles of each other
AND “ Find all cities near a lake”
- More complex
- Expensive to evaluate
1) Geographic Information System(GIS)
Ex. MAP
2) Computer Aided Design/Manufacturing(CAD/CAM)
Ex. Surface of design object
Range and Spatial join queries used
3) Multimedia Database
video, audio, image, text also required spatial data
Nearest neighbor queries and point data
Point Data: Grid files, ḥE trees, Kdtrees, point quad
trees
Region data: Quad trees, R trees, SKD trees,
-Yet no best indexing technique
- R trees are commonly used :
due to simplicity, ability to handle both data
performance to complex queries
Three main indexing techniques :
 Region Quad-Trees and Z-Ordering – handle
both point and region data
 Grid Files – only point data
 R-Trees – handle both point and region data
 Z-ordering gives us a way to group points
according to spatial proximity.
 Consider X-01 and Y-11
 Z-value is 0111 by interleaving
X and Y values.
This gives us the value for the
point 7.
Space filling curves
The Region Quad tree structure corresponds directly to the
recursive decomposition of the data space.
Each node in the tree corresponds to a square-shaped region of
the data space.
 Grid files rely upon a grid directory to identify the
data page containing a desired point.
 The Grid file partitions space into rectangular
regions using lines that are parallel to the axes.
 If the X axis is cut into i segments and the Y
axis is cut into j segments, we have a total of i
x j partitions. The grid directory is an i by j
array with one entry per partition.
 This description is maintained in an array
called a linear scale; there is one linear scale
per axis.
Searching for a point in a grid file
Inserting points in a Grid File
 Adaptation of B+ Tree
 Height-balanced data structure
 Search key values are referred to as Bounding
Boxes
 A data entry consists of a pair (n-dimensional
box, Rid)
 Rid – object Identifier
 N-dimensional box is the smallest box that
contains the object
An example R-Tree
 Search for Objects Overlapping Box Q
Start at root.
1. If current node is non-leaf, for each
entry <E, ptr>, if box E overlaps Q,
search subtree identified by ptr.
2. If current node is leaf, for each entry
<E, rid>, if E overlaps Q, rid identifies
an object that might overlap Q.
Insert Entry <B, ptr>
 Start at root and go down to “best-fit” leaf L.
 Go to child whose box needs least enlargement to
cover B; resolve ties by going to smallest area child.
 If best-fit leaf L has space, insert entry and stop.
Otherwise, split L into L1 and L2.
 Adjust entry for L in its parent so that the box now
covers (only) L1.
 Add an entry (in the parent node of L) for L2. (This
could cause the parent node to recursively split.)
Region Quad
Trees
Grid Files(point
data)
R-Trees
Range Queries Easily handled Easily handled
for point data.
Handled by
calculating
bounding box
Nearest
Neighbour
Queries
Can be
handled.
Sometimes
tricky due to
long diagonal
jumps
Easily handled
for point data.
Handled well
by traversing
for the point
or region
Spatial Joins Can be
handled with
some
extension to
range queries
Easily handled
for point data.
Handled very
well
 The Generalized Search Tree (GiST) abstracts the
“tree” nature of a class of indexes including B+ trees
and R-tree variants.
 Striking similarities in insert/delete/search and even
concurrency control algorithms make it possible to
provide “templates” for these algorithms that can be
customized to obtain the many different tree index
structures.
 GiST provides an alternative for implementing other
tree indexes in an ORDBMS.
 Typically, high-dimensional datasets are collections
of points, not regions.
 E.g., Feature vectors in multimedia applications.
 Very sparse
 Nearest neighbor queries are common.
 R-tree becomes worse than sequential scan for most
datasets with more than a dozen dimensions.
 As dimensionality increases contrast (ratio of
distances between nearest and farthest points)
usually decreases; “nearest neighbor” is not
meaningful.
 Spatial data management has many
applications, including GIS, CAD/CAM,
multimedia indexing, Point and region data
 R-tree approach is widely used in GIS
systems
 Used in spatial data mining approaches.
 Popular SDBMS : MySQL(geometry datatype),
Neo4j, AllegroGraph, SpaceBase, CouchDB,
PostGreSQL, SpatialDB
 “Database Management Systems” by Raghu
Ramakrishnan, 3rd Edition
 www.techopedia.com/definition
 dna.fernuni-hagen.de/IntroSpatialDBMS
 www.geol-amu.org/notes
Spatial databases
Spatial databases

More Related Content

What's hot (20)

PPTX
Spatial Databases
Pratibha Chaudhary
 
PPTX
Spatial databases
Seraphic Nazir
 
PPTX
Spatial Data Models
RajalakshmiS34
 
PPTX
TYBSC IT PGIS Unit I Chapter I- Introduction to Geographic Information Systems
Arti Parab Academics
 
PPT
Spatial Database Systems
Asifuzzaman Hridoy
 
PPTX
SPATIAL DATABASES.pptx
AmanSingla57
 
PPTX
Gis unit 3
sridevi5983
 
PPTX
TYBSC IT PGIS Unit III Chapter I Spatial Referencing and Positioning
Arti Parab Academics
 
PPT
4.2 spatial data mining
Krish_ver2
 
PDF
Principles of GIS unit 2
SanjanaKhemka1
 
PDF
Four data models in GIS
Prof. A.Balasubramanian
 
PPTX
Visualizing Data with Geographic Information Systems (GIS)
Kate Dougherty
 
PPTX
DATA in GIS and DATA Query
KU Leuven
 
PPTX
Conversion of Existing Data
Tilok Chetri
 
PPT
Geodatabases
John Reiser
 
PPTX
TYBSC IT PGIS Unit I Chapter II Geographic Information and Spacial Database
Arti Parab Academics
 
PPTX
Geographical information system
Bipin Karki
 
PPT
215 spatial db
surekamurali
 
PPTX
Spatial Database and Database Management System
Lal Mohammad
 
PPTX
GIS - Project Planning and Implementation
Malla Reddy University
 
Spatial Databases
Pratibha Chaudhary
 
Spatial databases
Seraphic Nazir
 
Spatial Data Models
RajalakshmiS34
 
TYBSC IT PGIS Unit I Chapter I- Introduction to Geographic Information Systems
Arti Parab Academics
 
Spatial Database Systems
Asifuzzaman Hridoy
 
SPATIAL DATABASES.pptx
AmanSingla57
 
Gis unit 3
sridevi5983
 
TYBSC IT PGIS Unit III Chapter I Spatial Referencing and Positioning
Arti Parab Academics
 
4.2 spatial data mining
Krish_ver2
 
Principles of GIS unit 2
SanjanaKhemka1
 
Four data models in GIS
Prof. A.Balasubramanian
 
Visualizing Data with Geographic Information Systems (GIS)
Kate Dougherty
 
DATA in GIS and DATA Query
KU Leuven
 
Conversion of Existing Data
Tilok Chetri
 
Geodatabases
John Reiser
 
TYBSC IT PGIS Unit I Chapter II Geographic Information and Spacial Database
Arti Parab Academics
 
Geographical information system
Bipin Karki
 
215 spatial db
surekamurali
 
Spatial Database and Database Management System
Lal Mohammad
 
GIS - Project Planning and Implementation
Malla Reddy University
 

Similar to Spatial databases (20)

PPTX
spatial databases ADBMS ppt
RitaThakkar1
 
PPTX
Optimizing spatial database
Ishraq Al Fataftah
 
PPTX
Geographic Phenomena and their Representations
NAXA-Developers
 
PPT
23. Advanced Datatypes and New Application in DBMS
koolkampus
 
PPTX
Spot db consistency checking and optimization in spatial database
Pratik Udapure
 
PPT
UNIT - III GIS DATA STRUCTURES (2).ppt
RamMishra65
 
PPT
UNIT - III GIS DATA STRUCTURES (1).ppt
RamMishra65
 
PPTX
Algorithms for Query Processing and Optimization of Spatial Operations
Natasha Mandal
 
PPTX
Data models in geographical information system(GIS)
PRAMODA G
 
PDF
Chapter7.Revised
Achmad Solichin
 
PPTX
Mujungi Davis
Said Mujungi
 
PPTX
Mujungi Davis
Said Mujungi
 
PPTX
Geoinformatics.pptx
jatinbagale2610
 
PPTX
Lect 7 &amp; 8 types of vector data model-gis
Rehana Jamal
 
PPT
3D Analyst
Hartanto Sanjaya
 
PDF
Geospatial Data Models, Vector And Raster Data Model
FarihaMunia
 
PPTX
Lect 5 data models-gis
Rehana Jamal
 
PDF
Scattered gis handbook
Waleed Liaqat
 
PDF
Trees Information
Sriram Raj
 
PPTX
Spatial database managment.pptx
bitcivilmeerut
 
spatial databases ADBMS ppt
RitaThakkar1
 
Optimizing spatial database
Ishraq Al Fataftah
 
Geographic Phenomena and their Representations
NAXA-Developers
 
23. Advanced Datatypes and New Application in DBMS
koolkampus
 
Spot db consistency checking and optimization in spatial database
Pratik Udapure
 
UNIT - III GIS DATA STRUCTURES (2).ppt
RamMishra65
 
UNIT - III GIS DATA STRUCTURES (1).ppt
RamMishra65
 
Algorithms for Query Processing and Optimization of Spatial Operations
Natasha Mandal
 
Data models in geographical information system(GIS)
PRAMODA G
 
Chapter7.Revised
Achmad Solichin
 
Mujungi Davis
Said Mujungi
 
Mujungi Davis
Said Mujungi
 
Geoinformatics.pptx
jatinbagale2610
 
Lect 7 &amp; 8 types of vector data model-gis
Rehana Jamal
 
3D Analyst
Hartanto Sanjaya
 
Geospatial Data Models, Vector And Raster Data Model
FarihaMunia
 
Lect 5 data models-gis
Rehana Jamal
 
Scattered gis handbook
Waleed Liaqat
 
Trees Information
Sriram Raj
 
Spatial database managment.pptx
bitcivilmeerut
 
Ad

Recently uploaded (20)

PPTX
Introduction to Neural Networks and Perceptron Learning Algorithm.pptx
Kayalvizhi A
 
DOC
MRRS Strength and Durability of Concrete
CivilMythili
 
PDF
POWER PLANT ENGINEERING (R17A0326).pdf..
haneefachosa123
 
PPTX
Green Building & Energy Conservation ppt
Sagar Sarangi
 
PPT
Oxygen Co2 Transport in the Lungs(Exchange og gases)
SUNDERLINSHIBUD
 
PPTX
Break Statement in Programming with 6 Real Examples
manojpoojary2004
 
PDF
Unified_Cloud_Comm_Presentation anil singh ppt
anilsingh298751
 
PDF
Water Design_Manual_2005. KENYA FOR WASTER SUPPLY AND SEWERAGE
DancanNgutuku
 
PPTX
Day2 B2 Best.pptx
helenjenefa1
 
PDF
IoT - Unit 2 (Internet of Things-Concepts) - PPT.pdf
dipakraut82
 
PPTX
Pharmaceuticals and fine chemicals.pptxx
jaypa242004
 
PPTX
site survey architecture student B.arch.
sri02032006
 
PPTX
UNIT DAA PPT cover all topics 2021 regulation
archu26
 
PPTX
Types of Bearing_Specifications_PPT.pptx
PranjulAgrahariAkash
 
PPTX
ISO/IEC JTC 1/WG 9 (MAR) Convenor Report
Kurata Takeshi
 
PPTX
MobileComputingMANET2023 MobileComputingMANET2023.pptx
masterfake98765
 
PPTX
265587293-NFPA 101 Life safety code-PPT-1.pptx
chandermwason
 
PPTX
Hashing Introduction , hash functions and techniques
sailajam21
 
PPTX
Destructive Tests corrosion engineer (1).pptx
zeidali3
 
PPTX
Solar Thermal Energy System Seminar.pptx
Gpc Purapuza
 
Introduction to Neural Networks and Perceptron Learning Algorithm.pptx
Kayalvizhi A
 
MRRS Strength and Durability of Concrete
CivilMythili
 
POWER PLANT ENGINEERING (R17A0326).pdf..
haneefachosa123
 
Green Building & Energy Conservation ppt
Sagar Sarangi
 
Oxygen Co2 Transport in the Lungs(Exchange og gases)
SUNDERLINSHIBUD
 
Break Statement in Programming with 6 Real Examples
manojpoojary2004
 
Unified_Cloud_Comm_Presentation anil singh ppt
anilsingh298751
 
Water Design_Manual_2005. KENYA FOR WASTER SUPPLY AND SEWERAGE
DancanNgutuku
 
Day2 B2 Best.pptx
helenjenefa1
 
IoT - Unit 2 (Internet of Things-Concepts) - PPT.pdf
dipakraut82
 
Pharmaceuticals and fine chemicals.pptxx
jaypa242004
 
site survey architecture student B.arch.
sri02032006
 
UNIT DAA PPT cover all topics 2021 regulation
archu26
 
Types of Bearing_Specifications_PPT.pptx
PranjulAgrahariAkash
 
ISO/IEC JTC 1/WG 9 (MAR) Convenor Report
Kurata Takeshi
 
MobileComputingMANET2023 MobileComputingMANET2023.pptx
masterfake98765
 
265587293-NFPA 101 Life safety code-PPT-1.pptx
chandermwason
 
Hashing Introduction , hash functions and techniques
sailajam21
 
Destructive Tests corrosion engineer (1).pptx
zeidali3
 
Solar Thermal Energy System Seminar.pptx
Gpc Purapuza
 
Ad

Spatial databases

  • 1. By- Neha Kulkarni ME Computer Pune Insitute of Computer Technology
  • 2.  What is spatial data?  Types of spatial data  Types of queries  Applications  Indexing Techniques  Comparison of Indexing techniques  GiST  Indexing High-dimensional data  Conclusion
  • 3. • Spatial data represent the location ,size and shape of an object on earth • Ex. Building, lake
  • 4. Point data: Line data: polygon data:
  • 5.  Point Data:  Simplest form of representing spatial data  No space and has no associated area or volume  Consists of collection of points  Ex. Raster data
  • 6.  Region data:  Has spatial extend with location and boundaries  Represented using of points, line, polygons  Ex. roads, rivers: line data
  • 8. 1) Spatial range queries: Related with region data Ex. “Find all cities within 50 miles of Pune” 2)Nearest Neighbor queries: Related with point data Ex. “Find 10 cities nearest to Pune” In ordered cities Use in multimedia database
  • 9. 3) Spatial Join Queries: - use both point and region data. - Ex. “Find pairs of cities within 200 miles of each other AND “ Find all cities near a lake” - More complex - Expensive to evaluate
  • 10. 1) Geographic Information System(GIS) Ex. MAP 2) Computer Aided Design/Manufacturing(CAD/CAM) Ex. Surface of design object Range and Spatial join queries used 3) Multimedia Database video, audio, image, text also required spatial data Nearest neighbor queries and point data
  • 11. Point Data: Grid files, ḥE trees, Kdtrees, point quad trees Region data: Quad trees, R trees, SKD trees, -Yet no best indexing technique - R trees are commonly used : due to simplicity, ability to handle both data performance to complex queries
  • 12. Three main indexing techniques :  Region Quad-Trees and Z-Ordering – handle both point and region data  Grid Files – only point data  R-Trees – handle both point and region data
  • 13.  Z-ordering gives us a way to group points according to spatial proximity.  Consider X-01 and Y-11  Z-value is 0111 by interleaving X and Y values. This gives us the value for the point 7. Space filling curves
  • 14. The Region Quad tree structure corresponds directly to the recursive decomposition of the data space. Each node in the tree corresponds to a square-shaped region of the data space.
  • 15.  Grid files rely upon a grid directory to identify the data page containing a desired point.  The Grid file partitions space into rectangular regions using lines that are parallel to the axes.  If the X axis is cut into i segments and the Y axis is cut into j segments, we have a total of i x j partitions. The grid directory is an i by j array with one entry per partition.  This description is maintained in an array called a linear scale; there is one linear scale per axis.
  • 16. Searching for a point in a grid file
  • 17. Inserting points in a Grid File
  • 18.  Adaptation of B+ Tree  Height-balanced data structure  Search key values are referred to as Bounding Boxes  A data entry consists of a pair (n-dimensional box, Rid)  Rid – object Identifier  N-dimensional box is the smallest box that contains the object
  • 20.  Search for Objects Overlapping Box Q Start at root. 1. If current node is non-leaf, for each entry <E, ptr>, if box E overlaps Q, search subtree identified by ptr. 2. If current node is leaf, for each entry <E, rid>, if E overlaps Q, rid identifies an object that might overlap Q.
  • 21. Insert Entry <B, ptr>  Start at root and go down to “best-fit” leaf L.  Go to child whose box needs least enlargement to cover B; resolve ties by going to smallest area child.  If best-fit leaf L has space, insert entry and stop. Otherwise, split L into L1 and L2.  Adjust entry for L in its parent so that the box now covers (only) L1.  Add an entry (in the parent node of L) for L2. (This could cause the parent node to recursively split.)
  • 22. Region Quad Trees Grid Files(point data) R-Trees Range Queries Easily handled Easily handled for point data. Handled by calculating bounding box Nearest Neighbour Queries Can be handled. Sometimes tricky due to long diagonal jumps Easily handled for point data. Handled well by traversing for the point or region Spatial Joins Can be handled with some extension to range queries Easily handled for point data. Handled very well
  • 23.  The Generalized Search Tree (GiST) abstracts the “tree” nature of a class of indexes including B+ trees and R-tree variants.  Striking similarities in insert/delete/search and even concurrency control algorithms make it possible to provide “templates” for these algorithms that can be customized to obtain the many different tree index structures.  GiST provides an alternative for implementing other tree indexes in an ORDBMS.
  • 24.  Typically, high-dimensional datasets are collections of points, not regions.  E.g., Feature vectors in multimedia applications.  Very sparse  Nearest neighbor queries are common.  R-tree becomes worse than sequential scan for most datasets with more than a dozen dimensions.  As dimensionality increases contrast (ratio of distances between nearest and farthest points) usually decreases; “nearest neighbor” is not meaningful.
  • 25.  Spatial data management has many applications, including GIS, CAD/CAM, multimedia indexing, Point and region data  R-tree approach is widely used in GIS systems  Used in spatial data mining approaches.  Popular SDBMS : MySQL(geometry datatype), Neo4j, AllegroGraph, SpaceBase, CouchDB, PostGreSQL, SpatialDB
  • 26.  “Database Management Systems” by Raghu Ramakrishnan, 3rd Edition  www.techopedia.com/definition  dna.fernuni-hagen.de/IntroSpatialDBMS  www.geol-amu.org/notes