SlideShare a Scribd company logo
4
Most read
6
Most read
10
Most read
OPTICS: Ordering Points To
Identify the Clustering Structure
Presented By:
Rohit Paul
 Disadvantages of DBSCAN
 Requires two user inputs(Eps and MinPts)
 Unable to determine variable density cluster
 OPTICS:
 Able to get variable density cluster
 Mainly requires one inputs (i.e, MinPts)
 Eps can be considered as ‘infinite’
Idea
 Creates an augmented ordering of the database
representing its density-based clustering structure
 Help us gain a high level understanding of the way data
is structured
Observation
 For a constant MinPts value, density-based
clusters with higher density are completely
contained in density-connected sets with respect
to a lower density.
 Extend the DBSCAN algorithm such that several
distance parameters are processed at the same
time
OPTICS
 An infinite number of distance parameters eps’
which are smaller than a “generating distance”
eps (i.e. 0 <= eps’ <= eps).
 Order is stored in which the objects are
processed and the information which would be
used by an extended DBSCAN algorithm to
assign cluster memberships
 This information consists of only two values for
each object: the core-distance and a reachability-
distance
Terminology
 Core distance of an object p:
 The core-distance of an object p is simply the smallest
distance eps’ between p and an object in its e-
neighborhood such that p would be a core object with
respect to eps’ if this neighbor is contained in Ne(p).
 Reachability-distance object p w.r.t. object o:
 Reachability-distance of an object p with respect to
another object o is the smallest
distance such that p is directly
density-reachable from o if o is
a core object
Algorithm
 FOR i FROM 1 TO SetOfObjects.size DO
 IF NOT Object.Processed THEN
1. neighbors := SetOfObjects.neighbors(Object, e);
2. Object.Processed := TRUE;
3. Object.reachability_distance := UNDEFINED;
4. Object.setCoreDistance(neighbors, e, MinPts);
5. OrderedFile.write(Object);
6. IF Object.core_distance <> UNDEFINED THEN
 OrderSeeds.update(neighbors, Object);
 WHILE NOT orderSeeds.empty() DO
 Repeat Step 1, 2, 4, 5 and 6
 If reachability-distance of the current object
Object is larger than the clustering-distance eps’
 Object is not density-reachable from any of the objects
which are located before the current object in the
cluster-ordering.
 We look at the core-distance of Object and start a new
cluster if Object is a core object with respect to eps’
and MinPts; otherwise, Object is assigned to NOISE
 If reachability-distance of the current object is
smaller than eps’
 Can simply assign this object to the current cluster
because then it is density-reachable from a preceding
core object in the cluster-ordering.
Reachability plot insensitive to input
parameter
• The smaller the Eps
value, the more
objects have an
UNDEFINED
reachability-distance
• Lower values MinPts
reachability-plot looks
more jagged and
higher value
smoothen the curve.
Thank You

More Related Content

PDF
Optics ordering points to identify the clustering structure
Rajesh Piryani
 
PPTX
Graph clustering
ssusered887b
 
PPTX
Connectivity of graph
Shameer P Hamsa
 
PPTX
Dbscan
RohitPaul52
 
PPTX
Support vector machine
zekeLabs Technologies
 
PPT
3.3 hierarchical methods
Krish_ver2
 
PPT
Greedy Algorithm
Waqar Akram
 
PPTX
Greedy Algorithms
Amrinder Arora
 
Optics ordering points to identify the clustering structure
Rajesh Piryani
 
Graph clustering
ssusered887b
 
Connectivity of graph
Shameer P Hamsa
 
Dbscan
RohitPaul52
 
Support vector machine
zekeLabs Technologies
 
3.3 hierarchical methods
Krish_ver2
 
Greedy Algorithm
Waqar Akram
 
Greedy Algorithms
Amrinder Arora
 

What's hot (20)

PPT
2.5 backpropagation
Krish_ver2
 
PPTX
Grid based method & model based clustering method
rajshreemuthiah
 
PPT
K mean-clustering algorithm
parry prabhu
 
PPTX
Introduction to Clustering algorithm
hadifar
 
PDF
Classification Based Machine Learning Algorithms
Md. Main Uddin Rony
 
PPTX
Hierarchical clustering.pptx
NTUConcepts1
 
PDF
Understanding Bagging and Boosting
Mohit Rajput
 
PPT
Covering (Rules-based) Algorithm
ZHAO Sam
 
PPTX
Graphs in data structure
hamza javed
 
PDF
Data preprocessing using Machine Learning
Gopal Sakarkar
 
PPTX
Dimensionality Reduction and feature extraction.pptx
Sivam Chinna
 
PPTX
Suffix Tree and Suffix Array
Harshit Agarwal
 
PPTX
K-Folds Cross Validation Method
SHUBHAM GUPTA
 
PPTX
Bellman ford algorithm
MdSajjadulislamBappi
 
PPTX
AI Agents, Agents in Artificial Intelligence
Kirti Verma
 
PPTX
0 1 knapsack using branch and bound
Abhishek Singh
 
PPTX
Instance based learning
swapnac12
 
PDF
Lecture 5 - Agent communication
Antonio Moreno
 
PPT
Machine Learning
Dhananjay Birmole
 
PDF
Algorithms Lecture 2: Analysis of Algorithms I
Mohamed Loey
 
2.5 backpropagation
Krish_ver2
 
Grid based method & model based clustering method
rajshreemuthiah
 
K mean-clustering algorithm
parry prabhu
 
Introduction to Clustering algorithm
hadifar
 
Classification Based Machine Learning Algorithms
Md. Main Uddin Rony
 
Hierarchical clustering.pptx
NTUConcepts1
 
Understanding Bagging and Boosting
Mohit Rajput
 
Covering (Rules-based) Algorithm
ZHAO Sam
 
Graphs in data structure
hamza javed
 
Data preprocessing using Machine Learning
Gopal Sakarkar
 
Dimensionality Reduction and feature extraction.pptx
Sivam Chinna
 
Suffix Tree and Suffix Array
Harshit Agarwal
 
K-Folds Cross Validation Method
SHUBHAM GUPTA
 
Bellman ford algorithm
MdSajjadulislamBappi
 
AI Agents, Agents in Artificial Intelligence
Kirti Verma
 
0 1 knapsack using branch and bound
Abhishek Singh
 
Instance based learning
swapnac12
 
Lecture 5 - Agent communication
Antonio Moreno
 
Machine Learning
Dhananjay Birmole
 
Algorithms Lecture 2: Analysis of Algorithms I
Mohamed Loey
 
Ad

Similar to Optics (20)

PPTX
density based method and expectation maximization
Siva Priya
 
PPTX
Could a Data Science Program use Data Science Insights?
Zachary Thomas
 
PPTX
Density based clustering
YaswanthHariKumarVud
 
PPT
instance bases k nearest neighbor algorithm.ppt
Johny139575
 
PPT
3.4 density and grid methods
Krish_ver2
 
PPT
[PPT]
butest
 
PDF
A framework for outlier detection in
ijfcstjournal
 
PDF
A study on rough set theory based
ijaia
 
PDF
Data Mining: Cluster Analysis
Suman Mia
 
PDF
Computer Vision: Visual Extent of an Object
IOSR Journals
 
PDF
Ijartes v1-i2-006
IJARTES
 
PDF
Clustering Algorithms for Data Stream
IRJET Journal
 
PPTX
Clique and sting
Subramanyam Natarajan
 
DOCX
Lalal
Himanshu Sharma
 
PDF
UNIT V.pdf
MerbinJose
 
PDF
Convolution neural networks
Fares Hasan
 
PDF
Reduct generation for the incremental data using rough set theory
csandit
 
PDF
ML_Unit_IV_Clustering in Machine Learning.pdf
rameshwarchintamani
 
PPTX
Dbscan algorithom
Mahbubur Rahman Shimul
 
density based method and expectation maximization
Siva Priya
 
Could a Data Science Program use Data Science Insights?
Zachary Thomas
 
Density based clustering
YaswanthHariKumarVud
 
instance bases k nearest neighbor algorithm.ppt
Johny139575
 
3.4 density and grid methods
Krish_ver2
 
[PPT]
butest
 
A framework for outlier detection in
ijfcstjournal
 
A study on rough set theory based
ijaia
 
Data Mining: Cluster Analysis
Suman Mia
 
Computer Vision: Visual Extent of an Object
IOSR Journals
 
Ijartes v1-i2-006
IJARTES
 
Clustering Algorithms for Data Stream
IRJET Journal
 
Clique and sting
Subramanyam Natarajan
 
UNIT V.pdf
MerbinJose
 
Convolution neural networks
Fares Hasan
 
Reduct generation for the incremental data using rough set theory
csandit
 
ML_Unit_IV_Clustering in Machine Learning.pdf
rameshwarchintamani
 
Dbscan algorithom
Mahbubur Rahman Shimul
 
Ad

Recently uploaded (20)

PPTX
Introduction to Data Analytics and Data Science
KavithaCIT
 
PPTX
Introduction to Biostatistics Presentation.pptx
AtemJoshua
 
PPTX
Fuzzy_Membership_Functions_Presentation.pptx
pythoncrazy2024
 
PDF
Chad Readey - An Independent Thinker
Chad Readey
 
PPTX
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
PPTX
Pipeline Automatic Leak Detection for Water Distribution Systems
Sione Palu
 
PDF
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
PPTX
IP_Journal_Articles_2025IP_Journal_Articles_2025
mishell212144
 
PPTX
The whitetiger novel review for collegeassignment.pptx
DhruvPatel754154
 
PPTX
short term internship project on Data visualization
JMJCollegeComputerde
 
PPTX
INFO8116 -Big data architecture and analytics
guddipatel10
 
PDF
D9110.pdfdsfvsdfvsdfvsdfvfvfsvfsvffsdfvsdfvsd
minhn6673
 
PPTX
White Blue Simple Modern Enhancing Sales Strategy Presentation_20250724_21093...
RamNeymarjr
 
PPTX
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
PDF
SUMMER INTERNSHIP REPORT[1] (AutoRecovered) (6) (1).pdf
pandeydiksha814
 
PPTX
INFO8116 - Week 10 - Slides.pptx data analutics
guddipatel10
 
PPTX
INFO8116 - Week 10 - Slides.pptx big data architecture
guddipatel10
 
PDF
Key_Statistical_Techniques_in_Analytics_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
PDF
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 
Introduction to Data Analytics and Data Science
KavithaCIT
 
Introduction to Biostatistics Presentation.pptx
AtemJoshua
 
Fuzzy_Membership_Functions_Presentation.pptx
pythoncrazy2024
 
Chad Readey - An Independent Thinker
Chad Readey
 
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
Pipeline Automatic Leak Detection for Water Distribution Systems
Sione Palu
 
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
IP_Journal_Articles_2025IP_Journal_Articles_2025
mishell212144
 
The whitetiger novel review for collegeassignment.pptx
DhruvPatel754154
 
short term internship project on Data visualization
JMJCollegeComputerde
 
INFO8116 -Big data architecture and analytics
guddipatel10
 
D9110.pdfdsfvsdfvsdfvsdfvfvfsvfsvffsdfvsdfvsd
minhn6673
 
White Blue Simple Modern Enhancing Sales Strategy Presentation_20250724_21093...
RamNeymarjr
 
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
SUMMER INTERNSHIP REPORT[1] (AutoRecovered) (6) (1).pdf
pandeydiksha814
 
INFO8116 - Week 10 - Slides.pptx data analutics
guddipatel10
 
INFO8116 - Week 10 - Slides.pptx big data architecture
guddipatel10
 
Key_Statistical_Techniques_in_Analytics_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 

Optics

  • 1. OPTICS: Ordering Points To Identify the Clustering Structure Presented By: Rohit Paul
  • 2.  Disadvantages of DBSCAN  Requires two user inputs(Eps and MinPts)  Unable to determine variable density cluster  OPTICS:  Able to get variable density cluster  Mainly requires one inputs (i.e, MinPts)  Eps can be considered as ‘infinite’
  • 3. Idea  Creates an augmented ordering of the database representing its density-based clustering structure  Help us gain a high level understanding of the way data is structured
  • 4. Observation  For a constant MinPts value, density-based clusters with higher density are completely contained in density-connected sets with respect to a lower density.  Extend the DBSCAN algorithm such that several distance parameters are processed at the same time
  • 5. OPTICS  An infinite number of distance parameters eps’ which are smaller than a “generating distance” eps (i.e. 0 <= eps’ <= eps).  Order is stored in which the objects are processed and the information which would be used by an extended DBSCAN algorithm to assign cluster memberships  This information consists of only two values for each object: the core-distance and a reachability- distance
  • 6. Terminology  Core distance of an object p:  The core-distance of an object p is simply the smallest distance eps’ between p and an object in its e- neighborhood such that p would be a core object with respect to eps’ if this neighbor is contained in Ne(p).  Reachability-distance object p w.r.t. object o:  Reachability-distance of an object p with respect to another object o is the smallest distance such that p is directly density-reachable from o if o is a core object
  • 7. Algorithm  FOR i FROM 1 TO SetOfObjects.size DO  IF NOT Object.Processed THEN 1. neighbors := SetOfObjects.neighbors(Object, e); 2. Object.Processed := TRUE; 3. Object.reachability_distance := UNDEFINED; 4. Object.setCoreDistance(neighbors, e, MinPts); 5. OrderedFile.write(Object); 6. IF Object.core_distance <> UNDEFINED THEN  OrderSeeds.update(neighbors, Object);  WHILE NOT orderSeeds.empty() DO  Repeat Step 1, 2, 4, 5 and 6
  • 8.  If reachability-distance of the current object Object is larger than the clustering-distance eps’  Object is not density-reachable from any of the objects which are located before the current object in the cluster-ordering.  We look at the core-distance of Object and start a new cluster if Object is a core object with respect to eps’ and MinPts; otherwise, Object is assigned to NOISE  If reachability-distance of the current object is smaller than eps’  Can simply assign this object to the current cluster because then it is density-reachable from a preceding core object in the cluster-ordering.
  • 9. Reachability plot insensitive to input parameter • The smaller the Eps value, the more objects have an UNDEFINED reachability-distance • Lower values MinPts reachability-plot looks more jagged and higher value smoothen the curve.