SlideShare a Scribd company logo
Hierarchical
Clustering
4/30/17
1
Two type of hierarchical clustering
• Divisive hierarchical clustering
• Top – down approach
• Agglomerative hierarchical
clustering
• Bottom – up approach
4/30/17
2
Algorithm:
agglomerative
hierarchical clustering
4/30/17
3
Cluster similarity or dissimilarity
• Distance metric
• Euclidean distance
• Manhattan distance
• Jaccard index*
• Linkage criteria
• Single linkage
• Complete linkage
• Average linkage
* https://en.wikipedia.org/wiki/Jaccard_index
4/30/17
4
Single linkage
4/30/17
5
Complete linkage
4/30/17
6
Average linkage
4/30/17
7
Example: Complete linkage
Final set of clusters is (1,3,5),(2),(4)
4/30/17
8
Impact of metrics
Distance metric
• In a 2-dimensional space, the distance
between the point (1,1) and the origin
(0,0) can be 2 under Manhattan
distance, √2 under Euclidean distance.
01
Linkage criteria
• Distance between to clusters can be
different based on linkage criteria used
02
4/30/17
9
Linkage criteria
• Complete linkage is the most popular metric used for hierarchical clustering. It
is less sensitive to outliers.
• Single linkage can handle non-elliptical shapes. But, single linkage can lead to
clusters that are quite heterogeneous internally and it more sensitive to outliers
and noise.
4/30/17
10
Pros and cons: Hierarchical Clustering
• Pros
• No assumption of particular number of clusters
• Cons
• Too slow for large data sets, O(n2 log(n) )
• Once a decision is made to combine two clusters, it can’t be undone
4/30/17
11
References
• http://www.saedsayad.com/clustering_hierarchical.htm
• https://onlinecourses.science.psu.edu/stat555/node/86
• https://www.slideshare.net/xuyangela?utm_campaign=profiletracking&utm_me
dium=sssite&utm_source=ssslideview
4/30/17
12

More Related Content

What's hot (20)

PPTX
Privacy, security and ethics in data science
Nikolaos Vasiloglou
 
PPTX
Unsupervised learning
amalalhait
 
PDF
Data science presentation
MSDEVMTL
 
PPTX
Kdd process
Rajesh Chandra
 
PPTX
Introduction to ML (Machine Learning)
SwatiTripathi44
 
PPTX
Clustering in Data Mining
Archana Swaminathan
 
PDF
Methods of Optimization in Machine Learning
Knoldus Inc.
 
PPTX
ID3 ALGORITHM
HARDIK SINGH
 
PPT
Capter10 cluster basic : Han & Kamber
Houw Liong The
 
PPT
2. visualization in data mining
Azad public school
 
PPT
3.5 model based clustering
Krish_ver2
 
PPTX
Introduction to Clustering algorithm
hadifar
 
PPT
2.2 decision tree
Krish_ver2
 
PPT
Data Preprocessing
Object-Frontier Software Pvt. Ltd
 
PPT
Cluster analysis
Kamalakshi Deshmukh-Samag
 
PDF
Exploratory data analysis data visualization
Dr. Hamdan Al-Sabri
 
PPTX
Data preprocessing in Machine learning
pyingkodi maran
 
PPT
Data mining: Concepts and Techniques, Chapter12 outlier Analysis
Salah Amean
 
PDF
Classification and Clustering
Eng Teong Cheah
 
PPT
2.1 Data Mining-classification Basic concepts
Krish_ver2
 
Privacy, security and ethics in data science
Nikolaos Vasiloglou
 
Unsupervised learning
amalalhait
 
Data science presentation
MSDEVMTL
 
Kdd process
Rajesh Chandra
 
Introduction to ML (Machine Learning)
SwatiTripathi44
 
Clustering in Data Mining
Archana Swaminathan
 
Methods of Optimization in Machine Learning
Knoldus Inc.
 
ID3 ALGORITHM
HARDIK SINGH
 
Capter10 cluster basic : Han & Kamber
Houw Liong The
 
2. visualization in data mining
Azad public school
 
3.5 model based clustering
Krish_ver2
 
Introduction to Clustering algorithm
hadifar
 
2.2 decision tree
Krish_ver2
 
Cluster analysis
Kamalakshi Deshmukh-Samag
 
Exploratory data analysis data visualization
Dr. Hamdan Al-Sabri
 
Data preprocessing in Machine learning
pyingkodi maran
 
Data mining: Concepts and Techniques, Chapter12 outlier Analysis
Salah Amean
 
Classification and Clustering
Eng Teong Cheah
 
2.1 Data Mining-classification Basic concepts
Krish_ver2
 

Similar to Hierarchical clustering (20)

PDF
Hierarchical clustering.pdf
MostafaMenna
 
PDF
Hierarchical clustering
Learnbay Datascience
 
PDF
Mastering Hierarchical Clustering: A Comprehensive Guide
CyberPro Magazine
 
PDF
Mean shift and Hierarchical clustering
Yan Xu
 
PDF
Hierarchical Clustering
Carlos Castillo (ChaTo)
 
PPTX
log6kntt4i4dgwfwbpxw-signature-75c4ed0a4b22d2fef90396cdcdae85b38911f9dce0924a...
ABINASHPADHY6
 
ODP
Hierarchical Clustering With KSAI
Knoldus Inc.
 
PPTX
Hierarchical methods navdeep kaur newww.pptx
dhaliwalharsh055
 
PPTX
Unsupervised Learning-Clustering Algorithms.pptx
jasontseng19
 
PDF
12. Clustering.pdf for the students of aktu.
tanyasingh3130
 
PPTX
9 Hierarchical Clustering
Vishal Dutt
 
PPT
Slide-TIF311-DM-10-11.ppt
ImXaib
 
PPT
Slide-TIF311-DM-10-11.ppt
SandinoBerutu1
 
PPTX
Hierarchical clustering machine learning by arpit_sharma
Er. Arpit Sharma
 
PPTX
TYPES OF CLUSTERING.pptx
Incrediblev Vishnu
 
PPT
clustering and their types explanation of data mining
vandanasharma862095
 
PDF
cluster-Notes.pdf
adasdas13
 
PPTX
Hierarchical clustering.pptx
NTUConcepts1
 
PPTX
12 types of clustering
Vishal Dutt
 
PPTX
ML basic & clustering
monalisa Das
 
Hierarchical clustering.pdf
MostafaMenna
 
Hierarchical clustering
Learnbay Datascience
 
Mastering Hierarchical Clustering: A Comprehensive Guide
CyberPro Magazine
 
Mean shift and Hierarchical clustering
Yan Xu
 
Hierarchical Clustering
Carlos Castillo (ChaTo)
 
log6kntt4i4dgwfwbpxw-signature-75c4ed0a4b22d2fef90396cdcdae85b38911f9dce0924a...
ABINASHPADHY6
 
Hierarchical Clustering With KSAI
Knoldus Inc.
 
Hierarchical methods navdeep kaur newww.pptx
dhaliwalharsh055
 
Unsupervised Learning-Clustering Algorithms.pptx
jasontseng19
 
12. Clustering.pdf for the students of aktu.
tanyasingh3130
 
9 Hierarchical Clustering
Vishal Dutt
 
Slide-TIF311-DM-10-11.ppt
ImXaib
 
Slide-TIF311-DM-10-11.ppt
SandinoBerutu1
 
Hierarchical clustering machine learning by arpit_sharma
Er. Arpit Sharma
 
TYPES OF CLUSTERING.pptx
Incrediblev Vishnu
 
clustering and their types explanation of data mining
vandanasharma862095
 
cluster-Notes.pdf
adasdas13
 
Hierarchical clustering.pptx
NTUConcepts1
 
12 types of clustering
Vishal Dutt
 
ML basic & clustering
monalisa Das
 
Ad

More from Chakrit Phain (20)

PDF
LLM_PairProgramming.pdf
Chakrit Phain
 
PPTX
Web scraping with php
Chakrit Phain
 
PPTX
ChatGPT_Prompts.pptx
Chakrit Phain
 
PDF
Sentence-BERT
Chakrit Phain
 
PDF
AI_ML_Softnix.pdf
Chakrit Phain
 
PPTX
Web Scraping with Python
Chakrit Phain
 
PPTX
เปรียบเทียบ RPA Opensource
Chakrit Phain
 
PPTX
PHP Bandwidth Shaping script
Chakrit Phain
 
PPTX
PHP Explode & Preg_split Test
Chakrit Phain
 
PPTX
Types of Big Data Analytics
Chakrit Phain
 
PDF
Genetic Algorithm
Chakrit Phain
 
PDF
Machine Learning Algorithm & Anomaly detection 2021
Chakrit Phain
 
PDF
Text classification With Rapid Miner
Chakrit Phain
 
PPTX
Ai optimization Example
Chakrit Phain
 
PPTX
Zabbix aws
Chakrit Phain
 
PPTX
Anomaly Detection Technique
Chakrit Phain
 
PPTX
Softnix Anomaly Detection Methods
Chakrit Phain
 
PDF
Neo4j Graph Database และการประยุกตร์ใช้
Chakrit Phain
 
PDF
Softnix how ml_work_0.1draft
Chakrit Phain
 
PPTX
Shell Shock
Chakrit Phain
 
LLM_PairProgramming.pdf
Chakrit Phain
 
Web scraping with php
Chakrit Phain
 
ChatGPT_Prompts.pptx
Chakrit Phain
 
Sentence-BERT
Chakrit Phain
 
AI_ML_Softnix.pdf
Chakrit Phain
 
Web Scraping with Python
Chakrit Phain
 
เปรียบเทียบ RPA Opensource
Chakrit Phain
 
PHP Bandwidth Shaping script
Chakrit Phain
 
PHP Explode & Preg_split Test
Chakrit Phain
 
Types of Big Data Analytics
Chakrit Phain
 
Genetic Algorithm
Chakrit Phain
 
Machine Learning Algorithm & Anomaly detection 2021
Chakrit Phain
 
Text classification With Rapid Miner
Chakrit Phain
 
Ai optimization Example
Chakrit Phain
 
Zabbix aws
Chakrit Phain
 
Anomaly Detection Technique
Chakrit Phain
 
Softnix Anomaly Detection Methods
Chakrit Phain
 
Neo4j Graph Database และการประยุกตร์ใช้
Chakrit Phain
 
Softnix how ml_work_0.1draft
Chakrit Phain
 
Shell Shock
Chakrit Phain
 
Ad

Recently uploaded (20)

PPTX
Introduction to computer chapter one 2017.pptx
mensunmarley
 
PDF
McKinsey - Global Energy Perspective 2023_11.pdf
niyudha
 
PPTX
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
PPTX
Introduction to Data Analytics and Data Science
KavithaCIT
 
PPTX
Insurance-Analytics-Branch-Dashboard (1).pptx
trivenisapate02
 
PPTX
Data-Driven Machine Learning for Rail Infrastructure Health Monitoring
Sione Palu
 
PPTX
UVA-Ortho-PPT-Final-1.pptx Data analytics relevant to the top
chinnusindhu1
 
PDF
apidays Munich 2025 - The Physics of Requirement Sciences Through Application...
apidays
 
PDF
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
PPTX
MR and reffffffvvvvvvvfversal_083605.pptx
manjeshjain
 
PDF
Blitz Campinas - Dia 24 de maio - Piettro.pdf
fabigreek
 
PDF
blockchain123456789012345678901234567890
tanvikhunt1003
 
PPTX
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
PDF
apidays Munich 2025 - Making Sense of AI-Ready APIs in a Buzzword World, Andr...
apidays
 
PDF
apidays Munich 2025 - The Double Life of the API Product Manager, Emmanuel Pa...
apidays
 
PPTX
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
PPTX
short term internship project on Data visualization
JMJCollegeComputerde
 
PPTX
HSE WEEKLY REPORT for dummies and lazzzzy.pptx
ahmedibrahim691723
 
PDF
Blue Futuristic Cyber Security Presentation.pdf
tanvikhunt1003
 
PPTX
7 Easy Ways to Improve Clarity in Your BI Reports
sophiegracewriter
 
Introduction to computer chapter one 2017.pptx
mensunmarley
 
McKinsey - Global Energy Perspective 2023_11.pdf
niyudha
 
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
Introduction to Data Analytics and Data Science
KavithaCIT
 
Insurance-Analytics-Branch-Dashboard (1).pptx
trivenisapate02
 
Data-Driven Machine Learning for Rail Infrastructure Health Monitoring
Sione Palu
 
UVA-Ortho-PPT-Final-1.pptx Data analytics relevant to the top
chinnusindhu1
 
apidays Munich 2025 - The Physics of Requirement Sciences Through Application...
apidays
 
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
MR and reffffffvvvvvvvfversal_083605.pptx
manjeshjain
 
Blitz Campinas - Dia 24 de maio - Piettro.pdf
fabigreek
 
blockchain123456789012345678901234567890
tanvikhunt1003
 
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
apidays Munich 2025 - Making Sense of AI-Ready APIs in a Buzzword World, Andr...
apidays
 
apidays Munich 2025 - The Double Life of the API Product Manager, Emmanuel Pa...
apidays
 
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
short term internship project on Data visualization
JMJCollegeComputerde
 
HSE WEEKLY REPORT for dummies and lazzzzy.pptx
ahmedibrahim691723
 
Blue Futuristic Cyber Security Presentation.pdf
tanvikhunt1003
 
7 Easy Ways to Improve Clarity in Your BI Reports
sophiegracewriter
 

Hierarchical clustering