8
Most read
17
Most read
18
Most read
Principal
Component
Analysis
Ricardo Wendell
Aug 2013
2
Feature Engineering
(Our motivation)
Introduction to Principal Component
Analysis
(And some statistical concepts)
Agile Analytics and PCA
(Helping visualization…)
Agenda
3
Feature
Engineering
4
Given a
classification
problem…
How do we choose
the right features?
5
Intuition
fails in high
dimensions
Building a classifier in two or three
dimensions is relatively easy…
It’s usually possible to find a
reasonable frontier between
examples of different
classes just by visual inspection.
6
Feature
engineering
Intuitively, one might
think that gathering
more features never
hurts, right?
At worst they provide
no new information
about the domain…
7
The curse of
dimensionality
Many algorithms that work fine in low
dimensions become intractable
when the input is high-dimensional.
Bellman, 1961
8
How do we
solve it?
Feature Selection
Feature Extraction
9
Feature
extraction
“In most applications
examples are not spread
uniformly throughout the
examples space, but are
concentrated on or near
a lower-dimensional
subspace.”
10
Introduction to
PCA
11
Objective of
PCA
To perform dimensionality
reduction while preserving
as much of the randomness
in the high-dimensional
space as possible
12
Principal
Component
Analysis
It takes your cloud of data
points, and rotates it such
that the maximum variability
is visible.
PCA is mainly concerned
with identifying correlations
in the data.
13
Measuring
Correlation
Degree and type of relationship
between any two or more quantities
(variables) in which they vary together
over a period
Correlation can vary from +1 to -1.
Values close to +1 indicate a high-
degree of positive correlation, and
values close to -1 indicate a high
degree of negative correlation.
Values close to zero indicate poor
correlation of either kind, and 0
indicates no correlation at all
14
Measuring
Correlation
15
Beware: Correlation does not
imply causation
16
Correlation
matrix
It shows at a glance how
variables correlate with
each other
17
Eingenvalues
and
eingevectors
18
Steps for PCA 1. Standardize the data
2. Calculate the covariance matrix
3. Find the eigenvalues and
eingenvectors of the covariance
matrix
4. Plot the eigenvectors / principal
components over the scaled data
19
Demo
with R
Let’s check the products
of PCA…
20
Agile analytics
and PCA
21
Agile
Analytics
Machine learning and data
mining tools and techniques
+
Knowledge of the
domain at hand
+
Short feedback cycles
22
Agile
Analytics
We could use PCA as a tool to
quickly identify correlation
between features, helping
feature extraction and
selection.
Reducing dimensionality using
PCA or other similar technique
can help us achieve better and
quicker results.
23	

QA & Next Steps
23

More Related Content

PPTX
PPTX
Principal component analysis
PPTX
Principal component analysis
PPTX
Principal Component Analysis (PCA) and LDA PPT Slides
PPTX
Lect4 principal component analysis-I
PPTX
Lect5 principal component analysis
PPTX
Principal Component Analysis PCA
Principal component analysis
Principal component analysis
Principal Component Analysis (PCA) and LDA PPT Slides
Lect4 principal component analysis-I
Lect5 principal component analysis
Principal Component Analysis PCA

What's hot (20)

ODP
Introduction to Principle Component Analysis
PPTX
Pca(principal components analysis)
PPTX
PDF
Principal component analysis and lda
PDF
PCA (Principal component analysis)
PDF
Naive Bayes
PPTX
Random forest algorithm
PPTX
Introduction to principal component analysis (pca)
PDF
Exploratory data analysis data visualization
PDF
Data Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
PPSX
Lasso and ridge regression
PDF
Principal Component Analysis and Clustering
PPTX
Naive bayes
PPTX
Logistic regression
PDF
Decision trees in Machine Learning
PPTX
Ensemble learning
PPTX
Exploratory data analysis
PPTX
Exploratory Data Analysis
PDF
Dimensionality Reduction
PPT
Introduction to Principle Component Analysis
Pca(principal components analysis)
Principal component analysis and lda
PCA (Principal component analysis)
Naive Bayes
Random forest algorithm
Introduction to principal component analysis (pca)
Exploratory data analysis data visualization
Data Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
Lasso and ridge regression
Principal Component Analysis and Clustering
Naive bayes
Logistic regression
Decision trees in Machine Learning
Ensemble learning
Exploratory data analysis
Exploratory Data Analysis
Dimensionality Reduction
Ad

Similar to Principal Component Analysis (20)

PPTX
Machine Learning.pptx
PPTX
Data analysis
PPTX
Knowledge And Patterns
PDF
Working with the data for Machine Learning
PPTX
Anomaly Detection for Real-World Systems
PPTX
Reuqired ppt for machine learning algirthms and part
PDF
ML-Unit-4.pdf
PDF
Data visualization
PPTX
Six sigma tools an overview
PPTX
ML-Lec-18-NEW Dimensionality Reduction-PCA (1).pptx
PDF
Machine Learning for the System Administrator
PDF
Mastering Customer Segmentation with LLM.pdf
PDF
fmelleHumanActivityRecognitionWithMobileSensors
PPT
Analyzing Performance Test Data
PDF
DA ST-1 SET-B-Solution.pdf we also provide the many type of solution
PDF
Machine Learning Algorithm for Business Strategy.pdf
PPTX
PCA_2022-In_and_out.pptx zxczxczxczxczxcxzczx
PDF
Slides distancecovariance
PDF
AWS Certified Machine Learning Specialty
PDF
Dimensionality Reduction
Machine Learning.pptx
Data analysis
Knowledge And Patterns
Working with the data for Machine Learning
Anomaly Detection for Real-World Systems
Reuqired ppt for machine learning algirthms and part
ML-Unit-4.pdf
Data visualization
Six sigma tools an overview
ML-Lec-18-NEW Dimensionality Reduction-PCA (1).pptx
Machine Learning for the System Administrator
Mastering Customer Segmentation with LLM.pdf
fmelleHumanActivityRecognitionWithMobileSensors
Analyzing Performance Test Data
DA ST-1 SET-B-Solution.pdf we also provide the many type of solution
Machine Learning Algorithm for Business Strategy.pdf
PCA_2022-In_and_out.pptx zxczxczxczxczxcxzczx
Slides distancecovariance
AWS Certified Machine Learning Specialty
Dimensionality Reduction
Ad

More from Ricardo Wendell Rodrigues da Silveira (6)

PDF
Data Lakes com Hadoop e Spark: Agile Analytics na prática
PDF
Data Science e Python: entendendo e aplicando
PPTX
Kintsugi: The beauty in imperfection
PDF
Apresentando Groovy e Grails
Data Lakes com Hadoop e Spark: Agile Analytics na prática
Data Science e Python: entendendo e aplicando
Kintsugi: The beauty in imperfection
Apresentando Groovy e Grails

Recently uploaded (20)

PDF
CXOs-Are-you-still-doing-manual-DevOps-in-the-age-of-AI.pdf
PDF
4 layer Arch & Reference Arch of IoT.pdf
PDF
Auditboard EB SOX Playbook 2023 edition.
PDF
Rapid Prototyping: A lecture on prototyping techniques for interface design
PDF
Transform-Quality-Engineering-with-AI-A-60-Day-Blueprint-for-Digital-Success.pdf
PDF
The-Future-of-Automotive-Quality-is-Here-AI-Driven-Engineering.pdf
PPTX
GROUP4NURSINGINFORMATICSREPORT-2 PRESENTATION
PPTX
MuleSoft-Compete-Deck for midddleware integrations
PDF
Transform-Your-Streaming-Platform-with-AI-Driven-Quality-Engineering.pdf
PPTX
Configure Apache Mutual Authentication
PDF
sbt 2.0: go big (Scala Days 2025 edition)
PDF
Flame analysis and combustion estimation using large language and vision assi...
PDF
Convolutional neural network based encoder-decoder for efficient real-time ob...
PDF
Consumable AI The What, Why & How for Small Teams.pdf
PPTX
Build Your First AI Agent with UiPath.pptx
PDF
Accessing-Finance-in-Jordan-MENA 2024 2025.pdf
PDF
Data Virtualization in Action: Scaling APIs and Apps with FME
PDF
Taming the Chaos: How to Turn Unstructured Data into Decisions
PDF
AI.gov: A Trojan Horse in the Age of Artificial Intelligence
PDF
“A New Era of 3D Sensing: Transforming Industries and Creating Opportunities,...
CXOs-Are-you-still-doing-manual-DevOps-in-the-age-of-AI.pdf
4 layer Arch & Reference Arch of IoT.pdf
Auditboard EB SOX Playbook 2023 edition.
Rapid Prototyping: A lecture on prototyping techniques for interface design
Transform-Quality-Engineering-with-AI-A-60-Day-Blueprint-for-Digital-Success.pdf
The-Future-of-Automotive-Quality-is-Here-AI-Driven-Engineering.pdf
GROUP4NURSINGINFORMATICSREPORT-2 PRESENTATION
MuleSoft-Compete-Deck for midddleware integrations
Transform-Your-Streaming-Platform-with-AI-Driven-Quality-Engineering.pdf
Configure Apache Mutual Authentication
sbt 2.0: go big (Scala Days 2025 edition)
Flame analysis and combustion estimation using large language and vision assi...
Convolutional neural network based encoder-decoder for efficient real-time ob...
Consumable AI The What, Why & How for Small Teams.pdf
Build Your First AI Agent with UiPath.pptx
Accessing-Finance-in-Jordan-MENA 2024 2025.pdf
Data Virtualization in Action: Scaling APIs and Apps with FME
Taming the Chaos: How to Turn Unstructured Data into Decisions
AI.gov: A Trojan Horse in the Age of Artificial Intelligence
“A New Era of 3D Sensing: Transforming Industries and Creating Opportunities,...

Principal Component Analysis