2
Most read
3
Most read
4
Most read
Concept of principal
component analysis
By
Sangeetha M S
DFK1302
PCA -Overview
• It is a simple, nonparametric method of extracting
relevant information from confusing datasets.
• Statistical procedure that uses an orthogonal
transformation to convert a set of observations of
possibly correlated variables into a set of values of
linearly uncorrelated variable called principal
components.
• PCA was invented in 1901 by Karl Pearson
• Independently developed (and named) by Harold
Hotelling in the 1930s.
• First applied in ecology by Goodall (1954) under the
name “factor analysis” (“principal factor analysis” is a
synonym of PCA).
Overview
• It is a way of identifying patterns in data, and
expressing the data in such a way as to highlight
their similarities and differences.
• Since patterns in data can be hard to find in data
of high dimension, where the luxury of graphical
representation is not available, PCA is a powerful
tool for analysing data.
• The other main advantage of PCA is that once you
have found these patterns in the data, and you
compress the data, ie. by reducing the number of
dimensions, without much loss of information.
• PCA is considered an exploratory technique that
can be used to gain a better understanding of the
interrelationships between variables.
• PCA is performed on a set of data with the hope
of simplifying the description of a set of
interrelated variables.
• Variables are treated equally and they are not
separated into dependent and independent
variables.
• In simplest terms, PCA transforms the original
interrelated variables into a new set of
uncorrelated variables call Principal Components.
• An advantage of principal components to
researchers is that the complexity in
interpretation that can be caused by having
a large number of interrelated variables
can be reduced by utilizing only the first
few principal components that explain a
large proportion of the total variation.
• PCA can be used to test for normality. If
the principal components are not normally
distributed, then the original data weren’t
either.
• An important concept of PCA is to reduce
the number of variables or reduce
dimensionality.
• An important decision that the researcher
must make when using PCA is to determine
the number of principal components to use.
• This decision has no hard-set rules, and
the decision may seem subjective at times.
ROLE OF PCA
• PCA captures major (principal) variability present
in the data set and ignores smaller variability.
• It helps in finding Eigen Values and Eigen
Vectors.
• Significant Eigen Values and Eigen Vectors are
taken for deciding PCs.
• PCA forms new coordinate system defined by the
significant Eigen vectors. (new coordinates will
have lower dimensions )
• Mapping data to the new space.
PRINCIPLE OF PCA
• It works on Linear projection method to
reduce the number of variables.
• Transfer a set of correlated variables
into a new set of uncorrelated variables.
• Map the data into a space of lower
dimensionality.
• PCA rotates existing axes to new
positions in the space defined by original
variable.
IMPORTANCE
• With data of high dimensions, where
graphical representation is difficult, PCA is
a powerful multivariate statistical tool for
analyzing data and finding patterns in it.
• Mapping of data compression is also
possible using PCA.
INTERPRETATION
 PCA plots should be interpreted by looking
at points relative to the origin.
– Points that are in similar directions or
closed positions are positively correlated.
– Points that are on opposite sides of the
origin are negatively correlated.
– Points that are far from the origin or
more perpendicular less correlated.
APPLICATIONS
1. Sensory evaluation and quality control in
post harvest technologies.
2. Applicable to Stock assessments in fisheries
resources.
3. Applicable in genetics (Genomics) and
related experiments.
4. To assess the nutritional status of fishes
and shellfishes with respect to biochemical
compositions, growth parameters and
dietary aspects (Nutrients, 2011).
5. Monitoring environmental studies and other
parameters in aquaculture.
ADVANTAGES
Both objective and subjective attributes can be used.
• It can be done accurately (only) with the help of
Statistical software.
• Direct inputs from treatments.
• There is flexibility in naming and using dimensions.
• PCA is useful for finding new, more informative
and uncorrelated components.
• It reduces dimensionality by rejecting lower
variance components.
DISADVANTAGES
• Usefulness depends on the researchers' ability
to develop a complete and accurate set of
attributes - If important attributes are
missed, precision of the procedure is reduced
accordingly.
• Naming of the factors (independent variables)
can be difficult - multiple attributes can be
highly correlated with no apparent reason.
• If the observed variables are completely
unrelated, PCA analysis is unable to produce a
meaningful pattern.
Pca analysis

More Related Content

PDF
Principal Component Analysis
PPTX
Principal component analysis
PPTX
Lect4 principal component analysis-I
PPTX
Lect5 principal component analysis
PPTX
PPTX
PPTX
Pca(principal components analysis)
PPTX
Principal component analysis
Principal Component Analysis
Principal component analysis
Lect4 principal component analysis-I
Lect5 principal component analysis
Pca(principal components analysis)
Principal component analysis

What's hot (20)

PPTX
Introduction to principal component analysis (pca)
PPT
Multivariate Analysis Techniques
PDF
Principal component analysis and lda
PPTX
Principal Component Analysis (PCA) and LDA PPT Slides
PPTX
Introduction to Maximum Likelihood Estimator
PPTX
Cluster Analysis
PDF
PCA (Principal component analysis)
PPTX
Hierarchical clustering.pptx
PPTX
Multivariate data analysis
PPTX
Probability Theory
PPTX
Cluster analysis
PPTX
Logistic regression
PDF
Exploratory data analysis data visualization
PPT
Principles of experimental design
PPTX
Polynomial regression
PPTX
Random forest
PPTX
Principal Component Analysis PCA
PPTX
Dimension Reduction: What? Why? and How?
ODP
Introduction to Principle Component Analysis
PPTX
Exploratory Data Analysis
Introduction to principal component analysis (pca)
Multivariate Analysis Techniques
Principal component analysis and lda
Principal Component Analysis (PCA) and LDA PPT Slides
Introduction to Maximum Likelihood Estimator
Cluster Analysis
PCA (Principal component analysis)
Hierarchical clustering.pptx
Multivariate data analysis
Probability Theory
Cluster analysis
Logistic regression
Exploratory data analysis data visualization
Principles of experimental design
Polynomial regression
Random forest
Principal Component Analysis PCA
Dimension Reduction: What? Why? and How?
Introduction to Principle Component Analysis
Exploratory Data Analysis
Ad

Viewers also liked (7)

PPSX
PPTX
Application of Principal Components Analysis in Quality Control Problem
PPTX
PCA (Principal component analysis) Theory and Toolkits
PDF
Principal component analysis - application in finance
PDF
Principal Component Analysis(PCA) understanding document
PDF
PCA (Principal Component Analysis)
PPTX
Steps for Principal Component Analysis (pca) using ERDAS software
Application of Principal Components Analysis in Quality Control Problem
PCA (Principal component analysis) Theory and Toolkits
Principal component analysis - application in finance
Principal Component Analysis(PCA) understanding document
PCA (Principal Component Analysis)
Steps for Principal Component Analysis (pca) using ERDAS software
Ad

Similar to Pca analysis (20)

PPTX
Principal Component Analysis in Machine learning.pptx
PPTX
Principal component analysis.pptx
PPTX
Principal component analysis.pptx
PPTX
PRINCIPLE COMPONENT ANALYSIS.pptx
PPTX
Principal component analysis.pptx
PPTX
Easy_PCA_Presentation multivariate .pptx
DOCX
Principal component analysis (PCA) to analyze data
PPTX
Feature selection using PCA.pptx
PPTX
Principal Component Analysis (PCA).pptx
PPTX
Principal component analysis in machine L
PPTX
PCA Final.pptx
PPTX
Dimensionality Reduction and feature extraction.pptx
PPTX
PCACONFUSIONMATRIX.pptx
PDF
When to use Principal Component Analysis (PCA) ?
PDF
Principal Component Analysis in Machine Learning.pdf
PPTX
Lecture4_machine learning Principle .pptx
PPTX
Principal Component Analysis (PCA) machine Learning.
PDF
Mathematical Introduction to Principal Components Analysis
PPTX
Unit3_1.pptx
Principal Component Analysis in Machine learning.pptx
Principal component analysis.pptx
Principal component analysis.pptx
PRINCIPLE COMPONENT ANALYSIS.pptx
Principal component analysis.pptx
Easy_PCA_Presentation multivariate .pptx
Principal component analysis (PCA) to analyze data
Feature selection using PCA.pptx
Principal Component Analysis (PCA).pptx
Principal component analysis in machine L
PCA Final.pptx
Dimensionality Reduction and feature extraction.pptx
PCACONFUSIONMATRIX.pptx
When to use Principal Component Analysis (PCA) ?
Principal Component Analysis in Machine Learning.pdf
Lecture4_machine learning Principle .pptx
Principal Component Analysis (PCA) machine Learning.
Mathematical Introduction to Principal Components Analysis
Unit3_1.pptx

More from College of Fisheries, KVAFSU, Mangalore, Karnataka (20)

PPTX
PPTX
Types of coral reefs and its distribution
PPTX
PPTX
PPTX
Stunted seed production & culture practices
PPTX
PPTX
PPTX
Plants and animals associates of living reef corals
PPTX
PPT
Detail accounts of different comme rci al carp egg hatching devices (2)
PPTX
Conservation and management of coral reefs
PPTX
Types of coral reefs and its distribution
Stunted seed production & culture practices
Plants and animals associates of living reef corals
Detail accounts of different comme rci al carp egg hatching devices (2)
Conservation and management of coral reefs

Recently uploaded (20)

PDF
LIFE & LIVING TRILOGY - PART (3) REALITY & MYSTERY.pdf
PDF
PUBH1000 - Module 6: Global Health Tute Slides
PPTX
PLASMA AND ITS CONSTITUENTS 123.pptx
PDF
Chevening Scholarship Application and Interview Preparation Guide
PDF
Nurlina - Urban Planner Portfolio (english ver)
PDF
African Communication Research: A review
PDF
Journal of Dental Science - UDMY (2020).pdf
PPTX
Macbeth play - analysis .pptx english lit
PDF
LIFE & LIVING TRILOGY- PART (1) WHO ARE WE.pdf
PPTX
2025 High Blood Pressure Guideline Slide Set.pptx
PPTX
Diploma pharmaceutics notes..helps diploma students
PDF
Laparoscopic Colorectal Surgery at WLH Hospital
PPTX
ACFE CERTIFICATION TRAINING ON LAW.pptx
PDF
LIFE & LIVING TRILOGY - PART - (2) THE PURPOSE OF LIFE.pdf
PDF
Physical education and sports and CWSN notes
PDF
0520_Scheme_of_Work_(for_examination_from_2021).pdf
PDF
Horaris_Grups_25-26_Definitiu_15_07_25.pdf
PDF
faiz-khans about Radiotherapy Physics-02.pdf
PDF
Farming Based Livelihood Systems English Notes
PDF
Journal of Dental Science - UDMY (2022).pdf
LIFE & LIVING TRILOGY - PART (3) REALITY & MYSTERY.pdf
PUBH1000 - Module 6: Global Health Tute Slides
PLASMA AND ITS CONSTITUENTS 123.pptx
Chevening Scholarship Application and Interview Preparation Guide
Nurlina - Urban Planner Portfolio (english ver)
African Communication Research: A review
Journal of Dental Science - UDMY (2020).pdf
Macbeth play - analysis .pptx english lit
LIFE & LIVING TRILOGY- PART (1) WHO ARE WE.pdf
2025 High Blood Pressure Guideline Slide Set.pptx
Diploma pharmaceutics notes..helps diploma students
Laparoscopic Colorectal Surgery at WLH Hospital
ACFE CERTIFICATION TRAINING ON LAW.pptx
LIFE & LIVING TRILOGY - PART - (2) THE PURPOSE OF LIFE.pdf
Physical education and sports and CWSN notes
0520_Scheme_of_Work_(for_examination_from_2021).pdf
Horaris_Grups_25-26_Definitiu_15_07_25.pdf
faiz-khans about Radiotherapy Physics-02.pdf
Farming Based Livelihood Systems English Notes
Journal of Dental Science - UDMY (2022).pdf

Pca analysis

  • 1. Concept of principal component analysis By Sangeetha M S DFK1302
  • 2. PCA -Overview • It is a simple, nonparametric method of extracting relevant information from confusing datasets. • Statistical procedure that uses an orthogonal transformation to convert a set of observations of possibly correlated variables into a set of values of linearly uncorrelated variable called principal components. • PCA was invented in 1901 by Karl Pearson • Independently developed (and named) by Harold Hotelling in the 1930s. • First applied in ecology by Goodall (1954) under the name “factor analysis” (“principal factor analysis” is a synonym of PCA). Overview
  • 3. • It is a way of identifying patterns in data, and expressing the data in such a way as to highlight their similarities and differences. • Since patterns in data can be hard to find in data of high dimension, where the luxury of graphical representation is not available, PCA is a powerful tool for analysing data. • The other main advantage of PCA is that once you have found these patterns in the data, and you compress the data, ie. by reducing the number of dimensions, without much loss of information.
  • 4. • PCA is considered an exploratory technique that can be used to gain a better understanding of the interrelationships between variables. • PCA is performed on a set of data with the hope of simplifying the description of a set of interrelated variables. • Variables are treated equally and they are not separated into dependent and independent variables. • In simplest terms, PCA transforms the original interrelated variables into a new set of uncorrelated variables call Principal Components.
  • 5. • An advantage of principal components to researchers is that the complexity in interpretation that can be caused by having a large number of interrelated variables can be reduced by utilizing only the first few principal components that explain a large proportion of the total variation. • PCA can be used to test for normality. If the principal components are not normally distributed, then the original data weren’t either.
  • 6. • An important concept of PCA is to reduce the number of variables or reduce dimensionality. • An important decision that the researcher must make when using PCA is to determine the number of principal components to use. • This decision has no hard-set rules, and the decision may seem subjective at times.
  • 7. ROLE OF PCA • PCA captures major (principal) variability present in the data set and ignores smaller variability. • It helps in finding Eigen Values and Eigen Vectors. • Significant Eigen Values and Eigen Vectors are taken for deciding PCs. • PCA forms new coordinate system defined by the significant Eigen vectors. (new coordinates will have lower dimensions ) • Mapping data to the new space.
  • 8. PRINCIPLE OF PCA • It works on Linear projection method to reduce the number of variables. • Transfer a set of correlated variables into a new set of uncorrelated variables. • Map the data into a space of lower dimensionality. • PCA rotates existing axes to new positions in the space defined by original variable.
  • 9. IMPORTANCE • With data of high dimensions, where graphical representation is difficult, PCA is a powerful multivariate statistical tool for analyzing data and finding patterns in it. • Mapping of data compression is also possible using PCA.
  • 10. INTERPRETATION  PCA plots should be interpreted by looking at points relative to the origin. – Points that are in similar directions or closed positions are positively correlated. – Points that are on opposite sides of the origin are negatively correlated. – Points that are far from the origin or more perpendicular less correlated.
  • 11. APPLICATIONS 1. Sensory evaluation and quality control in post harvest technologies. 2. Applicable to Stock assessments in fisheries resources. 3. Applicable in genetics (Genomics) and related experiments. 4. To assess the nutritional status of fishes and shellfishes with respect to biochemical compositions, growth parameters and dietary aspects (Nutrients, 2011). 5. Monitoring environmental studies and other parameters in aquaculture.
  • 12. ADVANTAGES Both objective and subjective attributes can be used. • It can be done accurately (only) with the help of Statistical software. • Direct inputs from treatments. • There is flexibility in naming and using dimensions. • PCA is useful for finding new, more informative and uncorrelated components. • It reduces dimensionality by rejecting lower variance components.
  • 13. DISADVANTAGES • Usefulness depends on the researchers' ability to develop a complete and accurate set of attributes - If important attributes are missed, precision of the procedure is reduced accordingly. • Naming of the factors (independent variables) can be difficult - multiple attributes can be highly correlated with no apparent reason. • If the observed variables are completely unrelated, PCA analysis is unable to produce a meaningful pattern.