Welcome to Scribd!

0% found this document useful (0 votes)

23 views

Pca 1692550768

Uploaded by

PCA is an unsupervised machine learning algorithm used for dimensionality reduction. It works by identifying principal components as linear combinations of the original variables that explain the maximum variance in the data. The algorithm standardizes data, computes the covariance matrix, calculates eigenvectors and eigenvalues to identify principal components, and projects the data onto these components to reduce dimensionality while retaining variation. PCA is useful for data visualization, compression, and preprocessing for machine learning models.

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Pca 1692550768

Uploaded by

kan luc N'guessan

0% found this document useful (0 votes)

23 views13 pages

Original Title

PCA_1692550768

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

0% found this document useful (0 votes)

23 views13 pages

Pca 1692550768

Uploaded by

kan luc N'guessan

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

Jump to Page

You are on page 1of 13

Search inside document

Principal

Component
Analysis
What is Principal
Component Analysis?

Principal Component Analysis (PCA) is an

unsupervised learning algorithm that is used for
dimensionality reduction in machine learning.

PCA is used to reduce the dimensionality of a

large dataset while retaining as much of the
original variation as possible.
How PCA works?

PCA works by identifying a set of new variables,

called principal components, that are linear
combinations of the original variables, and then
projecting the data onto these new variables.

The first principal component is chosen to explain

the maximum amount of variation in the data,
and each subsequent component is chosen to
explain the remaining variation in order of
decreasing importance. The principal
components are chosen such that they are
orthogonal (i.e., uncorrelated) to each other.
PCA Algorithm Steps

1. Standardize the data: PCA assumes that the

data is standardized (i.e., centered at zero and
scaled to have unit variance). This step involves
subtracting the mean of each variable from each
observation and dividing it by the standard
deviation of each variable.

2. Compute the covariance matrix: The

covariance matrix measures the pairwise
covariances between all pairs of variables in the
data. This matrix is used to compute the principal
components.
PCA Algorithm Steps

3. Compute the eigenvectors and eigenvalues

of the covariance matrix: The eigenvectors of
the covariance matrix represent the directions in
which the data varies the most, while the
eigenvalues represent the amount of variance
explained by each eigenvector.
PCA Algorithm Steps

4. Select the principal components: The

eigenvectors are sorted in descending order of
their corresponding eigenvalues, and the top k
eigenvectors are selected as the principal
components. The number of principal
components to select depends on the amount of
variance that needs to be explained and the
desired level of dimensionality reduction.
PCA Algorithm Steps

5. Project the data onto the principal

components: The original data is then projected
onto the new set of k principal components. This
involves computing the dot product of the original
data matrix with the matrix of selected
eigenvectors.

6. Analyze the results: The resulting matrix of

projected data can be analyzed using standard
statistical techniques. The principal components
themselves can also be examined to gain insight
into the underlying structure of the data.
Why it is useful?

PCA can be used for a variety of purposes,

including data visualization, data compression,
and data pre-processing for machine learning
algorithms.
By reducing the dimensionality of the data, PCA
can make it easier to analyze and interpret large
datasets, and can also help to remove noise and
redundancy in the data.
Advantages

1. Easy to calculate and compute.

2. Speeds up machine learning computing
processes and algorithms.
3. Prevents predictive algorithms from data
overfitting issues.
4. Increases performance of ML algorithms by
eliminating unnecessary correlated variables.
5. Principal Component Analysis results in high
variance and increases visualization.
6. Helps reduce noise that cannot be ignored
automatically.
Disadvantages

1. Sometimes, PCA is difficult to interpret. In rare

cases, you may feel difficult to identify the most
important features even after computing the
principal components.

2. You may face some difficulties in calculating

the covariances and covariance matrices.

3. Sometimes, the computed principal

components can be more difficult to read rather
than the original set of components.
Follow #DataRanch on LinkedIn
for more...
Follow #DataRanch on LinkedIn
for more...
info@dataranch.org

linkedin.com/company/dataranch

A Project Report On Cost Analysis
Document65 pages
A Project Report On Cost Analysis
Royal Projects
83% (41)
Unit 3
Document31 pages
Unit 3
wansejalm527
No ratings yet
The Intuition Behind PCA: Machine Learning Assignment
Document11 pages
The Intuition Behind PCA: Machine Learning Assignment
Palash Ghosh
No ratings yet
Chapter Five Principal Comonent Analysis (PCA)
Document33 pages
Chapter Five Principal Comonent Analysis (PCA)
Ruun Mohamed
No ratings yet
3.2 Pca
Document27 pages
3.2 Pca
Javada Javada
No ratings yet
Dimensionality Reduction
Document30 pages
Dimensionality Reduction
suryafootball01
No ratings yet
Principal Component Analysis
Document11 pages
Principal Component Analysis
Sravan Kumar Thota
No ratings yet
U4 - PCA - 5th Sem - DS
Document14 pages
U4 - PCA - 5th Sem - DS
subbumail051
No ratings yet
Need of PCA
Document6 pages
Need of PCA
Simi Jain
100% (1)
Principal Component Analysis
Document3 pages
Principal Component Analysis
ecobalas7
No ratings yet
Love Report
Document7 pages
Love Report
vinayakjivtode84
No ratings yet
Advantages and Disadvantage of PCA
Document2 pages
Advantages and Disadvantage of PCA
Shobha Kumari Choudhary
No ratings yet
Data Analytics
Document28 pages
Data Analytics
researcherniaz
No ratings yet
Principal Component Analysis (PCA)
Document3 pages
Principal Component Analysis (PCA)
Kk
No ratings yet
Principal Component Analysis
Document10 pages
Principal Component Analysis
Deeksha Manoj
No ratings yet
Principal Component Analysis
Document2 pages
Principal Component Analysis
Shobha Kumari Choudhary
No ratings yet
Principal Component Analysis
Document10 pages
Principal Component Analysis
parvathamhymavathi
No ratings yet
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
Document59 pages
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
Indumathy Paranthaman
No ratings yet
Mloa Exp2 C121
Document20 pages
Mloa Exp2 C121
Devanshu Maheshwari
No ratings yet
Principal Component Analysis
Document1 page
Principal Component Analysis
RANJIT PANIGRAHI
No ratings yet
Ai & ML Week-9
Document30 pages
Ai & ML Week-9
ಹರಿ ಶಂ
No ratings yet
Unit 4 Part 2
Document17 pages
Unit 4 Part 2
Prince Rathore
No ratings yet
Assignment
Document24 pages
Assignment
Santhi Palanisamy
No ratings yet
Dimensionality Reduction
Document47 pages
Dimensionality Reduction
bka212407
No ratings yet
Pca
Document18 pages
Pca
gerry
No ratings yet
1694601214-Unit 3.4 Principal Component Analysis CU 2.0
Document36 pages
1694601214-Unit 3.4 Principal Component Analysis CU 2.0
prime9316586191
No ratings yet
Data Mining Project 11
Document18 pages
Data Mining Project 11
Abraham Zeleke
No ratings yet
Week 4
Document3 pages
Week 4
MANISH P
No ratings yet
Data Science Technical Interview Questions
Document24 pages
Data Science Technical Interview Questions
pablo.villegas.mills
No ratings yet
Remote Sensing Assignment
Document10 pages
Remote Sensing Assignment
Kio Meow
No ratings yet
Principal Component Analysis
Document13 pages
Principal Component Analysis
Shil Shambharkar
No ratings yet
Presentation1
Document15 pages
Presentation1
anirbandutta541
No ratings yet
EDAB Module 5 Singular Value Decomposition (SVD)
Document58 pages
EDAB Module 5 Singular Value Decomposition (SVD)
nagarajan
No ratings yet
PCA and LDA Assignment
Document5 pages
PCA and LDA Assignment
nwaytk520
No ratings yet
UNIT-4
Document79 pages
UNIT-4
21311a1962
No ratings yet
ML 6
Document7 pages
ML 6
ananyahc12
No ratings yet
DR Pca
Document22 pages
DR Pca
adarsh.tripathi
No ratings yet
Unit 5 - Machine Learning - Www.a2softech - Xyz - A2kash
Document12 pages
Unit 5 - Machine Learning - Www.a2softech - Xyz - A2kash
Aakash Kumar Pawar
No ratings yet
Dimensionality reduction
Document7 pages
Dimensionality reduction
shaikarimulla830
No ratings yet
Unit 1
Document8 pages
Unit 1
binokad912
No ratings yet
Unit 5 Pattern Recognition
Document10 pages
Unit 5 Pattern Recognition
Shayar Chauhan
No ratings yet
ML Module 6
Document6 pages
ML Module 6
Viman
No ratings yet
Principal Component Analysis - Intro - Towards Data Science
Document4 pages
Principal Component Analysis - Intro - Towards Data Science
Alan Picard
No ratings yet
Data Preprocessing in Machine Learning
Document5 pages
Data Preprocessing in Machine Learning
Musto
No ratings yet
PCA - Principal Component Analysis: Step by Step Computation of PCA
Document2 pages
PCA - Principal Component Analysis: Step by Step Computation of PCA
Sohini Dey
No ratings yet
PRACTICAL5
Document23 pages
PRACTICAL5
thundergamerz403
No ratings yet
AI Unit-5
Document53 pages
AI Unit-5
Jyoti Mishra
No ratings yet
Module 3 ML
Document19 pages
Module 3 ML
neha1831sewani
No ratings yet
PCA - Ensemble Classifiers
Document9 pages
PCA - Ensemble Classifiers
ritikagupta.3k
No ratings yet
Research Citation Notes
Document35 pages
Research Citation Notes
Web Best Wabii
No ratings yet
Machine Learning Qs
Document10 pages
Machine Learning Qs
onkarxo
No ratings yet
SML Updated UNIT-2
Document43 pages
SML Updated UNIT-2
22416
No ratings yet
ML UNIT IV PART I
Document11 pages
ML UNIT IV PART I
T.Ramakrishna JITS
No ratings yet
Why Use PCA
Document85 pages
Why Use PCA
Professor
No ratings yet
20 Questions On Feature Engineering and Eda
Document9 pages
20 Questions On Feature Engineering and Eda
rahul.guptaoct31
No ratings yet
Principal Component Analysis
Document6 pages
Principal Component Analysis
mrinmoyee.bhattacharya
No ratings yet
Principal Component Analysis
Document14 pages
Principal Component Analysis
amansonnii
No ratings yet
Summary PCA by Atta Mohammad 26040
Document2 pages
Summary PCA by Atta Mohammad 26040
Atta Mohammad
No ratings yet
Module-2 C3-C4
Document66 pages
Module-2 C3-C4
abhishek365ngp
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
Pengaruh Motivasi Kerja Rizal
Document7 pages
Pengaruh Motivasi Kerja Rizal
Rizal Fadli
No ratings yet
Regression Analysis
Document2 pages
Regression Analysis
Jasmin Eisma
No ratings yet
Chda Exam Content Outline Final 11 2023
Document2 pages
Chda Exam Content Outline Final 11 2023
Narendra Bendi
No ratings yet
Master-of-Business-analytics Monash University
Document2 pages
Master-of-Business-analytics Monash University
Harshit Jain
No ratings yet
Forecast Project
Document17 pages
Forecast Project
rjhajharia1997
No ratings yet
Fast Publication Journals
Document10 pages
Fast Publication Journals
rikaseo rika
No ratings yet
8 Hypothesis Testing
Document103 pages
8 Hypothesis Testing
Ja9 Alyssa
No ratings yet
Making Sense of the Social World: Methods of Investigation Daniel F. Chambliss all chapter instant download
Document55 pages
Making Sense of the Social World: Methods of Investigation Daniel F. Chambliss all chapter instant download
cejassturnb5
100% (1)
AI in HRM
Document30 pages
AI in HRM
preethi
No ratings yet
w17 Presentation, Analysis and Interpretation of Data
Document5 pages
w17 Presentation, Analysis and Interpretation of Data
Sweet Potato
No ratings yet
Module 5 - Forecasting
Document13 pages
Module 5 - Forecasting
Ashima Aggarwal
No ratings yet
66 Job Interview Questions For Data Scientists
Document10 pages
66 Job Interview Questions For Data Scientists
Ravi Ranjan
No ratings yet
Unit 4 Clustering - K-Means and Hierarchical
Document40 pages
Unit 4 Clustering - K-Means and Hierarchical
animeshrajak649
No ratings yet
Data Visulization Report
Document21 pages
Data Visulization Report
Rushikesh Gaikhe
No ratings yet
Arima Slide Share
Document65 pages
Arima Slide Share
shrasti gupta
No ratings yet
Global Marketing Strategies For Indian Aluminium Products - A Study
Document16 pages
Global Marketing Strategies For Indian Aluminium Products - A Study
nitesh tekriwal
No ratings yet
The Success and Failures of Sari-Sari Stores: Exploring The Minds of Women Micro-Entrepreneurs
Document27 pages
The Success and Failures of Sari-Sari Stores: Exploring The Minds of Women Micro-Entrepreneurs
Jughead Jones
No ratings yet
ANIS MGT648 Ind Asgmnt c4
Document16 pages
ANIS MGT648 Ind Asgmnt c4
Ahliya Iman
No ratings yet
MFX Module 3 Properties of Time Series
Document76 pages
MFX Module 3 Properties of Time Series
Thanh Nguyen
No ratings yet
Practical Geostatistics For Resource Estimation: Last Update: July 2014
Document31 pages
Practical Geostatistics For Resource Estimation: Last Update: July 2014
EduardoHO
No ratings yet
Session 7-8 - Data Cleaning and Logistic Regression For Classification
Document30 pages
Session 7-8 - Data Cleaning and Logistic Regression For Classification
Shishir Gupta
No ratings yet
Big Data
Document6 pages
Big Data
Moona Awan
No ratings yet
Effect of Retrenchment Practices On Performance of Surviving Employees in State Corporations of Nakuru County
Document12 pages
Effect of Retrenchment Practices On Performance of Surviving Employees in State Corporations of Nakuru County
Manish Singh
No ratings yet
Week 3 - The SLRM (2) - Updated PDF
Document49 pages
Week 3 - The SLRM (2) - Updated PDF
Windyee Tan
No ratings yet
Anemia Code
Document33 pages
Anemia Code
sksharini67
No ratings yet
Scientific Wri. Midterm Exam Group 1 Topic A
Document14 pages
Scientific Wri. Midterm Exam Group 1 Topic A
Mai Quỳnh
No ratings yet
Binary Logistic Regression Lecture 9
Document33 pages
Binary Logistic Regression Lecture 9
Trongtin Lee
No ratings yet
Aksum University College of Social Sciences and Humanity Department of Civic and Ethical Studies
Document18 pages
Aksum University College of Social Sciences and Humanity Department of Civic and Ethical Studies
Yonatan Zelie
No ratings yet
MTE3105 Pengujian Hipotesis Khi Kuasa Dua 2
Document32 pages
MTE3105 Pengujian Hipotesis Khi Kuasa Dua 2
Sitherrai Paramananthan
No ratings yet