Principal component analysis

Feb 14, 2023

—

in Introduction to Multivariate Analysis

Principal component analysis (PCA) is a data reduction method. Technically, we take a vector of random variables , and we transform it to another vector , by a linear transformation represented by a square matrix . In more detail we have

These equations should not be confused with regression equations. The transformed Z_i variables are not observed and used in a fitting procedure; indeed, there is no error term. They are just transformations of the original variables, which are not classified as dependent or independent. Hence, PCA is an interdependence technique, aimed at metric data, and used for exploratory purposes. In Section 17.2 we show that, by taking suitable combinations, we may find a small subset of Z_i variables, the principal components, that explain most of the variability in the original variables X_i. By disregarding the less relevant components, we reduce data dimensionality without losing a significant portion of information.

Principal component analysis

Comments

Leave a Reply Cancel reply