Hello all,

I have a matrix X that contains industrial data, where each column is a variable.

Since PCA is done by the correlation matrix, should I use:

. original X;

. X after Centering (subtracting each column of X by the mean of the respective column);

.X after Standardizing? (subtracting each column of X by the mean and dividing by the standard deviation);

I think autoscaling is better, but I dont know if it is a must or just a good practice

Thanks for your time

I have a matrix X that contains industrial data, where each column is a variable.

Since PCA is done by the correlation matrix, should I use:

. original X;

. X after Centering (subtracting each column of X by the mean of the respective column);

.X after Standardizing? (subtracting each column of X by the mean and dividing by the standard deviation);

I think autoscaling is better, but I dont know if it is a must or just a good practice

Thanks for your time

Last edited: