Why does filtering data before PCA improve results?

Question

Morgan Facchin on 2 Aug 2022

0
Link

Direct link to this question

https://au.mathworks.com/matlabcentral/answers/1772560-why-does-filtering-data-before-pca-improve-results

Edited: Bruno Luong on 2 Aug 2022

I have a set of images that I want to discriminate using PCA. I noticed that applying a low-pass filtering (using filter2) to the images before feeding them into PCA greatly improves the results (it increases the relative amount of variance in the first PCs and corresponds more to what I expect). I then have the following more general question: why does filtering improve the results? I have two conflicting intuitions on this:

On the one hand, the performance is better simply because filtering reduces the noise in the images
On the other hand, filtering is only a linear transformation of the data, and the principal axes found by PCA should be "dragged" by this linear transformation and give the exact same results.

Would you have any clues to help me clarify this?

7 Comments
Show 5 older commentsHide 5 older comments

Matt J on 2 Aug 2022

Edited: Matt J on 2 Aug 2022

Open in MATLAB Online

The spatial filtering is linear, but I don't know why you think PCA is invariant to linear transformations of the observations. The following simplified example shows that it is not.

X=rand(7,5); X=X-mean(X);
[U,S,V]=svd(X,0); PCA1=U*S
PCA1 = 7×5
   -0.5159    0.3722   -0.3550    0.0126   -0.0581
    0.6617    0.6046    0.1841    0.0151   -0.0022
    0.2549   -0.3709   -0.0693    0.2474    0.0109
    0.2677   -0.2852   -0.2614    0.1047   -0.0047
   -0.4143   -0.1693    0.5058    0.0784   -0.0438
    0.1698   -0.2888   -0.0159   -0.4318   -0.0113
   -0.4240    0.1374    0.0118   -0.0264    0.1091
[U,S,V]=svd(X*rand(5),0); PCA2=U*S
PCA2 = 7×5
   -0.5649    0.4691   -0.1119    0.0754   -0.0108
    1.1273    0.1412   -0.2793   -0.0462   -0.0124
    0.2503   -0.4315   -0.0264    0.0420    0.0115
   -0.2608   -0.3301   -0.1545    0.0349    0.0043
    0.5185    0.0112    0.4618    0.0151   -0.0156
   -1.0065   -0.1389    0.0153   -0.0824   -0.0166
   -0.0639    0.2791    0.0950   -0.0387    0.0396

Bruno Luong on 2 Aug 2022

Convolution f*g is linear wrt f and wrt g.

Bruno Luong on 2 Aug 2022

Edited: Bruno Luong on 2 Aug 2022

Open in MATLAB Online

@Morgan Facchin

Let me try too understand your question, because I do this extremey simple code to feel how filtering improve PCA, and my conclusion is quite the opposite:

M=diag([1,100]);
x=randn(2,1e6);
y=M*x;
% PCA of Non filtered data
[U,S,V]=svd(y',0);
PCA=V(:,1);
if PCA(2)<0
    PCA=-PCA;
end
nfiltererror = norm(PCA-[0;1])
nfiltererror = 1.8027e-05
% PCA of filtered data
xf = mean(x,2);
yf = M*xf;
[Uf,Sf,Vf]=svd(yf',0);
PCAf=Vf(:,1);
if PCAf(2)<0
    PCAf=-PCAf;
end
filtererror = norm(PCAf-[0;1])
filtererror = 0.0279
if filtererror < nfiltererror
    fprintf('filter is better\n');
else
    fprintf('non-filter is better\n');
end
non-filter is better

So what do you observe? Can you make a MWE (example with 2 pixels?) to show it?

Sign in to comment.

Sign in to answer this question.

Answer 1

Matt J on 2 Aug 2022

0
Link

Direct link to this answer

https://au.mathworks.com/matlabcentral/answers/1772560-why-does-filtering-data-before-pca-improve-results#answer_1019725

Edited: Matt J on 2 Aug 2022

Open in MATLAB Online

PCA applied to the transformed cluster should find PC1 close to L', and therefore the projections of the images on L' should be the same as they were on L (withing a scaling factor)

That is true for a rotation, but for arbitrary linear transformations, it is not true when the dimension of L is greater than 1. We can recraft my example above to examine how the singular values change under an arbitrary transformation when L and L' are 2D:

X=rand(7,2); X=[X,X]; X=X-mean(X);
S1=svd(X,0)
S1 = 4×1
    1.4299
    1.0318
    0.0000
    0.0000
S2=svd(X*rand(4),0)
S2 = 4×1
    2.0355
    0.2737
    0.0000
    0.0000

Clearly also the change is more than just a global scaling,

S1./S2.*[1 1 0 0]'
ans = 4×1
    0.7025
    3.7701
         0
         0

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Why does filtering data before PCA improve results?

7 Comments
Show 5 older commentsHide 5 older comments

Answers (1)

0 Comments
Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

Why does filtering data before PCA improve results?

7 Comments Show 5 older commentsHide 5 older comments

Answers (1)

0 Comments Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

7 Comments
Show 5 older commentsHide 5 older comments

0 Comments
Show -2 older commentsHide -2 older comments