Terminology:
Component: PC
loading-score[i,j]
: the j
feature in PC[i]
Question:
I know the question regarding feature selection is asked several times here at StackOverflow (SO) and on other tech-pages, and it proposes different answers/discussion. That is why I want to open a discussion for the different solutions, and not post it as a general question since that has been done.
Different methods are proposed for feature selection using PCA: For instance using the dot product between the original features and the components (here) to get their correlation, a discussion at SO here suggests that you can only talk about important features as loading-scores in a component (and not use that importance in the input space), and another discussion at SO (which I cannot find at the moment) suggest that the importance for feature[j]
would be abs(sum(loading_score[:,j])
i.e the sum of the absolute value of loading_score[i,j]
for all i
components.
I personally would think that a way to get the importance of a feature would be an absolute sum where each loading_score[i,j]
is weighted by the explained variance of component i
i.e
imp_feature[j]=sum_i (abs(loading_score[i,j])*explained_variance[i]
.