I am trying to do PCA for dimension reduction in WEKA (Classification Problem).
I have 200 attributes in my data and close to 2100 rows.
Here are the steps that i follow
Import csv file in WEKA explorer
In preprocess tab, apply, Normalize data (To bring entire data in range of [0,1]
Then implement PCA.
- In options for PCA, there is an option for centerData which if set to False, would calculate using correlation matrix after standardizing data (Correct me if i am wrong) and if set to true would using covariance matrix.
My doubt is
- Should i be normalizing data before implementing PCA or not? I tried doing it before and after normalizing i am getting different results. So i am confused.
- Should i Standardize data (bring mean to 0) and then apply PCA.
What is the option that i should select in PCA WEKA for centerData option in either case?