I have a dataset containing 8 Parameters (4 Continuous 4 Categorical) and I am trying to eliminate features as per RFEC class in Scikit.
This is the formula I am using:
svc = SVC(kernel="linear")
rfecv = RFECV(estimator=svc, step=1, cv=StratifiedKFold(y, 2),
scoring='accuracy')
rfecv.fit(X, y)
As I have categorical data also, I changed it to the Dummy Variable using dmatrics (Patsy).
I want to try different Classification models on the data after feature selection to improve model along with SVC.
I ran RFE after transforming data and I think I am doing wrong.
Do we run the RFECV before transforming the Categorical data or after?
I can't find any clear indication in any document.