I have 2 functions to standardize my data such as:
def standartChanger(dataFrame):
stdSc = preprocessing.StandardScaler()
cols = dataFrame.columns
dfscaled = stdSc.fit_transform(dataFrame)
dfscaled = pd.DataFrame(dfscaled, columns=cols)
return dfscaled, stdSc
def standartChangerwithMeanVar(dataFrame,stdSc):
cols = dataFrame.columns
dataFrame = stdSc.transform(dataFrame)
dfscaled = pd.DataFrame(dataFrame, columns=cols)
return dfscaled
One to standardize train and other for test. I have some dummy variables in my data-frame which I don't want to standardize. Now I need to modify these functions to not touch the dummy variables which are 0-1. How can I do that?
Also, in linear regression I have an issue that my dummy variables' coefficients are too large which creates senseless points in predictions. Do you have any idea for that?