How to output p-values for sci-kit learn regression models? Any other Python options?

Asked Jul 06 '16 at 07:56

Active Jul 06 '16 at 08:03

Viewed 5,535 times

I have a dataframe with 5 columns. I delete one column and define this as y, the dependent variable. I will use the other 4 columns x1, x2, x3, x4 or matrix X to predict this variable with some sort of regression modeling.

For example:

from sklearn import linear_model
clf = linear_model.LinearRegression()
clf.fit(X,y)

clf.coef_ will have the regression coefficients, i.e. clf.coef_ gives me the coefficient for each variable. My question is how do I find the P value for each variable?

Also, in comparing multiple sklearn models (here skearn.linear_model), how do I plot these to see the effects of linear regression vs lasso vs. ridge regression, etc.?

edited Jul 06 '16 at 08:03

asked Jul 06 '16 at 07:56

ShanZhengYang

16,511
49
132
234

3

Possible duplicate of [Find p-value (significance) in scikit-learn LinearRegression](http://stackoverflow.com/questions/27928275/find-p-value-significance-in-scikit-learn-linearregression) – piman314 Jul 06 '16 at 12:57

How to output p-values for sci-kit learn regression models? Any other Python options?

0 Answers0