Questions tagged [multicollinearity]
62 questions
5
votes
1 answer
Remove linear dependent variables while using the bife package
Some pre-programmed models automatically remove linear dependent variables in their regression output (e.g. lm()) in R. With the bife package, this does not seem to be possible. As stated in the package description in CRAN on page 5:
If bife does…

John Doe
- 212
- 1
- 9
- 28
3
votes
2 answers
Ordinal Logistic Regression in Python with rpy2 (Python interface for R): issue with collinear predictors
I am trying to perform an Ordinal Logistic Regression in Python calling R's mass.polr function with rpy2 (Python interface for the R language). However, I run into trouble when there are some collinear or almost collinear columns in my predictors:…

Elena Fortina
- 152
- 1
- 1
- 11
2
votes
0 answers
Multicollinearity on multiple dependent variables
Consider the following:
library(tidyverse)
library(car)
a <- c( 2, 3, 4, 5, 6, 7, 8, 9, 100, 11)
b <- c(5, 6, 7, 80, 9, 10, 11, 12, 13, 14)
c <- c(15, 16, 175, 18, 19, 20, 21, 22, 23, 24)
x <- c(17,18,50,15,64,15,3,5,6,9)
y <-…

Kgosi
- 33
- 5
2
votes
0 answers
How to overcome infinite value to perform multicollinearity test in environmental variables?
Here's the dataset which consists of X = 45 columns collected the data from bioclimate database.
The multicollinearity test model -
from statsmodels.stats.outliers_influence import variance_inflation_factor
vif_data =…

perth
- 21
- 3
2
votes
2 answers
Multicollinearity test with car::vif
I am trying to run a car::vif() test in R, to test for multicollinearity. However, when I run the code
reg.model1 <- log(Price2) ~ Detached.house + Semi.detached.house +
Attached.houses + Apartment +
Stock.apartment +…

whoknows
- 53
- 6
2
votes
2 answers
Convergence issue with multi linear regression problems
I'm try to solve a multi-linear-regression problem with a very simple linear network. The network only consists of a single dense layer as its output layer and the activation function is set to linear. I synthesize the output data Y by multiplying…

Kami
- 23
- 4
2
votes
1 answer
Calculating VIF for ordinal logistic regression & multicollinearity in R
I am running an ordinal regression model. I have 8 explanatory variables, 4 of them categorical ('0' or '1') , 4 of them continuous. Beforehand I want to be sure there's no multicollinearity, so I use the variance inflation factor (vif function…

Hugo Perrichon
- 43
- 1
- 6
1
vote
0 answers
Feature selection based on (automatic) multicollinearity detection
as part of an automatic feature selection pipeline, I am wanting to automatically remove features if they are highly correlated, based on its Variable Inflation Factor (VIF) result. Below is an example of df (although the data I use often have >200…

Danny Smith
- 13
- 2
1
vote
1 answer
How can I solve multicollinearity?
I constructed a linear model and tried to calculate the VIF of the variables but I get the following error:
vif(lm_model3101)
Error in vif.default(lm_model3101) :
there are aliased coefficients in the model
To check which numeric variables are…

Lucasjansens
- 11
- 3
1
vote
1 answer
How to interpret multicollinearity in a correlation plot?
I created a correlation plot for my dataset in R but I am not sure how to choose which of the following pairs of variables express multicollinearity? An explanation with examples would be really helpful!

Feverish123
- 105
- 1
- 7
1
vote
1 answer
Checking for multicollinearity using fixed effects model in R
Guten Tag from Germany community :)
I'm working with panel data and fixed effects (= FE) for both, time and firm.
I wanted to check my model for multicollinearity by using the variance inflation factor (= VIF), but R is giving me a warning message…

wrangjangler
- 53
- 6
1
vote
1 answer
How to simulate multi-collinearity using Sklearn?
I want to see what effect multi-collinearity has on a linear regression model but I need to be able to generate multi collinear data where I can vary the number of features and the collinearity between these features.
I've had a look at Sklearn's…

Dwo
- 79
- 7
1
vote
0 answers
Splitting a data into 70:30 but all the outliers only in the training samples using R
I am simulating data in R to check which models perform better when outliers and multicollinearity present simultaneously. For this, I split the data into 70:30 random split, but I need to introduce outliers and multicollinearity only in the 70…

jeza
- 299
- 1
- 4
- 21
1
vote
1 answer
VIF Test for Multiple Multivariate Regression
I would like to test for the Multicollinearity by using the VIF function. I have a model with 13 measures and 4 predictors, created like so:
M <- lm(cbind(S1, S2, S3, S4, S5, S6, S7, S8, S9, S10, S11, S12, S13) ~ x1 + x2 + x3 + x4, data =…

Christina
- 73
- 6
1
vote
2 answers
How to use VIF in r?
I am new in R and learning ml using caret. I was working on UCI bank marketing response data but used iris data here for reproducibility.
Issue is that I am getting error on running vif from car package on classification…

ViSa
- 1,563
- 8
- 30