I'm currently using a CSV file to import multiple datasets in R. This dataset contains 2500 variables over 16 columns. I'm trying to make a regression function with lm in R. But when I try to make a dummy variable for year effects or industry effects, the regression won't work.
This is how I create the dummy variable:
CNAME <- factor(Combined.data[6], levels=c(1:20), labels= c("AUSTRIA", "BELGIUM", "DENMARK",
"FINLAND", "FRANCE", "GERMANY", "IRELAND", "ISLE OF MAN", "ITALY", "LUXEMBOURG",
"NETHERLANDS", "NORWAY", "POLAND", "PORTUGAL", "SPAIN", "SWEDEN", "SWITZERLAND",
"TURKEY", "UNITED KINGDOM", "UNITED STATES"))
And this is what the regression function looks like:
results <- lm(Tax_Avoidance ~ ENVSCORE + CGVSCORE + SOCSCORE + ECNSCORE + Size +
Leverage + ROA + MTB + ROA + RND + AUD + PPE + Intang + CDP +
CHS + NET + CNAME,
data = finalresults)
summary(results)
I cannot see what I'm doing wrong, I appreciate your help.