R - select only factor columns of dataframe

Question

I am trying to select only factor columns from my data frame. Example is below:

bank[,apply(bank[,names(bank)!="id"], is.factor)]

But the code behaves strangely. Step by step:

sapply(bank[,names(bank)!="id"], is.factor)

I get:

age         sex      region      income     married    children         car 
      FALSE        TRUE        TRUE       FALSE        TRUE       FALSE        TRUE 
   save_act current_act    mortgage         pep      ageBin 
       TRUE        TRUE        TRUE        TRUE        TRUE

Looks OK. Now, I assume that I just pass this matrix of TRUE/FALSE to the next step and get only the columns I need:

bank[,sapply(bank[,names(bank)!="id"], is.factor)]

But as result I get all the same columns as in original bank dataframe. Nothing is filtered out. I tried it one way or another but can't find a solution. Any advice on what I am doing wrong?

You should provide a [reproducible example](http://stackoverflow.com/questions/5963269/how-to-make-a-great-r-reproducible-example). That will make it easier to help you. The TUE/FALSE values should match up to all your columns. By excluding the "id" column, you are shifting everything. — MrFlick, Mar 31 '17 at 19:20

score 7 · Accepted Answer · answered Mar 31 '17 at 19:27

7

#DATA
df = mtcars
colnames(df) = gsub("mpg","id",colnames(df))
df$am = as.factor(df$am)
df$gear = as.factor(df$gear)
df$id = as.factor(df$id)

#Filter out 'id' after selecting factors
df[,sapply(df, is.factor) & colnames(df) != "id"]

answered Mar 31 '17 at 19:27

d.b

32,245
6
36
77

1

Thank you, makes sense. Didn't think that I need to apply 'not id' separately. – Maksim Khaitovich Mar 31 '17 at 19:32

score 4 · Answer 2 · answered Apr 01 '17 at 19:40

4

df = mtcars
colnames(df) = gsub("mpg","id",colnames(df))
df$am = as.factor(df$am)
df$gear = as.factor(df$gear)
df$id = as.factor(df$id)

library(dplyr)
df %>%  select_if(is.factor) %>% select(-id)

answered Apr 01 '17 at 19:40

MLEN

2,162
2
20
36

R - select only factor columns of dataframe

2 Answers2