Subset with [ ] or subset( ) in R

Question

I was working with a dataframe similar to this one, called DB_reduced:

I was expected to get a similar result with this two codes:

DB_reduced[DB_reduced$sex == "f", 2] # first line
# or
subset(DB_reduced, DB_reduced$Sexo == "f", select = 2, drop = TRUE) # second line

but rather than just finish with the same dataframe, the first returns:

sex  BLT
f    NA
f    3
f    4
NA   3.4
f    3.4
NA   3.5

and the second:

sex  BLT
f    NA
f    3
f    4
f    3.4

Why the difference? I thought that both codes worked in tha same way. How can I modify the first line to obtain the same result as the second?

Thanks all!

You can change the first line to `DB_reduced[DB_reduced$sex == "f" & !is.na(DB_reduced$sex), 2]` — MrFlick, Jun 21 '22 at 20:59

score 1 · Answer 1 · answered Jun 21 '22 at 20:56

1

The documentation for ?subset specifies the following:

subset  logical expression indicating elements or rows to keep: missing values are taken as false.

so it drops NAs by default. You can get the same result using [ by adding & !is.na(DB_reduced$sex) as noted in the comments.

answered Jun 21 '22 at 20:56

joran

Yes, but what can I do with the first line (DB_reduced[DB_reduced$sex == "f", 2] ) to also drop NAs? – MJRC Jun 21 '22 at 21:00
@MJRC See my edit and MrFlick's comment under your question. – joran Jun 21 '22 at 21:01

1 Answers1