Equal(?) functions not returning same values

Question

I was reading this post when I came across with a question.

Why (in the post's dataframe) this function doesn't return the same value

>df[df$X3==c(1,2),]

   X1    X2 X3
1  s1 45.11  1
4  s1 51.41  2
10 s1 43.12  2
17 s5 25.40  1

as this function?

>df[df$X3 %in% c(1,2),] 

   X1    X2 X3
1  s1 45.11  1
2  s1 45.13  1
3  s1 53.42  2
4  s1 51.41  2
9  s3 43.58  2
10 s1 43.12  2
17 s5 25.40  1
18 s5 25.50  1

I used to believe that both are kind of equal. What's the difference between them?

Thanks in advance.

It was solved, thank you very much – Cris Oct 11 '16 at 12:21 — Cris, Oct 11 '16 at 12:21

Zheyuan Li · Accepted Answer · 2016-10-11T04:04:08.033

3

df$X3 == c(1,2) is not doing what you think. c(1,2) is first recycled to have the same length as length(df$X3), then element-wise == is performed. Let's take a small example:

1:4 == 2:3  ## which is doing `1:4 == c(2,3,2,3)`
# [1] FALSE FALSE FALSE FALSE

and we get all FALSE. On the other hand, if we do

1:4 %in% 2:3
# [1] FALSE  TRUE  TRUE FALSE

we get two TRUE.

edited Oct 11 '16 at 04:04

answered Oct 11 '16 at 02:19

Zheyuan Li

71,365
17
180
248

Should that comment in the first code block be "which is doing `1:4 == ..." not 1:3? – mathematical.coffee Oct 11 '16 at 03:31

Equal(?) functions not returning same values

1 Answers1