R program questions

Question

I am trying to get some unique combinations of two variables.
For each value of x, I would like to have this unique y value, and drop those have several y values. But several x values could share same y value.
For example,
a=data.frame(x=c(1,1,2,4,5,5),y=c(2,3,3,3,6,6)),
and I would like to get the output like:

b=data.frame(x=c(2,4,5),y=c(3,3,6))

I have tried unique(), but it does not help this situation. Thank you!

Gregor Thomas · Accepted Answer · 2018-11-14T19:29:51.137

First we use unique to omit repeated rows with the same x and y values (keeping only one copy of each). Any repeated x values that are left have different y values, so we want to get rid of them. We use the standard way to remove all copies of any duplicated values as in this R-FAQ.

a=data.frame(x=c(1,1,2,4,5,5),y=c(2,3,3,3,6,6))
b = unique(a) 
b = b[!duplicated(b$x) & !duplicated(b$x, fromLast = TRUE), ]
b
#   x y
# 3 2 3
# 4 4 3
# 5 5 6

Fans of dplyr would probably do it like this, producing the same result.

library(dplyr)
a %>% 
  group_by(x) %>%
  filter(n_distinct(y) == 1) %>%
  distinct

amanda · Answer 2 · 2018-11-14T20:34:02.373

1

Using dplyr:

library(dplyr)

a <- data.frame(x=c(1,1,2,4,5,5),y=c(2,3,3,3,6,6))

a %>% 
  distinct() %>% 
  add_count(x) %>%   # adds an implicit group_by(x)
  filter(n == 1) %>% 
  select(-n)
#> # A tibble: 3 x 2
#> # Groups:   x [3]
#>       x     y
#>   <dbl> <dbl>
#> 1     2     3
#> 2     4     3
#> 3     5     6

^{Created on 2018-11-14 by the reprex package (v0.2.1)}

edited Nov 14 '18 at 20:34

answered Nov 14 '18 at 19:28

amanda

321
5
12

R program questions

2 Answers2