Filtering out duplicate combinations in a data frame

Asked Feb 26 '16 at 00:15

Active Feb 26 '16 at 00:15

Viewed 16 times

I have a data frame that looks as follows:

df <- data.frame(x = c('a', 'b', 'c', 'd'), 
                 y = c('b', 'a', 'c', 'c'), 
                 z = c(1, 1, 1, 4))

I would like to identify duplicate cases/rows. Seems like simple task I thought I should be able to tackle via duplicates or unique. Not so much, unless I'm missing something obvious here.

I would like to return a data frame where case 1 (a-b) and case 2 (b-a) are recognized as the same. In other words, the result of this should be

x y z
a b 1
c c 1
d c 4

I don't care which case, 1 or 2, gets returned, so long as there is only one.

Any help would be much appreciated!

asked Feb 26 '16 at 00:15

user3179350

Plenty of results via google when searching for "R a-b b-a duplicate" – thelatemail Feb 26 '16 at 00:18
1

Maybe - http://stackoverflow.com/questions/25297812/pair-wise-duplicate-removal-from-dataframe might be a better duplicate if this doesn't suit you. – thelatemail Feb 26 '16 at 00:20

Filtering out duplicate combinations in a data frame

0 Answers0