R Counting duplicate values and adding them to separate vectors

Question

x <- c(1,1,1,2,3,3,4,4,4,5,6,6,6,6,6,7,7,8,8,8,8)
y <- c('A','A','C','A','B','B','A','C','C','B','A','A','C','C','B','A','C','A','A','A','B')
X <- data.frame(x,y)

Above I have a data frame where I want to identify the duplicates in vector x, while counting the number of duplicate instances for both (x,y).... For example I have found that ddply and this post here is similar to what I am looking for (Find how many times duplicated rows repeat in R data frame).

library(ddply)
ddply(X,.(x,y), nrow)

This counts the number of instances 1 - A occurs which is 2 times... However I am looking for R to return the unique identifier in vector x with the counted number of times that x matches in column y (getting rid of vector y if necessary), like below..

Any help will be appreciated, thanks

score 7 · Answer 1 · answered Mar 20 '14 at 16:00

7

You just need the table function :)

answered Mar 20 '14 at 16:00

Julien Navarre

7,653
3
42
69

Arun · Accepted Answer · 2014-03-20T16:05:38.277

3

This is fairly straightforward by casting your data.frame.

require(reshape2)
dcast(X, x ~ y, fun.aggregate=length)

Or if you'd want things to be faster (say working on large data), then you can use the newly implemented dcast.data.table function from data.table package:

require(data.table) ## >= 1.9.0
setDT(X)            ## convert data.frame to data.table by reference
dcast.data.table(X, x ~ y, fun.aggregate=length)

Both result in:

edited Mar 20 '14 at 16:05

answered Mar 20 '14 at 16:00

Arun

116,683
26
284
387

Thanks for your help! Both answers work! – boothtp Mar 20 '14 at 16:21

R Counting duplicate values and adding them to separate vectors

2 Answers2