How to extract information of a vector in a data frame corresponding to each unique value of another vector in the same data frame?

Question

Suppose I have the following data frame data-

Now I want to extract information of each level, i.e. (A,B,C,D & E) of V1. As an example, if I choose to see the sum of different levels in V2 for each level of V1, what should be the code? The output I want is-

I tried lapply and sapply but they are not giving the information I want. Of course I tried sapply(data,unique) which made no sense.

Also, in advance (may be a bit trickier), if I want to see the values in V2 which are unique in all the levels of V1,how to do it? Thanks !!

Can you show the expected output as it is not clear. Do you need `library(data.table);setDT(data)[, if(uniqueN(V1)>1) .SD ,.(V2)]` — akrun, Jul 08 '16 at 04:27
Do you just want `unique(data)` ? Or maybe this is helpful - http://stackoverflow.com/questions/18201074/find-how-many-times-duplicated-rows-repeat-in-r-data-frame/18201245 ? — thelatemail, Jul 08 '16 at 04:52
@thelatemail, actually the link you gave is not exactly what I want. I want how many values each of A,B & C has and what values are common in them. — madmathguy, Jul 08 '16 at 05:14

score 3 · Answer 1 · answered Jul 08 '16 at 05:18

3

I think this is what you want, in that it will find unique values which are common across different groups:

Common V2 values in each level of V1

Reduce(intersect, split(dat$V2, dat$V1))
#[1] 3 2

Common V1 values in each level of V2

Reduce(intersect, split(dat$V1, dat$V2))
#[1] "C"

answered Jul 08 '16 at 05:18

thelatemail

91,185
12
128
188

I am sorry I am late. But it worked ! Thanks @akrun for helping as well.. – madmathguy Jul 09 '16 at 18:21
Also thanks @thelatemail – madmathguy Jul 09 '16 at 18:21

score 1 · Answer 2 · answered Jul 08 '16 at 05:32

Using data.table, we can find the unique values in 'V2' that are common across 'V1'.

library(data.table)
setDT(data)[,uniqueN(V1)==uniqueN(data$V1) , by = V2][(V1)]$V2
#[1] 3 2

and the common 'V1' in each unique element of 'V2'

setDT(data)[, if(uniqueN(V1)==1) .SD , by = V2]$V1
#[1] "C"

user2100721 · Answer 3 · 2016-07-08T06:38:30.343

0

Maybe this is helpful

output <- aggregate(data=df,V2~.,FUN=paste)

For extraction of common values in V2 presented all the levels of V1 use this

Reduce(intersect,output$V2)

edited Jul 08 '16 at 06:38

answered Jul 08 '16 at 04:42

user2100721

3,557
2
20
29

Thanks @user2100721 , it is almost what I want. The only thing is it is giving output in list when I am trying to extract the `V2` values and using the `unlist` function is producing one single vector. – madmathguy Jul 08 '16 at 05:05
Yes, that is perfectly fine. For the 2nd part, I want to extract the unique values which are present in all levels. In this case A:3,2,1,B:2,3,C:4,3,1,2. So the unique values will be 2 & 3. How to get them? – madmathguy Jul 08 '16 at 05:25
Okay, let me illustrate with the example I provided. Form the first part of my query, I want to show A takes values 3,2,1. B takes values 2,3 and C takes values 4,3,1,2. For the 2nd part, we can see that the values 2 & 3 are unique in A,B & C. I want the output as 2 & 3. – madmathguy Jul 08 '16 at 05:44

How to extract information of a vector in a data frame corresponding to each unique value of another vector in the same data frame?

3 Answers3