DataFrame transformation in R

Question

I have next DataFrame in R:

col1 col2 col3
a    W    1
a    Q    1
b    T    2
b    W    3
b    Q    1
b    Z    2
c    T    3
c    Z    1
....

I want to transform it in the next Data Frame

col1 T W Q Z
a    0 1 1 0
b    2 3 1 2
c    3 0 0 1
...

What is the most efficient way to do it in R?

bgoldst · Accepted Answer · 2016-07-03T08:53:00.077

2

reshape(df,dir='w',idvar='col1',timevar='col2');
##   col1 col3.W col3.Q col3.T col3.Z
## 1    a      1      1     NA     NA
## 3    b      3      1      2      2
## 7    c     NA     NA      3      1

If we want to match the expected output exactly (except for column order which doesn't appear to have a pattern AFAICT):

res <- reshape(df,dir='w',idvar='col1',timevar='col2');
names(res)[-1L] <- sub('.*\\.','',names(res)[-1L]);
res[is.na(res)] <- 0L;
rownames(res) <- NULL;
res;
##   col1 W Q T Z
## 1    a 1 1 0 0
## 2    b 3 1 2 2
## 3    c 0 0 3 1

edited Jul 03 '16 at 08:53

answered Jul 03 '16 at 08:49

bgoldst

34,190
6
38
64

@ bgoldist, thank you very much. – Guforu Jul 03 '16 at 08:51

score 2 · Answer 2 · answered Jul 03 '16 at 09:07

2

We can use dcast from data.table to convert to 'wide' format.

library(data.table)
dcast(setDT(df1), col1~col2, value.var='col3', fill = 0)
#   col1 Q T W Z
#1:    a 1 0 1 0
#2:    b 1 2 3 2
#3:    c 0 3 0 1

Or another option is spread

library(tidyr)
spread(df1, col2, col3, fill=0)    
#  col1 Q T W Z
#1    a 1 0 1 0
#2    b 1 2 3 2
#3    c 0 3 0 1

answered Jul 03 '16 at 09:07

akrun

874,273
37
540
662

why did you reopen? it is a clear dupe imo – Jaap Jul 03 '16 at 09:14
the question was about efficient transform so i thought if the other answer get selected it may confuse readers – akrun Jul 03 '16 at 09:15
5

just leaving a comment telling OP to use `data.table` or `tidyr` would have been enough, still no need to reopen imo – Jaap Jul 03 '16 at 09:19
that is a good option, didn't thought about it – akrun Jul 03 '16 at 09:20
could you close it again then? – Jaap Jul 03 '16 at 09:21

DataFrame transformation in R

2 Answers2