Map entries between two data.tables

Question

Suppose I have a data table A containing a set of entries and an index column that assigns a unique number to each row. I also have a data table B that contains entries of A, like so:

library(data.table)
set.seed(1)
A <- do.call(CJ, list(seq(3), seq(2), seq(2)))
A[,index := seq(nrow(A))]
B <- data.table(sample(3,3,replace=TRUE), sample(2,3,replace=TRUE),
                sample(2,3,replace=TRUE))

I want to define an index column for B that assigns each row to the corresponding index in A. What is the most efficient way to do this with data.table?

Thanks.

@akrun In my opinion, that's not an appropriate dupe. OP presumably wants to add a column to `B` by reference like `B[A, on=names(B), index := i.index ]`, not materialize an entirely new table. That's one of the big reasons folks use data.table. — Frank, Jan 18 '17 at 16:30
Frank's answer is exactly what I was looking for. I would accept it as an answer if it wasn't a comment. — user3294195, Jan 18 '17 at 16:35

score 3 · Accepted Answer · edited Sep 23 '17 at 06:17

3

To add a column from A to B based on their matching rows:

B[A, on=names(B), index := i.index ]

The main docs are at ?data.table

edited Sep 23 '17 at 06:17

Graham

7,431
18
59
84

answered Jan 18 '17 at 16:37

Frank

66,179
8
96
180

score 1 · Answer 2 · answered Jan 18 '17 at 16:23

1

I think you need a join:

A[B, on = c("V1", "V2", "V3")]

#   V1 V2 V3 index
#1:  1  2  2     4
#2:  2  1  2     6
#3:  2  2  2     8

answered Jan 18 '17 at 16:23

Psidom

209,562
33
339
356

Map entries between two data.tables

2 Answers2