Get column index from data frame that matches numeric vector?

Question

Very similar questions have been asked here, here, and here. However, they all seem to rely on knowing the column names of the data.

I am trying to get the column index of a data frame that matches a numeric vector. For example, if I have some data and a vector like so:

dat <- data.frame(
  x = c(1,2,3,4,5),
  y = c(10,9,8,7,6),
  z = c(2,4,6,8,10)
)

testVec <- c(2,4,6,8,10)

I would just like to return the column index of dat that matches testVec . We can see that dat$z matches testVec... so in this situation I would just like to return 3.

Any suggestions as to how I could do this?

No, in this case, there would be no match. The order of my `testVec` will always match the order of some column in `dat` — Electrino, Feb 17 '22 at 14:14

benson23 · Accepted Answer · 2022-02-17T14:15:53.350

4

Here's a base R approach, which compares every column in dat with testVec to see if they are identical. Use which to output the column index if they're identical.

which(sapply(1:ncol(dat), function(x) identical(dat[,x], testVec)))
[1] 3

UPDATE @nicola has provided a better syntax to my original code (you can see it in the comment under this answer):

which(sapply(dat, identical, y = testVec))
z 
3

edited Feb 17 '22 at 14:15

answered Feb 17 '22 at 14:07

benson23

16,369
9
19
38

4

Just `which(sapply(dat, identical, y = testVec))` is cleaner I guess. – nicola Feb 17 '22 at 14:11
You're right! I'm too used to using custom function, I forget we can do this. Great tips! – benson23 Feb 17 '22 at 14:14

ThomasIsCoding · Answer 2 · 2022-02-17T14:30:20.343

2

You perhaps can try this

> which(colSums(dat == testVec) == nrow(dat))
z
3

edited Feb 17 '22 at 14:30

answered Feb 17 '22 at 14:09

ThomasIsCoding

96,636
9
24
81

score 1 · Answer 3 · answered Feb 17 '22 at 15:47

1

An option with select from dplyr

library(dplyr)
dat %>%
   select(where(~ all(testVec == .x))) %>% 
   names %>% 
   match(names(dat))
[1] 3

answered Feb 17 '22 at 15:47

akrun

874,273
37
540
662

score 1 · Answer 4 · answered Feb 17 '22 at 16:49

Subtract the testVec.

which(colSums(dat - testVec) == 0)
# z 
# 3

Without name:

unname(which(colSums(dat - testVec) == 0))
# [1] 3

Data:

dat <- structure(list(x = c(1, 2, 3, 4, 5), y = c(10, 9, 8, 7, 6), z = c(2, 
4, 6, 8, 10)), class = "data.frame", row.names = c(NA, -5L))
testVec <- c(2, 4, 6, 8, 10)

Get column index from data frame that matches numeric vector?

4 Answers4