How to extract certain columns from a list of data frames

Question

I have a list 'l' of data frames. These data frames in itself are 2-dimensional matrices. For my work, I'm required to create another list which has data frames which are a subset of the data frames from the original list.

Eg: List l1 has two data frames D1 and D2, having 10 and 12 different columns of data respectively. Now I want to create a new list l2 which also has two data frames but these data frames are columns picked out from the earlier data frames D1 and D2. Please consider that the position of the same column in D1 and D2 could be different, therefore I would have to access it through column name and not index

Could someone please suggest how I could go about implementing this?

`lapply(l, )`. If you want more specific code, you need to provide a more specific description of D3 and D4 than "basically subsets of D1 and D2". — Gregor Thomas, Nov 22 '17 at 20:32
If you want the rows 1:5 and the columns 2 and 3, you could do `lapply(l, "[", 1:5, 2:3)`, but if you have conditions or something an example would go a long way. — Gregor Thomas, Nov 22 '17 at 20:34
Put it in your question! Make your question "How would I extract columns named `"X"` and `"MyFavoriteColumn"`?" or "How would I extract the 2nd, 4th, and 321st column?" or something like that. — Gregor Thomas, Nov 22 '17 at 20:35
Its easier to help you if you provide a [reproducible example](https://stackoverflow.com/questions/5963269/how-to-make-a-great-r-reproducible-example) with sample input data and the desired output data. That way possible solutions can be tested and verified. — MrFlick, Nov 22 '17 at 20:44

score 26 · Answer 1 · answered Nov 22 '17 at 20:52

Here's an example (this is the kind of thing you should have put in your question. You will get near-instantaneous help if you can structure your question with a clear, copy/pasteable, reproducible example like this.)

Problem:

# list of data frames:
l = list(mtcars, mtcars)

# vector of column names I would like to extract
my_names = c("mpg", "wt", "am")
# these columns might be at different positions in the data frames

Solution:

result = lapply(l, "[", , my_names)

# look at the top 6 rows of each to verify that it worked:
lapply(result, head)
# [[1]]
#                    mpg    wt am
# Mazda RX4         21.0 2.620  1
# Mazda RX4 Wag     21.0 2.875  1
# Datsun 710        22.8 2.320  1
# Hornet 4 Drive    21.4 3.215  0
# Hornet Sportabout 18.7 3.440  0
# Valiant           18.1 3.460  0
#
# [[2]]
#                    mpg    wt am
# Mazda RX4         21.0 2.620  1
# Mazda RX4 Wag     21.0 2.875  1
# Datsun 710        22.8 2.320  1
# Hornet 4 Drive    21.4 3.215  0
# Hornet Sportabout 18.7 3.440  0
# Valiant           18.1 3.460  0

Explanation: You essentially want to do l[[1]][, my_names], l[[2]][, my_names], ... lapply applies a function to every list element. In this case, the function is [, which takes rows as its first argument (we leave it blank to indicate all rows), columns as its second argument (we give it my_names). It returns the results in a list.

I don't quite understand the logic behind `[` but I am really thankful for this answer. — Martin, Jan 19 '22 at 14:58

score 5 · Answer 2 · answered Nov 22 '17 at 21:56

5

You can use dplyr, it is nice, easy and the syntax is clear:

    library(dplyr)
    l <- list(mtcars, mtcars) # the list of 2 df
    new_list <- lapply(l, function(x) x%>% select(mpg,wt,am))

Ciao!

answered Nov 22 '17 at 21:56

theLudo

127
4

4

I get " Error: `select()` doesn't handle lists." ? – Sebastian Hesse Jan 07 '21 at 14:23

score 1 · Answer 3 · answered Mar 06 '23 at 14:21

1

A purrr solution:

library(purrr)
library(dplyr)
map(l, ~ .x |> select(all_of(my_names)))

answered Mar 06 '23 at 14:21

Julian

6,586
2
9
33

score 0 · Answer 4 · answered Jul 04 '19 at 23:56

0

I had a list of 21 columns and out of which I wanted to extact and create a separate list with columns from 1 to 7, 11 and 21. This is what worked for me.

mydata <- read.csv("data.csv")
newdatalist <- data[c(1:7, 11, 21)]

answered Jul 04 '19 at 23:56

Sujoy

802
11
22

How to extract certain columns from a list of data frames

4 Answers4

Linked

Related