read multiple ENVI files and combine them in one csv

Question

I'm fairly new in working with R but trying to get this done. I have dozens of ENVI spectral datasets stored in a directory. Each dataset is seperated into two files. They all have the same name convention, i.e.:

ID_YYYYMMDD_350-200nm.asr
ID_YYYYMMDD_350-200nm.hdr

The task is to read the dataset, add two columns (ID and date from filename), and store the results in a *.csv-file. I got this to work for a single file (hardcoded).

library(caTools)

setwd("D:/some/path/software_scripts")

### filename without extension
name <- "011a_20100509_350-2500nm"

### split filename in area-id and date
flaeche<-substr(name, 0, 4)
date <- as.Date((substr(name,6,13)),"%Y%m%d")

### get values from ENVI-file in a matrix
spectrum <- read.ENVI(paste(name,".esl", sep = ""), headerfile=paste(name,".hdr", sep=""))

### add columns
spectrum <- cbind(Flaeche=flaeche,Datum=as.character(date),spectrum)


### CSV-Dataset with all values
write.csv(spectrum, file = name,".csv", sep=",")

I want to combine all available files into one *.csv file. I know that I've to use list.files but have no idea, how to implement the read.ENVI function and add the resulting matrices ongoing to CSV.

Update:

library(caTools)

setwd("D:/some/path/mean")

files <- list.files() # change or leave totally empty if setwd() put you in the right spot

all_names <- sub("^([^.]*).*", "\\1", files) # strip off extensions

name <- unique(all_names) # get rid of duplicates from .esl and .hdr

# wrap your existing code in a function
mungeENVI <- function(name) {

  # split filename in area-id and date
  flaeche<-substr(name, 0, 4)
  date <- as.Date((substr(name,6,13)),"%Y%m%d")

  # get values from ENVI-file in a matrix
  spectrum <- read.ENVI(paste(name,".esl", sep = ""), headerfile=paste(name,".hdr", sep=""))

  # add columns
  spectrum <- cbind(Flaeche=flaeche,Datum=as.character(date),spectrum)
  return(spectrum)
}

# use lapply to 'loop' over each name
list_of_ENVIs <- lapply(name, mungeENVI) # returns a list

# use do.call(rbind, x) to turn it into a big data.frame
final_df <- do.call(rbind, list_of_ENVIs)

# now write output
write.csv(final_df, "all_results.csv")

you can find a sample dataset here: Sample dataset

you need to get all of your files into one large data frame, something like this `lapply(list.files(dir), read.ENVI) %>% do.call(rbind,.)` — Nate, Oct 06 '16 at 15:02
thanks for your answer, but it's still a bit too cryptic to me. — dan_ke, Oct 06 '16 at 19:18
no worries mate, give me a couple of minutes and I''l upload a more detailed answer for you — Nate, Oct 06 '16 at 20:31

Nate · Accepted Answer · 2016-10-10T15:36:15.820

0

I work with a lot of lab data where I can rely on the output files being in a reliable format (same column order, column name, header format, etc). So this is assuming that the .ENVI files you have are similar to that. If your files are not like that, I'm happy to help with that too, I'd just need to see a dummy file or two.

Anyways here's the idea:

library(caTools)
library(lubridate)
library(magrittr)

setwd("~/Binfo/TST/Stack/") # adjust as needed

files <- list.files("data/", full.name = T) # adjust as needed
all_names <- gsub("\\.\\D{3}", "", files) # strip off extensions
names1 <- unique(all_names) # get rid of duplicates

# wrap your existing code in a function
mungeENVI <- function(name) {
    # split filename in area-id and date
    f <- gsub(".*\\/(\\d{3}\\D)_.*", "\\1", name)
    d <- gsub(".*_(\\d+)_.*", "\\1", name) %>% ymd()
    # get values from ENVI-file in a matrix
    spectrum <- read.ENVI(paste(name,".esl", sep = ""), headerfile=paste(name,".hdr", sep=""))
    # add columns
    spectrum <- cbind(Flaeche=f,Datum= as.character(d),spectrum)
    return(spectrum)
}
# use lapply to 'loop' over each name
list_of_ENVIs <- lapply(names1, mungeENVI) # returns a list

# use do.call(rbind, x) to turn it into a big data.frame
final_df <- do.call(rbind, list_of_ENVIs)
# now write output
write.csv(final_df, "data/all_results.csv")

Let me know if you have any problems and we an go from there. Cheers.

I edited my answer a bit, I think the problem you were hitting is in list.files() it should have had the argument full.name = T. I also adjusted you parsing method to be a little more defensive and use grep capture expressions. I tested the code with your two example files (4 really) but I can build out a large matrix (66743 elements). Also I used lubridate, I think it's a better way to work with dates and times.

edited Oct 10 '16 at 15:36

answered Oct 06 '16 at 22:39

Nate

10,361
3
33
40

Hi, thank you for coming back on this! I had to do some minor changes in the extension-strip-off and duplicates removal, which works fantastic now. Unfortunately there is an error:`Error in read.ENVI(paste(name, ".esl", sep = ""), headerfile = paste(name, : read.ENVI: Could not open input header file: .hdr Called from: read.ENVI(paste(name, ".esl", sep = ""), headerfile = paste(name, ".hdr", sep = ""))` – dan_ke Oct 07 '16 at 07:52
changes: `all_names <- sub("^([^.]*).*", "\\1", files) # strip off extensions name <- unique(all_names) # get rid of duplicates from .esl and .hdr` – dan_ke Oct 07 '16 at 13:55
Can you check the names variable and see if there are any blank string in there? – Nate Oct 07 '16 at 16:32
Unfortunately I couldn't find any blank string, but have updated the question with the current code (incl. minor changes) and sample dataset. – dan_ke Oct 07 '16 at 21:25
Edited my answer to a working solution with your example files, let me know if it works for you... – Nate Oct 09 '16 at 18:33
Hi Nathan, thanks for your efforts. I really appreciate this. I got following error, when I tried to run your code: `> # use lapply to 'loop' over each name > list_of_ENVIs <- lapply(names1, mungeENVI) # returns a list Error in FUN(X[[i]], ...) : could not find function "%>%" Called from: FUN(X[[i]], ...)` – dan_ke Oct 10 '16 at 12:56
whoops my bad `%>%` is from `library(magrittr)`. Hopefully thats the last piece :) – Nate Oct 10 '16 at 15:35

read multiple ENVI files and combine them in one csv

1 Answers1