I'm trying to import a huge dataset (to do analysis later), but I only want 4 of the rows (they are categorized by utility, and I'm only interested in a few of them).
Issue is: the formatting is bizarre AND huge, and I keep getting errors about not having enough column names.
Formatting is as follows: Each row is a utility region, and each column is every hour from Summer 2015 to present along with its electricity usage, so there are a LOT of columns (the number of hours from 2015 to present x 2).
I've tried the following:
> data<-read.table("C:\\Users\\EBA.txt",header=true,nrows=150)
Error in read.table("C:\\Users\\EBA.txt", :
object 'true' not found
> data<-read.table("C:\\Users\\EBA.txt",header=TRUE,nrows=150)
Error in read.table("C:\\Users\\EBA.txt", :
more columns than column names
> data<-read.table("C:\\Users\\EBA.txt",header=TRUE,sep=",")
#cancelled here because it froze my computer
> data<-read.table("C:\\Users\\EBA.txt",header=TRUE,sep=",]")
Error in scan(file, what = "", sep = sep, quote = quote, nlines = 1, quiet = TRUE, :
invalid 'sep' value: must be one byte
> data<-read.table("C:\\Users\\EBA.txt",header=TRUE,sep=",")
Error in read.table("C:\\Users\\EBA.txt", :
more columns than column names
The dataset I'm looking at lives here ("US Electric System Operating Data", https://www.eia.gov/opendata/bulkfiles.php"
Ultimately, I'm going to want to use the dates and electricity usage, so reading it in a usage form and converting from ISO time would be also be great - thanks!