32

I have a time series dataset called "Data" which contains water elevation data for several wells over many years of sampling dates. The head of the data.frame looks like this:

           Date            Well   Elev
1    2002-05-23            MW-3 929.04
2    2002-05-29            MW-3 929.39
3    2002-05-31            MW-3 929.37
4    2002-06-05            MW-3 929.36
5    2002-06-12            MW-3     NA
6    2002-06-13            MW-3 929.47
7    2002-06-19            MW-3 929.42
8    2002-06-26            MW-3 930.02
9    2002-07-05            MW-3 930.00

I am trying to use ggplot to plot water elevation over time for each well, such that my x-axis is "Date," my y-axis is "Elev" and each well is plotted in a different color. I have created this plot with the code below, and it is to my satisfaction.

My problem is that I am trying to overlay gray rectangles with geom_rect to show the periods in which a well pump was on. I think I am very close, but I must be doing something wrong with date formatting (?), because I keep getting the following error:

Error: Invalid input: date_trans works with objects of class Date only

Any help? Thanks in advance!

Here is my code:

#Import and fix up data
Data = read.csv("water_elevation_for_R_date.csv", stringsAsFactors=FALSE)
colnames(Data)[1] <- "Date"
Data$Date = as.Date(Data$Date, format = "%m/%d/%Y")
Data$Well <- as.factor(Data$Well)
Data$Elev <- as.numeric(Data$Elev)

#Load ggplot and scales
library(ggplot2)
library(scales)

#Create graph
ggplot(data= Data, aes(x = Date, y = Elev, group = Well, colour = Well)) +
geom_line(size = 0.75) +
xlab("") + ylab("Elevation (ft.)") +
scale_color_brewer(palette = "Spectral") +
scale_x_date(breaks = date_breaks("2 year"),
             labels = date_format("%Y")) +
theme_bw()+
theme(plot.background = element_blank(), 
      panel.grid.major = element_blank(), 
      panel.grid.minor = element_blank(), 
      panel.border = element_blank(), 
      axis.line.x = element_line(color = "black"),
      axis.line.y = element_line(color = "black")) +
geom_rect(data = Data, 
          aes(xmin = "2004-04-29", 
              xmax = "2004-12-20",
              ymin = -Inf, 
              ymax = Inf),
          fill = "gray", 
          alpha = 0.5)
Axeman
  • 32,068
  • 8
  • 81
  • 94
snalven
  • 411
  • 1
  • 6
  • 12
  • Can you provide the output of `dput(Data)`? – J.Con Jan 24 '17 at 01:50
  • Hi J. Con, Sorry-- I am new to posting here, and the output of 'dput(Data)' is too long for both the comment and the original post. Any suggestions on how to get you the output? – snalven Jan 25 '17 at 15:08
  • Read this for some tips. https://www.r-bloggers.com/three-tips-for-posting-good-questions-to-r-help-and-stack-overflow/ – J.Con Jan 25 '17 at 21:34
  • 1
    Try just using the first three observations in your data, then run `dput(data)` – YourEconProf Jan 25 '17 at 22:04

1 Answers1

43

The problem seems to be in your geom_rect area (it plots without this). Other "date_trans" errors on this site point to needed to set dates with as.Date. So yes, you were in the right debug area. This works:

Wrap your minimum and maximum in xmin and xmax call in geom_rect section:

aes(xmin = as.Date("2004-04-29", "%Y-%m-%d"), 
    xmax = as.Date("2004-12-20",  "%Y-%m-%d"),

CODE BELOW FOR OTHERS TO USE

I created three data lines only as @YourEconProf suggested.

#Import and fix up data
#Data = read.csv("water_elevation_for_R_date.csv", stringsAsFactors=FALSE)
#Date            Well   Elev
#1    2002-05-23            MW-3 929.04
#2    2002-05-29            MW-3 929.39
#3    2002-05-31            MW-3 929.37
# etc...
Data = data.frame(Date = c(as.Date("2002-05-23", "%Y-%m-%d"),
                           as.Date("2002-05-29", "%Y-%m-%d"),
                           as.Date("2002-05-31", "%Y-%m-%d")),
                  Well = c("MW-3","MW-3","MW-3"),
                  Elev = c(929.04, 929.39, 929.37))

colnames(Data)[1] <- "Date"
Data$Date = as.Date(Data$Date, format = "%m/%d/%Y")
Data$Well <- as.factor(Data$Well)
Data$Elev <- as.numeric(Data$Elev)

#Load ggplot and scales
library(ggplot2)
library(scales)

#Create graph
ggplot(data= Data, aes(x = Date, y = Elev, group = Well, colour = Well)) +
  geom_line(size = 0.75) +
  xlab("") + ylab("Elevation (ft.)") +
  scale_color_brewer(palette = "Spectral") +
  scale_x_date(breaks = date_breaks("2 year"),
               labels = date_format("%Y")) +
  theme_bw()+
  theme(plot.background = element_blank(), 
        panel.grid.major = element_blank(), 
        panel.grid.minor = element_blank(), 
        panel.border = element_blank(), 
        axis.line.x = element_line(color = "black"),
        axis.line.y = element_line(color = "black")) +
  geom_rect(data = Data, 
            aes(xmin = as.Date("2004-04-29", "%Y-%m-%d"), 
                xmax = as.Date("2004-12-20",  "%Y-%m-%d"),
                ymin = -Inf, 
                ymax = Inf),
            fill = "gray", 
            alpha = 0.5)

Which gets you this:

3lines of data chart

Community
  • 1
  • 1
micstr
  • 5,080
  • 8
  • 48
  • 76
  • 2
    Thanks so much micstr! I had tried using as.Date but didn't realize I had to specify the format. – snalven Jan 27 '17 at 14:24