Split Date-Time column (containing a character) into two separate columns in R

Question

I have a dataset which has a combined date-time column, which I would like to split into separate year, month, day and time columns. I usually use the lubridate library with appropriate arguments, but this particular column also has a character T in it too in each row.

How can I split this column by dropping the character T from each row of this column?

Date_Time
2020-01-01T00:48:00  
2020-01-01T00:46:00
2020-01-02T15:07:00
2020-01-02T15:07:00

score 3 · Accepted Answer · answered Aug 01 '21 at 02:22

You can use tidyr::separate -

tidyr::separate(df, Date_Time, c('Year', 'Month', 'Day', 'Time'), sep = '[T-]')

#  Year Month Day     Time
#1 2020    01  01 00:48:00
#2 2020    01  01 00:46:00
#3 2020    01  02 15:07:00
#4 2020    01  02 15:07:00

Or extract date and time after converting Date_Time to POSIXct type.

library(dplyr)
library(lubridate)


df %>%
  mutate(Date_Time  = ymd_hms(Date_Time), 
         Year = year(Date_Time), 
         Month = month(Date_Time), 
         Day = day(Date_Time),
         Time = format(Date_Time, '%T'))

score 2 · Answer 2 · answered Aug 01 '21 at 03:08

2

Base R solution:

cbind(
  df, 
  strcapture(
    pattern = "^(\\d{4})-(\\d{2})-(\\d{2})T(.*)$",
    x = df$Date_Time,
    proto = list(
      year = integer(), 
      month = integer(), 
      day = integer(),
      time = character()
    )
  )
)

answered Aug 01 '21 at 03:08

hello_friend

5,682
1
11
15

Split Date-Time column (containing a character) into two separate columns in R

2 Answers2