so im trying to make a calendar (dataframe) with the soccer matches coming. Im webscraping the columns one by one because i dont need them all. When scraping the column with the timedate (HORA) i get a values that are incorrect, dont know why... i dont think it has to be with timezone because its just text.
library(rvest)
url <- "https://www.cruzados.cl/competitions/campeonato-nacional"
page <- read_html(url)
hora_inicio <- page %>% html_nodes("td.team-schedule__time") %>% html_text()
> hora_inicio
[1] "21:00" "22:30" "23:15" "22:30" "00:30" "00:00" "02:00" "02:00" "19:00" "22:00" "19:00" "22:15" "19:00" "02:00"
[15] "19:00" "19:00" "19:00" "19:00" "19:00" "19:00" "19:00" "19:00" "19:00" "19:00" "19:00" "19:00" "19:00" "19:00"
[29] "19:00" "19:00" "19:00" "19:00" "19:00" "20:00" "20:00" "20:00" "20:00" "20:00" "20:00"
the right ones are: 18:00, 19:30, 19:15, 18:30, 20:30, 20:00, 18:00 , ...