I'm working on learning to clean datasets and am encountering multiple date formats. There appears to be only two formats datetimes were stored as. The below code works fine, but feels clunky. Is there a better (or faster) way to handle multiple formats in a DataFrame? If it were only one format, it would be doable in just one line and I like avoiding try/except statements if I can.
For reference, the data goes from 1963 to 2010, so declaring the date_parsed column as 2023 seemed safe to me. Thanks!
data['date_parsed'] = pd.to_datetime('01/01/2023', format='%m/%d/%Y')
for i in range(len(data.Date)):
try:
data['date_parsed'][i] = pd.to_datetime(data.Date[i], format='%m/%d/%Y')
except:
data['date_parsed'][i] = pd.to_datetime(data.Date[i], format='%Y-%m-%dT%H:%M:%S.%fZ')