We have two columns in a data frame: start_time, end_time (both type object)
data = {
"passenger": [913383, 442365, 983560, 163350],
"start_time": [0:00:00, 0:01:17, 0:00:24, 0:00:26],
"end_time": [0:00:17, 0:01:32, 0:03:20, 0:01:38]
}
#load data into a DataFrame object:
df = pd.DataFrame(data)
We are looking to create a third column with the time difference (end_time-start_time) in hh:mm:ss format:
df[time_difference]=df[end_time]-df[start_time]
If we convert the object type to_datetime it gives the full date too which is not what we want.
Any assistance would be appreciated.