I've got data called 'Planned Leave' which includes 'Start Date', 'End Date', 'User ID' and 'Leave Type'.
I want to be able to create a new data-frame which shows all days between Start and End Date, per 'User ID'.
So far, I've only been able to create a date_list which supplies a range of dates between start and end date, but I cannot find a way to include this for each 'User ID' and 'Leave Type'.
Here is my current function:
def datesplit(data):
x = pd.DataFrame(columns=['Date'])
for i in plannedleave.iterrows():
start = data['Start Date'][i]
end = data['End Date'][i]
date_list = [start + dt.timedelta(days=x) for x in range((end-start).days)]
x.append(date_list)
return x
>>> datesplit(plannedleave)
>>> Value Error: Can only Tuple-index with a MultiIndex
Here's what the data looks like:
>>> plannedleave.dtypes
>>>
Employee ID int64
First Name object
Last Name object
Leave Type object
Start Date datetime64[ns]
End Date datetime64[ns]
dtype: object
I'd be forever grateful if you could find a solution here! :-)