I have production data, that has 38 different statuses ranging from 0 - 5000, i.e 3000, 3100, 3500. Along with each one of the status, is a status date, model number, and a ship to location.
I need to figure out, on average, how long it takes an item to go from status 3000 to 5000 (delivered), given that it is item M going to destination X. Not just for status 3000 to 4000, but for all 37 different statuses (that are before delivery) so that I can set up a predictive model to forecast future delivery times.
Data Set:
ID | Status | Status Date "%m/%d/%Y" | Model | Ship to Location |
---|---|---|---|---|
ABC123 | 3000 | 1/1/2023 | M | X |
ABC123 | 5000 | 1/5/2023 | M | X |
ABC124 | 2000 | 5/10/2022 | N | Y |
ABC124 | 5000 | 5/15/2022 | N | Y |
ABC124 2500 12/10/2023 M X
This is the result I am looking for:
Status | Model | Ship to Location | Days needed to reach status 5000 |
---|---|---|---|
3000 | M | X | 20 |
3100 | M | X | 15 |
3300 | N | Y | 10 |
3400 | N | Y | 5 |