I'm getting started with arulesSequences with an aim to perform Frequent Sequence Mining on some data I have. The data for a store A looks like below:
CUSTOMER_ID seq_num Size bought_items
1 17399 1 2 {100,100}
2 17399 2 1 {800}
3 17399 3 2 {900,900}
4 17399 4 1 {405}
5 17399 5 4 {200,505,200,505}
What this means is that this customer #17399 shopped with this store A on multiple occasions. During his/her first shopping trip, this person bought items with item codes 100 and 100 (2 items). During his/her second shopping trip, this customer bought just the item 800. And so on.
Now i want to use cSPADE on this customer, where order doesnt matter within a "basket" but does matter across shopping trips. So eventually my record for customer 17399 would be:
CUSTOMER_ID bought_items
17399 {(100,100),800,(900,900),405,(200,505,200,505)}
Where {} contains the full sequence and () represents each shopping trip.
I understand in general this is a possibility.However, I haven't seen any examples (a few hours of searching) or notes explicitly talking about arulesSequences supporting this. Any thoughts out there? Would appreciate the help.
Thanks for your time.