I have website visitor data that resembles the example below:
id | pages |
---|---|
001 | /ice-cream, /bagels, /bagels/flavors |
002 | /pizza, /pizza/flavors, /pizza/recipe |
I would like to transform to below, where I can count the amount of times they have visited a part of my site that deals with specific content. A general count of all pageviews, delimited by comma, would be great as well.
id | bagel_count |
---|---|
001 | 2 |
002 | 0 |
id | pizza_count |
---|---|
001 | 0 |
002 | 3 |
id | total_pages_count |
---|---|
001 | 3 |
002 | 3 |
I have the option to perform in SQL or Python but I am not sure what is easier, hence why I am asking the question.
I have referenced following questions but they are not serving my purpose:
Count the number of occurrences of a character in a string (this was close but I am not sure how to apply to a dataframe)