Using apply to populate a dictionary

Question

I have a dataframe column that contains python lists of tags. I need to create a dictionary that counts how many times a tag was used. I did it this way:

tags_use_count = {}

def count_tags(tag_list):
    
    for tag in tag_list:
        if tag in tags_use_count:
            tags_use_count[tag] += 1
        else:
            tags_use_count[tag] = 1

q2019['Tags'].apply(count_tags)

It works just fine, but I wonder if this is a good way of doing it. Somehow, using apply that way seems like a crappy workaround that seasoned coders would frown upon. (It's not what apply was built for, I guess.) The dataset is small, so I guess I could use iterrows to loop through the column, but I understand it's not a good idea for larger datasets and I wonder if my approach would be the go-to in that case or if there's a a better way.

score 0 · Accepted Answer · answered Dec 13 '21 at 09:58

0

IIUC, you just want to count across every list in every row. So you can just explode 'Tags'-column and count values and convert to dictionary:

q2019['Tags'].explode().value_counts().to_dict()

answered Dec 13 '21 at 09:58

Mustafa Shujaie · Answer 2 · 2021-12-13T10:05:28.943

0

You can use collections.Counter to do exactly this:

>>> from collections import Counter
>>> tag_list = ['tag_a', 'tag_b', 'tag_b', 'tag_c']
>>> dict(Counter(tag_list))
{ 'tag_a': 1, 'tag_b': 2, 'tag_c': 1}

edited Dec 13 '21 at 10:05

answered Dec 13 '21 at 10:00

Mustafa Shujaie

1,447
1
16
30

Using apply to populate a dictionary

2 Answers2