I have a csv that looks like this:
CompanyName High Priority QualityIssue
Customer1 Yes User
Customer1 Yes User
Customer2 No User
Customer3 No Equipment
Customer1 No Neither
Customer3 No User
Customer3 Yes User
Customer3 Yes Equipment
Customer4 No User
I want to count how many time each instance in CompanyName appears in the entire file and sort by the number of appearance descend, but only print the CompanyName once:
For example, by using this code:
df['count'] = df.groupby('CompanyName'['CompanyName'].transform(pd.Series.value_counts)
df.sort('count', ascending=False)
I get:
Out:
CompanyName HighPriority QualityIssue count
5 Customer3 No User 4
3 Customer3 No Equipment 4
7 Customer3 Yes Equipment 4
6 Customer3 Yes User 4
0 Customer1 Yes User 3
4 Customer1 No Neither 3
1 Customer1 Yes User 3
8 Customer4 No User 1
2 Customer2 No User 1
What I want is:
CompanyName count
Customer3 4
Customer1 3
Customer4 1
Customer2 1
Any idea?
Problem 2: With empty rows:
CompanyName High Priority QualityIssue
Customer1 Yes User
Customer1 Yes User
No User
Customer3 No Equipment
Customer1 No Neither
No User
Customer3 Yes User
Customer3 Yes Equipment
Customer4 No User
Expected output:
CompanyName count
Customer3 3
Customer1 3
2
Customer4 1