- Snapshot of the referenced data given below: reference data
I want to create two new columns from this data set.
I need to create a scatter plot to display how each firm performed in the year 2017 based on their average score (x-axis) per their market penetration (y-axis) [# of unique respondents that rated the company / total unique respondents] based on the data (a portion of data is in snapshot)
New column ‘mtkp’. 'mktp' is the #unique respondents per company per year / total unique respondents in 2017
New column 'mean'. mean is the average score given to a company that year.
How do I do it? Any suggestion? I am using Pandas and python 3