-6

I have:

df = pd.DataFrame({
  "ID": [55218,55218,55218,55222],
  "Cluster": [0,0,1,1],  
  "Rating":[-1,2,0,2]})

I want to sum the Rating of e.g. Cluster "0" for every "ID"

My expected output is:

df_new = pd.DataFrame({
  "ID": [55218,55218,55218,55222], 
  "Cluster": [0,1,1], 
  "Sum": [1,0,2] })
IoaTzimas
  • 10,538
  • 2
  • 13
  • 30
Mars
  • 41
  • 6

1 Answers1

-1

Use groupby

df.groupby("ID")["Cluster"].sum().reset_index()
Wasif
  • 14,755
  • 3
  • 14
  • 34
  • 2
    Questions as basic as this one have usually been asked and answered. Please look for duplicates before posting simple, one-line answers. – Pranav Hosangadi Nov 03 '20 at 16:16