I have this PySpark DataFrame
df = pd.DataFrame(np.array([
["aa@gmail.com",2,3], ["aa@gmail.com",5,5],
["bb@gmail.com",8,2], ["cc@gmail.com",9,3]
]), columns=['user','movie','rating'])
sparkdf = sqlContext.createDataFrame(df, samplingRatio=0.1)
user movie rating
aa@gmail.com 2 3
aa@gmail.com 5 5
bb@gmail.com 8 2
cc@gmail.com 9 3
I need to add a new column with a Rank by User
I want have this output
user movie rating Rank
aa@gmail.com 2 3 1
aa@gmail.com 5 5 1
bb@gmail.com 8 2 2
cc@gmail.com 9 3 3
How can I do that?