I have a Dataframe with many computers of different producers, released in different Years, their Salenumbers...
Now, my goal is to find all new Computers released in 2015 which didn't exist in any years before. That means I have to control, if the Computernames are listed in any Years bevor 2015 and if they are, I want to delete these computernames of my 2015 list. Furthermore, there are also computers released in 2016,2017,... which shouldn't be forgotten. I want to have the number of these new computers.
Well, I have so many values, that I don't know if there are duplicates of the names, only with different Years, but this was my first idea.
df_noduplicates=df[df.Year<2016](subset=['Name'], keep='first')
df_Year2013 = df[df.Year==2015]
print(df_Year2015.shape(0))
But I only get the Error 'DataFrame' object is not callable after running. It should be because of the first line, but I don't know, what I did wrong.
Another problem is, that I should use 'set' to solve this exercise, but I don't know how it could be used in this context.
Thank you for your help in advance. :)