I have a dataset where some individuals are listed twice because they have received one or more degrees. I am wanting to only get the latest graduation date for each individual. Below are examples of what is currently happening and the desired output.
People | g_date |
PersonA|2019-01-01|
PersonA|2020-01-01|
PersonB|2010-05-01|
PersonB|2015-01-01|
PersonC|2010-06-01|
People | g_date |
PersonA|2020-01-01|
PersonB|2015-01-01|
Personc|2010-06-01|