What I have:
df
Name |Vehicle
Dave |Car
Mark |Bike
Steve|Car
Dave |
Steve|
I want to drop duplicates from the Name column but only if the corresponding value in Vehicle column is null. I know I can use
df.dropduplicates(subset=['Name'])
with either Keep =
either 'First' or 'Last'
but what I am looking for is a way to drop duplicates from Name
column where the corresponding value of Vehicle
column is null
. So basically, keep the Name
if the Vehicle
column is NOT null and drop the rest. If a name does not have a duplicate,then keep that row even if the corresponding value in Vehicle
is null.
Many Thanks