I am using this dataset and reading it through pandas dataframe. I need to work with the paperAbsrtract
column only which has some missing data.
filename = "sample-S2-records"
df = pd.read_json(filename, lines=True)
abstract = df['paperAbstract']
Because there are some missing data in the abstract
dataframe, I want to remove those rows that are empty. So following the documentation, I do below
abstract.dropna(how='all')
But this doesn't remove those empty rows. They are still there in the abstract
dataframe. What am I missing?