I have a few categorical variables which I binary encoded.
The problem is there are a lot of Nan values, I know I can just do df.fillna(0)
for replacing the nan values..but will that be meaningful for machine learning?
Some columns have data and some columns are filled with Nans, and this varies row by row.
How to make the data useful? What specific operation is required?