I have a database that contains 37000 rows and 100 features.
The DB has a lot of Nan values.
But the big problem is that if feature A occurs with 600 rows and feature B occurs with 700 rows, the common rows between them may be only 10 rows.
I wanna find the max number of features that occurs with N rows at least
Example: Input Database
out: where N=5