Jupyter notebook taking forever to iterate over a for loop

Question

I an using the following for loop to iterate over df (droping a row if condition match) :

for index, row in df.iterrows():
   if(pd.isnull(row["CR Date"])):
     df.drop(index, inplace=True)

the df shape 240,000*30 It is working BUT It is taking more than 1.5 hours. Is there any faster way? I am using Anaconda JupyterLab

This does not answer your question. Correct me If I am wrong, after dropping a row, wont index of next row change. for e.g. after deleting row 5, wont row 6 become new row 5? — ksholla20, Jul 21 '19 at 03:52

score 0 · Accepted Answer · answered Jul 21 '19 at 03:53

0

replace:

for index, row in df.iterrows():
   if(pd.isnull(row["CR Date"])):
     df.drop(index, inplace=True)

by this to drop rows with missing value:

df.dropna(subset=['CR Date'], inplace=True)

answered Jul 21 '19 at 03:53

Cao Minh Vu

You are welcome, please mark it as the answer if it works – Cao Minh Vu Jul 21 '19 at 04:40

1 Answers1