Python Pandas data frame setting copy of slice working sometimes but not always, despite nearly identical code

Question

I have one data frame called patient_df that is made like this:

PATIENT_COLS = ['Origin', 'Status', 'Team', 'Bed', 'Admit_Time', 'First_Consult', 'Decant_Time', 'Ward_Time', 'Discharge_Order', 'Discharged'] # data to track for each patient
patient_df = pd.DataFrame(columns=PATIENT_COLS)

Then, at multiple points in my code I will access a row of this data frame and update fields associated with it (the row at patient_ID doesn't exist prior to me creating it in the first line):

patient_df.loc[patient_ID] = [None for i in range(NUM_PATIENT_COLS)]
record = patient_df.loc[patient_ID]
record.Origin = ORIGIN()
record.Admit_Time = sim_time

This code runs perfectly with no errors or warnings and the output is as expected (the actual data frame is updated).

However, I have another data frame called ip_df:

ip_df = pd.read_csv(PATH + 'Clean_IP.csv')

Now, when I try to access the rows in the same way (this time the rows already exist):

for patient in ALC_patients:
    record = ip_df.loc[patient]
    orig_end = record.IP_Discharge_DT
    record.IP_LOS = MAX_STAY
    record.IP_Discharge_DT = record.N_Left_DT + timedelta(days=MAX_STAY)

I get

SettingWithCopyWarning: A value is trying to be set on a copy of a slice from a DataFrame

Now, I realize what's happening is I'm actually accessing a copy of the data frame and thus not updating the actual one, and I can fix this by using

ip_df[patient, 'IP_LOS'] = MAX_STAY

However, I find the first code much cleaner, plus I don't have to make the data frame search for the row again every time. Why is this working with patient_df but not for ip_df, and is there anything I can change to use code more like what I am for patient_df?

score 0 · Answer 1 · answered Jun 15 '17 at 18:07

0

pd.options.mode.chained_assignment = None # default='warn'

According to this link setting this in your code will turn off the warn flag

answered Jun 15 '17 at 18:07

Matthew Barlowe

2,229
1
14
24

2

Yeah but it still doesn't update the actual DataFrame, that just makes it not tell me that it's not updating. In patient_df it does actually update though. – blair Jun 15 '17 at 18:11

Python Pandas data frame setting copy of slice working sometimes but not always, despite nearly identical code

1 Answers1