I have several csv files one for blood pressure for patients and others for heart rate, WBc,etc for the same patients and the same hour measure , please see the following example
First csv:
subject_id hour_id value label
1 1 96 blood pressure
1 1 94 blood pressure
1 1 93 blood pressure
2 2 99 blood pressure
Second csv:
subject_id hour_id value label
1 1 80 Heart rate
2 2 89 Heart rate
2 2 81 Heart rate
third csv:
subject_id hour_id value label
1 1 1 WBC
2 2 10 WBC
2 2 12 WBC
Fourth csv:
subject_id hour_id value label
1 1 123 glucose
2 2 111 glucose
2 2 113 glucose
Desired output:
subject_id hour_id blood_pressure heart rate WBC gloucose
1 1 96 80 1 123
2 2 99 89 10 120
I tried:
df = pd.read_csv('D:\\....', low_memory=False, error_bad_lines=False)
df2 = pd.read_csv('D:\\Users', low_memory=False, error_bad_lines=False)
merged = pd.concat([df, df2,df3,df4], axis=1, keys=['subject_id', 'hour_mesaure'])
print(merged)
But it gives me:
subject_id hour_id blood_pressure
1 1 96
2 2 99
subject_id hour_id value label
1 1 80 Heart rate
2 2 89 Heart rate
and complete the files sequentially
any help will be appreciated