I have a dataset with IDs listed in column 1 and then diagnosis in column 2, cause in column 3 and so on. For IDs with several diagnosis these are listed as one row per diagnosis so the same ID with have several rows. I would like to have everything as one row per ID and the several diagnosis as columns. Is this possible?
My data look somethign like this:
ID diagnosis cause_of_diagnosis
1 A A
1 B B
1 C C
2 A A
3 A A
3 B B
3 C C
I would like the data to end up looking something like:
ID diagnosis_1 diagnosis_2 diagnosis_3 cause_of_diagnosis_1 cause_of_diagnosis_2 cause_of_diagnosis_3
1 A B C A B C
2 A - - A - -
3 A B C A B C