Hej,
I have an input the file that has one column with gene id and then one with GO terms with multiple rows per gene (anywhere from 1 to >20). The format I need to generate has one single row for each unique gene id, with the GO terms in a second column, separated by semi-colons.
My data:
GeneID GO
am1001 190909
am1001 600510
am1002 500050
am1002 432323
am1002 100209
The desired output:
GeneID GO_list
am1001 190909; 600510
am1002 50050; 432323; 100209
I have tried things similar to How to create new columns in a data.frame based on row values in R? but was not successful.
Thanks in advance for your advice! :)