I want to transpose and group my data: The data shape is:
APOC2 GO:0006629
APOC2 GO:0006869
APOC2 GO:0008047
APOC2 GO:0042627
APOC2 GO:0043085
CRYAB GO:0005212
SERPINA1 GO:0005615
DMD GO:0001954
DMD GO:0002162
DMD GO:0003779
DMD GO:0005200
DMD GO:0005886
But I require data in this simple tab delimited format: (i.e the records in $1 are grouped such that it appear once, and all its GO values (which are present in $2 of input file) should come in front of it in the same row). Like the output for above records is:
APOC2 GO:0006629 GO:0006869 GO:0008047 GO:0042627 GO:0043085
CRYAB GO:0005212
SERPINA1 GO:0005615
DMD GO:0001954 GO:0002162 GO:0003779 GO:0005200 GO:0005886
The solution is given in questions/17853218
at this forum, but my data file is large such that MS Excel cannot handle it. How can I do same task in Linux or R program.
Thanks.