I have a strange error on some of the data frames I'm working on, shifting all my rows by one cell so they're no more corresponding to my data:
Instead of having this:
> head(xaa.small)
AGCATTCGAAACATCGAGGCTAACATCCAGTACGCAAGTGGCC AGCATTCGAAACATCGCCAGTTCAATCCATCTTCACAGTGGCC
hg19_ENSG00000000003 0 0
hg19_ENSG00000000419 0 0
hg19_ENSG00000000457 0 0
It looks like this:
> head (xab.small)
AGCATTCGAAACATCGAGGCTAACATCCAGTACGCAAGTGGCC AGCATTCGAAACATCGCCAGTTCAATCCATCTTCACAGTGGCC
0 0
hg19_ENSG00000103160 0 0
hg19_ENSG00000103168 0 0
There's that empty space that appears on some of the dataframes.
How could I erase it and "repair" my dataframes in R? Or import it more efficiently? fill = True
when importing the dataframe works but blocks me for the further analysis that I have to do.
The data is comming from a huge tsv file that was cut in multiple parts. Maybe during the cutting process there was an error (split
function was used in order to cut the initial tsv file in several 200 Mb files)