for the input
ATTTGGC
TGCCTTA
CGGTATC
GAAAATT
I want an output of 3-mers from each line forming a final list composed of all 3-mers the output should be like
[ATT, TTT, TTG, TGG, GGC, TGC, GCC...]
not the GC\n
for first line or TA\n
for second-line
def getKmersFromDna(Dna,k):
kmer_list = []
for i in range(len(Dna)-k+1):
kmer_list.append(Dna[i:i+k])
return list(kmer_list)
giving
output like ['CC\n', 'C\nG', '\nGT']
which I do not want.