I'm trying to find occurrences of several pairs of words in strings which are in a list in a tsv file. A list in a tsv file is below.
0 ILDIGCGRGRHARALVRRGWQVTGLDLSEDAVAAARSRVADDDLDV...
1 AELETLQAKINPHFLYNSLNSIASLVYTDPEKAEKMVLMLSKLFRV...
2 AQLSSLKEQLNPHFLFNTFNTLYGISLKYPERVPDLIMHTSQLMRY...
3 TEIKALQSQIKPHFLFNTLNAIRCTIINNNNDKAADLVYKLAMLLR...
4 SEMSRLNAQINPHFLFNTLNFFYSEVRTLHPKISESILLLSDIMRY... ...
...1000 SELSFLKAQINPHFFFNTLNNIYALTMMDVASAQEALHRLSRMMRY...
1001 ILEPGCGTGRLMLALAEHGHHVAGVDASATALEFCRERLTQHGLTG...
1002 IADLGAGEGTISQLMAQRAKRVIAIDNSEKMVEFGAELARKHGIAN...
1003 AELRALRAQISPHFIYNALAAIASFVRTDPERARELLLEFADFSRY...
1004 VVDLGCGSGASTDALVNSMGHRGETYAAIGIDASAGMLTEAHSKPW...
[1005 rows x 1 columns]
then, I'd like to get occurrences of AA, AB, AC, ...ZY, ZZ for each row. An example is below.
If there is a string "AEAETLQAKIN" in a row, then I'd like to get the result below.
(the definition of strings must be acid. ex)acid='AEAETLQAKIN')
IN[]......(I'd like to know how to describe codes which can get occurrences here. )
OUT[] AA: 0, AC: 0, AD: 0, AE: 2, ... AK: 1, ... EA: 1, ...