I have two groups A and B of strings of the letters "AGTE" and I'd like to find some way of comparing these to see whether they are statistically similar. The first group A are real world observations, B are predictions. There are 400 or so in each group Eg
**A**
GTAATEGTTTEAAA
TTEAGE
...
**B**
AGTEAAAAGT
TAT
GGATEAATGGGTEAATG
....
I'd also like to be up to visualise these in some way really for presentation purposes. Do you have any ideas how I might be able to do that?