I have a string and I want to extract a sub-string comprising of the following characters [A,T,C,G,\n] only. This characters can appear in the sub-string in any order and number without a specific pattern. I also don't have any constant delimiter before and after this sub-string that I can use. Example of a full string and the sub-string I would like to extract in BOLD.
-
AC068547.7 Homo sapiens BAC clone RP11-458J7 from 2, complete sequence GAATTCAACTTTCTAGACCAATGATTTTTGGACTAATGATGTTTGGAGGGCCCAACAACCCAGAAAGTTGAATTCCAGTC\nTCCTTTAGTGAAAATAAA\n
-
AC1284347.7 Homo sapiens XXX clone RP11-1238J7 from 3,CDSTAGGGCTGAGATCGGCGTAAG\nGAGATCGGAGAGCTGAAT