I used tokenizer = RegexpTokenizer(r'\w+')
which retains alphanumeric characters
But how do I combine a regular expression to remove every other element retaining just characters greater than length 2
Below is one row in the dataframe which contains random text
0 [ANOTHER 2'' F/P SAMPLE 01:52 ...A13232 / AS OUTPUT MSG...