What is the best way to filter text in Python so that I am only including numbers, upper/lower case letters, all punctuation and characters for new line, tab, etc.
For example I might have the text below and want to get rid of the pictures, but the links, punctuation, letters, numbers are fine:
Episode 19 is OUT NOW! Pasta Go Go Food Review Candle Light Dinner in the Car! PASTA LA VISTA Click Link B…
I have looked at regex expressions, but not sure how that would work. I was trying re.match.
It looks like translation tables might be the way to go, but they don't seem to work by exclusion. I would like to define the set of characters I want and remove anything else.