I'm parsing through a pdf file that I converted its content to strings and there are many occurrences of \*** (* meaning any symbol)happening inside words. For example:
transaction, a middle ground has seemed workable\xe2\x80\x94norms explicitly articulated, backed by sanctions of the relevant professional associations
Using text.replace("\\***","")
obviously does not work and so I was looking into using re.sub()
.
I'm having trouble with the syntax (reg expressions) to put into the arguements and was hoping for some help with it.