I have used the strip_tags function. It removes tags like "<p>, <b>"
, etc but things like " “"
and other html encodings remain. How to remove them ?
Asked
Active
Viewed 314 times
0

Abdul Aziz Barkat
- 19,475
- 3
- 20
- 33

Alpha
- 237
- 1
- 2
- 7
-
2https://stackoverflow.com/a/2087433/5386938 – Jun 05 '21 at 18:19
1 Answers
0
Pass the string to ‘unquote’ function
from urllib.parse import unquote
s = unquote(s)

Mohamed ElKalioby
- 1,908
- 1
- 12
- 13
-
-
-
-
This function decodes URL request escapes (the ones you find in URLs, of the form `%xx` - like "%20" for a space character), and not HTML escapes of the form `&...`. See [documentation](https://docs.python.org/3/library/urllib.parse.html#urllib.parse.unquote) – Anis R. Jun 05 '21 at 18:28
-