Is there a way to search for non-english letters in file names?

Question

I am trying to write a script that will replace all non-english alphabet letters in file names with their English counterparts, is this possible?

score 0 · Answer 1 · answered Aug 09 '20 at 19:54

If you mean to "deburr" strings, there's a nice, simple-ish recipe for it (for many accented characters anyway) that uses the Unicode NFKD normalization form, then strips everything non-ascii out of it:

>>> import unicodedata
>>> unicodedata.normalize("NFKD", "törkylempijävongahdus").encode("ascii", "ignore").decode()
'torkylempijavongahdus'

For more complex use cases, maybe https://pypi.org/project/transliterate/ is your thing.

Is there a way to search for non-english letters in file names?

1 Answers1