I have a spatial database (esri geodatabase format) with Arabic characters. The problem is that when I summarize the values, some of the cells that logically should be identical with each other, are different. For example, "اسدی" and "اسدی" should be (and appear to be) indentical, but are not.
There is no white space in begin and end of the names, but when I check the length of the strings, some are 4 and some are 6. The ones that are 6 characters have 2 hidden characters, but I don't see them, and can't find them.
How can I remove the characters with python? I tried to use rstrip()
and lstrip()
, but the length remains 6.