I have a list of ids which are like:
A5ukur+de2.008x١٥١١٠٦١١٥٠٥٢٤٦٢
and I have written the following commands that is using the group name,
>>> RE_SID = re.compile(ur'(?P<sid>(?<=sid:)([A-Za-z0-9+.\u0627-\u064a]+))',re.UNICODE)
>>> x = RE_SID.search('sid:A5ukur+de2.008x١٥١١٠٦١١٥٠٥٢٤٦٢">>')
>>> x.group('sid')
'A5ukur+de2.008x'
However this is not working when Persian/Arabic alphabet is combining with the latin characters and this is returning me only A5ukur+de2.008x
.
I will be appreciated for any help.