I used python:
for m in regex.findall(r"\X", 'ल्लील्ली', regex.UNICODE):
for i in m:
print(i, i.encode('unicode-escape'))
print('--------')
the results show ल्ली has 2 hindi characters:
ल b'\\u0932'
् b'\\u094d'
--------
ल b'\\u0932'
ी b'\\u0940'
--------
it's wrong, actually ल्ली is one hindi character. How to get hindi character(such as ल्ली) by how many unicode compose.
In short, I want to split 'कृपयाल्ली'
to 'कृ'
,'प'
,'या'
,'ल्ली'