For example:
x = u'\U0001f622'
[c for c in x]
returns [u'\ud83d', u'\ude22']
instead of the expected [u'\U0001f622'].
\U0001f622
is an emoji, and since I have an emoji dict with which Im attempting to detect emojis in text, I need the looping to capture \U0001f622
and not the combo u'\ud83d', u'\ude22'
.
I'm using python 2.7. According to this u'\ud83d', u'\ude22'
is the C/C++/Java source code of that emoji.
How do I do that? Thanks.