Emoji to unicode

Question

I was looking at https://r12a.github.io/app-conversion/ and I see that they have a "JS/Java/C" section. I was wondering if anyone had the code for that in python. I can't seem to find it. Thanks!

Edit: code

b = ''
txt = b.encode('utf-8')

Check the C-style checkbox in the JS/Java/C section and then print this code. For example you might use something like this: print(u'\U0001f600'). — George, Jul 26 '21 at 14:36
Hi, thanks for your reply but I'm trying to get Javascript Escape — oofer, Jul 26 '21 at 14:37
Did you try using Python's decode function? E.g. if you have a variable x = '' then you can do something like this x.decode('utf-8'). — George, Jul 26 '21 at 14:42
Yes, but the result is b'\xf0\x9f\x98\x80' and I'm looking for \uD83D\uDE00 — oofer, Jul 26 '21 at 14:52
@George AttributeError: 'str' object has no attribute 'decode' — oofer, Jul 26 '21 at 15:19
@oofer you need to `.encode()` (to bytes) before you `.decode(_)` (to str) — ti7, Jul 26 '21 at 15:20
`\uD83D\uDE00` is UTF-16, not UTF-8, and representing surrogate pairs in UTF-8 like that is invalid — phuclv, Jul 26 '21 at 15:54
What if you try something like this? `x = ''` and then `unicode(x, 'utf-8')`. — George, Jul 26 '21 at 18:30

ti7 · Answer 1 · 2021-07-26T15:32:32.263

From How to work with surrogate pairs in Python? (linked from duplicate Escaped Unicode to Emoji in Python )

If you see '\ud83d\ude4f' Python string (2 characters) then there is a bug upstream. Normally, you shouldn't get such string. If you get one and you can't fix upstream that generates it; you could fix it using surrogatepass error handler:
>>> "\uD83D\uDE00".encode('utf-16', 'surrogatepass').decode('utf-16')
''

Original Answer

Perhaps you're looking for ord()?

Given a string representing one Unicode character, return an integer representing the Unicode code point of that character. For example, ord('a') returns the integer 97 and ord('€') (Euro sign) returns 8364. This is the inverse of chr().

>>> hex(ord(""))
'0x1f600'

Emoji to unicode

1 Answers1