I wrote a script that extracts all emojis from a given dataset:
for message in df['Message']:
for char in message:
if char in emoji.UNICODE_EMOJI:
print(char)
It kinda works and correctly identifies which characters are emojis. However, the output does not correctly parse some of the emojis and they simply show up as brown square:
Why is this happening? Is there any way of solving this? Most emojis show up just fine but there are a few that just won't.
Edit: After looking into it again, it seems like the brown squares come with certain emojis to state the used color tone.
However, some there are still some issues with certain emojis. The usual heart emoji, for example does show up as a heart character but not in the emoji style. Screenshot because pasting it here ends up displaying it correctly: