I have a .json
file with over 70,000 tweets, with each tweet containing emojis. However, I am unsure how to convert the Unicode into the actual emojis, so that it can be used for sentiment analysis.
This is a sample of 5 tweets in my .json
file:
{"text":"The morning is going so fast Part 2 of #DiscoveryDay is in full swing \ud83d\ude01\n\nGreat Atmosphere in the room \n\n#BIGSocial\u2026 https:\/\/t.co\/P08qBoH6tv"}
{"text":"Double kill! #XiuKai lives! I died. \ud83d\ude0c https:\/\/t.co\/QCyk3r2JCb"}
{"text":"ALLTY \ud83d\udc94"}
{"text":"Shouldn\u2019t be normal for a 24 year old to be this tiered \ud83d\udca4"}
{"text":"@TheNames_BrieX Trust me! \ud83d\udcaf"}
Now, how would I convert the unicode for all the tweets into the actual emoji? For instance, how would \ud83d\ude0c be converted into the actual emoji?
What methods can be used to convert the unicode into the actual emojis?