I have a dataset of tweets where it contains at least one occurrence of emoji. But sometimes there are more. Emojis can be in the middle of the sentence, or it could be at the start or at the end. Hence for each tweet the case is different. I am having difficulties trying to split only the emojis in the sentence. If I loop through each word, the multiple emojis are also considered as one word.
She is too hot for Congress. Vote her out! #sarcasm
Expected output: She is too hot for Congress. Vote her out! #sarcasm
The Struggle is Real #struggle #struggleisreal #struggles #funny #humor #saying #sarcasm #lifestruggles #sarcastic #funnysaying #sayings #thestruggleisreal
Expected output: The Struggle is Real #struggle #struggleisreal #struggles #funny #humor #saying #sarcasm #lifestruggles #sarcastic #funnysaying #sayings #thestruggleisreal
For More Funny Post Follow
Expected output: For More Funny Post Follow
Answer from the above post gives me a list and toknized words for each tweet in the dataset which I don't want, it also does not solve my problem. I do not get space between the emojis.