So I have this file in which the apostrophe and double quotes are not getting displayed properly. I tried changing the encoding to UTF-8 but still it didn't help.Problem is that the change is not consistent throughout, so I cannot simply replace the characters with apostrophe or double quotes. Please help me with this. So basically I want to read this text in java and do some further processing for NLP application. When I read these files in java by explicitly setting the encoding to UTF-8, I still get junk characters, though different from what I see in the file.
Here are two sample text :
It<92>s easy enough, however, to define oneself in whatever way one wants especially when no one in the media challenges you on it. The real test of moral courage is how one acts<97>not just talks<97>in real-life situations. And in the one concrete instance when the Illinois senator was called upon to stand up for justice, he was nowhere to be seen.
Another sample text :
I would have researched everything beforehand and known exactly what kind of tests to expect at each appointment and what the normal range is supposed to be for those tests. It?~@~Ys not that I don?~@~Yt worry that something will happen or that one or more of the tests will come back abnormal. I do. I thought that with all these good appointments I have had in the last few months, I would start feeling less fearful of something going wrong. But my fear level stays about the same.