I am trying to find a quick way to decode html numbers (not just special characters) and convert them to unicode in Java. For example, I would like to be able to convert "В ;н ;у ;т ;р ;е ;н ;н ;и ;й ;" to "Внутренний." I just can't seem to find a way to do it.
Asked
Active
Viewed 60 times
1
-
this is not a duplicate I had already tried that answer and if you do System.out.println(StringEscapeUtils.unescapeHtml("Внутренний")); you get ??????? which is not correct – Megaminx Jul 31 '14 at 19:25
-
??????? is correct, the problem is that you cannot see the characters it represents, because the font does not have glyphs for those characters. (so then the font uses ?'s instead) – user2097804 Jul 31 '14 at 19:51