Assuming that the characters that you have copy-and-pasted into your question are correct ...
The characters in special
are characters in non-Latin alphabets that (typically) look like Latin characters.
- 'Ρ' - is the greek uppercase rho character
- 'E' - is the greek uppercase epsilon character
- 'Ν' - is the greek uppercase nu character
- 'Α' - is the greek uppercase alpha character
There is no correct conversion of these characters to Latin letters. (They are homoglyphs for the Latin letters.)
... how can these Greek capital letters be converted into normal English letters?
They can't. The characters are not equivalent. They mean something different. If you encounter those characters in a string, they should not be converted to Latin characters.
(But if you insist, here is a library that purports to do the job: https://github.com/codebox/homoglyph. Use at your own risk!)
The characters in em
are Unicode full width characters. For example 'E' is U+FF25 which is described in the Unicode code charts as "FULLWIDTH LATIN CAPITAL LETTER E".
You can convert any full width characters in a Java string to regular characters using java.text.Normalizer
using the NFKC
form.