I have a Japanese content which is being converted to MS help with a certain tool. The problem is that the third party tool isn't using utf-8 encoding and is creating a .xml with garbage characters:
<param name="Name" value="ÉAÉvÉäÉPÅÉVÉáÉìdžÇ'ÇËÇØÅǵÇÃ'ÇÃ']">
<param name="Name" value="Test File">
<param name="Local" value="applications.htm#Xau1044547">
I tried playing around with the encoding and it now produces:
<param name="Name" value="ÉAÉvÉäÉPÅ">
<param name="Name" value="Test">
<param name="Local" value="applications.htm#Xau1044547">
But with utf-8 encoding (another tool) and the correct output should be:
<param name="Name" value="アプリケーション">
<param name="Name" value="Small Business アプリケーションの起動 ">
<param name="Local" value="applications1.html#wp1044548">
Is there any java API I can use to decode and encode the files to have the correct output. I am not sure what the tool is using but I am guessing its "ISO-8859-1".
Thanks.