0

I have this &#2361 &#2379 and also this \u0936\u093e\u0902\u0924\u093f But I dont know to what encoding they belong to.

The hindi font gets stored as the 1st encoding in the database.

So please tell me what type encoding it is? And also how to get my hindi font characters in the 2nd encoding type(\u0924\u093f).

Santino 'Sonny' Corleone
  • 1,735
  • 5
  • 25
  • 52
  • What do you mean by "what encoding they belong to"? These Unicode characters (\u0936\u093e\u0902\u0924\u093f, that is शांति) are not present in ISO-8859-1 if that's what you mean. – Joni Jan 08 '14 at 10:51
  • [What is the difference between UTF-8 and ISO-8859-1?](http://stackoverflow.com/q/7048745/1031945) – Aniket Kulkarni Jan 08 '14 at 10:54

1 Answers1

0

Both ह and \u0939 encode the Unicode Character Devanagari letter HA :

ह is the HTML entity that represents the unicode character 2361 in decimal, which is equivalent to hexadecimal 0939

\u0939 is javascript escape sequence that represents the unicode character 0939 in hexadecimal, which is the equivalent to decimal 2361.

ISO 8859 do not include those characters.

PA.
  • 28,486
  • 9
  • 71
  • 95
  • One more thing is when i end this with semicolon ह ो on stackoverflow I get the hindi charaters.But when I retrieve from database to pdf(xmlworker) I get the the same thing. – Santino 'Sonny' Corleone Jan 08 '14 at 10:53
  • Ok thnx for the explaination..But stil I dont understand,when I create the pdf,\u0939 converts to hindi character but ् gives the same thing back i.e ्(not a hindi character) – Santino 'Sonny' Corleone Jan 09 '14 at 04:22
  • `्` is an HTML entity. It requires a browser or some other HTML interpreter to be rendered properly into the U+0939 unicode character. Probably the program you use to "create the pdf" does not process HTML entities. – PA. Jan 09 '14 at 08:34