Why is ñ changing to Ã±?

Question

I don't understand whenever I save any string that contains ñ it changes to Ã±. Even in the database the ñ is changed to Ã±.

Examples:

ñ becomes Ã±.
Niño becomes NiÃ±o.

I don't have any clue what causes this problem or where the problem is coming from. Please help. Thanks in advance.

I suspect it's a database issue, due to "even in the database..."; inspecting the *actual value* inserted would likely confirm/disprove this. — , May 29 '12 at 00:52

score 12 · Answer 1 · answered May 29 '12 at 12:12

12

Character ñ (U+00F1) is encoded using UTF-8 as the two bytes 11000011 10110001 (0xC3 0xB1).

These two bytes are decoded using ISO 8859-1 as the two characters Ã±.

So, you are most likely using UTF-8 to encode the character as bytes, and ISO 8859-1 (Latin-1, as guessed by Sajmon) to decode the bytes as characters.

answered May 29 '12 at 12:12

Nathan Ryan

12,893
4
26
37

May I be curious and ask how you worked out the binary encoding/ which source you used to get it? – Philippe May 30 '12 at 01:14
2

@Philippe: I used the standard definition of UTF-8. Wikipedia has a nice page http://en.wikipedia.org/wiki/UTF-8#Description – Nathan Ryan May 30 '12 at 07:06

score 6 · Answer 2 · answered May 29 '12 at 00:08

6

Character encoding problems, for sure. Make sure that the database, the web pages, content charset, java files, string encoding, etc. are all using the exact same encoding - for instance, UTF-8.

answered May 29 '12 at 00:08

Óscar López

232,561
37
312
386

score 5 · Accepted Answer · edited May 23 '17 at 12:15

5

Your string has a wrong encoding. It's UTF-8 but you need other, uhm Latin-1? You need decode.

Check this

Hope it help you.

edited May 23 '17 at 12:15

Community

1
1

answered May 29 '12 at 00:07

Simon Dorociak

33,374
10
68
106

Yes its UTF-8. So you're saying that's where the probelm is coming from? – NinjaBoy May 29 '12 at 00:08

score 3 · Answer 4 · edited May 23 '17 at 12:08

3

It is a character encoding issue, you need to check if your whole stack from writer to reader is set to UTF-8.

Check out this discussion, it might contain some info to help you:

edited May 23 '17 at 12:08

Community

1
1

answered May 29 '12 at 00:07

Philippe

446
2
12

Why is ñ changing to Ã±?

4 Answers4