HTML Coding - Removing Â character

Question

I run a very simple html site; the charset is UTF-8 but I don't necessary know what that means but it seems relevant as per other related questions. I looked up another similar question but it dealt in php and also dealt with character substitution whereas mine is more random addition albeit always in the same places. I wrote the text for the page in Word and this causes problems in HTML as I have found out before but I've converted it to plain text and the character reappears in the same places. I copied the HTML text, deleted the character in a new plain text copy, and the character still reappears in those same places when I paste it and re-upload the page. What could be the problem?

Maybe related: http://stackoverflow.com/questions/5634823/character-encoding-utf8-issue — Jared Farrish, Dec 28 '11 at 01:19
[What Every Programmer Absolutely, Positively Needs To Know About Encodings And Character Sets To Work With Text](http://kunststube.net/encoding/) — deceze, Dec 28 '11 at 01:39
Perhaps, but I've learned what I need to in order to accomplish what I need; the Internet is here to help me fill in the gaps in my knowledge. I don't think learning exactly what UTF-8 is will be anything more than 'interesting.' — Raj, Dec 28 '11 at 02:32
You don't need to study UTF-8 in detail, but you should have some knowledge of what exactly **encodings** are. Then your question would pretty much answer itself. — deceze, Dec 28 '11 at 04:01

score 1 · Accepted Answer · answered Dec 28 '11 at 01:37

1

Some quick advises:

Do not use Word for html creating/editing;
UTF-8 ^Wikipedia is encoding for the characters that are not present in ASCII. It is very important on sites which use a language different from English;
The problem could be caused by BOM ^Wikipedia. Use Notepad++, you can set the encoding to UTF-8 without BOM.

answered Dec 28 '11 at 01:37

Bakudan

19,134
9
53
73

I thought BOM at first, but how is a BOM causing this? – Jared Farrish Dec 28 '11 at 01:47
On php pages with BOM on some browsers I saw a strange way of interpreting html. I think it is good not to have it on a page. – Bakudan Dec 28 '11 at 01:52
I'm on Mac so I tried TextWrangler (apparently best equivalent for NP++); problem solved. Thank you very much! – Raj Dec 28 '11 at 02:23

HTML Coding - Removing Â character

1 Answers1