0

I run a very simple html site; the charset is UTF-8 but I don't necessary know what that means but it seems relevant as per other related questions. I looked up another similar question but it dealt in php and also dealt with character substitution whereas mine is more random addition albeit always in the same places. I wrote the text for the page in Word and this causes problems in HTML as I have found out before but I've converted it to plain text and the character reappears in the same places. I copied the HTML text, deleted the character in a new plain text copy, and the character still reappears in those same places when I paste it and re-upload the page. What could be the problem?

Raj
  • 3
  • 2
  • Maybe related: http://stackoverflow.com/questions/5634823/character-encoding-utf8-issue – Jared Farrish Dec 28 '11 at 01:19
  • 3
    Perhaps you should go learn what UTF-8 is? – bmargulies Dec 28 '11 at 01:30
  • 4
    [What Every Programmer Absolutely, Positively Needs To Know About Encodings And Character Sets To Work With Text](http://kunststube.net/encoding/) – deceze Dec 28 '11 at 01:39
  • Perhaps, but I've learned what I need to in order to accomplish what I need; the Internet is here to help me fill in the gaps in my knowledge. I don't think learning exactly what UTF-8 is will be anything more than 'interesting.' – Raj Dec 28 '11 at 02:32
  • 1
    You don't need to study UTF-8 in detail, but you should have some knowledge of what exactly **encodings** are. Then your question would pretty much answer itself. – deceze Dec 28 '11 at 04:01

1 Answers1

1

Some quick advises:

  1. Do not use Word for html creating/editing;
  2. UTF-8 Wikipedia is encoding for the characters that are not present in ASCII. It is very important on sites which use a language different from English;
  3. The problem could be caused by BOM Wikipedia. Use Notepad++, you can set the encoding to UTF-8 without BOM.
Bakudan
  • 19,134
  • 9
  • 53
  • 73