I'm developing a web scraper for a soccer site. There is an issue when pulling in names from various countries (which, obviously, have various characters). I've worked out a method to correct some of it, but it's not catching Turkish or anything else. Here is what I have so far:
private String formatMe(String sF)
{
String myString = Encoding.UTF8.GetString(Encoding.UTF8.GetBytes(sF))
.Replace("é", "é")
.Replace("á", "á")
.Replace("Ã", "í")
.Replace("ó", "ó");
return myString;
}//END FORMAT
Here's an example of a site I would pull from.
Is there anyway I can just fix the encoding from the site in one fell swoop?