0

I'm developing a web scraper for a soccer site. There is an issue when pulling in names from various countries (which, obviously, have various characters). I've worked out a method to correct some of it, but it's not catching Turkish or anything else. Here is what I have so far:

private String formatMe(String sF)
    {
      String myString = Encoding.UTF8.GetString(Encoding.UTF8.GetBytes(sF))
         .Replace("é", "é")
         .Replace("á", "á")
         .Replace("í", "í")
         .Replace("ó", "ó");

      return myString;

    }//END FORMAT

Here's an example of a site I would pull from.

Is there anyway I can just fix the encoding from the site in one fell swoop?

stuartd
  • 70,509
  • 14
  • 132
  • 163
jDave1984
  • 896
  • 4
  • 13
  • 43

0 Answers0