I am processing strings encoded in utf-8, in Perl. One task is that I need a way to know that words starting with a letter with a diacritic, such as "écrit", begin with the same letter as "elephant", and also "England". I need a general solution, since I will be working across several languages. I need to know this because I am creating letter headings, for an index. Each of the words I just mentioned would be stored under "E".
Is there a straightforward way to do this?