I am looking for a way to transliterate Unicode letter characters from any language into accented Latin letters. The intent is to allow foreigners to gain insight into the pronunciation of names and words written in any non-Latin script.
Examples:
Greek:Romanize("Αλφαβητικός")
returns "Alphabētikós"
(or "Alfavi̱tikós")
Japanese:Romanize("しんばし")
returns "shimbashi"
(or "sinbasi")
Russian:Romanize("яйца Фаберже")
returns "yaytsa Faberzhe"
(or "jajca Faberže")
It should ideally support characters in the following scripts: CJK, Indic, Cyrillic, Semitic, and Greek. It should to be data driven and extensible, using data from either the Unicode Consortium, the USA, the EU or the UN. The code should be open source written in .NET or Java.
Does such a library exist?