8

Does .NET Transliteration library exists ? Note that this is not translation, something like this Perl lib :

http://www.lingua-systems.com/transliteration/Lingua-Translit-Perl-module/

I just find :

http://transliterator.codeplex.com/

Antonio Bakula
  • 20,445
  • 6
  • 75
  • 102

1 Answers1

13

Check my UnidecodeSharpFork.

It's based on great Python Unidecode transliteration tables, support many languages.

Example usage:

Assert.AreEqual("CZSczs", "ČŽŠčžš".Unidecode());
Assert.AreEqual("Hello, World!", "Hello, World!".Unidecode());
Assert.AreEqual("Rabota s kirillitsey", "Работа с кириллицей".Unidecode());

Simple, fast and powerful. And it's easy to extend/modify transliteration table if you want to.

Siewers
  • 22,626
  • 3
  • 20
  • 31
Dima Stefantsov
  • 943
  • 1
  • 14
  • 20
  • Ive been checking Georgian transliteration nad ქ will translate to k`, other sources will translate it to k (http://ge.translit.cc/, http://www.translitteration.com/transliteration/en/georgian/national/) – watbywbarif Feb 18 '15 at 14:11
  • 1
    I am very satisfied with this library. Tested it with a number of languages, and the result is quite appealing. The only problem I encountered was with the transliteration of Japanese names: UnidecodeSharpFork read them as Chinese names, instead of Japanese: for "橋本" you get "Qiao Ben" instead of the expected "Hashimoto" (at least according to https://surnames.behindthename.com/names/usage/japanese, I myself do not know either language). Hiragana was processed correctly. – Alex Konnen Feb 11 '19 at 18:15
  • 1
    Unfortunately, it does not deliver any satisfactory results with Hebrew and, I suppose, Arabic. The problem is certainly, that of the vowels. In many cases, a transliterated Hebrew word contains gibberish like the '@' character; the Hebrew Vav is always transliterated as 'v' which is nonsense. I am still looking for a C# tool to satisfactorily transliterate Hebrew and Arabic into Latin script. – Alex Konnen Feb 15 '19 at 09:47