I'm using a .NET port of Mecab (called NMecab) to try to parse Japanese Hiragana, Katakana, and Kanji to romaji.
Here's my code:
using NMeCab;
MeCabTagger _tagger;
public string Parse(string input)
{
_tagger = MeCabTagger.Create();
_tagger.OutPutFormatType = "lattice";
_tagger.LatticeLevel = MeCabLatticeLevel.Two;
var output = _tagger.Parse(input);
return output;
}
When I call Parse(input)
using the following Japanese text: "ども"
I get the output: "ども助詞,接続助詞,,,,,ども,ドモ,ドモ EOS"
I'm looking for the romaji of "ども", which would be "domo."
I've tried to use Mecab directly as discussed in this SO answer, but get the same output.