I want to do basic Chinese tokenization, simply breaking the string into individual characters. How can I do that in Java?
String str = "这是一个测试"
I want it to be;
["这",“是”,“一”,“个”,“测”,“试”]
I want to do basic Chinese tokenization, simply breaking the string into individual characters. How can I do that in Java?
String str = "这是一个测试"
I want it to be;
["这",“是”,“一”,“个”,“测”,“试”]