I am processing text corpus. It contains several characters belonging to different languages, symbols, numbers, etc.
-> All I need to do is to skip the symbols like arrow mark, heart symbol, etc.
-> I should not be spoiling any characters of different languages.
Any leads?
----UPDATE----
Character.isLetter('\unicode') is working for most of them, if not some. I have checked my regional languages, it seems it's working for some but not each and every.
Thanks.