One approach is using the info we know about language ranges. For example, unicode range 30A0–30FF represents Japanese Katakana characters, so if your string consists of characters within that range, you could make an educated guess that it's Japanese and work accordingly.
This requires knowing how to handle unicode language pages in java
Any help please