Text in Thai is written with no spaces between the words. Instead, spaces indicate a break, like a comma or the end of a sentence. For example, the string พูดไปสองไพเบี้ย นิ่งเสียตำลึงทอง
means "Speech is cheap; silence is golden" with the space acting like my semi-colon.
I'm working on an algorithm to detect the word boundaries in Thai text for a Chrome extension. Google Chrome is able to split Thai text along word boundaries at the end of lines. This article indicates that Chrome uses the ICU4C library to achieve this.
Is there a way to access Chrome's Thai word-segmentation feature from JavaScript?