I am developing a small library automation software and I need to determine a word is in English or Turkish. An example scenario is like this:
- User enters a book title.
- Determine it's Turkish or English.
- Set the languge combobox to the respective language to help user fill the form.
A friend of mine suggested me "connect to Google Translate and use it" which seems reasonable but an algorithm without connecting an external service or database will be more appropriate for me. (I also search the Turkish/English specific characters like ç,ş,İ/w,x to decide) Therefore I am searching an algorithm to do this job maybe based on letter frequencies or something like it. Anything available in literature? Thanks, in advance. (I use php, mysql if it's important)