Questions tagged [indic]

Indic scripts are the writing systems that are used by languages spoken by about two billion people in India and its neighboring countries, as well is Southeast Asian countries such as Thailand, Burma, Laos, Cambodia and Indonesia. These scripts developed from the ancient Brahmic script and share similar features, which make them challenging for use in computers: many ligatures, complex letter ordering rules and partial support in common operating systems.

66 questions
29
votes
6 answers

Combining Devanagari characters

I have something like a = "बिक्रम मेरो नाम हो" I want to achieve something like a[0] = बि a[1] = क्र a[3] = म but as म takes 4 bytes while बि takes 8 bytes I am not able to get to that straight. So what could be done to achieve that? In Python.
meadhikari
  • 971
  • 3
  • 13
  • 27
10
votes
4 answers

is there is any stemmer available for indian language

is there is any implementation of stemmers for indian languages like(hindi,telugu) are available ....
rajesh
  • 1,773
  • 3
  • 12
  • 6
8
votes
2 answers

OCR for Devanagari (Hindi / Marathi / Sanskrit)

Does anybody have any idea about any recent work being done on optical character recognition for Indian scripts using modern Machine Learning techniques ? I know of some research being done at ISI, calcutta, but nothing new has come up in the last…
Egon
  • 3,718
  • 3
  • 34
  • 48
7
votes
1 answer

How to convert array of tamil unicode values into tamil string in python with whitespaces?

Here is the list of Tamil unicode codepoints [u'\u0b9a', u'\u0b9f', u'\u0bcd', u'\u0b9f', u'\u0b9a', u'\u0baa', u'\u0bc8', u'\u0baf', u'\u0bbf', u'\u0bb2', u'\u0bcd', u'\u0ba8', u'\u0bc7', u'\u0bb1', u'\u0bcd', u'\u0bb1', u'\u0bc1] How can I convert…
7
votes
2 answers

Display Kannada Text in Android application

I have a requirement to display kannada text in GCM alerts sent to an android app. I followed these tutorials Install Read / Write Indian Language Fonts on Android and How to Write & Read in Indian Regional Language Fonts on Android. Just to brief,…
Ayaz Pasha
  • 1,045
  • 6
  • 13
  • 28
7
votes
4 answers

Regex to get list of all words with specific letters (unicode graphemes)

I'm writing a Python script for a FOSS language learning initiative. Let's say I have an XML file (or to keep it simple, a Python list) with a list of words in a particular language (in my case, the words are in Tamil, which uses a Brahmi-based…
Ashwin Balamohan
  • 3,303
  • 2
  • 25
  • 47
7
votes
4 answers

devanagari i18n in java

I am trying to use i18n in java for devanagari/hindi using sample ttf files from internet. I am able to load resource bundle entries and also load the ttf and set font but it will not render jlabel as desired. It shows blocks in place of…
Miten
  • 356
  • 7
  • 23
6
votes
1 answer

Flutter/Dart: Changing colors of specific unicode characters in a string

I'm new to Flutter, trying a code to change the color of specific Unicode characters in a string. Color coding \u0951, \u0952, and \u1cda to say blue, red and green. The output string is not matching the expected format. I'm seeing some characters…
user2901219
  • 371
  • 1
  • 3
  • 15
5
votes
1 answer

render bengali(and possibly other indic) font in vim

I am trying to write in bengali(http://en.wikipedia.org/wiki/Bengali_alphabet) using gvim. The font rendering is not correct. I have tried gedit, and rendering is correct there. To compare, see this image for gedit and this image for gvim (all the…
BaRud
  • 3,055
  • 7
  • 41
  • 89
4
votes
0 answers

Count Number of Character in Indic Language (Hindi,Tamil support all indian language)

Is there any optimal way to implement character count for indic language like Hindi Tamil For example, if we take the word "Mother" in English, it is a 6 letter word. But if you type the same word(माता) in Hindi, it is a two letter word(मा + ता) but…
4
votes
1 answer

Are there technologies-S/w framework/tools/sites that do Translation from Indic(Indian) languages to English?

Are there any tools/sites which will allow me to enter some text in one of the many Indian Languages(Kannada,Marathi,Tamil,Hindi,..) and get translated it to English. Google translate currently only supports Hindi among the Indian languages.
goldenmean
  • 18,376
  • 54
  • 154
  • 211
4
votes
0 answers

Language detection for pinyin, translit etc?

Real-world user-generated text in non-Latin alphabet languages is often not in canonical form but in translit, shlyokavitsa, arabizi, pinyin and so on. Language detection software is starting to handle it smartly, but usually it doesn't work, even…
Adam Bittlingmayer
  • 1,169
  • 9
  • 22
4
votes
1 answer

indic-text-renderer using harfbuzz and freetype

I am developing an Android application which should support Devanagari Fonts. So I downloaded indic-text-renderer and tried to run it as per the instruction stated here and managed to successfully compile the NDK part. But when I try to run the…
dd619
  • 5,910
  • 8
  • 35
  • 60
4
votes
0 answers

Transliteration API alternative to google API for Android

What are the other alternatives for doing transliteration ? since google transliteration api was deprecated previously. do suggest any services / api to perform translation. Any indic language to English is highly appreciated.
user894795
  • 145
  • 1
  • 2
  • 11
4
votes
1 answer

Drawing complex text in android ics in native c

NB: My whole senario is for only android version ICS. My Goal is to render text having complex script/indic script. In ICS, this feature has been added in WebView (and so Browser). If any indic text is rendered in Browser or WebView it renders…
Sarim
  • 3,124
  • 2
  • 20
  • 20
1
2 3 4 5