Questions tagged [non-latin]

70 questions
28
votes
10 answers

Are there programming languages that rely on non-latin alphabets?

Every programming language I have ever seen has been based on the Latin alphabet, this is not surprising considering I live in Canada... But it only really makes sense that there would be programming languages based on other alphabets, or else…
Jaxsun
  • 526
  • 6
  • 11
11
votes
5 answers

Does MySql full text search works reasonably with non-Latin languages (Hebrew, Arabic, Japanese...)

Does MySql full text search works reasonably with non-Latin languages? (Hebrew, Arabic, Japanese...) Addition: Did some tests... It has some problems with Hebrew. Example: The name מוסינזון is pronounced the same as מושינזון but searching one…
Itay Moav -Malimovka
  • 52,579
  • 61
  • 190
  • 278
10
votes
2 answers

Output one character per two keys in Android Keyboard

I am designing a custom keyboard for Amharic language in Android, but the following is applicable to many other non-English languages. Two or more combination of keys translate to one character. So, if the user types 'S', the keyboard will output…
Shahid Thaika
  • 2,133
  • 5
  • 23
  • 59
4
votes
1 answer

Non-Latin characters in URL (Google App Engine)

In my project I have html file with non-Latin characters: Кондиционер.html. When i make request: www.myDomain.com/Кондиционер.html Server sends 404 error: Error: Not Found The requested URL…
Antoxa
  • 43
  • 2
  • 6
3
votes
0 answers

MSysGit console and non latin filenames

I have install msysgit. But when I run Git Bash Console I can not input non latin charachers and non latin filenames are displayed as questuion marks. Is there any way to make console undertand non latin characters. Thank you in advance.
Art Spasky
  • 1,635
  • 2
  • 17
  • 30
3
votes
1 answer

regex to match non-latin char with ASCII 0-31 and 128-255

wanted to match the non-latin char. tried it. as per my understanding if (a.matches("[\\x8A-\\xFF]+")) should return true but its false. String a = "Ž"; if (a.matches("[\\x8A-\\xFF]+")) { }
Romi
  • 4,833
  • 28
  • 81
  • 113
3
votes
0 answers

wkhtmltopdf - Chinese characters slanting

I tried converting http://book.kanunu.org/book4/10406/231373.html to PDF. I used the following command: wkhtmltopdf --encoding gbk http://book.kanunu.org/book4/10406/231373.html test.pdf And I got the following…
3
votes
2 answers

how to write web.config for wordpress on iis with non latin url

I am using wordpress wirh permalink structure (/%year%/%monthnum%/%day%/%postname%/) on iis server. The issue is with non-latin characters like malayalam. I wrote the web.config as follows,
Vidhu
  • 79
  • 1
  • 5
3
votes
2 answers

Keep non-Latin characters when scraping page in python

I have a program that scrapes a page, parses it for any links, then downloads the pages linked to (sounds like a crawler, but it's not) and saves each one in a separate file. The file name used to save is part of the url of the page. So for…
bsg
  • 825
  • 2
  • 14
  • 34
2
votes
1 answer

In Python (or any language) what does an "upper" function do to Hindi, Amharric and other non-Latin character sets?

Subject says it all. Been looking for an answer, but cannot seem to find it. I am writing a web app that will store data in a database and also have language files translated into a wide variety of character sets. At various moments, the text will…
horace
  • 938
  • 9
  • 20
2
votes
1 answer

GIThub and non-english windows?

As you probably know, Git Bash doesn't support non-latin characters. That results in multiple failures while creating and using SSH keys (since the paths are russian). Is there a workaround?
Daniel
  • 95
  • 1
  • 4
2
votes
1 answer

Write Non Latin Characters in CSV using Java

We have modified the application to support Non-Latin characters (Chinese & Thai) but we faced some issues in writing to CSV and resolved using below approach. String line = "שלום, hello, привет"; OutputStream os = new…
Vijay
  • 21
  • 1
2
votes
2 answers

Remove all special chars, but not non-Latin characters

I'm using this PHP function for SEO urls. It's working fine with Latin words, but my urls are on Cyrillic. This regex - /[^a-z0-9_\s-]/ is not working with Cyrillic chars, please help me to make it works with non-Latin chars. function…
Vesselina Taneva
  • 115
  • 2
  • 10
2
votes
1 answer

Greek language support for lunr.js

Registering a new stemmer function in lunr for greek words doesn't work as expected. here is my code on codepen. I am not receiving any errors, the function stemWord() works fine when used separately but it fails to stem the words in lunr. below is…
2
votes
3 answers

Regex validation on UTF8 / multi byte 'language' characters (inc chinese etc) but not special characters such as {/*

Using PHP / MySQL all encoded up as UTF, we have recently had to start capturing non-Latin characters, such as Chinese etc. We have PHP validation that checks the string length and alpha numeric such as: if…
megaSteve4
  • 1,760
  • 1
  • 17
  • 24
1
2 3 4 5