Questions tagged [urdu]

The Urdu tag is for issues relating to displaying user interfaces in the Urdu language.

95 questions
12
votes
1 answer

How to join Urdu alphabets while converting text into image in PHP

I use TIC to convert text into images. I have searched a lot on this but it seems like Unicode problem (unicodes of initial medial and final letters) or may be content type as image is in PNG. If I echo without image conversion with content type…
Zehra Noor
  • 121
  • 3
9
votes
2 answers

how to properly concatenate bidi strings in r?

I want to add markup to (Urdu language) text that is written right to left. I am trying to use gsub for the purpose but everything I have tried so far does not produce the desired output text <- "یہ جملہ ایک مثال کے لیے استعمال کیا جا رہا…
Imran Ali
  • 2,223
  • 2
  • 28
  • 41
6
votes
2 answers

How to convert variable (text/string) to utf8mb4 in php

Hi I'm looking for a encode function for utf8mb4, $var = = "نور"; echo utf8mb4_encode($string); output = نور // its $var output in UTFMB4 The output should be "نور" this, its a conversion of $var in utfmb4
Mukhyyar
  • 115
  • 1
  • 1
  • 11
6
votes
2 answers

Making a website in urdu

I want to make a website in urdu, I have googled it and found some unicode method, Now what my question is that how these unicodes are working or how can I extract these codes from the urdu sentences. Following is the code snippet which is used to…
Ali Sajid
  • 3,964
  • 5
  • 18
  • 33
5
votes
3 answers

How to know text is Arabic or in Urdu

I want to know is text contain any letter in Urdu or Arabic..using this condition which produce false results when special characters comes.what is right way to do it .any library or what is right regex for this ? if (cap.replaceAll("\\s+",…
Usman Saeed
  • 843
  • 3
  • 9
  • 21
5
votes
1 answer

Split Urdu words based on nonexistent space

I have a Urdu word "لاعلم" and more similar words. How can I split the word that I get "لا" and "علم" separately in an array? I have tried converting the words to unicode characters, but I can,t detect the break between "لا" and "علم". English words…
4
votes
1 answer

Urdu in WPF Application

I am developing a WPF desktop application in urdu language. I have mutiple ext blocks with urdu text in them. I require using some custom urdu fonts in my application. One way to do so is to intall the required font on the desktop machine but I want…
4
votes
0 answers

Tesseract Custom trained Model for Urdu language left to right output and no space recognized

I have trained my own model for Urdu language using jtessboxeditor to create tiff/box file and then used Serak tesseract trainer for creating trainedata file, Model is recognizing urdu language but there are 2 issues mainly other than…
4
votes
1 answer

Urdu file reading in java

I am trying to read a file which have urdu data. When I view the file in Notepad++ it has data in urdu. But when I view it in eclipse then it shows some type of encoding (may be it has get some default). Original Urdu Data (Notepad++): "10","کراچی…
Hammad Hassan
  • 1,192
  • 17
  • 29
4
votes
1 answer

How to join Urdu alphabets in c#

i am developing an urdu based application using c#. i have done with segmentation now problem is after segmentation i got letters of Urdu language any one give me idea how to join them to make words from letters like ب ڑ ی = بڑی ب ا ت = بات
mashal
  • 37
  • 12
3
votes
1 answer

I want to display urdu text on image using python

from PIL import Image, ImageFont, ImageDraw from matplotlib import pyplot as plt import numpy as np import cv2 text_string = u'تصوير' img = Image.new('RGB', (200, 150)) draw = ImageDraw.Draw(img) font = ImageFont.truetype('C:\\Windows\\Nafees…
fatima
  • 31
  • 2
3
votes
0 answers

Config for pytesseract (Urdu language)

I am having some problems with pytesseract. With this line of code pytesseract works poorly with Urdu language: text = pytesseract.image_to_string(img, lang="urd") What configuration should I use to improve the accuracy for Urdu language? And what…
Samee Arif
  • 61
  • 3
2
votes
0 answers

Saving Urdu (RTL) in a text file from API to preserve the format

I am transcribing a few chunks of Urdu audio using an API - when I open in notepad - the order of text is messed up. Specifically the sentence that comes AFTER a period/punctuation is looped around the line. For e.g if the text is "ABCDE.FGH" it…
2
votes
0 answers

Missing several characters for Urdu and Sindhi languages

I'm trying to generate pdf with wicked-pdf gem. Pdf is generated through Qt WebKit rendering engine, so I had to install fonts on the server-side for non-utf languages like Myanmar, Urdu, Pashto etc. It works in most cases, but there is a problem…
irondnb
  • 55
  • 8
2
votes
0 answers

Pytesseract return nothing in Urdu and Arabic text

Converting Id Card Image to text by using Pytesseract. Till yet I've break the image in section for name address Id card number and parse it using import pytesseract as tess from PIL import Image im = Image.open("Image.jpg") crop_rectangle = (20,…
ProgSMI
  • 79
  • 6
1
2 3 4 5 6 7