The Urdu tag is for issues relating to displaying user interfaces in the Urdu language.
Questions tagged [urdu]
95 questions
12
votes
1 answer
How to join Urdu alphabets while converting text into image in PHP
I use TIC to convert text into images.
I have searched a lot on this but it seems like Unicode problem (unicodes of initial medial and final letters) or may be content type as image is in PNG.
If I echo without image conversion with content type…

Zehra Noor
- 121
- 3
9
votes
2 answers
how to properly concatenate bidi strings in r?
I want to add markup to (Urdu language) text that is written right to left. I am trying to use gsub for the purpose but everything I have tried so far does not produce the desired output
text <- "یہ جملہ ایک مثال کے لیے استعمال کیا جا رہا…

Imran Ali
- 2,223
- 2
- 28
- 41
6
votes
2 answers
How to convert variable (text/string) to utf8mb4 in php
Hi I'm looking for a encode function for utf8mb4,
$var = = "نور";
echo utf8mb4_encode($string);
output = نور // its $var output in UTFMB4
The output should be "نور" this, its a conversion of $var in utfmb4

Mukhyyar
- 115
- 1
- 1
- 11
6
votes
2 answers
Making a website in urdu
I want to make a website in urdu, I have googled it and found some unicode method, Now what my question is that how these unicodes are working or how can I extract these codes from the urdu sentences.
Following is the code snippet which is used to…

Ali Sajid
- 3,964
- 5
- 18
- 33
5
votes
3 answers
How to know text is Arabic or in Urdu
I want to know is text contain any letter in Urdu or Arabic..using this condition which produce false results when special characters comes.what is right way to do it .any library or what is right regex for this ?
if (cap.replaceAll("\\s+",…

Usman Saeed
- 843
- 3
- 9
- 21
5
votes
1 answer
Split Urdu words based on nonexistent space
I have a Urdu word "لاعلم" and more similar words. How can I split the word that I get "لا" and "علم" separately in an array? I have tried converting the words to unicode characters, but I can,t detect the break between "لا" and "علم".
English words…

user3699181
- 61
- 6
4
votes
1 answer
Urdu in WPF Application
I am developing a WPF desktop application in urdu language.
I have mutiple ext blocks with urdu text in them. I require using some custom urdu fonts in my application. One way to do so is to intall the required font on the desktop machine but I want…

RatedBinary
- 43
- 3
4
votes
0 answers
Tesseract Custom trained Model for Urdu language left to right output and no space recognized
I have trained my own model for Urdu language using jtessboxeditor to create tiff/box file and then used Serak tesseract trainer for creating trainedata file, Model is recognizing urdu language but there are 2 issues mainly other than…

Muhammad Moinuddin
- 135
- 1
- 11
4
votes
1 answer
Urdu file reading in java
I am trying to read a file which have urdu data. When I view the file in Notepad++ it has data in urdu. But when I view it in eclipse then it shows some type of encoding (may be it has get some default).
Original Urdu Data (Notepad++):
"10","کراچی…

Hammad Hassan
- 1,192
- 17
- 29
4
votes
1 answer
How to join Urdu alphabets in c#
i am developing an urdu based application using c#. i have done with segmentation now problem is after segmentation i got letters of Urdu language any one give me idea how to join them to make words from letters like
ب ڑ ی = بڑی
ب ا ت = بات

mashal
- 37
- 12
3
votes
1 answer
I want to display urdu text on image using python
from PIL import Image, ImageFont, ImageDraw
from matplotlib import pyplot as plt
import numpy as np
import cv2
text_string = u'تصوير'
img = Image.new('RGB', (200, 150))
draw = ImageDraw.Draw(img)
font = ImageFont.truetype('C:\\Windows\\Nafees…

fatima
- 31
- 2
3
votes
0 answers
Config for pytesseract (Urdu language)
I am having some problems with pytesseract. With this line of code pytesseract works poorly with Urdu language:
text = pytesseract.image_to_string(img, lang="urd")
What configuration should I use to improve the accuracy for Urdu language? And what…

Samee Arif
- 61
- 3
2
votes
0 answers
Saving Urdu (RTL) in a text file from API to preserve the format
I am transcribing a few chunks of Urdu audio using an API - when I open in notepad - the order of text is messed up. Specifically the sentence that comes AFTER a period/punctuation is looped around the line.
For e.g if the text is "ABCDE.FGH" it…

Moughees Ahmed
- 41
- 3
2
votes
0 answers
Missing several characters for Urdu and Sindhi languages
I'm trying to generate pdf with wicked-pdf gem. Pdf is generated through Qt WebKit rendering engine, so I had to install fonts on the server-side for non-utf languages like Myanmar, Urdu, Pashto etc. It works in most cases, but there is a problem…

irondnb
- 55
- 8
2
votes
0 answers
Pytesseract return nothing in Urdu and Arabic text
Converting Id Card Image to text by using Pytesseract. Till yet I've break the image in section for name address Id card number and parse it using
import pytesseract as tess
from PIL import Image
im = Image.open("Image.jpg")
crop_rectangle = (20,…

ProgSMI
- 79
- 6