0

I use PDFBox library to read pdf files in arabic language but there are some characters come in strange format such this: ۇ ۇ ۆ ۆ ۈ ۈ ۇٴ ۋچ

please i want to know why pdfbox can't read them correctly?

  • What do you mean by "can't read them"? What output are your observing, and what did you expect? Additionally, how are you using PDFBox when you see the unexpected output? – Andrzej Doyle Jul 17 '13 at 11:59
  • sample input in Arabic language : "يُرِيدُ اللهُ بِكًُمُ اليُسرَ ولا يُرِيدُ بِكُمُ العُسرَ" – user2557800 Jul 18 '13 at 07:52
  • the output come in this form: "ۇ ۇ ۆ ۆ ۈ ۈ ۇٴ ۋچ " Expected output as render character by character from left to right: رَ س ع ل ا م ك ب....... – user2557800 Jul 18 '13 at 07:57
  • My code i am using in PDFBox: PDStream contents = page.getContents(); if (contents != null) { printer.processStream(page, page.findResources(), page.getContents().getStream()); } then i use TextPosition to render file character by character wait your reply!! – user2557800 Jul 18 '13 at 07:58
  • Possible duplicate of https://stackoverflow.com/questions/48284888/writing-arabic-with-pdfbox-with-correct-characters-presentation-form-without-bei – Ioannis Brant-Ioannidis May 22 '21 at 19:54

0 Answers0