0

I have to read the content of a .pdf file, I am using ITextSharp.net, I have three problems:

1- the Arabic terms are extracted in reverse order.( ex: احمد is extracted as دمحا) which is reversed ( in English: Ahmad is extracted as damha ) if my file contains both Arabic and English, How to extract each language with its correct direction.

2- sometimes the glyphs are no defined as characters, so they appear as symbols, how to add my own definition for glyphs?

3- Can I extract the text with its formattings, to convert to html and display the file in a web page as is?

0 Answers0