I am using iText Java API to extract text from a PDF.
String text = PdfTextExtractor.getTextFromPage(reader,i);
Src PDF content:
1.2 SUBMITTALS
Generated Text:
SUBMITTALS
1.2
Extracted Text is split into 2 separate lines and order of the text is also messed up.
Can someone please help me understand what am I doing wrong?
Src pdf file link - https://www.dropbox.com/s/vc9it3c7856ejli/testPDF.pdf?dl=0
Target text file generated from iText - https://www.dropbox.com/s/ps2l9yz5ufuup01/test.txt?dl=0
But when I test with other PDF APIs like PDFClown, OCROnline it is working as expected.
Please help
Thanks