0

Answer https://stackoverflow.com/a/62174368/8117673

Further question is - will it affect the accuracy of text detection by Amazon Textract?

Do I need to pre-process the image to get better result from Amazon Textract?

John Rotenstein
  • 241,921
  • 22
  • 380
  • 470

1 Answers1

0

I converted PDF to PNG using command pdftoppm. In Python -> subprocess.Popen(['pdftoppm -png Sample.pdf Sample'])

The accuracy of Amazon Textract on PDF file was more than the PNG format. Because PDF is the original document.