Answer https://stackoverflow.com/a/62174368/8117673
Further question is - will it affect the accuracy of text detection by Amazon Textract?
Do I need to pre-process the image to get better result from Amazon Textract?
Answer https://stackoverflow.com/a/62174368/8117673
Further question is - will it affect the accuracy of text detection by Amazon Textract?
Do I need to pre-process the image to get better result from Amazon Textract?
I converted PDF to PNG using command pdftoppm. In Python -> subprocess.Popen(['pdftoppm -png Sample.pdf Sample'])
The accuracy of Amazon Textract on PDF file was more than the PNG format. Because PDF is the original document.