0

I am trying to automate scientific PDF to XML using Python. I would like to know: is there any Python library that can generate XML from text or from PDF documents?

1 Answers1

2

You can use textract to read a pdf file, for details: Extracting text from a PDF file using Python once you have your text in a variable use ElementTrees to write properly formatted XML.

Hood Khizer
  • 115
  • 7