I am trying to automate scientific PDF to XML using Python. I would like to know: is there any Python library that can generate XML from text or from PDF documents?
Asked
Active
Viewed 1,022 times
1 Answers
2
You can use textract to read a pdf file, for details: Extracting text from a PDF file using Python once you have your text in a variable use ElementTrees to write properly formatted XML.

Hood Khizer
- 115
- 7