Questions tagged [document-processing]
5 questions
1
vote
0 answers
Automated validation of Extractions from Intelligent Document Processing
I'm currently working on a project that involves using XtractEdge (which is similar to Amazon Textract, Microsoft Intelligent
Document Processing) to extract valuable information from a variety of documents. One of the major challenges I'm facing is…

Vishal
- 13
- 3
1
vote
1 answer
How to properly close Word documents after Documents.Open
I have the following code for a C# console app. It parses a Word document for textboxes and inserts the same text into the document at the textbox anchor point with markup. This is so I can convert to Markdown using pandoc, including textbox content…

DLT
- 139
- 8
0
votes
1 answer
Docx4j - Long image is missing content at page break and the missing content is not visible at next page in HTML to Docx4j migration
I migrate data from HTML document pages to docx via docx4j, basically everything else is working fine and normal images and contents are getting added in docx perfectly but in rare cases for long images the image isn't breaking at page break and the…

Faizan Shaikh Sarkar
- 173
- 1
- 2
- 9
0
votes
0 answers
Create bounding box with nearest horizontal value in Python
i have a PDF below
i have to create bounding box for retrieving values of date, Due Date , Invoice # , Amount Due , Address and some other field . but the bounding box i create is not extracting me correct values . The code should work for any type…

bay bay
- 11
- 1
0
votes
0 answers
how to make processor component in Power Apps to scan first page only out of multipage pages PDF?
I am trying to use processor component in Power Apps to scan first page only for the OCR Model (Document Processing Model).
when I upload PDF, it takes too much time to process it while I need only first page processing not the whole PDF…

HAJJAJ
- 3,667
- 14
- 42
- 70