I am trying extract all the words(chunks) / characters with coordinate from a searchable text PDF invoice / statement by iTextSharp using C# program , after getting coordinate, create an XML file, then read the XML file plot the data to DataGridView. I have tried some methods like iTestSharp. iTextSharp extract each character and getRectangle anyone could suggest a method to create an XML file with the following format XML :
<PDFExtract>
<PageLayout>Style</PageLayout>
<Page>
<Zone>
<Line>
<LOCX>298</LOCX>
<LOCY>199</LOCY>
<LOCW>1859</LOCW>
<LOCH>138</LOCH>
<WD>
<LOCX>298</LOCX>
<LOCY>199</LOCY>
<LOCW>139</LOCW>
<LOCH>69</LOCH>
<T>Start</T>
</WD>
<WD>
<LOCX>476</LOCX>
<LOCY>216</LOCY>
<LOCW>63</LOCW>
<LOCH>55</LOCH>
<T>Bucks</T>
</WD>
</Zone>
</Page>