Problem :- I can parse the PDF position operators in content stream. If the cooridnates are started from the left bottom to my calculations are getting correct and able to tag the content properly.
q1) In case the starting coordinates are changed i.e(top left, or top right or bottom right). The parsed coordinates are not matching to tag content , in this case how exactly to calculation will happen.
q2) If the starting points are changed how the content stream will represent it ?
for example "0 7.98 -7.98 0 90.8898 715.4183 Tm".
To Give you more explaination i am sharing 2 pdf which will help for better understanding.
We can look into this file the file Coordinates i.e (0,0) stats from bottom left and we are able to tag all the data in this file.
The Page coordinates(0,0) starts from top left. similarly there might be scenarios the coordinates might start from top right and bottom right , now the question is how to tag this kinds of files .
Thanks Tejas