Is it possible to fetch graphs from pdf using pdfBox

Asked Sep 16 '19 at 07:54

Active Sep 16 '19 at 09:16

Viewed 87 times

I have a pdf file which includes graphs, not in image format. Is there any way to process pdf file using content streams and fetch graphs out of it

edited Sep 16 '19 at 09:16

Tilman Hausherr

asked Sep 16 '19 at 07:54

Deepak Dhaka

1

Your question is about `pdfbox`, so why tag `itext`? – Amedee Van Gasse Sep 16 '19 at 08:15
I was unsure about it, seems linked with pdf creation extraction. that why I used. – Deepak Dhaka Sep 16 '19 at 08:24
Possible duplicate of [pdfbox 2.0.2 > Calling of PageDrawer.processPage method caught exceptions](https://stackoverflow.com/questions/38931422/pdfbox-2-0-2-calling-of-pagedrawer-processpage-method-caught-exceptions) - see the answer, it shows how to collect lines. – Tilman Hausherr Sep 16 '19 at 09:16
@TilmanHausherr, I'm new in manipulating pdf objects, can you guide how it is related? I need only graphs that are present in pdf, not anything else like table, text, image. – Deepak Dhaka Sep 16 '19 at 12:04
There is no such concept as a "graph" in PDF. The answer I linked to describes how to collect line coordinates. So if your graph is vector graphic within a table, you'll end up with the table and the graph details and will have to sort it out. – Tilman Hausherr Sep 16 '19 at 12:27

0 Answers0