I'm parsing PDFs with pdfMiner, using it as a library in my python script.
In most of these PDFs there is a table, where one of the columns is named "company".
Is there a way to:
- detect the existence of that table in the PDF.
- get all the company names (i.e. all the entries in the 2nd column of the table).