i have a problem with some PDF files which i need to extract text from. The PDFs are generated by the same institution. I found a topic from Stack Overflow on how to do the mappings manually. I tried that, but the problem is that each file i look at, has slight differences in the CIDs/GIDs.
For example:
Is there a way to fix the font somehow or the only option would be to use OCR?