Questions tagged [poppler]

Poppler is a GPL'd PDF rendering library based on the xpdf-3.0 code base.

290 questions
61
votes
15 answers

How to install Poppler on Windows?

The most recent version of ScraperWiki depends on Poppler (or so the GitHub says). Unfortunately, it only specifies how to get it on macOS and Linux, not Windows. A quick googling turned up nothing too promising. Does anyone know how to get Poppler…
akkatracker
  • 1,397
  • 3
  • 14
  • 25
22
votes
2 answers

poppler pdfunite cannot merge encrypted PDF files, how to remove encryption? (no password needed to open)

I have some PDF files which I want to merge into one. I typically use pdfunite from the Poppler PDF toolkit to do this. However now I'm getting this error: Could not merge encrypted files ('foo.pdf') When I run exiftool foo.pdf to see its…
RocketNuts
  • 9,958
  • 11
  • 47
  • 88
20
votes
4 answers

How to install poppler in ubuntu 15.04?

Poppler is a PDF rendering library based on the xpdf-3.0 code base. I have already downloaded the tar.xz file from the official site http://poppler.freedesktop.org/ But I do not know what to do with this file Is there any command to install or…
18
votes
4 answers

How to extract images from PDF using Ghostscript or ImageMagick?

I need to render or fetch all the images from a specific PDF file. How can I achieve this using Ghostscript or ImageMagick ?
mmoghrabi
  • 1,233
  • 1
  • 14
  • 23
15
votes
4 answers

Install Poppler for Python on Mac

I need to extract pdf annotations in Python. All my searches indicate I need Poppler to do this e.g. to run this But I am having a lot of trouble installing Poppler. I'm using Python 3.4 on OS X (Yosemite). Here's what I've tried so far: 1.…
simmons
  • 835
  • 1
  • 9
  • 17
14
votes
5 answers

How to install Poppler to be used on AWS Lambda

I have to run pdf2image on my Python Lambda Function in AWS, but it requires poppler and poppler-utils to be installed on the machine. I have tried to search in many different places how to do that but could not find anything or anyone that have…
DaviRod
  • 143
  • 1
  • 6
14
votes
5 answers

How to figure out the resolution (DPI) of images embedded in a PDF document?

I have a PDF document that also contains images. Now I want to know the resolution of these images. A first step would be to somehow get the images out of the PDF document. But how? Is that even possible with something provided in Cocoa?
Daniel
  • 1,473
  • 3
  • 33
  • 63
13
votes
3 answers

read, highlight, save PDF programmatically

I'd like to write a small script (which will run on a headless Linux server) that reads a PDF, highlights text that matches anything in an array of strings that I pass, then saves the modified PDF. I imagine I'll end up using something like the…
Jake
  • 5,379
  • 6
  • 19
  • 19
11
votes
1 answer

how do you install poppler on google colab

I was using a python package pdf2img, but I got an error asking me to check if I had installed poppler or not , so I ran "pip install python-poppler-qt5" in a code box in colab , but then I get the following error: Collecting python-poppler-qt5…
Akash
  • 195
  • 1
  • 3
  • 14
9
votes
2 answers

How to extract images from a pdf using the poppler library in Python?

I've got a pdf from which I want to extract some images using Python. I can easily extract images from the Linux command line using the pdfimages from the poppler-utils library like this: pdfimages my_file.pdf /tmp/image Next I found a Python…
kramer65
  • 50,427
  • 120
  • 308
  • 488
9
votes
2 answers

convert pdf with 300dpi bitmaps to svg

I'm creating a tool to convert pdf's into svg. These pdf's contain graphical data, including large bitmaps at 300 dpi and a bunch of vectors as well. Poking around here on stackoverflow, I've found pdf2svg, which great -- works like a charm, and the…
Paul Degnan
  • 1,972
  • 1
  • 12
  • 28
8
votes
1 answer

is MuPdf library faster than xpdf/poppler at rendering images from pdf pages?

Is MuPdf library faster at rendering images from pdf pages than xpdf/poppler? They say it is high-performance.
P5music
  • 3,197
  • 2
  • 32
  • 81
8
votes
3 answers

Pdf2Image library failing to read pdf signed using DocuSign

I'm trying to convert a pdf signed using DocuSign to image format. We are facing the error in convert_from_path method. Code and error are shown below: import pdf2image data=pdf2image.convert_from_path('name.pdf') PDFPageCountError: Unable to get…
Karthik B.G
  • 81
  • 1
  • 2
8
votes
3 answers

script does not continue after command line

I have a problem with a script with a command line.. The PHP script never continue.. Have tried to call the command line directly through putty and it outputs alot of errors but returns/completes instantly. Why doesn't it then return back to PHP? It…
clarkk
  • 27,151
  • 72
  • 200
  • 340
8
votes
2 answers

error while loading shared libraries: libpoppler.so.58: cannot open shared object file

I compiled and install poppler-0.39.0 as per the instruction. By default header files went int \usr\local\include and lib files went into \usr\local\lib. pdftohtml is installed in \usr\local\bin. Now when I tried to run pdftohtml it gives following…
user1428258
  • 101
  • 1
  • 4
1
2 3
19 20