Questions tagged [pdftoppm]

pdftoppm stands for Portable Document Format (PDF) to Portable Pixmap (PPM) converter. It is a tool in linux that converts Portable Document Format (PDF) files to color image files in Portable Pixmap (PPM) format.

pdftoppm stands for Portable Document Format () to Portable Pixmap () converter. It is a tool in that converts Portable Document Format (PDF) files to color image files in Portable Pixmap (PPM) format, grayscale image files in Portable Graymap (PGM) format, or monochrome image files in Portable Bitmap (PBM) format.

Usage

pdftoppm [options] PDF-file PPM-root

Pdftoppm reads the PDF file PDF-file, and writes one PPM file for each page PPM-root-nnnnnn.ppm, where nnnnnn is the page number.

You should use this tag if your question is related to the usage of pdftoppm tool.

18 questions
8
votes
3 answers

Pdf2Image library failing to read pdf signed using DocuSign

I'm trying to convert a pdf signed using DocuSign to image format. We are facing the error in convert_from_path method. Code and error are shown below: import pdf2image data=pdf2image.convert_from_path('name.pdf') PDFPageCountError: Unable to get…
Karthik B.G
  • 81
  • 1
  • 2
8
votes
1 answer

pdftoppm "No display font" errors

I'm using pdftoppm to extract pages from a pdf file, so I can later convert the resulting pbm files into multi-page tiffs with ImageMagick. I've got it to work using the following code: os.system('pdftoppm -f %i -l %i -aa no -mono -q "%s" %sx' %…
CCKx
  • 1,303
  • 10
  • 22
3
votes
2 answers

pdftoppm fails to create output

I'm running pdftoppm from command prompt but it fails to create any output. here is my input c:\test>pdftoppm -r 600 png W10475.pdf test here is what files I have in the associated 'test' directory c:\test>dir Volume in drive C is OS Volume…
attackgnome
  • 33
  • 1
  • 8
2
votes
3 answers

Why is pdf2image giving me a blank image file?

I trying to perform OCR using Tesseract OCR on multiple big pdf files (~400-600 pages). I don't necessarily want to extract text from all pages, but I just want a few pages (page numbers are known). The PDF file seems to have some sort of OCR…
Vedant Jumle
  • 133
  • 2
  • 11
1
vote
0 answers

How do I install pdftoppm as dependency for textract?

I am trying to run textract on mac and receive the following error message: ShellError: The command pdftoppm Frome11.pdf /var/folders/h0/7cg0dqzd32gc8wt0j_5wkjsm0000gn/T/tmpplv3r07t/conv failed because the executable pdftoppm is not installed on…
dmc
  • 21
  • 4
1
vote
1 answer

Converting a PDF to PNG with anti-aliasing and transparent background

I am trying to convert a LaTeX-generated PDF file to a PNG file with anti-aliasing and a transparent background (white text on a black background). After having read the answer to this post and one of the comments to the answer, I compared the…
niran90
  • 248
  • 1
  • 10
1
vote
1 answer

subprocess.run simple scenario fails

I am trying to run python subprocess.run function to execute following command: pdftoppm -jpeg -f 1 -scale-to 200 data/andromeda.pdf and-page pdftoppm - is part of poppler utility and it generates images from pdf files. File data/andromeda.pdf…
eugenci
  • 173
  • 1
  • 7
1
vote
1 answer

pdftoppm converts only one first page of pdf

I need to convert pdf to pgm, and when I run the (example)command pdftoppm -f 5 -l 10 -gray input.pdf > output.pgm I am getting the first page of the pdf as output. This is even though I am clearly specifying first page as page 5. I am not…
Kalpit
  • 891
  • 1
  • 8
  • 24
1
vote
1 answer

Convert pdf to image with pdftoppm in php without writing files on disk

I need to convert pdf to png in php. Because of quality reasons we don't want to use Imagemagick but prefer to use pdftoppm. For performance we prefer not to use the filesystem, but the memory. pdftoppm is properly installed on Ubuntu and works. For…
1
vote
1 answer

PDF to ppm save progressive jpg

I'm trying to convert a PDF to jpg images, but I want the images saved as progressive jpgs. I'm using pdftoppm to do this. This is the command : $imageExecCommand = 'pdftoppm -jpeg -cropbox -r 100 -f ' . $pageToCopy . ' -l ' . $pageToCopy . ' '…
gabitzish
  • 9,535
  • 7
  • 44
  • 65
0
votes
0 answers

pdftoppm giving libpng error: Write Error before core dumping

Sometimes when I run pdftoppm -png orig.pdf orig- I get a "libpng error: Write Error" and then I get a core dump. I'm running pdftoppm v0.48.0. Regrettably I am unable to provide the source PDFs. libpng error: Write Error Internal Error: error…
neubert
  • 15,947
  • 24
  • 120
  • 212
0
votes
0 answers

Pdf2image library of Python throws "No font in show/space" error

For a few pdfs the pdf2image library throws this error. I have set strict=True in the convert_from_path method and hence I am getting these syntax errors I think it means that my system doesn't have the required fonts but is there any other way to…
Jay
  • 1
  • 1
0
votes
0 answers

How parse a PDF document to array of images directly to RAM buffer

I'm trying to parse a huge PDF-document to list of images (each image has a bmp-format). I use ghostscript and python to parse PDF to list of numpy arrays but use very unuseful approach: def get_imgs_gs(path_to_pdf): cpu_number =…
andkot
  • 48
  • 6
0
votes
1 answer

Error in converting a PDF file page to PPM using pdftoppm

I am having trouble converting a PDF file to PPM using pdftoppm.exe. The PDF file has multiple pages, and I want to convert only page 6 of the file. I have tried the following but it inexplicably fails to do the conversion: pdftoppm…
JAT86
  • 997
  • 12
  • 24
0
votes
1 answer

Solved: Python multiprocessing imap BrokenPipeError: [Errno 32] Broken pipe pdftoppm

Let me first say that this is not a duplicate of the other similar questions, where people tend to manage more closely the pool of workers. I have been struggling with the following exception thrown by my code when using multiprocessing.Pool.imap: …
1
2