7

Can anyone recommend a good component for reading PDFs from .Net? Specifically, I want to read all the metadata of the document, including all the document properties (title, author, etc.). I really don't need to read the actual content of the document.

Commercial products are fine.

I read about PDFBox (for Java), but that seems painful. I've also been to this page, but nothing great jumped out at me.

tshepang
  • 12,111
  • 21
  • 91
  • 136
Deane
  • 8,269
  • 12
  • 58
  • 108

6 Answers6

4

What about iTextSharp? As far as I know its the definitive pdf library.

BC.
  • 24,298
  • 12
  • 47
  • 62
3

Docotic.Pdf library (I work for the company) may be used to accomplish the task.

Here is sample code:

using (PdfDocument document = new PdfDocument("file.pdf"))
{
    string author = document.Info.Author;
    string subject = document.Info.Subject;
    string title = document.Info.Title;
    string keywords = document.Info.Keywords;
    // ...
}

Beyond that the library can do many other things. There are samples for common tasks available online.

Bobrovsky
  • 13,789
  • 19
  • 80
  • 130
3

For .Net there is the PDF Lib (current version 1.3). It looks fine (but i've not tested it yet).

mapache
  • 1,361
  • 9
  • 11
1

Have you looked at ABC.pdf? It definitely does was you want, reading pdf document properties - see here

Christopher Edwards
  • 6,589
  • 8
  • 43
  • 57
0

You could look at CoherentPDF.

MichaelGG
  • 9,976
  • 1
  • 39
  • 82
0

I would like to recommend Amyuni PDF Creator ActiveX, a PDF editor ActiveX control (32 bit and 64 bit versions are available) and Amyuni PDF Creator .Net.
Disclaimer: I am part of the development team of this product.

yms
  • 10,361
  • 3
  • 38
  • 68
  • Does Amyuni webkit honor page break in HTML while generating PDF. I have a huge html with multiple pages however when i create a pdf with Amyuni it creates 1 long page without any page breaks. – user428747 Feb 02 '15 at 16:02
  • @user428747 Did you try changing the options in the method WKPDFOpenURL? The parameter called pageRenderingOptions controls whether the library will generate several pages or just one. – yms Feb 02 '15 at 18:47