0

I want to build an application that read and extract information from a pdf or word file. For example I have a file either in pdf format or in word format and I want to extract information like contact, address, email etc. How I can do it using c#?

Rafael Regh
  • 443
  • 5
  • 17
Muhammad Hannan
  • 2,389
  • 19
  • 28

1 Answers1

2

For Word just have a look at the Open XML SDK and the Power Tools for Open XML. They give you the possibility to work with any Open XML File very smoothly. For PDFs just have a look at iTextSharp which is an awesome PDF library for .Net.

Rafael Regh
  • 443
  • 5
  • 17
  • I have tried iTextSharp but not working for me :(. @Rafeel Regh – Muhammad Hannan Aug 04 '15 at 07:09
  • 2
    Then be more specific. What have you tried? What is not working? – Rafael Regh Aug 04 '15 at 07:10
  • I have tried this [link](http://stackoverflow.com/a/5003230/4913418) – Muhammad Hannan Aug 04 '15 at 07:17
  • 1
    I don't need to see the link, I need to see what your **actual** problem is. Tell me what is not working, what you have tried and where it stuck. – Rafael Regh Aug 04 '15 at 07:18
  • Kindly check the link I am using that code. It's not opening my pdf file. – Muhammad Hannan Aug 04 '15 at 07:20
  • 1
    I did and it is running **perfectly** without any changes. That's why **you** have to provide more detailed information. What is your setup (win forms, wf, etc)? What is the **exact** problem? Do you get an exception? Is it not compiling etc. pp – Rafael Regh Aug 04 '15 at 07:31
  • My setup is win forms. I am passing the filename from textbox text and when I ran it game exception no file found. But file is there I selected it through open file dialogue box. @Rafeel Regh – Muhammad Hannan Aug 04 '15 at 07:37
  • Let us [continue this discussion in chat](http://chat.stackoverflow.com/rooms/85078/discussion-between-muhammad-hannan-and-rafael-regh). – Muhammad Hannan Aug 04 '15 at 07:41