0

I am trying to extract text for pdf document in my iOS application, I am using CGPDFScanner.. I used the reference from these links:

The methods specified in the above two links work for some simple (Programming in Objective-c) type of pdf.. but when I try to extract text from pdf such as those in "Head First" series I get garbage value. Please suggest me some solution which will work for all types of pdf

Cœur
  • 37,241
  • 25
  • 195
  • 267
ViruMax
  • 1,216
  • 3
  • 16
  • 41

2 Answers2

0

If you are using PDFKitten, it can be CMap parsing issue in that. I have resolved this issue and found that searching is going well in almost every pdf.

Swaroop
  • 501
  • 4
  • 18
  • If you using PDFKitten, it have many problems with PDF having ToUnicodeMapping. It's parsing of CMap is totally wrong. It throws away most of the mapping while parsing. If you can fix that you will be able to search in PDF properly as you will be availed with proper text. – Swaroop May 08 '15 at 08:06
-1

Try using http://fastpdfkit.com/ , this is one of the best pdf reader library that i have used.

Rahul Mathur
  • 872
  • 1
  • 7
  • 20
  • I have seen fastpdfkit, but I can't use it because it does not make use of CGPDFDictionary, and I want to do by using it. I have also tried PDFKitten but this library also not able to scan pdf text properly. Do suggest some alternative way (code snippet) to this problem. – ViruMax Feb 18 '14 at 12:48
  • Hmmm, I didint faced this issue. Currently i dont have any alternative, but will let you know if i find any. – Rahul Mathur Feb 18 '14 at 13:08