I am using Apache PDFBox and going page by page to extract text. But at the same time I have to remove the strike-through text which comes in between.
I tried : Detect Bold, Italic and Strike Through text using PDFBox with VB.NET
but its failing for most of my scenarios.
I also tried: PDFBox delete comment maintain strikethrough
Are there any libraries present which do this?