I'm suddenly getting a few hundred thousand log messages like this. "Operator cm has too few operands". I understand that this is usually from a corrupt PDF file. I'm interested in limiting the number of errors logged
Will PDFBox produce this many errors from processing one file? Or should it abort after the first one? It looks like this is from processing a single file.
If this is all from one file, is there a handy way to limit logging from one file? Or cause an abort when it sees this?
I have read Disabling logging on PDFBox, and I could turn off all logging for org.apache.pdfbox.contentstream.PDFStreamEngine, but that might shut out useful messages.
This message is logged at level ERROR. If PDFBox continues processing, should PDFBox log this message as WARN?
If this is an error in the file, but it does not fail processing, is there a way to detect that this error occurred during processing, so I can flag the file for review? I'm just calling Tika.parseToString
Thanks!