pdfgrep
works like grep
except that it acts on pages instead of lines. How can I craft a regular expression with a newline character?
I want to look for a
, followed by any number of characters except linebreaks, followed by b
, but pdfgrep 'a[^\n]*b'
doesn't work, whereas pdfgrep 'a.*b'
returns results that span multiple lines. (I've examined the output with xxd
to confirm that these newlines are indeed \x0A
.)