I have scanned a PDF with Tika which contains the text in the following format, having multiple line breaks
Some non Interview text
interview with Mr.XYZ
Question: How are you?
Answer: I am fine.
Question: What do you do?
Answer: Nothing
Some non Interview text
How do I apply regex?I can match words and spaces but it is not going multiline. I tried the following regex:
https://regex101.com/r/sekUyT/1
What all I want is interview related text which starts with interview with and is considered end when the text does not contain any more Question: and Answer: