I am trying to make a script named letter.sh to unzip a word-file, extract the text and images, and save them in a directory. I think I managed to unzip and extract the images, but I am struggling with extracting the text from the document.xml file.
The sentences I want to extract are in this formatet:
<w:t>text</w:t>
I have tried using grep, but it doesn't work.
grep "<w:t>*</w:t>" ~word/document.xml < touch letter.txt
I would appreciate it if anyone could guide me onto the right path.
Thank you.