I have searched across a number of Q&As and can't find a solution specific enough to help.
I have a large xml file and need to do a conditional 'remove' in one field depending on the value in another field.
For example:
<vehicle>...<manufacturer>JCB</manufacturer>....<item_category>JCB Tractors</item_category>...</vehicle><vehicle>...<manufacturer>Caterpillar</manufacturer>....<item_category>Digger</item_category>...</vehicle><vehicle>...<manufacturer>Caterpillar</manufacturer>....<item_category>Caterpillar Digger</item_category>...</vehicle>
needs to become
<vehicle>...<manufacturer>JCB</manufacturer>...<item_category>Tractors</item_category>...</vehicle><vehicle>...<manufacturer>Caterpillar</manufacturer>...<item_category>Digger</item_category>...</vehicle><vehicle>...<manufacturer>Caterpillar</manufacturer>....<item_category>Digger</item_category>...</vehicle>
Ideally the solution would be something I could apply using find and replace functionality in textpad set to POSIX extended regex.
Really appreciate help on this one as I have been banging my head against it for a while!
If I use a parser, I can isolate the variable string I want to 'remove' using
(?<=<manufacturer>)(.*?)(?=<\/manufacturer>)
Is it possible to use that pattern to isolate the string I actually want to remove
e.g.,
(?<=<item_category>)(?<=<manufacturer>)(.*?)(?=<\/manufacturer>)(\s)