I am using Python to read HTML data, but I have difficulties finding all substrings between "d:Title>Good To Great</d:Title>" from this HTML.
data = "<html><head></head><body><pre style='word-wrap': break-word; white-space: pre-wrap;
d:Title>Good To Great</d:Title>d:ComplianceAssetId m:null='true'/>
d:Title>War and Peace</d:Title>/d:ComplianceAssetId m:null='false'/>
d:Title>The Great Gatsby</d:Title>/entry></feed></pre></body></html>"
Expected output:
['Good To Great', 'War and Peace', 'The Great Gatsby']
I suspect regex could be a solution but I have limited knowledge about the regex (still learning), can anyone help me with the problem?
Thanks in advance for your help.