I am trying to parse open office XML. I am doing fine parsing with lxml in Python, but data i need to grab is located within a tag who's structure looks a bit funky to me.
<w:sdt Content> Dataaaaa </w:sdt>
Normally this would be fine, as there are many sdt tags. But the "Content" part is throwing me off. This code:
for element in tree.iter('{http://schemas.openxmlformats.org/wordprocessingml/2006/main}sdt'):
print(element.tag, element.attrib)
returns the below for multiple tags, so i don't know which is which:
{http://schemas.openxmlformats.org/wordprocessingml/2006/main}sdt {}
Also, don't worry about the namespacing, as i have this figured out. I am specifically just trying to access the tag above and the data within. :)