Is it possible to access the data-* portion of an html element from python? I'm using scrapy and the data-* is not available in a selector object, though the raw data is available in a Request object.
If I dump the html using wget -O page http://page.com
then I can see the data in the file. It's something like <a href="blah" data-mine="a;slfkjasd;fklajsdfl;ahsdf">blahlink</a>
I can edit the data-mine
portion in an editor, so I know it's there ... it just seems like well-behaved parsers are dropping it.
As you can see, I'm confused.