So I have this HTML:
div class="price" itemprop="offers" itemscope itemtype="http://schema.org Offer"
And I'm trying to split it in a list something like this:
[class="price", itemprop="offers", itemscope, itemtype="http://schema.org Offer"]
But I'm nost sure how to split the part of itemscope
.
My current regex it looks like this (\s.*?\"\s*.*?\s*\")
, but the problem with this one is that when I will split it into a list, the itemscope and itemtype="http://schema.org Offer" will be just one element, so my list will be something like this:
[class="price", itemprop="offers", itemscope itemtype="http://schema.org Offer"]
Any idea how can I fix this?