I have HTML string retrieved from Discourse API with some few elements (p, span, div
etc.) and some of them have attributes like data-time, data-timezone, data-email-preview
etc. I want values that are on attributes data-email-preview
and these values are timestamps in format enter code here
. These values are always between first two span elements inside HTML string. Example of HTML string:
<p><span data-date="2019-05-10" data-time="19:00:00" class="discourse-local-date" data-timezones="Europe/Brussels" data-timezone="Europe/Berlin" data-email-preview="2019-05-10T17:00:00Z UTC">2019-05-10T17:00:00Z</span> → <span data-date="2019-05-10" data-time="22:00:00" class="discourse-local-date" data-timezones="Europe/Brussels" data-timezone="Europe/Berlin" data-email-preview="2019-05-10T20:00:00Z UTC">2019-05-10T20:00:00Z</span><br>
<div class="lightbox-wrapper"><div class="meta">
<span class="filename">HackSpace_by_Sugar_Ray_Banister.jpg</span><span class="informations">1596×771 993 KB</span><span class="expand"></span>
</div></a></div></p>
I need these two dates between span
elements extracted:
2019-05-10T17:00:00Z
and 2019-05-10T20:00:00Z