I have a site from which I want to get specific content from 7 posts. Those all 7 seven posts have same HTML layout (See Below)
<div class="eventInfo">
<h3>Z's(矢沢永吉)</h3>
<h4>Z's TOUR 2015</h4>
<dl>
<dt><img src="/event/img/btn_day.png" alt="公演日時" width="92" height="20"> </dt>
<dd>
<table width="99%" border="0" cellpadding="0" cellspacing="0">
<tbody><tr>
<td width="9%" nowrap="nowrap">2015年6月</td>
<td width="74%">4日 (木) 19:00開演</td>
</tr>
</tbody></table>
</dd>
<dt><img src="/event/img/btn_price.png" alt="料金" width="92" height="20"> </dt>
<dd>S¥10,500 A¥7,500 (全席指定・消費税込)<br><span class="attention">※</span>注意事項の詳細を<a href="http://www.siteurl.com/info/live/guidelines.html" target="_blank">矢沢永吉公式サイト</a>より必ずご確認ください</dd>
<dt><img src="/event/img/btn_ticket.png" alt="一般発売" width="92" height="20"> </dt>
<dd>
<table width="99%" border="0" cellpadding="0" cellspacing="0">
<tbody><tr>
<td width="9%" nowrap="nowrap">2015年5月</td>
<td width="74%">16日(土)</td>
</tr>
</tbody></table>
</dd>
<dt><img src="/event/img/btn_contact.png" alt="お問合わせ" width="92" height="20"> </dt>
<dd><a href="http://www.siteurl.com/" target="_blank">ソーゴー大阪</a> 06-6344-3326</dd>
<dt><img src="/event/img/btn_info.png" alt="公演詳細" width="92" height="20"> </dt>
<dd><a href="http://www.siteurl.com/zs/index_pc.html" target="_blank">http://www.siteurl.com</a> </dd>
</dl>
</div>
I just want to fetch the H3 from this layout and the first table in the code. What regex method should I use to get the desired results?
Also these are 7 posts just like the code above and I have to get H3 and the first table from each of it.
I have tested it but not sure that is it a correct way or not: https://regex101.com/r/sO6tJ8/1
But as you can see that I have to add unwanted data too like H4 DT IMG :(