PHP Regular Expression pick up matches only after certain word in text

Question

that's my first question here. :) Was searching around with my problem for a few days, but it is not yet fully solved. What I have is a bunch of text. There is some price data divided by exact phrase "promoted-after" . So here is my RegEx:

'/price-([\d $гр€\.]*)/i'

It awesomely works for ALL the prices it founds including prices before divider. But when I modify it to:

'/promoted-after.*price-([\d $гр€\.]*)/is'

It correctly bypasses the top part, but then saves only one last price of all the data. How can it be modified to correctly save only all the prices AFTER "promoted-after" tag? Here is the example of input:

price- 2680 $
a lot of some random html code here
price- 3250 $
a lot of some good html code here
price- 3450 $
promoted-after
price- 400 $
a lot of some strange html code here
price- 401 $
a lot of some awesome html code here
price- 402 $
a lot of some ugly html code here
price- 403 $
a lot of some nice html code here
price- 404 $
a lot of some best html code here

P.S. I use preg_match_all

EDIT: Ok, let's just ignore that it's HTML. Let it be plain text. What is the overall logical construction behind such a task should be?

https://stackoverflow.com/questions/1732348/regex-match-open-tags-except-xhtml-self-contained-tags#1732454 — delboy1978uk, Jun 14 '18 at 15:34
Do it in two steps: get the text after your "marker", then scan that text for your targets. — Niet the Dark Absol, Jun 14 '18 at 16:02
@revo it's not valid HTML of course. I scraped pieces just for a quick example. Ok, I'll edit it so that it will be just a plain text. — Tim Yoshi, Jun 14 '18 at 16:40

score 1 · Answer 1 · answered Jun 14 '18 at 16:12

As an alternative you might use DOMDocument and DOMXPath and use an xpath expression to find the div with the id promoted-after and then find all the siblings p/strong.

You could get their value using nodeValue.

$dom = new DOMDocument();
$dom->loadHTML($data);
$xpath = new DOMXPath($dom);
$items = $xpath->query('//div[@id="promoted-after"]/following-sibling::p/strong');
foreach($items as $item) {
    echo $item->nodeValue . "<br>";
}

Result

Demo

PHP Regular Expression pick up matches only after certain word in text

1 Answers1