how to extract part of string between first (variable) html-tag?

Question

I have some strings:

$string1 = '<p><strong>Extract me</strong></p><p>Leave me</p>';
$string2 = '<strong>Extract me</strong>Leave me';
$string3 = '<span style="font-weight: bold">Extract me</span><br /><span>Leave me</span>';

Let's check $string3:

The first tag of the string is  So the text between the first  and the first  wants to be extracted.

Extracted shall mean: remove it from $stringX and save it into $extractedX

How would I do this?

I tried many things with regex but failed. And I say shame on the guy who wrote the wikipedia-article. — iceteea, Apr 27 '12 at 07:03
In general I'd advice against using regex to parse html/xml structures. There are better ways (e.g. [Dom*](http://php.net/manual/book.dom.php), [SimpleXml](http://php.net/manual/book.simplexml.php)). — Yoshi, Apr 27 '12 at 07:04
Sounds plausible. But I have no idea how to solve my problem in general. — iceteea, Apr 27 '12 at 07:06

Jack · Accepted Answer · 2012-04-28T03:34:18.513

2

[^>]*?(?=<\/.*>)

What you should do is use an assertion. [^>]*? searches for any character that is NOT a >. This should be fine since if you need to use > as text, it would need to be escaped as >. Then it searches for the first closing tag as denoted by <\/.*>. The (?=) around it tells the regex engine not to include it in the match.

http://regexr.com?30pkm

edited Apr 28 '12 at 03:34

answered Apr 27 '12 at 07:02

Jack

5,680
10
49
74

score 1 · Answer 2 · edited May 23 '17 at 11:59

1

you have to be search first and then post your question here..
any ways here is the related question for your ans Click here to get the releted question

You can done it with preg replace

edited May 23 '17 at 11:59

Community

1
1

answered Apr 27 '12 at 07:00

chhameed

4,406
4
26
44

score 1 · Answer 3 · answered Apr 27 '12 at 07:07

You can use PHP's preg_match and regular expressions.

This online editor is useful for regex:

http://regexr.com?30pkp:

You'll need something like this to get started:

(.*)|<span.+font-weight:\ ?bold.+>(.*)

If you need to do more advanced parsing you could look at parsing the DOM in PHP e.g. using DOMDocument LoadHtml

Rainulf · Answer 4 · 2012-04-27T07:10:12.167

0

You can use strip_tags with some use of preg_match if you only want the first occurrence.

edited Apr 27 '12 at 07:10

answered Apr 27 '12 at 07:02

Rainulf

341
1
9

how to extract part of string between first (variable) html-tag?

4 Answers4