preg_match_all between two sentences

Question

from the phrase:

 <div class="latestf"> <a href="http://www.x.ro/anamaria/"
 rel="nofollow"

I want to extract anamaria. How to do that with preg_match_all ?

I tried:

preg_match_all("'<div class=\"latestf\">
<a href=\"http://www.x.ro/(.*?)\" rel=\"nofollow\"'si", $source, $match);

but it didn`t work...

Thank you in advance !

**What does "doesn't work" mean?** "Doesn't work" is an inadequate description for us to understand the problem. What happened when you tried it? Did you get an error message? Did you get incorrect results? Did you get *no* results? If the results were incorrect, what made them incorrect? What were you expecting instead? Did you get *any* correct results? If so, what were they? Don't make us guess. — Andy Lester, Sep 09 '13 at 12:17
http://stackoverflow.com/questions/1732348/regex-match-open-tags-except-xhtml-self-contained-tags — faino, Sep 09 '13 at 12:20
Is the newline in the regex intentional? Also, you should escape your periods: `.` -> `\.` — Jerry, Sep 09 '13 at 12:21

score 1 · Answer 1 · answered Sep 09 '13 at 12:22

Try this:

$source = '<div class="latestf"> <a href="http://www.x.ro/anamaria/" rel="nofollow"';


preg_match_all('#<div\s*class="latestf">\s*<a\s*href="http://www\.x\.ro/(.*?)/?"\s*rel="nofollow"#i', $source, $match);

print_r($match);

Array
(
    [0] => Array
        (
            [0] => <div class="latestf"> <a href="http://www.x.ro/anamaria/" rel="nofollow"
        )

    [1] => Array
        (
            [0] => anamaria
        )

)

score 1 · Answer 2 · answered Sep 09 '13 at 12:25

1

Don't try to parse HTML with regex. Use a DOM parser instead:

$html = '<div class="latestf"> <a href="http://www.x.ro/anamaria/"
 rel="nofollow"';

$dom = new DOMDocument;
@$dom->loadHTML($html);
foreach ($dom->getElementsByTagName('a') as $node)
{
    $link = $node->getAttribute("href");
}

$parsed = parse_url($link);

echo substr($parsed['path'], 1, -1);

Output:

anamaria

Demo!

answered Sep 09 '13 at 12:25

Amal Murali

75,622
18
128
150

Don't use `@`. If there's an error, you should know something is wrong. Blundering along, hoping for the best can be harmful, a notice isn't (not really) – Elias Van Ootegem Sep 09 '13 at 12:41
@EliasVanOotegem: I agree that error messages shouldn't be suppressed, but in this case, `@` sounds like a good idea since the HTML isn't properly formatted. – Amal Murali Sep 09 '13 at 12:46

wuya · Answer 3 · 2013-09-09T12:53:19.427

0

/ should be escaped like this \/

<?php

  $source = '<div class="latestf"> <a href="http://www.x.ro/anamaria/" rel="nofollow"';

  preg_match_all('/<div class="latestf"> <a href="http:\/\/www.x.ro\/(.*?)\/" rel="nofollow"/', $source, $match);

  var_dump($match);exit;

edited Sep 09 '13 at 12:53

answered Sep 09 '13 at 12:43

wuya

1
1

Wrong. OP uses single quote `'` as regex delimiter. – Toto Sep 09 '13 at 13:24
That '.' should be escaped as '\.' – wuya Sep 09 '13 at 14:11

preg_match_all between two sentences

3 Answers3

Demo!