How to remove a link from content using php?

Question

$text = file_get_contents('http://www.example.com/file.php?id=name');
echo preg_replace('#<a.*?>.*?</a>#i', '', $text)

the link contains this content:

text text text. <br><a href='http://www.example.com' target='_blank' title='title' style='text-decoration:none;'>name</a>

what is the problem at this script?

it seems that it works.... i don't know what was the problem... — Adrian, Sep 30 '10 at 14:43

score 3 · Answer 1 · edited May 23 '17 at 11:43

3

You can't parse HTML with regular expressions. Use an XML/HTML parser.

edited May 23 '17 at 11:43

Community

1
1

answered Sep 30 '10 at 14:31

Williham Totland

28,471
6
52
68

score 1 · Answer 2 · answered Sep 30 '10 at 14:34

Tempted to flag your question, but there's no option for "Report user for summoning Cthulhu"

I'd recommend reading: http://www.codinghorror.com/blog/2009/11/parsing-html-the-cthulhu-way.html

RegEx is very poor and not at all intended to parse HTML. That's why there are HTML parsing libraries. Find and use one for PHP. :)

score 0 · Answer 3 · answered Apr 21 '13 at 01:29

USE strip_tags this way

$t = 'http://yoururl.com/test1.php';
$t1 = file_get_contents($t);
$text = strip_tags($t1);

it should work getting rid of all the links inside the page you are reading, visit the reference anyway, it may not work for complicated elements http://php.net/manual/en/function.strip-tags.php

score 0 · Answer 4 · answered Sep 30 '10 at 14:35

0

use <a[^>]+>[^<]*</a> (works fine as long as theres just text and no tags inside the a element)

answered Sep 30 '10 at 14:35

Hannes

8,147
4
33
51

How to remove a link from content using php?

4 Answers4