getting parts of html file in php

Question

i need to get two things out of an html file:

text between <title> and </title>
text between <body> and </body>

does anybody know how to do this? this is what i have so far:

$contents = file_get_contents($_GET['file']);
$title = preg_replace("/.*<title[^>]*>|<\/title>.*/si", "", $file);
$body = preg_replace("/.*<body[^>]*>|<\/body>.*/si", "", $file);

i need to echo the title in a textbox and the body in a textarea.

*(related)* [Best Methods to parse HTML](http://stackoverflow.com/questions/3577641/best-methods-to-parse-html/3577662#3577662) — Gordon, Dec 15 '10 at 20:15
Read [Parsing Html The Cthulhu Way](http://www.codinghorror.com/blog/2009/11/parsing-html-the-cthulhu-way.html) — AlexV, Dec 15 '10 at 20:33

score 5 · Accepted Answer · edited May 23 '17 at 11:47

5

Do not use regex to parse HTML. See this answer. Instead, use DOMDocument::LoadHTML.

edited May 23 '17 at 11:47

Community

1
1

answered Dec 15 '10 at 19:16

asthasr

9,125
1
29
43

getting parts of html file in php

1 Answers1