I'm a Delphi developer and was tasked to do crawling of <title>
and meta
description and keywords for public facing websites.
It was fine until I encountered a website that self redirects and did not redirect with HTTP 302/301.
For example, if I type example.com
in the url, it will automatically jump to example-b.com
- but on the client side - not via HTTP 301 or 302.
My goal is to get title, description, and keywords of example-b.com
.
I'm using TidHttp in delphi if that helps.
Edit
I tried this answer but it was stated that will only work with HTTP 301 and 302. I have handled those redirects already. I'm trying to figure out how to handle <meta>
refersh tags or other html commands that do redirects.
Edit 2
just found this commands :
<meta http-equiv="refresh" content="5;url=http://thisinterestsme.com/detecting-ajax-requests-with-php/">
header( "refresh:5;url=http://thisinterestsme.com/php-forcing-https-over-http/" );
header('Location: http://thisinterestsme.com/php-forcing-https-over-http/');
window.location.href= 'http://thisinterestsme.com/php-forcing-https-over-http/';
let me know if I missed other commands.