Hi want to extract all the text in the url patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO1&Sect2=HITOFF&p=1&u=/netahtml/PTO/srchnum.html&r=1&f=G&l=50&d=PALL&s1=6700867.PN.
The text on this page are after "br /" tag
But when i try to extract text using tag name br it returns empty strings.
here is part of the code that i tried
Set HTMLbrs = HTMLDoc.getElementsByTagName("br")
For Each HTMLbr In HTMLbrs
Debug.Print htmlbr.innertext
Next HTMLbr
The final aim of the code is to check whether a paragraph present in excel belongs to this website or not. the paragraphs could be from any section and its exact location cannot be known. the code aims to proofread that the text was taken from this website.
The response text that is receive has all the paragraphs but they have line breaks and
tags in the middle, due to which instr cannot be used.
When i tried to remove linefeeds by replace(responsetext,vblr," ") whole response text got divided into paragraphs each of 1023 characters, because of which i could not use instr again.
I have used application.worksheet.clean(responsetext). replace(responsetext,vbcr," ") and replace( responsetext,vbcrlr," ") but none of them gave desired result.
` but the closing tag ` is nowhere to be found. Which **specific** part of the webpage do you need? – Stavros Jon Jun 11 '20 at 10:30