I'm trying to reading an XML file which is an export of a website. When I run the following:
result <- xmlParse(file = "~/Desktop/export.xml")
I get:
PCDATA invalid Char value 8
PCDATA invalid Char value 1
PCDATA invalid Char value 8
PCDATA invalid Char value 1
PCDATA invalid Char value 8
PCDATA invalid Char value 1
PCDATA invalid Char value 8
PCDATA invalid Char value 1
PCDATA invalid Char value 8
PCDATA invalid Char value 1
PCDATA invalid Char value 8
PCDATA invalid Char value 1
Error: 1: PCDATA invalid Char value 8
Is there any way I can skip these invalid characters and read it anyway? Or do I have to somehow remove them? I simply want to parse the XML to find URLs within it containing a specific string.