I want to parse a HTML file using Java and i have used DocumentBuilder
class for it. My HTML contains a <img src="xyz">
tag, without a closing </img>
tag,which is allowed in browser.But when i give it to DocumentBuilder
for parsing it gives me this error
The element type "img" must be terminated by the matching end-tag
</img>
.
Java :
DocumentBuilderFactory docBuilderFactory = DocumentBuilderFactory.newInstance();
DocumentBuilder docBuilder = docBuilderFactory.newDocumentBuilder();
Document document = docBuilder.parse(is);
What should i do to get rid of this error?