HTML Agility Pack is an open-source HTML parser that builds a read/write DOM and supports Linq, plain XPATH or XSLT.
HTML Agility Pack is an open-source HTML parser that builds a read-and-write DOM and supports Linq, plain XPath or XSLT.
It is a .NET code library that allows parsing out of the web HTML files. The parser is very tolerant to malformed HTML. The object model is very similar to what proposes System.Xml, but for HTML documents or streams.
Installing HTML Agility Pack can most easily be done using its NuGet package:
Install-Package HtmlAgilityPack
Latest stable release: 1.11.3 / 18 April 2019
GitHub page: https://github.com/zzzprojects/html-agility-pack