Looking for C# HTML parser

Asked Sep 19 '08 at 07:57

Active Sep 12 '11 at 14:15

Viewed 6.3k times

112

Possible Duplicate:
What is the best way to parse html in C#?

I would like to extract the structure of the HTML document - so the tags are more important than the content. Ideally, it would be able to cope reasonably with badly-formed HTML to some extent also.

Anyone know of a reliable and efficient parser?

c# .net html parsing

edited May 23 '17 at 12:22

Community

asked Sep 19 '08 at 07:57

benefactual

7,079
5
23
16

0 Answers0

Linked

Adding http:// to all links without a protocol

HTML Parser for C#

Parsing HTML/CSS/PHP File(s)

HTML and JavaScript parser in .NET

C# - Processing html tag attributes

Inlining CSS in C#

How do you parse a poorly formatted HTML file?

Getting a substring of text containing HTML tags

Regex Grouping in C#

How to find a matching closing tag in html string?

How to parse an XHTML file that is not 100% valid?

Good way to navigate the DOM

How do I validate a html file with C#?

C# | Regex Dotall and Result Match

Parse a HTML combox in C#

i want to capture all tags not having a specific tag

Regex to get src value from an img tag

Read tables (content) from Wikipedia using C#

C# - Best Approach to Parsing Webpage?

Library for reading HTML files as XML (.NET)

Open HTML Document in C#

Using jQuery on a string containing HTML

How do I scrape only the <body> tag off of a website

Non mshtml c# parsing html and javascript

Download HTML file and convert it to TXT

Read specific data from XML file

RegEx - HTML between two values

Remove all HTML tags and do a carriage return on <BR> in C#

strip out everything out side of <img src=random.jpg> and <p>random text</p> in html

Is there a jQuery-like CSS/HTML selector that can be used in C#?

How to get Links on a web-page using C#?

Get text from HTML

Extract data webpage

Regular expression to get html without comments

C# Is there a LINQ to HTML, or some other good .Net HTML manipulation API?

Regexp that matches all the text content of a HTML input

I need a Powerful Web Scraper library

Regex to match if string DOES NOT have more than one period . Matching URL paths that are NOT fully qualified

How to remove all <a></a> tags from a large html string in C#?

How do I get content from a table using its ID with a regex?

How to parse this piece of HTML?

-1

C# How to i store website list in xmlnode

How do I read the file content and sent those content to the functions parameter

Taking too long to load a page with HttpWebResponse

HttpParsing for hypertext

Get a value from a webpage c#

How to read a line of HTML using C#

Parsing MHTML File using C#

HTML Parser for C#

Looking for C# HTML parser

0 Answers0

Linked

Related