HTML Parser is a Java HTML parsing library. It features filters, visitors, custom tags and easy to use JavaBeans.
Questions tagged [html-parser]
211 questions
33
votes
4 answers
Parsing HTML to get text inside an element
I need to get the text inside the two elements into a string:
source_code = """Martin Elias"""
>>> text
'Martin Elias'
How could I achieve this?

Martin Eliáš
- 387
- 1
- 5
- 11
32
votes
4 answers
Method to parse HTML document in Ruby?
like DOMDocument class in PHP, is there any class in RUBY (i.e the core RUBY), to parse and get node elements value from a HTML Document.

Harish Kurup
- 7,257
- 19
- 65
- 94
28
votes
3 answers
How do I convert a document made in Jsoup (the Java html parser) into a string
I have a document that was made in jsoup that looks like this
Document doc = Jsoup.connect("http://en.wikipedia.org/").get();
How do i convert that doc into a string.

Hudson Hughes
- 343
- 1
- 3
- 9
13
votes
3 answers
Extend HTML file with script and override/extend some section tags
There is open source (client side) which I can use to extend HTML,
for example I need to add scripts to it or change some of the src values and add additional tags, etc.
I found the following: https://www.npmjs.com/package/gulp-html-extend
but I'm…
user4209821
9
votes
1 answer
HTMLParser for Python 3.4
I have some code written in Python (2.7) which uses HTMLParser.
I am using Python 3.4 currently.
I can not find the HTMLParser download module.
If it exists, can anyone share the link?
If not, what should I do?

Arpi Shah
- 109
- 2
- 5
8
votes
1 answer
Handling htmlParse error (failed to load HTTP resource)
I'm trying to web-scrape a page. However, from time to time my loop doesn`t work, because the parser "fail to load HTTP resource" . The problem is that the page doesn't load in my browser, so it's not a problem with the code.
However, it's quite…

Manoel Galdino
- 2,376
- 6
- 27
- 40
7
votes
2 answers
PHP DOMDocument parentNode->replaceChild causing foreach to skip next item
I am parsing html in the $content variable with the DOMDocument to replace all iframes with images. The foreach is only replacing the ODD iframes. I have removed all the code in the foreach and found the piece of code causing this is:…

9ete
- 3,692
- 1
- 34
- 32
7
votes
3 answers
Tool to Convert External CSS to Inline CSS in Ruby?
I am messing around with GoogleDocs and it has very very primitive CSS support. If you upload an MSWord .doc or an HTML file and export it as HTML, all styles are applied inline. They have a style block in the html file, but when you upload the…

Lance
- 75,200
- 93
- 289
- 503
6
votes
2 answers
HTML parsing in FLUTTER for android / iOS development
We know there is a Jsoup library for android developers to parse html text, code etc. As I am new in flutter mobile app development I want to know if there is any library like Jsoup to parse html text,code from a web site in flutter.

N. I. Md. Ashafuddula
- 111
- 1
- 1
- 9
5
votes
1 answer
How i can use htmlparser2 to parse html files?
I am using Node.js and i need to parse a html file. Now i have used htmlparser2 and it parses string in parser.write("String") method. Can i parse a html file using html parser? If yes then how?
Help is appreciated?

Saras Arya
- 3,022
- 8
- 41
- 71
5
votes
0 answers
C++ Html Parser as .NET HtmlAgilityPack
I'm porting my application over to C++ from C# and I need a library which can parse HTML and read html elements+their attributes and such, basicly everything that HtmlAgility pack has in C#.
I don't want to use HtmlAgilityPack or any other .NET…

user2920222
- 197
- 1
- 2
- 8
4
votes
1 answer
Is there a Standard Java SE HTML Parser? If so, why use non-standard ones?
I need to parse a simple HTML page with a simple form in it. The answers to similar questions on StackOverflow suggest using one of a large variety of non-standard Java libraries such as TagSoup, JSoup, HTMLParser and many others.
However, a web…

necromancer
- 23,916
- 22
- 68
- 115
4
votes
1 answer
.NET HTML Parser that supports Form posting
I know there are libraries that are used to parse html like HTMLAgilityPack but unable to find a library that supports form posting or executing javascript.
I want to post html form from my Windows Form application but don't want to use WebBrowser…

Software Engineer
- 3,906
- 1
- 26
- 35
4
votes
2 answers
HTML Parser validate tags
I need html parse which have capability to identify error in generated html and if tags are not closed then close it and return the valid html.
More detail: i am getting data from database and break that record to show partial detail on my website…

Xulfee
- 958
- 3
- 13
- 43
4
votes
3 answers
Is there a handy html-parser can be used in Nativescript
I had tried Jquery, Parse5, JsDom, and found that they can't work in nativescript. Jquery is Dom-dependent, and Parse5 and JsDom depend on Node.js which is not supported by nativescript now. What I want is only a html-parser, is it possible to…

yoshiyan
- 41
- 3