1

Possible Duplicates:
What HTML parsing libraries do you recommend in Java
Need a good HTML parser on php
What are the pros and cons of the leading Java HTML parsers?

Hello, I have used http://simplehtmldom.sourceforge.net/ for parsing html in php and is there is any thing similar to that....

Community
  • 1
  • 1
  • 3
    I think at least 5 relevant questions were shown in "Related questions" for this title. – Bozho Aug 02 '10 at 11:14

5 Answers5

2

IMHO,Jsoup is the best HTML Parser

chedine
  • 2,384
  • 3
  • 19
  • 24
0

There are a few listed here: http://java-source.net/open-source/html-parsers However JTidy is very popular among Java developers.

Behrang
  • 46,888
  • 25
  • 118
  • 160
0

The Mozilla Java Html Parser may be of interest to you

Colin Pickard
  • 45,724
  • 13
  • 98
  • 148
0

you can use Jericho HTML Parser or Mozilla Java Html Parser

Neel
  • 429
  • 7
  • 17
0

The TagSoup parser was a real good choice for me because HTML is, depending on it's author, not always well-formed. Since most parsers expect well-formed xml (or html, for that matter), I needed to use TagSoup in order to be able to parse all html that was fed to my application.

f1sh
  • 11,489
  • 3
  • 25
  • 51