I want to parse an HTML file for these reasons:
- To the get the content between tags. For example, between a pair of para tags
- To find the occurrence of break tags
- To get the attributes of a tag. For example, to get the value of COLOR in FONT COLOR="red">
I need to do this in Java. I know the basics of the Jehrico parser. How can I do it?