how to parse HTML without library in java?

Question

I need to parse an HTML document and get all urls and content of page and save it to database.I don't want to use any library. I can identify link tags using <a tag but how can I extract all content or useful text from html tag?

If you can't use a library. Copy paste everything it did? lmao — papaya, Feb 09 '20 at 08:15

score 0 · Answer 1 · answered Feb 09 '20 at 08:15

0

You can try this one: https://docs.oracle.com/javase/8/docs/api/javax/swing/text/html/parser/Parser.html

Sample of usage: How to extract info from HTML with Java's own Parser?

answered Feb 09 '20 at 08:15

Alex Chernyshev

1,719
9
11

how to parse HTML without library in java?

1 Answers1