In order to practice my skills im going to write an html parser. The idea I have in mind:
- Define what I want to tokenize via regex.
- Accept some html as a string.
- Loop through html string.
- Save information about the token such as content and position as an object.
- If token has another token then that token is a child object of the parent token.
Finish object graph.
Create appropiate getters and setters.
Would you say this makes sense?