I have a specific HTML source file that I need to be able to scan and parse and I am having trouble. While I understand that HTML aren't supposed to be part, this is part of the assignment, so I do not have any choice.
So far the regex I have are,
<[^/!].*?>
for start tags
I have other regex for end tag and comment which works fine, but I cannot seem to type them here.
I am trouble coming up with a regex to detect all texts in between tags or of the body.
I would greatly appreciate any help possible.