After seeing the famous reply about parsing HTML with regex: RegEx match open tags except XHTML self-contained tags
I've tried to figure out how to do this craziness. Is there a tool that assists with finding these strange characters? How was this done?
Months later, I found this useful as well: http://www.marlborotech.com/Zalgo.html