I was working on a parser that could read HTML however the code that splits it causes "l"s to be inserted in every other entry of the produced array.
The regexp is this:
textarea.value.split(/(?=<(.|\n)+>)/)
What it's supposed to do is split entry/exit/single HTML/XML tags while ignoring tabs and line terminators (it just appends them to tags they were split with)
May I have some insite as to what's happening? You can view code in action and edit here: http://jsfiddle.net/termtm/ew7Mt/2/ Just look in console for result it produces.
EDIT: MaxArt is right the l in last <html>
causes the anomalies to be "l"s