I want to replace all the <span...>
(including <span id="...
and <span class="...
) in an html by <span>
except if the span starts by <span id="textmarker
(for example I don't want to keep this span : <span attr="blah" id="textmarker">
)
I've tried the regex proposed here and here, I finally came up with this regex that never returns a <span id="textmarker
but somehow it sometimes misses the other spans:
<span(?!.*? id="textmarker).*?">
You can see my (simplified) html here : https://regex101.com/r/yT9jG2/2
Strangely, if I run the regex in notepad++ it returns 3 matches (the three spans in the second paragraph) but regex101 only returns 1 match. Notepad++ and regex101 both miss the span in the first paragraph.
This regex also doesn't return every spans it should( cf the spans with a gray highlights here
<span(?![^>]*? id="textmarker)[^>]*?>