I need to filter links and images from html pages with c++ and regex and I came up with this phrase:
<\s*(a.*?href|img.*?src)\s*=\s*\"(.*?)\".*?\s*>
unfortunately this will also find links and images within comments which it shouldn't. I tried some negative look-aheads without success.