I recently posted an answer with the following link:
https://cache-elastic-pandora.ecn.cl/emol/noticia/_search?q=publicada:true+AND+ultimoMinuto:true+AND+seccion:+AND+temas.id:&sort=fechaModificacion:desc&size=15&from=45
Actual link here:
https://cache-elastic-pandora.ecn.cl/emol/noticia/_search?q=publicada:true+AND+ultimoMinuto:true+AND+seccion:*+AND+temas.id:*&sort=fechaModificacion:desc&size=15&from=45
And was surprised that StackOverflow isn't able to accurately mark-up this hyperlink.
I know this isn't comprehensive (and missing quite a bit), but even a very crude regex up until a space, with a negative lookbehind to remove ending punctuations, is able to capture this:
https?:\/\/[^\s]+(?<![,.)\]?!])
https://regex101.com/r/9ZblaL/2/
Does anyone know what the StackExchange link-markup uses? And what might be a better regex that can be used to parse basic web links?
Update: I think the link itself has characters that are being interpreted as markup and then stripped before the link itself is being constructed (for example, the character *
).