get only x.x.x from x.x.x-SNAPSHOT from this expression

Question

INPUT- cat my.txt

The version of the current file

<version>x.x.x-SNAPSHOT</version>

Desired output:

x.x.x

(which are digits and dynamic values)

Tried multiple grep and awk commands but no luck.

[Don't Parse XML/HTML With Regex.](https://stackoverflow.com/a/1732454/3776858) I suggest to use an XML/HTML parser (xmlstarlet, xmllint ...). — Cyrus, Jun 28 '20 at 19:11

Gilles Quénot · Answer 1 · 2020-06-29T08:15:16.777

1

Like this:

xmllint --xpath '
    substring-before(//*[contains(text(), "-SNAPSHOT")]/text(), "-SNAPSHOT")
' file.xml

From a pipe:

curl -s 'http://example.com/query_string' |
    xmllint --xpath '
        substring-before(//*[contains(text(), "-SNAPSHOT")]/text(), "-SNAPSHOT")
' -

You can replace trailing - by /dev/stdin.

Output

x.x.x

Note

Don't parse XML/HTML with regex, use a proper XML/HTML parser and a powerful xpath query.

You can use one of the following :

xmllint often installed by default with libxml2-utils, xpath1

xmlstarlet can edit, select, transform... Not installed by default, xpath1

xpath installed via perl's module XML::XPath, xpath1

xidel xpath3

saxon-lint my own project, wrapper over @Michael Kay's Saxon-HE Java library, xpath3

or you can use high level languages and proper libs, I think of :

python's lxml (from lxml import etree)

perl's XML::LibXML, XML::XPath, XML::Twig::XPath, HTML::TreeBuilder::XPath

ruby nokogiri, check this example

php DOMXpath, check this example

Check: Using regular expressions with HTML tags

edited Jun 29 '20 at 08:15

answered Jun 28 '20 at 19:00

Gilles Quénot

173,512
41
224
223

Could you please suggest any grep or awk commands, xmllint is not working for me – codecracker Jun 28 '20 at 19:11
1

No, regex tools are not the way to parse XML. Which OS/distro/version ? – Gilles Quénot Jun 28 '20 at 19:11
I am getting this as output of a curl command, i want to use pipe "|" grep or awk to cut the x.x.x as an output, Please suggest something similar command – codecracker Jun 29 '20 at 05:39
I thought that was obvious, anyway, post edited accordingly – Gilles Quénot Jun 29 '20 at 08:15

get only x.x.x from x.x.x-SNAPSHOT from this expression

1 Answers1

Output

Note

or you can use high level languages and proper libs, I think of :