Regex to grab number in line

Question

I have an html file that I am reading the below line from. I would like to grab only the number that appears after the ':' and before the ',' using REGEX... THANKS IN ADVANCE

"totalPages":15,"bloodhoundHtml"

Does it have to be a regular expression? – wwii Jul 30 '14 at 02:03 — wwii, Jul 30 '14 at 02:03

Jan Moritz · Accepted Answer · 2014-07-30T02:14:26.537

1

"totalPages":([0-9]*),

You can see the Demo here

Then the python code is

import re

p = re.compile('"totalPages":([0-9]*),')
print p.findall('"totalPages":15,"bloodhoundHtml"')

edited Jul 30 '14 at 02:14

answered Jul 30 '14 at 02:03

Jan Moritz

2,145
4
23
33

this is good but what if the number 15 is not always going to be 15? – Jonathan Scialpi Jul 30 '14 at 02:23
but arent we searching for a line specific to 15 in the findall? – Jonathan Scialpi Jul 30 '14 at 02:25
No the regex is defined in the line above. In findall we define the text we want to analyze. – Jan Moritz Jul 30 '14 at 02:27
I suggest you to read the official python doc about how to use regex https://docs.python.org/2/howto/regex.html – Jan Moritz Jul 30 '14 at 02:31

score 0 · Answer 2 · answered Jul 30 '14 at 02:02

you can try :\d+, to get the ':15,' then you can trim first':' and trim end ',' to get the pure numbers, I don't know if python can use variable in the regex, I'm a c# programe, in c#, I can use :(?<id>\d+), to match this string, and get the number directly by result.group["id"]

score 0 · Answer 3 · edited May 23 '17 at 12:11

0

:\d{1,},

Also works for parsing the line you gave. According to this post, you might run into some trouble parsing the HTML

edited May 23 '17 at 12:11

Community

1
1

answered Jul 30 '14 at 02:07

Mark S.

301
1
4
10

so would it be something like re.compile('"totalPages":\d{1,},"bloodhoundHtml"' – Jonathan Scialpi Jul 30 '14 at 02:10

Regex to grab number in line

3 Answers3