How to use * or + with brackets in regular expressions in Python?

Question

There are multiple space separated characters in the input eg: string = "a b c d a s e "

What should the pattern be such that when I do re.search on the input using the pattern, I'd get the j'th character along with the space following it in the input by using .group(j)?

I tried something of the sort "^(([a-zA-Z])\s)+" but this is not working. What should I do?

EDIT My actual question is in the heading and the body described only a special case of it: Here's the general version of the question: if I have to take in all patterns of a specific type (initial question had the pattern "[a-zA-Z]\s") from a string, what should I do?

score 6 · Accepted Answer · answered Mar 03 '16 at 17:04

6

Use findall() instead and get the j-th match by index:

>>> j = 2
>>> re.findall(r"[a-zA-Z]\s", string)[j]
'c '

where [a-zA-Z]\s would match a lower or upper case letter followed by a single space character.

answered Mar 03 '16 at 17:04

alecxe

462,703
120
1,088
1,195

uhm, what does 'r' mean in .findall(r"[a-zA-Z]\s",..) ? – Soham Mar 04 '16 at 08:58
@LucyferZedd this just means the string is "raw" (http://stackoverflow.com/questions/2241600/python-regex-r-prefix). – alecxe Mar 04 '16 at 14:58

score 5 · Answer 2 · answered Mar 03 '16 at 17:04

5

Why use regex when you can simply use str.split() method and access to the characters with a simple indexing?

>>> new = s.split()
>>> new
['a', 'b', 'c', 'd', 'a', 's', 'e']

answered Mar 03 '16 at 17:04

Mazdak

105,000
18
159
188

dawg · Answer 3 · 2016-03-03T21:23:24.947

1

You could do:

>>> string = "a b c d a s e "
>>> j=2
>>> re.search(r'([a-zA-Z]\s){%i}' % j, string).group(1)
'b '

Explanation:

With the pattern ([a-zA-Z]\s) you capture a letter then the space;
With the repetition {2} added, you capture the last of the repetition -- in this case the second one (base 1 vs base 0 indexing...).

Demo

edited Mar 03 '16 at 21:23

answered Mar 03 '16 at 17:11

dawg

98,345
23
131
206

How to use * or + with brackets in regular expressions in Python?

3 Answers3