Python re find all combinations print as string

Question

Using Python regular expressions how can I find all instances of a combination and print each one on a new line?

Example:

import re

x = "a=123,b=123,c=123,d=123,a=456,b=456...etc"
y = re.search('a=(.*?),', x)
print(y)

Trying to get:

123
456

Have you read the `re` documentation? It has examples of how to do this. — Code-Apprentice, Sep 19 '17 at 23:55
Possible duplicate of [How can I find all matches to a regular expression in Python?](https://stackoverflow.com/questions/4697882/how-can-i-find-all-matches-to-a-regular-expression-in-python) — Code-Apprentice, Sep 19 '17 at 23:56
@JSimonsen do you want to use regex for any particular reason ? findall is much more easier — Chetan_Vasudevan, Sep 19 '17 at 23:58
@ChetanVasudevan Essentially pulling info from Logs and displaying just the parts I want to see. Just looking for a clean way to do that and present it — JSimonsen, Sep 20 '17 at 00:15
re.findall() should probably be enough I guess as Ajax1234 answered — Chetan_Vasudevan, Sep 20 '17 at 00:41

hyper-neutrino · Accepted Answer · 2017-09-20T00:05:19.590

The regular expression

First of all, your regular expression is incorrect. You're matching a= followed by any number of characters. This will match the entire string in one go because * is mostly greedy. Instead, you're trying to find any number of letters, an equal sign, and then any number of digits.

[A-Za-z]+=(\d+)  Regular Expression
        +        At least one
[A-Za-z]         (English) letter
         =       An equals sign
          (   )  Group 1
             +   At least one
           \d    digit

Also, use re.findall not re.search.

Then, doing re.findall(r"[A-Za-z]+=(\d+)", x) will give the list of strings, which you can print, parse, whatever.

Also, there might be a better way of doing this: if the data is exactly as you format it, you can just use regular string operations:

a = "a=123,b=456,c=789"
b = a.split(",") # gets ["a=123", "b=456", "c=789"]
c = [E.split("=") for E in b] # gets [["a", "123"], ["b", "456"], ["c", "789"]]

Then, if you want to turn this into a dictionary, you can use dict(c). If you want to print the values, do for E in c: print(E[1]). Etc.

Ajax1234 · Answer 2 · 2017-09-20T00:02:16.547

2

Just use re.findall:

import re
x = "a=123,b=123,c=123,d=123,a=456,b=456...etc"
final_data = re.findall("(?<=a\=)\d+", x)
for i in final_data:
   print(i)

Output:

123
456

This regular expression utilizes a positive look behind to make sure that the digits are part of the a= expression:

\d+: matches all digits until non-numeric character is found (in this case the start of the next expression).

(?<=a\=): searches for a= assignment part of expression and acts as anchor for \d+ regex.

edited Sep 20 '17 at 00:02

answered Sep 19 '17 at 23:56

Ajax1234

69,937
8
61
102

I'd recommend you explain the regular expression for those (including me) who might not understand it / are new to regex – hyper-neutrino Sep 19 '17 at 23:59
1

@HyperNeutrino thank you for your suggestion. Please see my recent edit. – Ajax1234 Sep 20 '17 at 00:02

Python re find all combinations print as string

2 Answers2

The regular expression