81

I have a list of regexes in python, and a string. Is there an elegant way to check if the at least one regex in the list matches the string? By elegant, I mean something better than simply looping through all of the regexes and checking them against the string and stopping if a match is found.

Basically, I had this code:

list = ['something','another','thing','hello']
string = 'hi'
if string in list:
  pass # do something
else:
  pass # do something else

Now I would like to have some regular expressions in the list, rather than just strings, and I am wondering if there is an elegant solution to check for a match to replace if string in list:.

starball
  • 20,030
  • 7
  • 43
  • 238
houbysoft
  • 32,532
  • 24
  • 103
  • 156

5 Answers5

112
import re

regexes = [
    "foo.*",
    "bar.*",
    "qu*x"
    ]

# Make a regex that matches if any of our regexes match.
combined = "(" + ")|(".join(regexes) + ")"

if re.match(combined, mystring):
    print "Some regex matched!"
Ned Batchelder
  • 364,293
  • 75
  • 561
  • 662
105
import re

regexes = [
    # your regexes here
    re.compile('hi'),
#    re.compile(...),
#    re.compile(...),
#    re.compile(...),
]

mystring = 'hi'

if any(regex.match(mystring) for regex in regexes):
    print 'Some regex matched!'
nosklo
  • 217,122
  • 57
  • 293
  • 297
  • If working in python 2.4, you won't have `any` - see http://stackoverflow.com/questions/3785433/python-backports-for-some-methods – Sam Heuck Sep 12 '13 at 19:42
  • 4
    How is this *"something better than simply looping through all of the regexes and checking them against the string and stopping if a match is found"*? I guess the combination of Ned's and this answer could be a winner though... – johndodo Jan 21 '14 at 15:26
13

Here's what I went for based on the other answers:

raw_list = ["some_regex","some_regex","some_regex","some_regex"]
reg_list = map(re.compile, raw_list)

mystring = "some_string"

if any(regex.match(mystring) for regex in reg_list):
    print("matched")
user5056973
  • 387
  • 5
  • 16
7

A mix of both Ned's and Nosklo's answers. Works guaranteed for any length of list... hope you enjoy

import re   
raw_lst = ["foo.*",
          "bar.*",
          "(Spam.{0,3}){1,3}"]

reg_lst = []
for raw_regex in raw_lst:
    reg_lst.append(re.compile(raw_regex))

mystring = "Spam, Spam, Spam!"
if any(compiled_reg.match(mystring) for compiled_reg in reg_lst):
    print("something matched")
Anderas
  • 630
  • 9
  • 20
2

If you loop over the strings, the time complexity would be O(n). A better approach would be combine those regexes as a regex-trie.

ospider
  • 9,334
  • 3
  • 46
  • 46