I need help with a regex that finds the first two words at the start then takes only the first two sentences after, despite how many instances occur in the text.
text = "The Smithsonian museum is home to a variety of different art displays. According various reports art appreciation is on the rise. Blah blah blah blah. The Smithsonian museum blah blah blah. Blah blah blah blah."
My code looks something like this:
(re.findall(r"""((The Smithsonian|The Metropolitan)[^\.]*\.[^\.]*\.)""", text))
However, this is returning multiple instances instead of just the first two sentences, and oftentimes it returns junk like "The Smithsonian, " at the end. Can you please help? Thanks!