<p>I'd like to find the string between the two paragraph tags.</p><br><p>And also this string</p>
How would I get the string between the first two paragraph tags? And then, how would I get the string between the 2nd paragraph tags?
<p>I'd like to find the string between the two paragraph tags.</p><br><p>And also this string</p>
How would I get the string between the first two paragraph tags? And then, how would I get the string between the 2nd paragraph tags?
import re
matches = re.findall(r'<p>.+?</p>',string)
The following is your text run in console.
>>>import re
>>>string = """<p>I'd like to find the string between the two paragraph tags.</p><br><p>And also this string</p>"""
>>>re.findall('<p>.+?</p>',string)
["<p>I'd like to find the string between the two paragraph tags.</p>", '<p>And also this string</p>']
If you want the string between the p tags (excluding the p tags) then add parenthesis to .+? in the findall method
import re
string = """<p>I'd like to find the string between the two paragraph tags.</p><br><p>And also this string</p>"""
subStr = re.findall(r'<p>(.+?)</p>',string)
print subStr
Result
["I'd like to find the string between the two paragraph tags.", 'And also this string']
In between <p>
and </p>
In [7]: content = "<p>I'd like to find the string between the two paragraph tags.</p><br><p>And also this string</p>"
In [8]: re.findall(r'<p>(.+?)</p>', content)
Out[8]:
["I'd like to find the string between the two paragraph tags.",
'And also this string']
.+?
',content)` ...... `second = split[1]`......it gives me an `index out of range` error. How do I get the 2nd element? – Zorgan May 30 '17 at 10:01(.+?)
',string) >>> split[1] 'And also this string'` – Isdj May 30 '17 at 10:21