I'm trying to grab paragraphs of text off a website with RegEx to put into a Python list, but for this particular website I'm having difficulty with formatting the RegEx to capture all the events. Can anyone help with gathering results from all instances? Or at least tell me if it's not practical and I'll find an alternate website.
from re import *
from urllib.request import urlopen
## Create Empty List
EventInfoListBEC = []
## Asign Website to a Variable
WebsiteBEC = 'https://www.brisent.com.au/Event-Calendar'
## Search for Event Info
EventInfoBEC = findall('<p class="event-description">(.+?)</p>', WebsiteBEC)
## Add Event Info to Event Info List and Print Details
print('Event Info appears', len(EventInfoBEC), 'times (BEC).')
for EventInfo in EventInfoBEC:
EventInfoListBEC.append(EventInfo)
print(EventInfoListBEC)
## There are Three Styles of Input from the HTML File
# One
<p class="event-description"><p>This is a sport where 8 seconds can cost you everything. Welcome to the world of the PBR.</p>
</p>
# Two
<p class="event-description"><p style="text-align: justify; color: rgb(0, 0, 0); font-family: sans-serif; font-size: 12px;">Fresh off the back of winning a Brit Award for ‘British Artist Video of the Year’ for ‘Woman Like Me’, and two Global Awards for ‘Best Group’ and ‘Best Song’; pop superstars Little Mix today announce that five new Australian shows have been added to 'LM5 - The Tour' for 2019!</p>
</p>
#Three
<p class="event-description"><p style="font-family: sans-serif; font-size: 12px; color: rgb(0, 0, 0); text-align: center;"><strong>OPENING NIGHT PERFORMANCE ADDED!</strong></p>
<p style="font-family: sans-serif; font-size: 12px; color: #000000; text-align: justify;">The world’s most beloved movie-musical comes to life on the arena stage like you’ve never seen it before! From the producers of GREASE - THE ARENA EXPERIENCE comes this lavish new arena production of THE WIZARD OF OZ.</p>