I'm trying to extract list of movies(titles), dateTime and whether the movie is "MX4D-2D" or "VIP" etc... from this website The website is using javascript to dynamically load content, so I used file get contents and strip tags on the website and now i'm left with the javascript in a plaint text format with the list of movies information, it's formatted so:
movieData = {
'2019-11-16': [
/*will have to change nowPlaying to have separate dates everywhere */
{
'url': 'the-addams-family',
'image-portrait': 'https://d10u9ygjms7run.cloudfront.net/0009/1573662077853_HO00002023.jpeg',
'image-landscape': 'https://d10u9ygjms7run.cloudfront.net/0009/1573662079231_h-HO00002023.jpeg',
'title': 'The Addams Family',
'releaseDate': '2019-10-17',
'endpoint': 'HO00002023',
'duration': '87 mins',
'rating': 'Rated PG',
'director': 'Greg Tiernan, Conrad Vernon',
'actors': 'Charlize Theron, Oscar Isaac, Chloë Grace Moretz, Allison Janney, Elsie Fisher, Nick Kroll, Bette Midler, Finn Wolfhard, Aimee Garcia',
'times': [
{ 'type': '','time': '12:45pm', 'bookingLink': 'https://themoviesticketing.com/ticketing/visSelectTickets.aspx?cinemacode=0009&txtSessionId=41264&visLang=1', 'attributes': [] },
]
},
{
'url': 'black-and-blue',
'image-portrait': 'https://d10u9ygjms7run.cloudfront.net/0009/1573662057611_HO00002024.jpeg',
'image-landscape': 'https://d10u9ygjms7run.cloudfront.net/0009/1573662058845_h-HO00002024.jpeg',
'title': 'Black and Blue',
'releaseDate': '2019-10-24',
'endpoint': 'HO00002024',
'duration': '108 mins',
'rating': 'Rated R',
'director': 'Deon Taylor',
'actors': 'Naomie Harris, Frank Grillo, Tyrese Gibson, Mike Colter, Reid Scott, Beau Knapp, Nafessa Williams',
'times': [
{ 'type': '','time': '10:00pm', 'bookingLink': 'https://themoviesticketing.com/ticketing/visSelectTickets.aspx?cinemacode=0009&txtSessionId=41257&visLang=1', 'attributes': [] },
{ 'type': '','time': '11:15pm', 'bookingLink': 'https://themoviesticketing.com/ticketing/visSelectTickets.aspx?cinemacode=0009&txtSessionId=41229&visLang=1', 'attributes': [] },
]
},
It also contains additional js that I don't need, is there an easy way to remove them and grab only the information i need? My end goal is store this in a db so i can keep track of movies from different cinema's. The full code is here: https://pastebin.com/TA0rfSB8