I'm trying to grab the title out with of a webpage using the following statement:
titl1 = re.findall(r'<title>(.*?)</title>',the_webpage)
Using that, I get ['random webpage example1']
. How do I remove the quotes and brackets?
I'm also trying to grab a set of links that change hourly (which is why I need the wildcard) using this: links = re.findall(r'(file=(.*?).mp3)',the_webpage)
.
I get
[('file=http://media.kickstatic.com/kickapps/images/3380/audios/944521.mp3',
'http://media.kickstatic.com/kickapps/images/3380/audios/944521'),
('file=http://media.kickstatic.com/kickapps/images/3380/audios/944521.mp3',
'http://media.kickstatic.com/kickapps/images/3380/audios/944521'),
('file=http://media.kickstatic.com/kickapps/images/3380/audios/944521.mp3',
'http://media.kickstatic.com/kickapps/images/3380/audios/944521')]
How do I get the mp3 links without the file=
?
I also want to download the mp3 files and append them with the title of the website so it will show
random webpage example1.mp3
How would I do this? I'm still learning Python and regex and this is kinda stumping me.