Python urlopen "expected string or buffer"

Question

I am getting the "expected string or buffer" error in my simple python file. I am trying to get the titles of reddit articles written down.

from urllib import urlopen
import re


worldNewsPage = urlopen("https://www.reddit.com/r/worldnews/")

collectTitle = re.compile('<p class="title"><a.*>(.*)</a>')

findTitle = re.findall(collectTitle, worldNewsPage)

listIterator = []
listIterator[:] = range(1,3)

for i in listIterator:
    print findTitle
    print

score 1 · Answer 1 · edited May 23 '17 at 10:27

1

Change

worldNewsPage = urlopen("https://www.reddit.com/r/worldnews/")

to

worldNewsPage = urlopen("https://www.reddit.com/r/worldnews/").read()

Also don't use regex to parse html. You can use BeautifulSoup

edited May 23 '17 at 10:27

Community

1
1

answered Oct 15 '16 at 04:19

jamylak

128,818
30
231
230

score 0 · Answer 2 · answered Oct 15 '16 at 05:57

0

Urlopen is an object so you have to call the method read to get the contents you downloaded (like files).

answered Oct 15 '16 at 05:57

marcomg

25
4

Python urlopen "expected string or buffer"

2 Answers2