I am writing a crawler to get certain parts of a html file. But I cannot figure out how to use re.findall().
Here is an example, when I want to find all ... part in the file, I may write something like this:
re.findall("<div>.*\</div>", result_page)
if result_page is a string "<div> </div> <div> </div>"
, the result will be
['<div> </div> <div> </div>']
Only the entire string. This is not what I want, I am expecting the two divs separately. What should I do?