During my crawling, some pages return a response with partial html body and status 200, after I compare the response body with the one I open in browser, the former one miss something. How can I catch this unexpected partial response body case in spider or in download middleware?
Below is about the log example:
2014-01-23 16:31:53+0100 [filmweb_multi] DEBUG: Crawled (408) http://www.filmweb.pl/film/Labirynt-2013-507169/photos> (referer: http://www.filmweb.pl/film/Labirynt-2013-507169) ['partial']