The official tutorial specifies the way on how to call scrapy within python script
By changing the following setting attributes:
settings.overrides['FEED_URI'] = output_path
settings.overrides['FEED_FORMAT'] = 'json'
I am able to store the data scraped in a json file.
However, I'm trying to process and return the data scraped immediately within the function I defined. Hence, other functions can call this wrapper function in order to scrap some websites.
I figure there must be some settings I can play with FEED_URI
, but I am not sure. Any advice will be appreciated deeply!