Scrapy Error: 'NotSupported: Unsupported URL scheme '': no handler available for that scheme'

Question

I am trying to scrap a site but while running the script, I'm getting following error

'NotSupported: Unsupported URL scheme '': no handler available for that scheme'

If the rule is not wrong, why does it occur and what's your suggestion, please help me. Thanks a lot.

code is here:

from scrapy.spiders import CrawlSpider, Rule, BaseSpider
from scrapy.linkextractors import LinkExtractor 
class FellowSearch(CrawlSpider):
    name ='fellow'
    allowed_domains = ['emma.cam.ac.uk']
    start_urls = [' https://www.emma.cam.ac.uk/']

    rules =(Rule(LinkExtractor(allow=(r'\?id=\d+$')),callback='parse_obj', follow=True),)

    def parse_obj(self, response):
        print response.url

I see a space before`https`? – Joop Eggen Apr 03 '17 at 20:45 — Joop Eggen, Apr 03 '17 at 20:45

vold · Accepted Answer · 2017-04-03T20:54:51.317

6

You need to remove space before https in your start_urls change to start_urls = ['https://www.emma.cam.ac.uk/'].

edited Apr 03 '17 at 20:54

answered Apr 03 '17 at 20:49

vold

1,549
1
13
19

Could you please check my scrapy rules ? its scrapy only 31 url but there is more than 100 – Samsul Islam Apr 04 '17 at 14:43
You should open a new question and specify what urls you want to extract and I'd glad to help you. – vold Apr 04 '17 at 14:54

Scrapy Error: 'NotSupported: Unsupported URL scheme '': no handler available for that scheme'

1 Answers1