0

Working on Storm Crawler 1.12.1 and Elastic search 6.5.x. My Crawler is running on http://localhost:8080 and Elastic search running on https://localhost:9200.I am trying to crawl a website.During URL injection process I am facing javax.net.ssl.SSLHandshakeException: General SSLEngine problem error and check the detailed error here.

I tried for OKHttp and added https.protocol.implementation: "com.digitalpebble.stormcrawler.protocol.okhttp.HttpProtocol" in crawler-conf.yaml.

How can I turn off the certificate verification temporarily.

rzo1
  • 5,561
  • 3
  • 25
  • 64
an__snatcher
  • 121
  • 2
  • 12
  • 1
    There is no fetching during the injection so switching to OKHttp is not going to have any effect. This is more likely to be an issue with the connection to Elasticsearch. – Julien Nioche Jan 03 '19 at 08:06

1 Answers1

1

As @julien-nioche indicated in his comment: StormCrawler does not fetch in the process of URL injection.

The SSLHandshakeException most likely originates from your ElasticSearch instance. Is your ElasticSearch really running on HTTPS as indicated by https://localhost:9200?

The stacktrace indicates that the certificate path could not be validated.

Caused by: sun.security.provider.certpath.SunCertPathBuilderException: unable to find valid certification path to requested target

To resolve this issue, you could add the certificate to your local key-store. For details on this subject, you can take a look at this answer. Another possibility would be to disable HTTPS on the ElasticSearch instance for your local development setup.

MWiesner
  • 8,868
  • 11
  • 36
  • 70
rzo1
  • 5,561
  • 3
  • 25
  • 64