I'm trying to load some data from an Amazon S3 bucket by:
SparkConf sparkConf = new SparkConf().setAppName("Importer");
JavaSparkContext ctx = new JavaSparkContext(sparkConf);
HiveContext sqlContext = new HiveContext(ctx.sc());
DataFrame magento = sqlContext.read().json("https://s3.eu-central-1.amazonaws.com/*/*.json");
This last line however throws an error:
Exception in thread "main" java.io.IOException: No FileSystem for scheme: https
The same line has been working in another project, what am I missing? I'm running Spark on a Hortonworks CentOS VM.