1

I have a Django application running against MySQL database. I want to introduce Big Data analytics in the application by connecting the same application to Spark which has data stored as hive tables. I can run queries and access the data through beeline. I want to run similar queries from Django app and show the result set in the app.

My environment is :

Spark 1.5.2
Thriftserver 2
Hive metastore on MySQL

Any help will be appreciated.

Thanks

Aditya S
  • 11
  • 1
  • 3
  • It's not really clear what exeactly your question is. To use the pyspark API you'd have to put pyspark and py4j on your `PYTHONPATH` and then read [this](http://spark.apache.org/docs/latest/quick-start.html#self-contained-applications). See also [here](http://stackoverflow.com/questions/26533169/why-cant-pyspark-find-py4j-java-gateway) – karlson Jan 18 '16 at 16:22
  • Thanks for your help. – Aditya S Mar 22 '16 at 11:06

0 Answers0