Apache Zeppelin is a web-based notebook that enables data-driven interactive data analytics. You can make beautiful data-driven, interactive and collaborative documents with SQL, Python, Scala and more. It also supports Markdown syntax.
Questions tagged [apache-zeppelin]
1460 questions
39
votes
2 answers
What is the difference between yarn and yarn install?
I am trying to install an additional plugin on Apache Zeppelin using Helium. The Installation command using by Helium to download this package is below:
com.github.eirslett.maven.plugins.frontend.lib.TaskRunnerException: 'yarn install…

gnish9
- 537
- 1
- 5
- 10
32
votes
7 answers
Spark + s3 - error - java.lang.ClassNotFoundException: Class org.apache.hadoop.fs.s3a.S3AFileSystem not found
I have a spark ec2 cluster where I am submitting a pyspark program from a Zeppelin notebook. I have loaded the hadoop-aws-2.7.3.jar and aws-java-sdk-1.11.179.jar and place them in the /opt/spark/jars directory of the spark instances. I get a…

peterlandis
- 645
- 1
- 7
- 17
30
votes
4 answers
Unrecognized VM option 'MaxPermSize=512m' when running Zeppelin
When I try to run Zeppelin by either
bin/zeppelin.sh
or
bin/zeppelin-deamon.sh start
I was getting the following error message.
Unrecognized VM option 'MaxPermSize=512m'
Error: Could not create the Java Virtual Machine.
Error: A fatal exception…

sbbs
- 1,450
- 2
- 13
- 20
27
votes
1 answer
No interpreters available in Zeppelin
I have just installed the following on my Mac (Yosemite 10.10.3):
oracle java 1.8 update 45
scala 2.11.6
spark 1.4 (precompiled release: http://d3kbcqa49mib13.cloudfront.net/spark-1.4.0-bin-hadoop2.6.tgz)
zeppelin from source…

Samuel Kerrien
- 6,965
- 2
- 29
- 32
26
votes
4 answers
Remove Temporary Tables from Apache SQL Spark
I have registertemptable in Apache Spark using Zeppelin below:
val hvacText = sc.textFile("...")
case class Hvac(date: String, time: String, targettemp: Integer, actualtemp: Integer, buildingID: String)
val hvac = hvacText.map(s =>…

Softwaremaker
- 401
- 1
- 5
- 4
25
votes
2 answers
How can I pretty print a data frame in Zeppelin/Spark/Scala?
I am using Spark 2 and Scala 2.11 in a Zeppelin 0.7 notebook. I have a dataframe that I can print like this:
dfLemma.select("text", "lemma").show(20,false)
and the output looks…

schoon
- 2,858
- 3
- 46
- 78
25
votes
1 answer
What to set `SPARK_HOME` to?
Installed apache-maven-3.3.3, scala 2.11.6, then ran:
$ git clone git://github.com/apache/spark.git -b branch-1.4
$ cd spark
$ build/mvn -DskipTests clean package
Finally:
$ git clone https://github.com/apache/incubator-zeppelin
$ cd…

A T
- 13,008
- 21
- 97
- 158
22
votes
4 answers
Why does SparkContext randomly close, and how do you restart it from Zeppelin?
I am working in Zeppelin writing spark-sql queries and sometimes I suddenly start getting this error (after not changing code):
Cannot call methods on a stopped SparkContext.
Then the output says further down:
The currently active SparkContext was…

The Puma
- 1,352
- 2
- 14
- 27
20
votes
1 answer
How to check version of Spark and Scala in Zeppelin?
When I run interactive spark-shell, I show spark version (2.2.0) and scala version (2.11.8)
Welcome to
____ __
/ __/__ ___ _____/ /__
_\ \/ _ \/ _ `/ __/ '_/
/___/ .__/\_,_/_/ /_/\_\ version 2.2.0
/_/
Using…

Haha TTpro
- 5,137
- 6
- 45
- 71
20
votes
4 answers
Zeppelin: How to restart sparkContext in zeppelin
I am using Isolated mode of zeppelins spark interpreter, with this mode it will start a new job for each notebook in spark cluster. I want to kill the job via zeppelin when the notebook execution is completed. For this I did sc.stop this stopped the…

eatSleepCode
- 4,427
- 7
- 44
- 93
20
votes
6 answers
How to suppress printing of variable values in zeppelin
Given the following snippet:
val data = sc.parallelize(0 until 10000)
val local = data.collect
println(s"local.size")
Zeppelin prints out the entire value of local to the notebook screen. How may that behavior be changed?

WestCoastProjects
- 58,982
- 91
- 316
- 560
17
votes
3 answers
Interact with Jupyter Notebooks via API
The problem: I want to interact with Jupyter from another application via Jupyter API, in particular I want to run my notebooks from the app at least (Perfect variant for me is to edit some paragraphs before running it). I've read the API…

Alexander Yakovlev
- 305
- 1
- 2
- 5
17
votes
10 answers
Getting NullPointerException when running Spark Code in Zeppelin 0.7.1
I have installed Zeppelin 0.7.1. When I tried to execute the Example spark program(which was available with Zeppelin Tutorial notebook), I am getting the following error
java.lang.NullPointerException
at…

Raj
- 2,368
- 6
- 34
- 52
17
votes
3 answers
converting pandas dataframes to spark dataframe in zeppelin
I am new to zeppelin. I have a usecase wherein i have a pandas dataframe.I need to visualize the collections using in-built chart of zeppelin I do not have a clear approach here. MY understanding is with zeppelin we can visualize the data if it is a…

Bala
- 675
- 2
- 7
- 23
17
votes
2 answers
Does Apache Zeppelin have intellisense?
Does Apache Zeppelin have intellisense/auto-completion support in its notebook ui?
if so how do I use it?
I tried searching on the internet but was unsuccessful to find a source which works. This https://github.com/NFLabs/zeppelin/issues/248 says…

DesirePRG
- 6,122
- 15
- 69
- 114