Questions tagged [spark-connect]
6 questions
1
vote
0 answers
The proper way to run Spark Connect in Anaconda - error '$HOME' is not recognized as an internal or external command, operable program or batch file
I try to learn this lesson https://spark.apache.org/docs/latest/api/python/getting_started/quickstart_connect.html
Method 1: from anaconda - window
by download the JP notebook to my Downloads folder, then start the jupyter notebook via…

Tom Tom
- 328
- 4
- 15
0
votes
0 answers
Possible to add own method to the Spark Connect Thin Client API?
we would like to add a method to the Spark Connect Server and expose on the Thin Client API. Here is why:
In our organization, we have built a timeseries database using Spark and HDFS storage for hundreds of thousands of signals and a few Petabyte…

Vito B.
- 1
0
votes
0 answers
Running Spark-Connect Server on kubernetes in cluster mode/high availability mode
I am trying to figure out how to effectively use the new Spark-Connect feature of Spark version >= 3.4.0. Specifically, I want so set up a kubernetes Spark cluster where various applications (mainly pyspark) will connect and submit their…

scienceseba
- 61
- 6
0
votes
0 answers
Pandas API support on Spark Connect
I am trying to use Spark PANDAS API on Spark Connect but I am getting assertion erorr
assert isinstance(spark_frame, SparkDataFrame)
AssertionError
I dont get any error if I use the spark Dataframe API.
Are Pandas-Spark API supported on Spark…
0
votes
1 answer
pyspark-connect can't show all hive databases
I'm using pyspark3.4.0 feature spark-connet module to connect remote hive 3.1.3.
When create sparksession in local mode with hive supported, all data base in hive can be viewed;
spark =…

leon
- 1
0
votes
1 answer
Using Spark Connect with Scala
I would like to use the new Spark Connect feature within a Scala program.
I started the Connect server and I am able to connect to it from Pyspark and also when submitting Python script, e.g., with spark-submit --remote sc://localhost…

hage
- 5,966
- 3
- 32
- 42