Azure Databricks: How to add Spark configuration in Databricks cluster

Question

I am using a Spark Databricks cluster and want to add a customized Spark configuration.
There is a Databricks documentation on this but I am not getting any clue how and what changes I should make. Can someone pls share the example to configure the Databricks cluster.
Is there any way to see the default configuration for Spark in the Databricks cluster.

I have yet to see any documentation of the databrick specific config options. Hopefully someone can chime in with that documentation. — Foxhound013, Jan 25 '23 at 15:16

Joey Gomes · Answer 1 · 2023-01-02T12:57:30.727

2

You can set cluster config in the compute section in your Databricks workspace. Go to compute (and select cluster) > configuration > advanced options:
Or, you can set configs via a notebook.

%python spark.conf.set("spark.sql.name-of-property", value)

edited Jan 02 '23 at 12:57

answered Jan 02 '23 at 12:28

Joey Gomes

66
3

score 0 · Answer 2 · answered Jun 15 '23 at 09:01

You have many ways to set up the default cluster configs:

Manually in the "compute" tab (as mentioned before): Go to Compute > Select a cluster > Advanced Options > Spark
Via notebook (as mentioned before): In a cell of your databricks notebook, you can set any spark configuration for that session/job by running the "spark.conf.set" command like spark.conf.set("spark.executor.memory","4g")
Using JOB CLI API: If you are aiming to deploy jobs programmatically in a multi-environment fashion (e.g. Dev, Staging, Production):

Useful links!

Azure Databricks: How to add Spark configuration in Databricks cluster

2 Answers2

Linked