How to tune spark executor number, cores and executor memory?

Question

Where do you start to tune the above mentioned params. Do we start with executor memory and get number of executors, or we start with cores and get the executor number. I followed the link. However got a high level idea, but still not sure how or where to start and arrive to a final conclusion.

score 253 · Accepted Answer · edited Feb 21 '18 at 12:36

The following answer covers the 3 main aspects mentioned in title - number of executors, executor memory and number of cores. There may be other parameters like driver memory and others which I did not address as of this answer, but would like to add in near future.

Case 1 Hardware - 6 Nodes, and Each node 16 cores, 64 GB RAM

Each executor is a JVM instance. So we can have multiple executors in a single Node

First 1 core and 1 GB is needed for OS and Hadoop Daemons, so available are 15 cores, 63 GB RAM for each node

Start with how to choose number of cores:

Number of cores = Concurrent tasks as executor can run 

So we might think, more concurrent tasks for each executor will give better performance. But research shows that
any application with more than 5 concurrent tasks, would lead to bad show. So stick this to 5.

This number came from the ability of executor and not from how many cores a system has. So the number 5 stays same
even if you have double(32) cores in the CPU.

Number of executors:

Coming back to next step, with 5 as cores per executor, and 15 as total available cores in one Node(CPU) - we come to 
3 executors per node.

So with 6 nodes, and 3 executors per node - we get 18 executors. Out of 18 we need 1 executor (java process) for AM in YARN we get 17 executors

This 17 is the number we give to spark using --num-executors while running from spark-submit shell command

Memory for each executor:

From above step, we have 3 executors  per node. And available RAM is 63 GB

So memory for each executor is 63/3 = 21GB. 

However small overhead memory is also needed to determine the full memory request to YARN for each executor.
Formula for that over head is max(384, .07 * spark.executor.memory)

Calculating that overhead - .07 * 21 (Here 21 is calculated as above 63/3)
                            = 1.47

Since 1.47 GB > 384 MB, the over head is 1.47.
Take the above from each 21 above => 21 - 1.47 ~ 19 GB

So executor memory - 19 GB

Final numbers - Executors - 17, Cores 5, Executor Memory - 19 GB

Case 2 Hardware : Same 6 Node, 32 Cores, 64 GB

5 is same for good concurrency

Number of executors for each node = 32/5 ~ 6

So total executors = 6 * 6 Nodes = 36. Then final number is 36 - 1 for AM = 35

Executor memory is : 6 executors for each node. 63/6 ~ 10 . Over head is .07 * 10 = 700 MB. So rounding to 1GB as over head, we get 10-1 = 9 GB

Final numbers - Executors - 35, Cores 5, Executor Memory - 9 GB

Case 3

The above scenarios start with accepting number of cores as fixed and moving to # of executors and memory.

Now for first case, if we think we dont need 19 GB, and just 10 GB is sufficient, then following are the numbers:

cores 5 # of executors for each node = 3

At this stage, this would lead to 21, and then 19 as per our first calculation. But since we thought 10 is ok (assume little overhead), then we cant switch # of executors per node to 6 (like 63/10). Becase with 6 executors per node and 5 cores it comes down to 30 cores per node, when we only have 16 cores. So we also need to change number of cores for each executor.

So calculating again,

The magic number 5 comes to 3 (any number less than or equal to 5). So with 3 cores, and 15 available cores - we get 5 executors per node. So (5*6 -1) = 29 executors

So memory is 63/5 ~ 12. Over head is 12*.07=.84 So executor memory is 12 - 1 GB = 11 GB

Final Numbers are 29 executors, 3 cores, executor memory is 11 GB

Dynamic Allocation:

Note : Upper bound for the number of executors if dynamic allocation is enabled. So this says that spark application can eat away all the resources if needed. So in a cluster where you have other applications are running and they also need cores to run the tasks, please make sure you do it at cluster level. I mean you can allocate specific number of cores for YARN based on user access. So you can create spark_user may be and then give cores (min/max) for that user. These limits are for sharing between spark and other applications which run on YARN.

spark.dynamicAllocation.enabled - When this is set to true - We need not mention executors. The reason is below:

The static params number we give at spark-submit is for the entire job duration. However if dynamic allocation comes into picture, there would be different stages like

What to start with :

Initial number of executors (spark.dynamicAllocation.initialExecutors) to start with

How many :

Then based on load (tasks pending) how many to request. This would eventually be the numbers what we give at spark-submit in static way. So once the initial executor numbers are set, we go to min (spark.dynamicAllocation.minExecutors) and max (spark.dynamicAllocation.maxExecutors) numbers.

When to ask or give:

When do we request new executors (spark.dynamicAllocation.schedulerBacklogTimeout) - There have been pending tasks for this much duration. so request. number of executors requested in each round increases exponentially from the previous round. For instance, an application will add 1 executor in the first round, and then 2, 4, 8 and so on executors in the subsequent rounds. At a specific point, the above max comes into picture

when do we give away an executor (spark.dynamicAllocation.executorIdleTimeout) -

Please correct me if I missed anything. The above is my understanding based on the blog i shared in question and some online resources. Thank you.

References:

I read somewhere there is only one executor per node in standalone mode, any idea on that? I don't see it covered in your answer. — jangorecki, Aug 24 '16 at 11:46
In a standalone cluster, by default we get one executor per worker. We need to play with spark.executor.cores and a worker has enough cores to get more than one executor. — Ramzy, Aug 24 '16 at 18:10
I found my worker utilize all 32 cores without setting up `spark.executor.cores`, so it may use all available by default. — jangorecki, Aug 24 '16 at 18:17
Yeah, the default for cores is infinite as they say. So spark can use all the available cores unless you specify. — Ramzy, Aug 24 '16 at 18:19
@Ramzy I think it should be noted that even with dynamic allocation, you should still specify spark.executor.cores to determine the size of each executor that Spark is going to allocate. Otherwise, whenever Spark is going to allocate a new executor to your application, it is going to allocate an entire node (if available), even if all you need is just five more cores. — Dan Markhasin, Dec 19 '16 at 21:25
During dynamic allocation, the spark is expected to use the resources based on requirement. Even for executors there is an initial number to start with. Did not find it for cores though. @Dan Markhasin Please clarify if I am missing something — Ramzy, Dec 21 '16 at 04:35
The point I was trying to make is that even when using dynamic allocation, you need to specify the *size* of the executors, not their number. — Dan Markhasin, Dec 21 '16 at 07:45
We need not mention the driver cores or memory for every application? How does it work if we use all the resources for the executors? — passionate, Jan 26 '18 at 00:18
Is it suitable for both CPU intensive and MEMORY intensive jobs? Or we have any separate logic for them? — Manindar, Jan 03 '19 at 15:27
@Manindar Its the same logic for all. But based on your job complexity, the numbers might vary — Ramzy, Jan 03 '19 at 17:48
I have three 40 core and 64GB machines in my Yarn Spark Cluster, what would be an optimum config. I am using in my PySpark ```'--master yarn --deploy-mode client --num-executors 17 --executor-cores 5 --executor-memory 1g --driver-memory 5g --conf "spark.driver.maxResultSize=5g" --conf "spark.executor.memoryOverhead=600"'``` is that good? — Watt, Apr 18 '19 at 05:40
How we do calculate these spark settings when we have multiple nodes with various configurations (RAM, Cores etc.) ? example - many instance EC2 type nodes are spun up when we request for nodes in AWS EMR? — Shubham Meshram, Jul 18 '22 at 11:43
How did they ended up by deciding the number of cores as 5? Is there any related documentation for the same? — nirmal, Jan 03 '23 at 16:57

score 6 · Answer 2 · answered Apr 07 '17 at 10:46

6

Also, it depends on your use case, an important config parameter is:

spark.memory.fraction(Fraction of (heap space - 300MB) used for execution and storage) from http://spark.apache.org/docs/latest/configuration.html#memory-management.

If you dont use cache/persist, set it to 0.1 so you have all the memory for your program.

If you use cache/persist, you can check the memory taken by:

sc.getExecutorMemoryStatus.map(a => (a._2._1 - a._2._2)/(1024.0*1024*1024)).sum

Do you read data from HDFS or from HTTP?

Again, a tuning depend on your use case.

answered Apr 07 '17 at 10:46

Thomas Decaux

21,738
2
113
124

Do you know what the map command would look like when using pyspark? I use sc._jsc.sc().getExecutorMemoryStatus() to get the executor status, but can't do anything with what it returns... – Thomas Mar 16 '18 at 08:43
1

Sorry **@Thomas Decaux**, but did you mean setting `spark.memory.storageFraction=0.1`? Because AFAIK `spark.memory.fraction` decides the amount of memory used by `Spark` for both *execution & storage* (you already mentioned that) and only `spark.memory.storageFraction` of memory (that is available for storage + execution) is immune from *cache eviction*. Please see [this link](https://spark.apache.org/docs/latest/tuning.html#memory-management-overview) – y2k-shubham Mar 22 '18 at 09:02
@Thomas If at my application I only have persist(StorageLevel.DISK_ONLY) than this option applicable as well right? It effects only memory fraction, but not affects any disk spill? – jk1 Jun 28 '18 at 07:17

How to tune spark executor number, cores and executor memory?

2 Answers2

Linked