How is memory for Spark on EMR calculated/provisioned?

Question

I am running Spark jobs on EMR with YARN and don't understand the provisioning and reporting of memory from the UIs. I have a master and one core node instance r4.8xlarge which should have 32 cores and 244 GB of memory. According to this doc, it should have 241 GB allocated to YARN. Looking at the UI, this number is 236 GB probably due to additional overheads. Based on best practices, I have configured job to have below configurations.

--executor-cores 5 --executor-memory 35GB --num-executors 6 --conf spark.dynamicAllocation.enabled=false

Calculation for executor memory (236 GB / 6 executors) * 0.9 = 35 GB

When I submit a spark job and I look at Spark UI or console for executor metrics, the numbers are very different and I am confused as to how these are calculated and provisioned. Instead of 6 executors, there are only 4 which results in the job only using 20 cores instead of the available 30. The amount of memory for each executor is 22.2 GB instead of 35 GB which is only 88 GB out of the total 236 GB available.

I have looked at many resources but they only talk about how to tune spark jobs by setting YARN and Spark config which I have followed yet the results are unexpected.

Can someone help explain?

edit: The only applications installed on the cluster are Spark and Hadoop.

What is the value of `yarn.scheduler.maximum-allocation-mb` and `yarn.scheduler.minimum-allocation-mb` ? — Jayadeep Jayaraman, Oct 23 '19 at 07:04
`yarn.scheduler.minimum-allocation-mb` is set to 32 and `yarn.scheduler.maximum-allocation-mb` is 241664 — blu, Oct 23 '19 at 19:42

Avishek Bhattacharya · Accepted Answer · 2019-10-23T05:16:07.840

Memory

This is due to the spark memory management.

Quoting from From: https://www.tutorialdocs.com/article/spark-memory-management.html

By default, Spark uses On-heap memory only. The size of the On-heap memory is configured by the –executor-memory or spark.executor.memory parameter when the Spark Application starts. The concurrent tasks running inside Executor share JVM's On-heap memory.

The On-heap memory area in the Executor can be roughly divided into the following four blocks:

Storage Memory: It's mainly used to store Spark cache data, such as RDD  
 cache, Broadcast variable, Unroll data, and so on.

Execution Memory: It's mainly used to store temporary data in the calculation 
 . process of Shuffle, Join, Sort, Aggregation, etc.

User Memory: It's mainly used to store the data needed for RDD conversion  
 operations, such as the information for RDD dependency.

Reserved Memory: The memory is reserved for system and is used to store 
Spark's internal objects.

The available memory that you see in the dashboard is the 75% of the allocated memory.

The total allocated memory per executor may vary due to available memory in the node. The exact 236GB might not be available for the yarn. The datanode process etc might take some more memory.

The memory you see is the storage memory. Storage memory + Execution memory = 75% of total memory allocated.

For more information:

Executor count

You need to check yarn.nodemanager.resource.memory-mb in the yarn-site.xml file. It denotes "the total memory that a single NodeManager can allocate across all containers on one node". It might be the case that the yarn hasn't given all the memory available in the box. Therefore, the spark is not able to negotiate the 6 executors.

One more thing spark.yarn.executor.memoryOverhead is set to 384MB per executor unless it is overridden. This needs to be added into the calculation.

For more information

Apache Spark: setting executor instances does not change the executors

How UI Calculates the memory

How does web UI calculate Storage Memory (in Executors tab)?

Ok, that makes sense. Thanks for the info, I will read up on it.That doesn't explain the number of executors being 4 instead of 6. Also, 22.2 GB * 4 executors is only 88 GB out of 236 GB which is only 37%. I would expect close to 177 GB (236 GB * 0.75) from the chart. — blu, Oct 23 '19 at 04:27
Hey, sorry I missed the 4 executors thing. You need to check yarn.nodemanager.resource.memory-mb in the yarn-site.xml file. It denotes "the total memory that a single NodeManager can allocate across all containers on one node". It might be the case that the yarn hasn't given all the memory available in the box. Therefore, the spark is not able to negotiate the 6 executors. — Avishek Bhattacharya, Oct 23 '19 at 04:37
One more thing spark.yarn.executor.memoryOverhead is set to 384MB per executor. This needs to be added into the calculation. — Avishek Bhattacharya, Oct 23 '19 at 04:38
yarn.nodemanager.resource.memory-mb is set to 241664 which is what's confusing me about number of executors and memory utilized. It's only provisioning 4 executors and utilizing 37% of total memory available. Thanks for your help — blu, Oct 23 '19 at 04:46
I'm not as worried about number of executors as I am with memory utilization. If it were 75% that'd be great but right now it's 37%. I am getting out of memory error and was going to increase memory but after investigating I "should" have plenty of memory but it's not being utilized — blu, Oct 23 '19 at 05:02
Hey, the memory you see is the storage memory. Not the total memory. My bad I didn't check that. Storage + Execution = 75% (Check the diagram) — Avishek Bhattacharya, Oct 23 '19 at 05:08
Let us [continue this discussion in chat](https://chat.stackoverflow.com/rooms/201295/discussion-between-avishek-bhattacharya-and-blu). — Avishek Bhattacharya, Oct 23 '19 at 05:16

How is memory for Spark on EMR calculated/provisioned?

1 Answers1

Memory

Executor count

How UI Calculates the memory