Tuning Spark Data Serialization Memory Tuning Memory Management Overview Determining Memory Consumption Tuning Data Structures Serialized RDD Storage Garbage Collection Tuning Other Considerations Level of Parallelism Parallel Listing on Input Paths Memory Usage of Reduce Tasks Broadcasting Large Variables Data Locality Summary Because of the in-memory nature of most Spark computations, Spark prog