Spark Partitions with Coalesce and Repartition

Reason is the light and the light of life.

Jerry Su Jul 19, 2019 1 mins

Spark splits data into partitions and executes computations on the partitions in parallel. You should understand how data is partitioned and when you need to manually adjust the partitioning to keep your Spark computations running efficiently.

https://medium.com/@mrpowers/managing-spark-partitions-with-coalesce-and-repartition-4050c57ad5c4


Read more:

Related posts: