PySpark Data Skew in 5 Minutes big-data 0 May 28, 2022 In spark, data are split into chunk of rows, then stored on worker nodes as shown in figure 1. Link