PySpark Data Skew in 5 Minutes

In spark, data are split into chunk of rows, then stored on worker nodes as shown in figure 1.

Link