The lineage is stored in a DAG, each new transformation is added to the lineage will be automatically updated to the DAG.
مثال مثلا لو انا عندي RDD من 4 partitions وعندي 4 nodes كل نود هتعمل ال transformation على ال partitions الخاص بيها دا اسمه
Narrow Operation → 4 partitions as input, 4 partitions gets produced.
Wide Operationn → 3 input partitions, 2 output partitions due to Shuffle operations