What’s the difference between Spark and map/reduce?

How does spark work?

  1. Spark works with any source file system, Unlike Hadoop which only works with HDFS.
  2. works with any resource manager(YARN, Mesos, Kubernetes)
  3. Spark Engine/core

image.png

PySpark Libraries: