MapReduce: Simplified Data Processing on Large Clusters
http://static.googleusercontent.com/media/research.google.com/en/us/archive/mapreduce-osdi04.pdf
MapReduce: The programming model and practice
https://ai.google/research/pubs/pub36249
Google’s MapReduce programming model — Revisited
https://www.sciencedirect.com/science/article/pii/S0167642307001281
Exploring Wikipedia with Apache Spark: A Live Coding Demo
https://www.infoq.com/presentations/wikipedia-apache-spark
Apache Spark Architecture – Spark Cluster Architecture Explained
https://www.edureka.co/blog/spark-architecture/
Putting Apache Spark Into Action with Jean Georges Perrin
https://www.dataengineeringpodcast.com/putting-apache-spark-into-action-with-jean-georges-perrin-episode-60/