MapReduce: The programming model and practice

https://ai.google/research/pubs/pub36249

Google’s MapReduce programming model — Revisited

https://www.sciencedirect.com/science/article/pii/S0167642307001281

Exploring Wikipedia with Apache Spark: A Live Coding Demo

https://www.infoq.com/presentations/wikipedia-apache-spark

Computer Latency at a Human Scale

Besides in-memory data, Apache Spark uses ideas from functional programming (immutable data, operations on data as functional transformations…) explaining why Scala is for the moment the lingua franca language of Apache Spark and Big Data.

a16z Podcast: A Conversation with the Inventor of Spark

Matei Zaharia, Spark inventor, explains the history of Spark starting from MapReduce and Hadoop.

a16z Podcast: Making Sense of Big Data, Machine Learning, and Deep Learning

The reason for Big Data is Machine Learning : MapReduce -> Hadoop -> BigData(Spark) -> Machine Learning

Apache Spark Architecture – Spark Cluster Architecture Explained

https://www.edureka.co/blog/spark-architecture/

Putting Apache Spark Into Action with Jean Georges Perrin

https://www.dataengineeringpodcast.com/putting-apache-spark-into-action-with-jean-georges-perrin-episode-60/