Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing

The Case Against MapReduce

Resilient Distributed Datasets (RDDs)

Discussion

References

[1] Dryad

[2] Making Sense of Performance in Data Analytics Frameworks