Spark is a fast, in-memory cluster computing framework with a language-integrated interface in Scala. It shines at iterative MapReduce (e.g. machine learning) and interactive data mining, where keeping data in memory provides substantial speedups.
C. Hong, D. Chen, W. Chen, W. Zheng, and H. Lin. Proceedings of the 19th International Conference on Parallel
Architectures and Compilation Techniques, page 217--226. New York, NY, USA, ACM, (2010)
T. Elsayed, J. Lin, and D. Oard. Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers, page 265--268. Stroudsburg, PA, USA, Association for Computational Linguistics, (2008)
C. Liu, H. Yang, J. Fan, L. He, and Y. Wang. Proceedings of the 19th international conference on World wide web, page 681--690. New York, NY, USA, ACM, (2010)