Almost everyone has heard of Google's MapReduce framework, but very few have ever hacked around with the idea of map and reduce. These two idioms are borrowed from functional programming, and form the basis of Google's framework. Although Python is not a functional programming language, it has built-in support for both of these concepts. A…
Disco is an open-source implementation of the Map-Reduce framework for distributed computing. As the original framework, Disco supports parallel computations over large data sets on unreliable cluster of computers.
MRQL (the Map-Reduce Query Language) is an SQL-like query language for map-reduce computations. It is implemented on top of Apache's Hadoop. MRQL is powerful enough to express most common data analysis tasks over many different kinds of raw data, including hierarchical data and nested collections, such as XML data. It is more powerful than other current languages, such as Hive and Pig Latin, since it can operate on more complex data and supports more powerful query constructs, thus eliminating the need for using explicit map-reduce code.
F. Chierichetti, R. Kumar, and A. Tomkins. WWW '10: Proceedings of the 19th international conference on World wide web, page 231--240. New York, NY, USA, ACM, (2010)
H. chih Yang, A. Dasdan, R. Hsiao, and D. Parker. SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data, page 1029--1040. New York, NY, USA, ACM, (2007)
H. chih Yang, A. Dasdan, R. Hsiao, and D. Parker. SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data, page 1029--1040. New York, NY, USA, ACM, (2007)
C. Chu, S. Kim, Y. Lin, Y. Yu, G. Bradski, A. Ng, and K. Olukotun. Advances in Neural Information Processing Systems 19, Proceedings of the Twentieth Annual Conference on Neural Information Processing Systems Vancouver, British Columbia, Canada, December 4-7, 2006, page 281-288. MIT Press, (2006)
R. Cordeiro, C. Jr., A. Traina, J. López, U. Kang, and C. Faloutsos. Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Diego, CA, USA, August 21-24, 2011, page 690-698. ACM, (2011)