Apache's Hadoop project aims to solve these problems by providing a framework for running large data processing applications on clusters of commodity hardware. Combined with Amazon EC2 for running the application, and Amazon S3 for storing the data, we can run large jobs very economically. This paper describes how to use Amazon Web Services and Hadoop to run an ad hoc analysis on a large collection of web access logs that otherwise would have cost a prohibitive amount in either time or money.
P. Sethia, and K. Karlapalem. Engineering Applications of Artificial Intelligence, 24 (7):
1120--1127(2011)Infrastructures and Tools for Multiagent Systems.
G. Sadasivam, and G. Baktavatchalam. MDAC '10: Proceedings of the 2010 Workshop on Massive Data Analytics on the Cloud, page 1--7. New York, NY, USA, ACM, (2010)
D. Knoell, M. Atzmueller, C. Rieder, and K. Scherer. Proc. GWEM 2017, co-located with 9th Conference Professional Knowledge Management (WM 2017), Karlsruhe, Germany, KIT, ((In Press) 2017)
D. Knoell, M. Atzmueller, C. Rieder, and K. Scherer. Proc. GWEM 2017, co-located with 9th Conference Professional Knowledge Management (WM 2017), Karlsruhe, Germany, KIT, (2017)
D. Knoell, M. Atzmueller, C. Rieder, and K. Scherer. Proc. GWEM 2017, co-located with 9th Conference Professional Knowledge Management (WM 2017), Karlsruhe, Germany, KIT, (2017)
J. Lin. SIGIR '09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, page 155--162. New York, NY, USA, ACM, (2009)
G. Limaye, J. Chaudhary, and P. Punjabi. International Journal on Recent and Innovation Trends in Computing and Communication, 3 (3):
1699--1703(March 2015)
T. Tran, and T. Nguyen. in Poster & System Demonstration Proceedings of 13th International Semantic Web Conference (ISWC2014), Riva Del Garda, Italy, October 2014, (2014)
A. Goyal, F. Bonchi, and L. Lakshmanan. WSDM '10: Proceedings of the third ACM international conference on Web search and data mining, page 241--250. New York, NY, USA, ACM, (2010)
H. chih Yang, A. Dasdan, R. Hsiao, and D. Parker. SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data, page 1029--1040. New York, NY, USA, ACM, (2007)
H. chih Yang, A. Dasdan, R. Hsiao, and D. Parker. SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data, page 1029--1040. New York, NY, USA, ACM, (2007)
T. Sandholm, and K. Lai. SIGMETRICS '09: Proceedings of the eleventh international joint conference on Measurement and modeling of computer systems, page 299--310. New York, NY, USA, ACM, (2009)
T. Sandholm, and K. Lai. SIGMETRICS '09: Proceedings of the eleventh international joint conference on Measurement and modeling of computer systems, page 299--310. New York, NY, USA, ACM, (2009)
F. Chierichetti, R. Kumar, and A. Tomkins. WWW '10: Proceedings of the 19th international conference on World wide web, page 231--240. New York, NY, USA, ACM, (2010)
F. Chierichetti, R. Kumar, and A. Tomkins. WWW '10: Proceedings of the 19th international conference on World wide web, page 231--240. New York, NY, USA, ACM, (2010)
T. Elsayed, J. Lin, and D. Oard. Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers, page 265--268. Stroudsburg, PA, USA, Association for Computational Linguistics, (2008)
T. Elsayed, J. Lin, and D. Oard. Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers, page 265--268. Stroudsburg, PA, USA, Association for Computational Linguistics, (2008)
C. Olston, B. Reed, U. Srivastava, R. Kumar, and A. Tomkins. SIGMOD '08: Proceedings of the 2008 ACM SIGMOD international conference on Management of data, page 1099--1110. New York, NY, USA, ACM, (2008)
C. Olston, B. Reed, U. Srivastava, R. Kumar, and A. Tomkins. SIGMOD '08: Proceedings of the 2008 ACM SIGMOD international conference on Management of data, page 1099--1110. New York, NY, USA, ACM, (2008)
C. Schmitz, G. Peled, and O. Koren. Proceedings of the International Conference on Information Integration and Web-Based Applications & Services (IIWAS 2021), (2021 hadoop hdfs fragmentation)
M. Bayir, I. Toroslu, A. Cosar, and G. Fidan. WWW '09: Proceedings of the 18th international conference on World wide web, page 161--170. New York, NY, USA, ACM, (2009)