Research Article | Open Access | Download PDF
Volume 16 | Number 1 | Year 2014 | Article Id. IJCTT-V16P128 | DOI : https://doi.org/10.14445/22312803/IJCTT-V16P128
Data Science: Bigtable, MapReduce and Google File System
Karan B. Maniar , Chintan B. Khatri
Citation :
Karan B. Maniar , Chintan B. Khatri, "Data Science: Bigtable, MapReduce and Google File System," International Journal of Computer Trends and Technology (IJCTT), vol. 16, no. 1, pp. 115-118, 2014. Crossref, https://doi.org/10.14445/22312803/IJCTT-V16P128
Abstract
Data science is the extension of research findings and drawing conclusions from data[1]. BigTable is built on a few of Google technologies[2]. MapReduce is a programming model and an associated implementation for processing and generating large data sets with a parallel, distributed algorithm on a cluster[3]. Google File System is designed to provide efficient, reliable access to data using large clusters of commodity hardware[4]. This paper will discuss Bigtable, MapReduce and Google File System, along with discussing the top 10 algorithms in data mining in brief.
Keywords
Data Science, Bigtable, MapReduce, Google File System, Top 10 algorithms in data mining.
References
1. en.wikipedia.org/wiki/Data_science
2. en.wikipedia.org/wiki/BigTable
3. en.wikipedia.org/wiki/MapReduce
4. en.wikipedia.org/wiki/Google_File_System
5. Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach Mike Burrows, Tushar Chandra, Andrew Fikes, Robert E. Gruber. “Bigtable: A Distributed Storage System for Structured Data”, 2006.
6. Xiao Chen. “Google Big Table”, 2010.
7. Jeffrey Dean and Sanjay Ghemawat. “MapReduce: Simplified Data Processing on Large Clusters”, 2004.
8. Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung. “The Google File System”, 2003.
9. http://www-01.ibm.com/software/data/infosphere/hadoop/mapreduce/
10. http://computer.howstuffworks.com/internet/basics/google-file-system.htm
11. XindongWu, Vipin Kumar, J. Ross Quinlan, Joydeep Ghosh, Qiang Yang, Hiroshi Motoda, Geoffrey J. McLachlan, Angus Ng, Bing Liu, Philip S. Yu, Zhi-Hua Zhou, Michael Steinbach, David J. Hand, Dan Steinberg. “Top 10 algorithms in data mining”, 2008.