Downloads: 108
M.Tech / M.E / PhD Thesis | Computer Science & Engineering | India | Volume 4 Issue 8, August 2015
Hadoop Distributed File System and Map Reduce Processing on Multi-Node Cluster
Dr. G. Venkata Rami Reddy [3] | CH. V. V. N. Srikanth Kumar
Abstract: Big Data relates to large-volume of growing data which are stored at multiple and autonomous sources. It is a collection of both structured and unstructured data that is too large, fast and distinct to be managed by traditional database management tools or traditional data processing application models. The most fundamental challenges for Big Data applications is to store the large volumes of data on multiple sources and to process it for extracting useful information or knowledge for future actions. Apache Hadoop [1] is a framework that provides reliable shared storage and distributed processing of large data sets across clusters of commodity computers using a programming model. Data Storage is provided by Hadoop Distributed File System (HDFS) [3] and data processing is provided by Map Reduce [2]. The main goal of the project is to implement the core components of Hadoop by designing a multimode cluster and build a common base platform HDFS for storing of huge data at multiple sources and perform Map Reduce processing model on data stored at these multiple nodes.
Keywords: Apache Hadoop, Hadoop Distributed File System, Map Reduce
Edition: Volume 4 Issue 8, August 2015,
Pages: 1424 - 1430
Similar Articles with Keyword 'Apache Hadoop'
Downloads: 109
Survey Paper, Computer Science & Engineering, India, Volume 5 Issue 6, June 2016
Pages: 2219 - 2223Performance Analysis for Optimizing Hadoop MapReduce Execution
Samiksha Misal | P. S Desai
Downloads: 111 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1
Research Paper, Computer Science & Engineering, India, Volume 3 Issue 11, November 2014
Pages: 1183 - 1187Memory Management in BigData
Yanish Pradhananga | Shridevi C. Karande