Downloads: 108

M.Tech / M.E / PhD Thesis | Computer Science & Engineering | India | Volume 4 Issue 8, August 2015

Hadoop Distributed File System and Map Reduce Processing on Multi-Node Cluster

Dr. G. Venkata Rami Reddy ^[3] | CH. V. V. N. Srikanth Kumar

Abstract: Big Data relates to large-volume of growing data which are stored at multiple and autonomous sources. It is a collection of both structured and unstructured data that is too large, fast and distinct to be managed by traditional database management tools or traditional data processing application models. The most fundamental challenges for Big Data applications is to store the large volumes of data on multiple sources and to process it for extracting useful information or knowledge for future actions. Apache Hadoop [1] is a framework that provides reliable shared storage and distributed processing of large data sets across clusters of commodity computers using a programming model. Data Storage is provided by Hadoop Distributed File System (HDFS) [3] and data processing is provided by Map Reduce [2]. The main goal of the project is to implement the core components of Hadoop by designing a multimode cluster and build a common base platform HDFS for storing of huge data at multiple sources and perform Map Reduce processing model on data stored at these multiple nodes.

Keywords: Apache Hadoop, Hadoop Distributed File System, Map Reduce

Edition: Volume 4 Issue 8, August 2015,

Pages: 1424 - 1430

Performance Analysis for Optimizing Hadoop MapReduce Execution

Samiksha Misal | P. S Desai

Share this Article

Downloads: 111 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

Research Paper, Computer Science & Engineering, India, Volume 3 Issue 11, November 2014

Pages: 1183 - 1187

Memory Management in BigData

Yanish Pradhananga | Shridevi C. Karande

Share this Article

Hadoop Distributed File System and Map Reduce Processing on Multi-Node Cluster

Similar Articles with Keyword 'Apache Hadoop'

Performance Analysis for Optimizing Hadoop MapReduce Execution

Memory Management in BigData