Downloads: 122 | Views: 158
Survey Paper | Computer Science & Engineering | India | Volume 4 Issue 1, January 2015
A Survey on Hierarchical Merge for Hadoop-A
Vaibhav Dhore [2] | Sonali R. Jagtap
Abstract: Hadoop is a popular open source implementation of the Map Reduce programming model for cloud computing. However, it faces a number of issues to achieve the best performance from the underlying systems. These include a serialization barrier that delays the reduce phase, repetitive merges, and disk accesses, and the lack of portability to different interconnects. We describe Hadoop-A, an acceleration framework that optimizes Hadoop with plug-in components for fast data movement, overcoming the existing limitations. A Hierarchical merge algorithm is introduced to merge data without repetition and disk access. In addition, we are using virtual shuffling to reduce disk access.
Keywords: Hadoop, MapReduce, virtual shuffling, hierarchical merge, Hadoop acceleration
Edition: Volume 4 Issue 1, January 2015,
Pages: 591 - 593