Downloads: 125

Survey Paper | Computer Science & Engineering | India | Volume 3 Issue 10, October 2014

Big Data Processing Using Hadoop: Survey on Scheduling

Harshawardhan S. Bhosale | Devendra P. Gadekar ^[3]

Abstract: The term Big Data describes innovative techniques and technologies to capture, store, distribute, manage and analyze petabyte- or larger-sized datasets with high-velocity and different structures. Big data can be structured, unstructured or semi-structured, resulting in incapability of conventional data management methods. Big Data is a data whose scale, diversity, and complexity require new architecture, techniques, algorithms, and analytics to manage it and extract value and hidden knowledge from it. In order to process large amounts of data in an inexpensive and efficient way, open source software called Hadoop is used. Hadoop enables the distributed processing of large data sets across clusters of commodity servers. Hadoop uses FIFO as default scheduling algorithm for execution of jobs. Performance of Hadoop can be increased by using appropriate scheduling algorithms. The objective of the research is to study and analyze various scheduling algorithms which can be used in Hadoop for better performance.

Keywords: Big data, Hadoop, Map Reduce, Locality, Job Scheduling

Edition: Volume 3 Issue 10, October 2014,

Pages: 272 - 277

An Investigation of the Applications of Artificial Intelligence and Other New Technologies in Smart Energy Infrastructure

Karan Chawla ^[5]

Share this Article

Downloads: 0

Informative Article, Computer Science & Engineering, India, Volume 9 Issue 9, September 2020

Pages: 1607 - 1610

Comprehensive Review on Automated Suspicious Activity Report Generation (SAR)

Ankur Mahida ^[7]

Share this Article

Big Data Processing Using Hadoop: Survey on Scheduling

Similar Articles with Keyword 'Big data'

An Investigation of the Applications of Artificial Intelligence and Other New Technologies in Smart Energy Infrastructure

Comprehensive Review on Automated Suspicious Activity Report Generation (SAR)