International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064


Downloads: 124

Research Paper | Computer Science & Engineering | India | Volume 4 Issue 4, April 2015


Implementation of Hadoop Based Framework for Parallel Processing of Biological Data

Praveen Kumar B | Nirmala Bariker


Abstract: Bioinformatics is challenged by the fact that traditional analysis tools have difficulty in processing large-scale datafrom high-throughput sequencing. Hadoop is designed to process large data sets (petabytes). It becomes a bottleneck, when handling massive small files because the name node utilize more memory to store the metadata of files and the data nodes consumes more CPU time to process massive small files. The open source Apache Hadoop project, which in this paper, presenting the Optimized Hadoop, consists of Merge Model to merge massive small files into a single large file and introduced the efficient indexing mechanism and adopts the MapReduce frame-work using decision classification rule for analysis and Diagnosis of Iris Plants data through a distributed file system to achieve scalable, efficient and reliable computing performance on Linux clusters of low cost commodity machines. Our experimental result shows that Optimized Hadoop improves performance of processing small files drastically up to 90.83 % and effectively reduces the memory utilization of the name node to store the metadata of files.


Keywords: Hadoop, Hadoop Distributed File System, Map, Reduce, Small Files, Iris plants data, Decision Tree, Classification rule


Edition: Volume 4 Issue 4, April 2015,


Pages: 1087 - 1091


How to Download this Article?

You Need to Register Your Email Address Before You Can Download the Article PDF


How to Cite this Article?

Praveen Kumar B, Nirmala Bariker, "Implementation of Hadoop Based Framework for Parallel Processing of Biological Data", International Journal of Science and Research (IJSR), Volume 4 Issue 4, April 2015, pp. 1087-1091, https://www.ijsr.net/get_abstract.php?paper_id=SUB153232

Similar Articles with Keyword 'Hadoop'

Downloads: 1

Research Paper, Computer Science & Engineering, India, Volume 10 Issue 6, June 2021

Pages: 1188 - 1193

Profit Contribution of Bank Customer from Different Business Liabilities

Vinod Desai | Shalini B Ullagaddi | Vittal A Odeyar

Share this Article

Downloads: 1

Research Paper, Computer Science & Engineering, India, Volume 11 Issue 1, January 2022

Pages: 1229 - 1231

Big Data in Healthcare

Pratiksha Patil

Share this Article
Top