International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
www.ijsr.net | Most Trusted Research Journal Since Year 2012

ISSN: 2319-7064



M.Tech / M.E / PhD Thesis | Computer Science & Engineering | India | Volume 5 Issue 7, July 2016

Implementing K-Means Clustering Algorithm Using MapReduce Paradigm

Botcha Chandrasekhara Rao, Medara Rambabu

Clustering is a useful data mining technique which groups data points such that the points within a single group have similar characteristics, while the points in different groups are dissimilar. Partitioning algorithm methods such as k-means algorithm is one kind of widely used clustering algorithms. As there is an increasing trend of applications to deal with vast amounts of data, clustering such big data is a challenging problem. Recently, partitioning clustering algorithms on a large cluster of commodity machines using the MapReduce framework have received a lot of attention. Traditional way of clustering text documents is Vector space model, in which tf-idf is used for k-means algorithm with supportive similarity measure. This project exhibits an approach to cluster text documents in which results obtained by executing map reduce k-means algorithm on single node cluster show that the performance of the algorithm increases as the text corpus increases.

Keywords: Vector space model, map reduce, text clustering, map reduce k-means, Hadoop

Edition: Volume 5 Issue 7, July 2016

Pages: 1240 - 1244

Share this Article

How to Cite this Article?

Botcha Chandrasekhara Rao, Medara Rambabu, "Implementing K-Means Clustering Algorithm Using MapReduce Paradigm", International Journal of Science and Research (IJSR), https://www.ijsr.net/search_index_results_paperid.php?id=14071601, Volume 5 Issue 7, July 2016, 1240 - 1244

34 PDF Views | 31 PDF Downloads

Download Article PDF

Similar Articles with Keyword 'Vector space model'

M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 5 Issue 7, July 2016

Pages: 1240 - 1244

Implementing K-Means Clustering Algorithm Using MapReduce Paradigm

Botcha Chandrasekhara Rao, Medara Rambabu

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 3 Issue 3, March 2014

Pages: 410 - 412

Multi Keyword Searching Techniques over Encrypted Cloud Data

P. Shanmuga Priya, R. Sugumar

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 5 Issue 6, June 2016

Pages: 2044 - 2048

Multi-keyword Ranked Search Over Encrypted Cloud Data Supporting Synonym Query

Siddheshwar S. Metkari, Dr. S. B. Sonkamble

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 3 Issue 6, June 2014

Pages: 496 - 501

Development of Secure Multikeyword Retrieval Methodology for Encrypted Cloud Data

Deepak I M, K.R. Shylaja, Ravinandan M E

Share this Article

Similar Articles with Keyword 'map reduce'

Research Paper, Computer Science & Engineering, India, Volume 3 Issue 11, November 2014

Pages: 619 - 625

Customized Travel Itinerary Mining for Tourism Services

Bonuguntla Saranya, Miryala Venkatesh

Share this Article

Survey Paper, Computer Science & Engineering, India, Volume 3 Issue 12, December 2014

Pages: 1510 - 1513

A Survey on Optimal Data Storage of Cache Manager for Big Data Using Map Reduce Framework

Rupali Pashte, Ritesh Thakur

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 6, June 2015

Pages: 2762 - 2766

Large Scale Data Shared by Peer to Peer Based System in Shared Network

Bhavsar Harshada V., Dr. S. V. Gumaste, Prof. Deokate Gajanan S.

Share this Article

Survey Paper, Computer Science & Engineering, India, Volume 3 Issue 12, December 2014

Pages: 1916 - 1919

Data Anonymization Using Map Reduce On Cloud by Using Scalable Two - Phase Top-Down Specialization Approach

Rahul.S Ransing, M. S. Patole

Share this Article

Review Papers, Computer Science & Engineering, India, Volume 3 Issue 12, December 2014

Pages: 2112 - 2115

Approach to Solve NP Complete Problem Using Game Theoretic Scheduling Algorithm and Map-Reduce on Clouds

V. Mogal, Shekhar H. Pingale

Share this Article

Similar Articles with Keyword 'text clustering'

Review Papers, Computer Science & Engineering, India, Volume 3 Issue 10, October 2014

Pages: 2135 - 2136

Text Clustering and Classification on the Use of Side Information

Shilpa S. Raut, Prof. V. B. Maral

Share this Article

Survey Paper, Computer Science & Engineering, India, Volume 3 Issue 12, December 2014

Pages: 1787 - 1791

Document Clustering Approach for Forensic Analysis: A Survey

Prachi K. Khairkar, D. A. Phalke

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 12, December 2015

Pages: 1420 - 1423

Text Clustering With Using Side Information

Shubhangi V. Airekar, Dhanshree S. Kulkurni

Share this Article

M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 5 Issue 7, July 2016

Pages: 1240 - 1244

Implementing K-Means Clustering Algorithm Using MapReduce Paradigm

Botcha Chandrasekhara Rao, Medara Rambabu

Share this Article

Review Papers, Computer Science & Engineering, India, Volume 3 Issue 5, May 2014

Pages: 1735 - 1738

Text Document Clustering Approach: A Brief Review of Literature

Ruchika Mavis Daniel, Arun Kumar Shukla

Share this Article

Similar Articles with Keyword 'Hadoop'

Survey Paper, Computer Science & Engineering, India, Volume 4 Issue 10, October 2015

Pages: 1707 - 1708

Survey Paper on Twitter Sentiment Analysis Using Portar Stemming Algorithm

Nishad Patil, Tingre Sayali, Thorat Kalyani, Shivshetty Swapnil, Patil Shwetal

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 6 Issue 6, June 2017

Pages: 1711 - 1716

Live Data Stream Classification for Reducing Query Processing Time: Design and Analysis

Spraha Kamriya, Vandana Kate

Share this Article

M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 4 Issue 4, April 2015

Pages: 3150 - 3153

Enhancing the Hadoop Performance through Data Placement in Heterogeneous Hadoop Cluster

A Ankita Poovaiah, Gopal B

Share this Article

Survey Paper, Computer Science & Engineering, India, Volume 4 Issue 5, May 2015

Pages: 1445 - 1448

Impala: Open Source, Native Analytic Database for Apache Hadoop - A Review Paper

Prof. Pramod Patil, Amit Patange

Share this Article

Survey Paper, Computer Science & Engineering, India, Volume 4 Issue 11, November 2015

Pages: 1249 - 1251

Survey on Resource Allocation in Phase-Level using MapReduce in Hadoop

Suryakant S. Bhalke

Share this Article



Top