On the Utilization Aspect of Document Data for Mining the Side Information
International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
www.ijsr.net | Open Access | Fully Refereed | Peer Reviewed International Journal

ISSN: 2319-7064



Downloads: 104

Research Paper | Computer Science & Engineering | India | Volume 4 Issue 4, April 2015

On the Utilization Aspect of Document Data for Mining the Side Information

N.S. Krishna Prasad, S. Dhana Sekaran

In text mining applications, side-information is also available along with the text documents. This side-information can be like document provenance information, links existing inside the document, web logs based on user-access behavior, or non-textual attributes which exist in the text document. Such attributes will contain remarkable amount of information for clustering purposes. Usually it-s difficult to estimate the importance of this side-information when they are noisy. In these scenarios, there is a huge amount of risk involved in incorporating this side-information into the mining process, since they can add noise to the process rather than improving the quality of the mining process. We need a standard way to perform the mining process, so that we make best use of the advantages based on this side information. In this paper, we propose an algorithm to create an effective clustering approach, based on the combination of traditional partitioning algorithms with probabilistic models. We also show how to illustrate methodology to the classification problem.

Keywords: Data mining, clustering, Text documents, partitioning algorithm

Edition: Volume 4 Issue 4, April 2015

Pages: 3069 - 3074

Share this Article

How to Cite this Article?

N.S. Krishna Prasad, S. Dhana Sekaran, "On the Utilization Aspect of Document Data for Mining the Side Information", International Journal of Science and Research (IJSR), https://www.ijsr.net/search_index_results_paperid.php?id=SUB153923, Volume 4 Issue 4, April 2015, 3069 - 3074

Enter Your Email Address




Similar Articles with Keyword 'Data mining'

Downloads: 65 | Monthly Hits: ⮙1

Research Paper, Computer Science & Engineering, India, Volume 10 Issue 2, February 2021

Pages: 1669 - 1672

Random Forest Based Heart Disease Prediction

Adeen, Preeti Sondhi

Share this Article

Downloads: 96

Research Paper, Computer Science & Engineering, Kenya, Volume 7 Issue 5, May 2018

Pages: 1409 - 1411

Students Performance Prediction Using FP-Tree Data Mining Techniques

Eliakim Ombati Akama

Share this Article

Similar Articles with Keyword 'clustering'

Downloads: 1 | Monthly Hits: ⮙1

Survey Paper, Computer Science & Engineering, India, Volume 10 Issue 5, May 2021

Pages: 948 - 951

Survey on Various Image Segmentation Techniques

Babita Chauhan

Share this Article

Downloads: 1 | Weekly Hits: ⮙1 | Monthly Hits: ⮙1

Research Paper, Computer Science & Engineering, India, Volume 10 Issue 7, July 2021

Pages: 421 - 424

Comparative Analysis of AI Techniques in the Prediction of Heart Disease

Irtiqa Dhar

Share this Article

Similar Articles with Keyword 'Text documents'

Downloads: 106

M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 5 Issue 5, May 2016

Pages: 2398 - 2403

Text Document Annotation and Retrieval Based on Content of the Document and Query Workload

Arunima P V, Ravinarayana B

Share this Article

Downloads: 108

M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 5 Issue 7, July 2016

Pages: 1240 - 1244

Implementing K-Means Clustering Algorithm Using MapReduce Paradigm

Botcha Chandrasekhara Rao, Medara Rambabu

Share this Article

Similar Articles with Keyword 'partitioning algorithm'

Downloads: 107

Review Papers, Computer Science & Engineering, India, Volume 3 Issue 10, October 2014

Pages: 2135 - 2136

Text Clustering and Classification on the Use of Side Information

Shilpa S. Raut, Prof. V. B. Maral

Share this Article

Downloads: 108

M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 5 Issue 7, July 2016

Pages: 1240 - 1244

Implementing K-Means Clustering Algorithm Using MapReduce Paradigm

Botcha Chandrasekhara Rao, Medara Rambabu

Share this Article

Similar Articles with Keyword 'Data'

Downloads: 502 | Monthly Hits: ⮙7

Research Paper, Computer Science & Engineering, India, Volume 9 Issue 7, July 2020

Pages: 1454 - 1458

Heart Disease Prediction with Machine Learning Approaches

Megha Kamboj

Share this Article

Downloads: 83

Research Paper, Computer Science & Engineering, India, Volume 9 Issue 10, October 2020

Pages: 1626 - 1628

Image Noise Reduction with Autoencoder using Tensor Flow

Jai Sehgal, Dr Yojna Arora

Share this Article

Similar Articles with Keyword 'mining'

Downloads: 1

Survey Paper, Computer Science & Engineering, India, Volume 3 Issue 11, November 2014

Pages: 2205 - 2207

A Survey of Generating Multi-Document Summarizations

Patil Ajita S., P. M. Mane

Share this Article

Downloads: 65 | Monthly Hits: ⮙1

Research Paper, Computer Science & Engineering, India, Volume 10 Issue 2, February 2021

Pages: 1669 - 1672

Random Forest Based Heart Disease Prediction

Adeen, Preeti Sondhi

Share this Article



Top