On the Utilization Aspect of Document Data for Mining the Side Information
International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
www.ijsr.net | Open Access | Fully Refereed | Peer Reviewed International Journal

ISSN: 2319-7064

Research Paper | Computer Science & Engineering | India | Volume 4 Issue 4, April 2015

On the Utilization Aspect of Document Data for Mining the Side Information

N.S. Krishna Prasad, S. Dhana Sekaran

In text mining applications, side-information is also available along with the text documents. This side-information can be like document provenance information, links existing inside the document, web logs based on user-access behavior, or non-textual attributes which exist in the text document. Such attributes will contain remarkable amount of information for clustering purposes. Usually it-s difficult to estimate the importance of this side-information when they are noisy. In these scenarios, there is a huge amount of risk involved in incorporating this side-information into the mining process, since they can add noise to the process rather than improving the quality of the mining process. We need a standard way to perform the mining process, so that we make best use of the advantages based on this side information. In this paper, we propose an algorithm to create an effective clustering approach, based on the combination of traditional partitioning algorithms with probabilistic models. We also show how to illustrate methodology to the classification problem.

Keywords: Data mining, clustering, Text documents, partitioning algorithm

Edition: Volume 4 Issue 4, April 2015

Pages: 3069 - 3074

Share this Article

How to Cite this Article?

N.S. Krishna Prasad, S. Dhana Sekaran, "On the Utilization Aspect of Document Data for Mining the Side Information", International Journal of Science and Research (IJSR), https://www.ijsr.net/search_index_results_paperid.php?id=SUB153923, Volume 4 Issue 4, April 2015, 3069 - 3074

109 PDF Views | 79 PDF Downloads

Download Article PDF



Similar Articles with Keyword 'Data mining'

Research Paper, Computer Science & Engineering, Kenya, Volume 7 Issue 5, May 2018

Pages: 1409 - 1411

Students Performance Prediction Using FP-Tree Data Mining Techniques

Eliakim Ombati Akama

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 11, November 2015

Pages: 1165 - 1168

Privacy Preserving Closed Frequent Pattern Mining

Anju Vijayan

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 10, October 2015

Pages: 2041 - 2047

Adaptive Analysis of Knowledge Engineering and Pattern Recognition on Medical Data

Jani Basha, Dasari. Rajesh

Share this Article

Survey Paper, Computer Science & Engineering, India, Volume 3 Issue 12, December 2014

Pages: 2374 - 2376

A Survey of Novel Clustering and Knowledge Extraction from Log

Vasim Dilawar Mujawar, Prof. Pratima Bhati

Share this Article

M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 4 Issue 4, April 2015

Pages: 2790 - 2794

Risk Distribution and Validation of Data in Passport Data Analysis Using Cluster Analysis

Sucheta Gulia, Dr. Rajan Vohra

Share this Article

Similar Articles with Keyword 'clustering'

Review Papers, Computer Science & Engineering, India, Volume 3 Issue 11, November 2014

Pages: 1936 - 1938

A Mining Method to Predict Patients DOSH

Ruchi Rathor, Pankaj Agarkar

Share this Article

Review Papers, Computer Science & Engineering, India, Volume 3 Issue 10, October 2014

Pages: 2253 - 2256

Survey on Hubness - Based Clustering Algorithms

Nikita Dhamal, Antara Bhatttacharya

Share this Article

M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 3 Issue 12, December 2014

Pages: 2275 - 2278

Data Collection from Clusters in Wireless Sensor Network with Help of Mobile Nodes

Suraj Borge, Mayura Kinikar

Share this Article

Survey Paper, Computer Science & Engineering, India, Volume 3 Issue 5, May 2014

Pages: 1742 - 1745

Enhancement of Leach Protocol in Wireless Sensor Network

Bipin Patel, Hardik Kadiya

Share this Article

Research Paper, Computer Science & Engineering, Egypt, Volume 3 Issue 11, November 2014

Pages: 1128 - 1132

Keyword Extraction using Clustering and Semantic Analysis

Dr. Mohamed H. Haggag, Dr.Amal Abutabl, Ahmed Basil

Share this Article

Similar Articles with Keyword 'Text documents'

Survey Paper, Computer Science & Engineering, India, Volume 3 Issue 11, November 2014

Pages: 2366 - 2368

A Survey on Effective Quality Enhancement of Text Clustering & Classification Using METADATA

Padmaja Shivane, Rakesh Rajani

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 4, April 2015

Pages: 3069 - 3074

On the Utilization Aspect of Document Data for Mining the Side Information

N.S. Krishna Prasad, S. Dhana Sekaran

Share this Article

M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 5 Issue 5, May 2016

Pages: 2398 - 2403

Text Document Annotation and Retrieval Based on Content of the Document and Query Workload

Arunima P V, Ravinarayana B

Share this Article

M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 5 Issue 7, July 2016

Pages: 1240 - 1244

Implementing K-Means Clustering Algorithm Using MapReduce Paradigm

Botcha Chandrasekhara Rao, Medara Rambabu

Share this Article

Review Papers, Computer Science & Engineering, India, Volume 4 Issue 7, July 2015

Pages: 821 - 823

A Review on Technique Used for Text and Image Categorization Using Feature Clustering

Dipak R. Pardhi, Charushila D. Patil

Share this Article

Similar Articles with Keyword 'partitioning algorithm'

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 4, April 2015

Pages: 3069 - 3074

On the Utilization Aspect of Document Data for Mining the Side Information

N.S. Krishna Prasad, S. Dhana Sekaran

Share this Article

M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 5 Issue 7, July 2016

Pages: 1240 - 1244

Implementing K-Means Clustering Algorithm Using MapReduce Paradigm

Botcha Chandrasekhara Rao, Medara Rambabu

Share this Article

Review Papers, Computer Science & Engineering, India, Volume 3 Issue 10, October 2014

Pages: 2135 - 2136

Text Clustering and Classification on the Use of Side Information

Shilpa S. Raut, Prof. V. B. Maral

Share this Article

M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 4 Issue 11, November 2015

Pages: 809 - 811

An Adaptive Partitioning Technique to Improve the Performance of Bigdata

R. Siva Kumar, K. Nageswara Rao

Share this Article

Research Paper, Computer Science & Engineering, India, Volume 6 Issue 7, July 2017

Pages: 572 - 575

Detecting Overlapping Nodes in MLM Chain Network

Sukhada Vader, Mugdha Kirkire, Rajvardhan Babar

Share this Article
Top