On the Utilization Aspect of Document Data for Mining the Side Information
International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
www.ijsr.net | Open Access | Fully Refereed | Peer Reviewed International Journal

ISSN: 2319-7064

Views: 138 , Downloads: 101 | CTR: 73 % | Weekly Popularity: ⮙1

Research Paper | Computer Science & Engineering | India | Volume 4 Issue 4, April 2015

On the Utilization Aspect of Document Data for Mining the Side Information

N.S. Krishna Prasad, S. Dhana Sekaran

In text mining applications, side-information is also available along with the text documents. This side-information can be like document provenance information, links existing inside the document, web logs based on user-access behavior, or non-textual attributes which exist in the text document. Such attributes will contain remarkable amount of information for clustering purposes. Usually it-s difficult to estimate the importance of this side-information when they are noisy. In these scenarios, there is a huge amount of risk involved in incorporating this side-information into the mining process, since they can add noise to the process rather than improving the quality of the mining process. We need a standard way to perform the mining process, so that we make best use of the advantages based on this side information. In this paper, we propose an algorithm to create an effective clustering approach, based on the combination of traditional partitioning algorithms with probabilistic models. We also show how to illustrate methodology to the classification problem.

Keywords: Data mining, clustering, Text documents, partitioning algorithm

Edition: Volume 4 Issue 4, April 2015

Pages: 3069 - 3074

Share this Article

How to Cite this Article?

N.S. Krishna Prasad, S. Dhana Sekaran, "On the Utilization Aspect of Document Data for Mining the Side Information", International Journal of Science and Research (IJSR), https://www.ijsr.net/search_index_results_paperid.php?id=SUB153923, Volume 4 Issue 4, April 2015, 3069 - 3074

138 PDF Views | 101 PDF Downloads

Download Article PDF



Similar Articles with Keyword 'Data mining'

Views: 73 , Downloads: 51 | CTR: 70 % | Weekly Popularity: ⮙2

Research Paper, Computer Science & Engineering, India, Volume 10 Issue 2, February 2021

Pages: 1669 - 1672

Random Forest Based Heart Disease Prediction

Adeen, Preeti Sondhi

Share this Article

Views: 156 , Downloads: 94 | CTR: 60 %

Research Paper, Computer Science & Engineering, Kenya, Volume 7 Issue 5, May 2018

Pages: 1409 - 1411

Students Performance Prediction Using FP-Tree Data Mining Techniques

Eliakim Ombati Akama

Share this Article

Views: 144 , Downloads: 97 | CTR: 67 % | Weekly Popularity: ⮙1

Survey Paper, Computer Science & Engineering, India, Volume 3 Issue 12, December 2014

Pages: 528 - 532

Knowledge Fusion Technique Using Classifier Ensemble by Combining the Sets of Classification Rules

Jaydeep B. Patil, Vaishali Nandedkar

Share this Article

Views: 135 , Downloads: 98 | CTR: 73 % | Weekly Popularity: ⮙4

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 11, November 2015

Pages: 1165 - 1168

Privacy Preserving Closed Frequent Pattern Mining

Anju Vijayan

Share this Article

Views: 135 , Downloads: 98 | CTR: 73 % | Weekly Popularity: ⮙3

Survey Paper, Computer Science & Engineering, India, Volume 4 Issue 11, November 2015

Pages: 2507 - 2509

A Survey on Extended MI technique for Edit Recommendation using Hybrid History Mining and Relevance Feedback

Shradha P. Patil, B. Padmavathi

Share this Article

Similar Articles with Keyword 'clustering'

Views: 133 , Downloads: 72 | CTR: 54 %

Review Papers, Computer Science & Engineering, India, Volume 3 Issue 11, November 2014

Pages: 1936 - 1938

A Mining Method to Predict Patients DOSH

Ruchi Rathor, Pankaj Agarkar

Share this Article

Views: 144 , Downloads: 95 | CTR: 66 % | Weekly Popularity: ⮙2

Review Papers, Computer Science & Engineering, India, Volume 3 Issue 10, October 2014

Pages: 2253 - 2256

Survey on Hubness - Based Clustering Algorithms

Nikita Dhamal, Antara Bhatttacharya

Share this Article

Views: 155 , Downloads: 98 | CTR: 63 % | Weekly Popularity: ⮙4

Review Papers, Computer Science & Engineering, India, Volume 4 Issue 2, February 2015

Pages: 2461 - 2466

A Review of Text Mining Techniques Associated with Various Application Areas

Dr. Shilpa Dang, Peerzada Hamid Ahmad

Share this Article

Views: 143 , Downloads: 99 | CTR: 69 % | Weekly Popularity: ⮙6

Survey Paper, Computer Science & Engineering, India, Volume 3 Issue 5, May 2014

Pages: 1742 - 1745

Enhancement of Leach Protocol in Wireless Sensor Network

Bipin Patel, Hardik Kadiya

Share this Article

Views: 128 , Downloads: 99 | CTR: 77 % | Weekly Popularity: ⮙4

Research Paper, Computer Science & Engineering, India, Volume 3 Issue 7, July 2014

Pages: 872 - 877

Ranking and Clustering of Software Cost Estimation Models

Vijaya Wable, S. M. Shinde

Share this Article

Similar Articles with Keyword 'Text documents'

Views: 138 , Downloads: 101 | CTR: 73 % | Weekly Popularity: ⮙1

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 4, April 2015

Pages: 3069 - 3074

On the Utilization Aspect of Document Data for Mining the Side Information

N.S. Krishna Prasad, S. Dhana Sekaran

Share this Article

Views: 125 , Downloads: 104 | CTR: 83 % | Weekly Popularity: ⮙3

M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 5 Issue 5, May 2016

Pages: 2398 - 2403

Text Document Annotation and Retrieval Based on Content of the Document and Query Workload

Arunima P V, Ravinarayana B

Share this Article

Views: 134 , Downloads: 106 | CTR: 79 % | Weekly Popularity: ⮙2

M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 5 Issue 7, July 2016

Pages: 1240 - 1244

Implementing K-Means Clustering Algorithm Using MapReduce Paradigm

Botcha Chandrasekhara Rao, Medara Rambabu

Share this Article

Views: 124 , Downloads: 107 | CTR: 86 % | Weekly Popularity: ⮙1

Survey Paper, Computer Science & Engineering, India, Volume 3 Issue 11, November 2014

Pages: 2366 - 2368

A Survey on Effective Quality Enhancement of Text Clustering & Classification Using METADATA

Padmaja Shivane, Rakesh Rajani

Share this Article

Views: 123 , Downloads: 110 | CTR: 89 % | Weekly Popularity: ⮙1

Research Paper, Computer Science & Engineering, India, Volume 5 Issue 5, May 2016

Pages: 2046 - 2050

Text Categorization using Jaccard Coefficient for Text Messages

Ankita Jadhao, Dr. A. J. Agrawal

Share this Article

Similar Articles with Keyword 'partitioning algorithm'

Views: 138 , Downloads: 101 | CTR: 73 % | Weekly Popularity: ⮙1

Research Paper, Computer Science & Engineering, India, Volume 4 Issue 4, April 2015

Pages: 3069 - 3074

On the Utilization Aspect of Document Data for Mining the Side Information

N.S. Krishna Prasad, S. Dhana Sekaran

Share this Article

Views: 136 , Downloads: 104 | CTR: 76 % | Weekly Popularity: ⮙2

Review Papers, Computer Science & Engineering, India, Volume 3 Issue 10, October 2014

Pages: 2135 - 2136

Text Clustering and Classification on the Use of Side Information

Shilpa S. Raut, Prof. V. B. Maral

Share this Article

Views: 134 , Downloads: 106 | CTR: 79 % | Weekly Popularity: ⮙2

M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 5 Issue 7, July 2016

Pages: 1240 - 1244

Implementing K-Means Clustering Algorithm Using MapReduce Paradigm

Botcha Chandrasekhara Rao, Medara Rambabu

Share this Article

Views: 144 , Downloads: 119 | CTR: 83 % | Weekly Popularity: ⮙1

M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 4 Issue 11, November 2015

Pages: 809 - 811

An Adaptive Partitioning Technique to Improve the Performance of Bigdata

R. Siva Kumar, K. Nageswara Rao

Share this Article

Views: 172 , Downloads: 147 | CTR: 85 % | Weekly Popularity: ⮙2

Research Paper, Computer Science & Engineering, India, Volume 6 Issue 7, July 2017

Pages: 572 - 575

Detecting Overlapping Nodes in MLM Chain Network

Sukhada Vader, Mugdha Kirkire, Rajvardhan Babar

Share this Article
Top