International Journal of Science and Research (IJSR)

International Journal of Science and Research (IJSR)
Call for Papers | Fully Refereed | Open Access | Double Blind Peer Reviewed

ISSN: 2319-7064

Downloads: 107 | Views: 145

Review Papers | Computer Science & Engineering | India | Volume 3 Issue 10, October 2014

Text Clustering and Classification on the Use of Side Information

Shilpa S. Raut | Prof. V. B. Maral

Abstract: Side-information is present with the text document in many text mining applications. An user-access behavior from web logs, or other non-textual attributes embedded into the text document, the links in the document, document provenance information etc are nothing but side information. These attributes contains a vast amount of information for clustering purposes. But it is difficult to estimate the relative importance when some information is noisy. In that case, it will be risky to incorporate side-information into mining process as there is possibility that it will increase the quality of the representation for the mining process or may add a noise to process. Thus a proper way to carry out the mining process is needed such that it will maximize the advantages form using side information. So in this topic, an algorithm is designed, in order to give an effective clustering algorithm. This algorithm combines classical partitioning algorithms with probabilistic models, then show how to extend the approach to the classification problem.

Keywords: clustering, classifiers information, text mining, text collection, clustering methods

Edition: Volume 3 Issue 10, October 2014,

Pages: 2135 - 2136

How to Download this Article?

Type Your Email Address below to Receive the Article PDF Link

Verification Code will appear in 2 Seconds ... Wait