Downloads: 107 | Views: 119
Review Papers | Computer Science & Engineering | India | Volume 3 Issue 10, October 2014
Text Clustering and Classification on the Use of Side Information
Shilpa S. Raut | Prof. V. B. Maral
Abstract: Side-information is present with the text document in many text mining applications. An user-access behavior from web logs, or other non-textual attributes embedded into the text document, the links in the document, document provenance information etc are nothing but side information. These attributes contains a vast amount of information for clustering purposes. But it is difficult to estimate the relative importance when some information is noisy. In that case, it will be risky to incorporate side-information into mining process as there is possibility that it will increase the quality of the representation for the mining process or may add a noise to process. Thus a proper way to carry out the mining process is needed such that it will maximize the advantages form using side information. So in this topic, an algorithm is designed, in order to give an effective clustering algorithm. This algorithm combines classical partitioning algorithms with probabilistic models, then show how to extend the approach to the classification problem.
Keywords: clustering, classifiers information, text mining, text collection, clustering methods
Edition: Volume 3 Issue 10, October 2014,
Pages: 2135 - 2136
Similar Articles with Keyword 'clustering'
Downloads: 0
Student Project, Computer Science & Engineering, India, Volume 11 Issue 6, June 2022
Pages: 1875 - 1880Microclustering with Outlier Detection for DADC
Aswathy Priya M.
Downloads: 2
Research Paper, Computer Science & Engineering, India, Volume 10 Issue 9, September 2021
Pages: 649 - 652Image Segmentation using Biogeography based Optimization and its Comparison with K Means Clustering
Babita Chauhan [2] | Preeti Sondhi [7]