Downloads: 108
Research Paper | Computer Science & Engineering | India | Volume 4 Issue 4, April 2015
An Improved Hierarchical Technique for Document Clustering
Priti B. Kudal [2] | Prof. Manisha Naoghare [2]
Abstract: Data mining is the process of non-trivial discovery from implied, previously unknown, and potentially useful information from data in large databases. Hence it is a core element in knowledge discovery, often used synonymously. Clustering, one of technique for data mining used for grouping similar terms together. Earlier statistical analysis used in text mining depends on term frequency. Then, new concept based text mining model was introduced which analyses terms. Clustering of document is useful for the purpose of document organization, summarization, and information retrieval in an efficient way. Initially, clustering is applied for enhancing the information retrieval techniques. Of late, clustering techniques have been applied in the areas which involve browsing the gathered data or in categorizing the outcome provided by the search engines for the reply to the query raised by the users. In this paper, we are providing a comprehensive survey over the document clustering.
Keywords: Data Mining, Clustering, Classification, Similarity Measure, Term Frequency
Edition: Volume 4 Issue 4, April 2015,
Pages: 1983 - 1986
Similar Articles with Keyword 'Data Mining'
Downloads: 0
Survey Paper, Computer Science & Engineering, India, Volume 11 Issue 8, August 2022
Pages: 947 - 949COVID-19 Prediction using Machine Learning Algorithms
Saily Suresh Patil
Downloads: 0
Review Papers, Computer Science & Engineering, India, Volume 13 Issue 3, March 2024
Pages: 1036 - 1039