Downloads: 125 | Views: 132
Informative Article | Computer Science & Engineering | India | Volume 3 Issue 5, May 2014
Wordnet Based Document Clustering
Madhavi Katamaneni | Ashok Cheerala
Abstract: Document clustering is considered as an important tool in the fast developing information explosion era. It is the process of grouping text documents into category groups and has found applications in various domains like information retrieval; web information systems. Ontology based computing is emerging as a natural evolution of existing technologies to design with the information onslaught. In current dissertation work; background knowledge derived from WordNet as ontology is applied during preprocessing of documents for document clustering. Document vectors constructed from WordNet synsets is used as input for clustering. Comparative analysis is done between clustering using k-means and clustering using bi-secting k-means. A document Categorization tool is developed which summarizes the hierarchy of concepts obtained from WordNet during clustering phase. GUI tool contains the association between WordNet concepts and documents belonging to the concept.
Keywords: Document clustering, Ontology, BOW, POS Tagging, Stemming, Labeling
Edition: Volume 3 Issue 5, May 2014,
Pages: 1610 - 1618
Similar Articles with Keyword 'Document clustering'
M.Tech / M.E / PhD Thesis, Computer Science & Engineering, India, Volume 4 Issue 6, June 2015Pages: 2114 - 2117
Enhanced Document Clustering for Forensic Analysis
Rahul D. Kopulwar | Fazeel Irshad Zama
Research Paper, Computer Science & Engineering, India, Volume 4 Issue 4, April 2015Pages: 1983 - 1986
An Improved Hierarchical Technique for Document Clustering
Priti B. Kudal  | Prof. Manisha Naoghare