M.Tech / M.E / PhD Thesis | Information Technology | Burma | Volume 8 Issue 1, January 2019
Comparison of Keyword-based and Semantic-based Web Page Clustering Systems
Ei Ei Moe, Hnin Hnin Htun
Abstract: Today, web page clustering is useful for many applications such as categorization, cleaning, schema detection and automatic extractions. Web page clustering is classified into different categories that are hierarchical and flat clustering, online and offline clustering, soft and hard clustering, and document-based and keywords-based clustering. Among them, keyword-based web page clustering uses the single words or compounds words occurring in the web page set as the features for clustering. In this situation, these words cant precisely represent the content of the web page because the synonyms and polysemous of the word can lead the ambiguity problems. Semantic analysis is useful to solve this ambiguity problem. So, this system proposes both keyword-based and semantic-based web page clustering system, and then compares the performance between them. In the semantic analysis, words in each web page are first mapped to word senses by using supervised based word sense disambiguation method. Then, semantic-based web page clustering system uses both keywords and semantic features for clustering. After performing each cluster process, this system points out the semantic-based web page clustering system is more precise and effective than the keyword-based clustering system.
Keywords: Semantic, Word Sense Disambiguation, Clustering
Edition: Volume 8 Issue 1, January 2019,
Pages: 1511 - 1516
How to Cite this Article?
Ei Ei Moe, Hnin Hnin Htun, "Comparison of Keyword-based and Semantic-based Web Page Clustering Systems", International Journal of Science and Research (IJSR), https://www.ijsr.net/get_abstract.php?paper_id=ART20194450, Volume 8 Issue 1, January 2019, 1511 - 1516
How to Share this Article?
Similar Articles with Keyword 'Semantic'
Information Retrieval Using Semantic Distance between WordNet
Rahul Shirbhate, Vishal Mogal
Performance Comparison between Keyword-based and WQCA-based Information Retrieval System
Naw Thiri Wai Khin, Nyo Nyo Yee
Similar Articles with Keyword 'Clustering'
Improving Stability, Smoothing and Diversifying of Recommender Systems
Sagar Sontakke, Pratibha Chavan
Inverse Problem with Solution Using Data Mining
Ashmikumari Shah, Pooja Jardosh